COVID-19 Word Cloud

COVID-19 Dataset

We use version 4 of the COVID-19 Twitter dataset, which contains tweets from 1st January 2020 to 5th April 2020. The dataset is regularly updated, and collects tweets for several languages (English, French, Spanish and German) based on COVID-19 keywords.

We focus on analysing tweets with at least 10 reactions for our rumour detection task. Thus, we first filter out the source tweets with less than 10 reactions.