Analysis of COVID-related symptoms from a database of Italian tweets
| Contribuinte(s) |
Remondini, Daniel Durazzi, Francesco |
|---|---|
| Data(s) |
03/12/2021
|
| Resumo |
Twitter is a highly popular social media which on one hand allows information transmission in real time and on the other hand represents a source of open access homogeneous text data. We propose an analysis of the most common self-reported COVID symptoms from a dataset of Italian tweets to investigate the evolution of the pandemic in Italy from the end of September 2020 to the end of January 2021. After manually filtering tweets actually describing COVID symptoms from the database - which contains words related to fever, cough and sore throat - we discuss usefulness of such filtering. We then compare our time series with the daily data of new hospitalisations in Italy, with the aim of building a simple linear regression model that accounts for the delay which is observed from the tweets mentioning individual symptoms to new hospitalisations. We discuss both the results and limitations of linear regression given that our data suggests that the relationship between time series of symptoms tweets and of new hospitalisations changes towards the end of the acquisition. |
| Formato |
application/pdf |
| Identificador |
http://amslaurea.unibo.it/24671/1/Damiani_Kim_Maria_tesi.pdf Damiani, Kim Maria (2021) Analysis of COVID-related symptoms from a database of Italian tweets. [Laurea], Università di Bologna, Corso di Studio in Fisica [L-DM270] <http://amslaurea.unibo.it/view/cds/CDS9244/> |
| Idioma(s) |
en |
| Publicador |
Alma Mater Studiorum - Università di Bologna |
| Relação |
http://amslaurea.unibo.it/24671/ |
| Direitos |
cc_by_sa4 info:eu-repo/semantics/embargoedAccess end:2022-11-19 |
| Palavras-Chave | #machine learning,twitter,language processing,COVID #Fisica [L-DM270] |
| Tipo |
PeerReviewed info:eu-repo/semantics/bachelorThesis |