eprintid: 3519 rev_number: 14 eprint_status: archive userid: 6 dir: disk0/00/00/35/19 datestamp: 2016-07-19 09:42:52 lastmod: 2016-07-20 07:36:58 status_changed: 2016-07-19 09:42:52 type: monograph metadata_visibility: show creators_name: Biorci, Grazia creators_name: Emina, Antonella creators_name: Puliga, Michelangelo creators_name: Sella, Lisa creators_name: Vivaldo, Gianna creators_id: creators_id: creators_id: michelangelo.puliga@imtlucca.it creators_id: creators_id: gianna.vivaldo@imtlucca.it title: Tweet-tales: moods of socio-economic crisis? ispublished: pub subjects: HB subjects: HM divisions: EIC full_text_status: public monograph_type: imt_eic_working_paper keywords: Big data, social media, Twitter, hierarchical clustering, unemployment. Jel codes: C4; C49; C55; C81; E24 abstract: The widespread adoption of highly interactive social media like Twitter, Facebook and other platforms allow users to communicate moods and opinions to their social network. Those platforms represent an unprecedented source of information about human habits and socio-economic interactions. Several new studies have started to exploit the potential of these big data as fingerprints of economic and social interactions. The present analysis aims at exploring the informative power of indicators derived from social media activity, with the aim to trace some preliminary guidelines to investigate the eventual correspondence between social media indices and available labour market indicators at a territorial level. The study is based on a large dataset of about 4 million Italian-language tweets collected from October 2014 to December 2015, filtered by a set of specific keywords related to the labour market. With techniques from machine learning and user’s geolocalization, we were able to subset the tweets on specific topics in all Italian provinces. The corpus of tweets is then analyzed with linguistic tools and hierarchical clustering analysis. A comparison with traditional economic indicators suggests a strong need for further cleaning procedures, which are then developed in detail. As data from social networks are easy to obtain, this represents a very first attempt to evaluate their informative power in the Italian context, which is of potentially high importance in economic and social research. date: 2016-07 date_type: published number: 4 publisher: IMT School for Advanced Studies Lucca pages: 12 institution: IMT School for Advanced Studies Lucca issn: 2279-6894 citation: Biorci, Grazia and Emina, Antonella and Puliga, Michelangelo and Sella, Lisa and Vivaldo, Gianna Tweet-tales: moods of socio-economic crisis? EIC working paper series #4/2016 IMT School for Advanced Studies Lucca ISSN 2279-6894. document_url: http://eprints.imtlucca.it/3519/1/EIC_WP_4_2016.pdf