SND-ID: SND 1037
Time period(s) investigated:
2015-06-01 — 2016-05-31
Unit of analysis:
Swedish Research Council — Ref. 2012-5659
Vasiliki Simaki, Carita Paradis, Maria Skeppstedt, Magnus Sahlgren, Kostiantyn Kucher, and Andreas Kerren. Annotating speaker stance in discourse: the Brexit Blog Corpus. In Corpus Linguistics and Linguistic Theory, 2017. De Gruyter, published electronically before print. https://doi.org/10.1515/cllt-2016-0060
If you have published anything based on these data, please notify us with a reference to your publication(s).
Andreas Kerren, Carita Paradis. Linnaeus University, Department of Computer Science (2017). Brexit Blog Corpus (BBC). Swedish National Data Service. Version 1.0. https://doi.org/10.5878/002925
The BBC is a collection of texts from blog sources. The corpus texts are thematically related to the 2016 UK referendum concerning whether the UK should remain members of the European Union or not. The texts were extracted from the Internet from June to August 2015. With the Gavagai API (https://developer.gavagai.se), the texts were detected using seed words, such as Brexit, EU referendum, pro-Europe, europhiles, eurosceptics, United States of Europe, David Cameron, or Downing Street. The retrie... Show more..
Data format / data structure:
Time period(s) for data collection: 2015-06-01 — 2016-05-31
Source of the data: Research data
Number of individuals/objects: