Texts from the Swedish Competition Agency

SND-ID: EXT 0337

This study is part of the collection Parallel Texts from Public Agencies

Description Data and documentation

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore

Institute for Language and Folklore, Language Council of Sweden

Description

Texts collected from the Swedish Competition Authority's website around March 2018. The texts are yearly reviews and other public information from this authority.

Language

English

Swedish

Protection and ethical review
Method and time period

Sampling procedure

Longer texts from the Swedish Competition Agency that were available in both Swedish and English.
Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

Geographic coverage
Topic and keywords

Research area

ECONOMICS, SOCIAL WELFARE POLICY AND SYSTEMS (CESSDA Topic Classification)
Economics, Business Administration, Law and Society, Languages and Literature (The Swedish standard of fields of research 2011)

Publications
Dataset
Parallel texts from the Swedish Competition Agency

Description

Parallel texts downloaded from the agency's website.
What was downloaded were pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.

Data format / data structure

Text

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore

Institute for Language and Folklore, Language Council of Sweden

Data collection

  • Mode of collection: Self-administered questionnaire: web based
  • Time period(s) for data collection: 2018-03-01–2018-03-31