Texts from the Swedish Competition Agency

This study is part of the collection Parallel Texts from Public Agencies

Creator/Principal investigator(s)

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore

Description

Texts collected from the Swedish Competition Authority's website around March 2018. The texts are yearly reviews and other public information from this authority.

Subject area

ECONOMICS, SOCIAL WELFARE POLICY AND SYSTEMS (CESSDA Topic Classification)
Economics, Business Administration, Law and Society, Languages and Literature (The Swedish standard of fields of research 2011)

Creator/Principal investigator(s)

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore

Contributor(s)

Institute for Language and Folklore, Language Council of Sweden

Description

Texts collected from the Swedish Competition Authority's website around March 2018. The texts are yearly reviews and other public information from this authority.

Language

English

Swedish

Sampling procedure

Longer texts from the Swedish Competition Agency that were available in both Swedish and English.

Subject area

ECONOMICS, SOCIAL WELFARE POLICY AND SYSTEMS (CESSDA Topic Classification)
Economics, Business Administration, Law and Society, Languages and Literature (The Swedish standard of fields of research 2011)

Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

  • Linguality

    Bilingual
  • Language

    • English (eng)

      Texts: 15

    • Swedish (swe)

      Texts: 15

    More..
  • Modality

    Written Language
  • Size

    Words: 479760 (TOT)

    Texts: 30 (TOT)

    Words: 217870 (swe)

    Texts: 15 (swe)

  • Original source

    konkurrensverket
    http://www.konkurrensverket.se/publikationer/

Parallel texts from the Swedish Competition Agency

Creator/Principal investigator(s)

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore

Description

Parallel texts downloaded from the agency's website.
What was downloaded were pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.

Data format / data structure

Text

Data collection

Mode of collection: Self-administered questionnaire: web based

Time period(s) for data collection: 2018-03-01 — 2018-03-31

Published: 2020-03-30