Texts from the Swedish Competition Agency

This study is part of the collection Parallel Texts from Public Agencies

Creator/Principal investigator(s):

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore

Description:

Texts collected from the Swedish Competition Authority's website around March 2018. The texts are yearly reviews and other public information from this authority.

Responsible department/unit:

Institute for Language and Folklore, Language Council of Sweden

Creator/Principal investigator(s):

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore

Contributor(s):

Institute for Language and Folklore, Language Council of Sweden

Identifiers:

SND-ID: EXT 0337

URL: http://liljeholmen.sprakochfolkminnen.se/sprakresurser/version/20190124/myndighetsdata/texter/

URL: http://liljeholmen.sprakochfolkminnen.se/sprakresurser/version/20190124/myndighetsdata/texter/Konkurrensverket

Description:

Texts collected from the Swedish Competition Authority's website around March 2018. The texts are yearly reviews and other public information from this authority.

Language:

English

Swedish

Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

  • Linguality

    Bilingual
  • Language

    • English (eng)

      Texts: 15

    • Swedish (swe)

      Texts: 15

    More..
  • Modality

    Written Language
  • Size

    Words: 479760 (TOT)

    Texts: 30 (TOT)

    Words: 217870 (swe)

    Texts: 15 (swe)

  • Original source

    konkurrensverket
    http://www.konkurrensverket.se/publikationer/

Parallel texts from the Swedish Competition Agency

Creator/Principal investigator(s):

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore

Description:

Parallel texts downloaded from the agency's website.
What was downloaded were pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.

Data format / data structure:

Text

Data collection:

Mode of collection: Self-administered questionnaire: web based

Time period(s) for data collection: 2018-03-01 — 2018-03-31

Data collector:

Published: 2020-03-30