Texts from the Swedish Consumer Agency

This study is part of the collection Parallel Texts from Public Agencies

Creator/Principal investigator(s):

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Description:

Parallel texts downloaded from the website, hallåkonsument.se, run by of the Swedish Consumer Agency.

Subject area:

consumption and consumer behaviour, TRADE, INDUSTRY AND MARKETS (CESSDA Topic Classification)
Economics and Business, Law and Society, Languages and Literature (The Swedish standard of fields of research 2011)

Responsible department/unit:

Institute for Language and Folklore, Language Council of Sweden

Creator/Principal investigator(s):

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Contributor(s):

Institute for Language and Folklore, Language Council of Sweden

Description:

Parallel texts downloaded from the website, hallåkonsument.se, run by of the Swedish Consumer Agency.

Geographic spread:

Geographic location: Sweden

Subject area:

consumption and consumer behaviour, TRADE, INDUSTRY AND MARKETS (CESSDA Topic Classification)
Economics and Business, Law and Society, Languages and Literature (The Swedish standard of fields of research 2011)

Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

  • Linguality

    Multilingual
  • Language

    • Swedish (swe)

      Texts: 42

    • English (eng)

      Texts: 42

    • French (fra)

      Texts: 31

    • Spanish (spa)

      Texts: 31

    • German (deu)

      Texts: 31

    • Polish (pol)

      Texts: 31

    • Finnish (fin)

      Texts: 31

    • Arabic (ara)

      Texts: 42

    • Persian (fas)

      Texts: 42

    • Somali (som)

      Texts: 6

    • Albanian (sqi)

      Texts: 31

    • Tigrinya (tir)

      Texts: 6

    • Central Kurdish (ckb)

      Texts: 37

    • Croatian (hrv)

      Texts: 31

    More..
  • Modality

    Written Language
  • Size

    Words: 190126 (TOT)

    Texts: 434 (TOT)

    Words: 21535 (swe)

    Texts: 42 (swe)

  • Original source

    konsumentverket
    hallåkonsument.se

License:

Creative Commons License

Texts from the Swedish Consumer Agency

Creator/Principal investigator(s):

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Description:

The texts have been downloaded using the command 'w3m -dump' from an ubuntu shell, whereafter the resulting text files were stripped to contain only the interesting text (no menus and such).

Data format / data structure:

Text

Data collection:

Mode of collection: Self-administered writings and/or diaries: web-based

Time period(s) for data collection: 2019-01-01 — 2019-01-31

Published: 2020-03-30