Texts from the Swedish Work Environment Authority

This study is part of the collection Parallel Texts from Public Agencies

Creator/Principal investigator(s)

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Description

Parallel texts downloaded from the websites of the Swedish Work Environment Authority.

Subject area

Responsible department/unit

Institute for Language and Folklore, Language Council of Sweden

Creator/Principal investigator(s)

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Contributor(s)

Institute for Language and Folklore, Language Council of Sweden

Description

Parallel texts downloaded from the websites of the Swedish Work Environment Authority.

Language

English

Swedish

Geographic spread

Geographic location: Sweden

Subject area

Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

  • Linguality

    Multilingual
  • Language

    • Swedish (swe)

      Texts: 21

    • English (eng)

      Texts: 19

    • Bulgarian (bul)

      Texts: 2

    • Czech (ces)

      Texts: 2

    • German (deu)

      Texts: 3

    • Estonian (est)

      Texts: 3

    • Finnish (fin)

      Texts: 1

    • Hungarian (hun)

      Texts: 1

    • Latvian (lav)

      Texts: 3

    • Lithuanian (lit)

      Texts: 3

    • Polish (pol)

      Texts: 4

    • Romanian (ron)

      Texts: 3

    • Spanish (spa)

      Texts: 2

    • Chinese (zho)

      Texts: 2

    • Russian (rus)

      Texts: 3

    • Arabic (ara)

      Texts: 1

    • Turkish (tur)

      Texts: 2

    • Thai (tha)

      Texts: 1

    • Hindi (hin)

      Texts: 1

    More..
  • Modality

    Written Language
  • Size

    Words: 166367 (swe)

    Texts: 21 (swe)

    Words: 432133 (TOT)

    Texts: 78

  • Original source

    arbetsmiljöverket
    www.av.se

License

Creative Commons License

Parallel texts from the Swedish Work Environment Authority

Creator/Principal investigator(s)

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Description

Parallel texts downloaded from the websites of the Swedish Work Environment Authority. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.

Data format / data structure

Text

Data collection

Mode of collection: Self-administered writings and/or diaries: web-based

Time period(s) for data collection: 2017-01-01 — 2017-01-31

Published: 2020-03-30
Last updated: 2020-05-15