Texts from Swedish Public Employment Service

SND-ID: EXT 0338

This study is part of the collection Parallel Texts from Public Agencies

Description Data and documentation

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Institute for Language and Folklore, Language Council of Sweden

Description

Parallel texts downloaded from the websites of the Swedish Public Employment Agency.
Protection and ethical review
Method

Sampling procedure

Multilingual parallel material.
Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

  • Linguality

    Multilingual
  • Language

    • Swedish (swe)

    • English (eng)

    • Finnish (fin)

    • Spanish (spa)

    • French (fra)

    • German (deu)

    • Romanian (ron)

    More..
  • Modality

    Written Language
  • Size

    Words: 43207 (swe)

    Texts: 39 (swe)

    Words: 152928 (TOT)

    Texts: 152 (TOT)

  • Original source

    arbetsförmedlingen
    www.arbetsformedlingen.se
Geographic coverage

Geographic spread

Geographic location: Sweden

Topic and keywords

Subject area

Migration, Employment, Unemployment, Social welfare policy, Social welfare systems/structures, Specific social services: use and availability, Labour and employment policy (CESSDA Topic Classification)
Public Administration Studies, Other Social Sciences not elsewhere specified, General Language Studies and Linguistics (The Swedish standard of fields of research 2011)

Keywords

labour policy

Publications
Dataset
Texts from Swedish Public Employment Service

Description

Parallel texts downloaded from the website of Swedish Public Employment Service.
What was actually downloaded were pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.

Data format / data structure

Text

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Institute for Language and Folklore, Language Council of Sweden

Data collection

  • Mode of collection: Self-administered writings and/or diaries: web-based
  • Time period(s) for data collection: 2017-01-01–2017-01-31
Published: 2020-03-30