Texts from the Swedish Crime Victim Compensation and Support Agency

SND-ID: EXT 0335

This study is part of the collection Parallel Texts from Public Agencies

Description Data and documentation

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Institute for Language and Folklore, Language Council of Sweden

Description

Parallel texts downloaded from the website of the Swedish Crime Victim Compensation and Support Agency.

Language

German

English

Spanish

French

Croatian

... Show more..
Protection and ethical review
Method and time period

Sampling procedure

Swedish texts that have one or more translations.
Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

  • Linguality

    Multilingual
  • Language

    • (swe)

      Texts: 7

    • (eng)

      Texts: 7

    • (hrv)

      Texts: 6

    • (spa)

      Texts: 1

    • (deu)

      Texts: 1

    • (fra)

      Texts: 1

    • (fin)

      Texts: 5

    • (rus)

      Texts: 1

    • (ara)

      Texts: 5

    • (tur)

      Texts: 1

    • (som)

      Texts: 1

    • (fas)

      Texts: 6

    • (bos)

      Texts: 6

    • (srp)

      Texts: 6

    More..
  • Modality

    Written Language
  • Size

    Words: 290151 (TOT)

    Texts: 54 (TOT)

    Words: 57201 (swe)

    Texts: 7 (swe)

  • Original source

    brottsoffermyndigheten
    https://www.brottsoffermyndigheten.se/
Geographic coverage

Geographic spread

Geographic location: Sweden

Topic and keywords

Research area

Legislation and legal systems, Crime and law enforcement, SOCIAL WELFARE POLICY AND SYSTEMS (CESSDA Topic Classification)
Social Sciences, Law, Languages and Literature (The Swedish standard of fields of research 2011)

Publications
Dataset
Parallel texts from the Swedish Crime Victim Compensation and Support Agency

Description

Parallel texts downloaded from ageny's website. What was actually downloaded were pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.

Data format / data structure

Text

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Institute for Language and Folklore, Language Council of Sweden

Data collection

  • Mode of collection: Self-administered writings and/or diaries: web-based
  • Time period(s) for data collection: 2018-01-01–2018-01-31