Texts from the Swedish Crime Victim Compensation and Support Agency

This study is part of the collection Parallel Texts from Public Agencies

Creator/Principal investigator(s):

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Description:

Parallel texts downloaded from the website of the Swedish Crime Victim Compensation and Support Agency.

Subject area:

legislation and legal systems, crime and law enforcement, SOCIAL WELFARE POLICY AND SYSTEMS (CESSDA Topic Classification)
Social Sciences, Law, Languages and Literature (The Swedish standard of fields of research 2011)

Keywords:

Responsible department/unit:

Institute for Language and Folklore, Language Council of Sweden

Creator/Principal investigator(s):

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Contributor(s):

Institute for Language and Folklore, Language Council of Sweden

Description:

Parallel texts downloaded from the website of the Swedish Crime Victim Compensation and Support Agency.

Language:

German

English

Spanish

French

Croatian

Swedish

Geographic spread:

Geographic location: Sweden

Subject area:

legislation and legal systems, crime and law enforcement, SOCIAL WELFARE POLICY AND SYSTEMS (CESSDA Topic Classification)
Social Sciences, Law, Languages and Literature (The Swedish standard of fields of research 2011)

Keywords:

Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

  • Linguality

    Multilingual
  • Language

    • Swedish (swe)

      Texts: 7

    • English (eng)

      Texts: 7

    • Croatian (hrv)

      Texts: 6

    • Spanish (spa)

      Texts: 1

    • German (deu)

      Texts: 1

    • French (fra)

      Texts: 1

    • Finnish (fin)

      Texts: 5

    • Russian (rus)

      Texts: 1

    • Arabic (ara)

      Texts: 5

    • Turkish (tur)

      Texts: 1

    • Somali (som)

      Texts: 1

    • Persian (fas)

      Texts: 6

    • Bosnian (bos)

      Texts: 6

    • Serbian (srp)

      Texts: 6

    More..
  • Modality

    Written Language
  • Size

    Words: 290151 (TOT)

    Texts: 54 (TOT)

    Words: 57201 (swe)

    Texts: 7 (swe)

  • Original source

    brottsoffermyndigheten
    https://www.brottsoffermyndigheten.se/

Parallel texts from the Swedish Crime Victim Compensation and Support Agency

Creator/Principal investigator(s):

Institute for Language and Folklore, Language Council of Sweden

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Description:

Parallel texts downloaded from ageny's website. What was actually downloaded were pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.

Data format / data structure:

Text

Data collection:

Mode of collection: Self-administered writings and/or diaries: web-based

Time period(s) for data collection: 2018-01-01 — 2018-01-31

Published: 2020-03-30