Texts from SIDA

SND-ID: EXT 0326

This study is part of the collection Parallel Texts from Public Agencies

Description Data and documentation

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Institute for Language and Folklore, Language Council of Sweden

Description

Parallel texts collected from the SIDAs website.

Language

English

Swedish

Research principal, contributors, and funding
Protection and ethical review
Method

Sampling procedure

Texts that are available in Swedish and at least one other language.
Language resources

Resource type

Corpus

Foreseen use

NLP application

Text corpus

Geographic coverage

Geographic spread

Geographic location: Sweden

Topic and keywords

Subject area

Economic systems and development, International politics and organisations, Equality, inequality and social exclusion, Social change (CESSDA Topic Classification)
Political Science, Social and Economic Geography, Languages and Literature (The Swedish standard of fields of research 2011)

Publications
Dataset
Parallel texts from SIDA

Description

Parallel texts downloaded from SIDAs website. What was actually downloaded were pdf files. The txt files that are available are the result of running the pdf files through the pdftotext command from an ubuntu shell.

Data format / data structure

Text

Creator/Principal investigator(s)

Simon Dahlberg - Institute for Language and Folklore, Language Council of Sweden

Institute for Language and Folklore, Language Council of Sweden

Time period(s) investigated

2018-08-01–2018-08-31

Data collection

  • Mode of collection: Self-administered writings and/or diaries: web-based
  • Time period(s) for data collection: 2018-08-01–2018-08-31
Published: 2020-03-30