metashareToCmdi.xsl
2020-03-30
clarin.eu:cr1:p_1361876010571
Parallella texter från Migrationsverket
Parallel texts from the Swedish Migration Agency
Parallel texts from the Swedish Migration Agency
Parallel texts downloaded with "w3m -dump" from an ubuntu shell, from the website of the Swedish Migration Agency.
The texts have been downloaded using the command 'w3m -dump' from an ubuntu shell, whereafter the resulting text files were stripped to contain only the interesting text (no menus and such).
Parallella texter nedladdade med hjälp av "w3m -dump" från ett ubuntu-skal, ifrån Migrationsverkets webbplats.
Texterna har laddats ner med hjälp av kommando 'w3m -dump' ifrån ett ubuntu-skal, varpå resulterande textfiler har skalats av till att innehålla endast den intressanta texten (inga menyer och dylikt).
http://liljeholmen.sprakochfolkminnen.se/sprakresurser/version/20190124/myndighetsdata/texter
http://liljeholmen.sprakochfolkminnen.se/sprakresurser/version/20190124/myndighetsdata/texter/Migrationsverket
ext0329-1-1
available-unrestrictedUse
contactPerson
Eriksson
Gunnar
gunnar.eriksson@sprakochfolkminnen.se
affiliation
2020-03-30
English
Swedish
en
sv
nlpApplications
terminologyExtraction
languageModelsTraining
machineTranslation
resourceCreator
Dahlberg
Simon
simon.dahlberg@sprakochfolkminnen.se
affiliation
Institute for Language and Folklore
Institutet för språk och folkminnen
Language Council of Sweden
Språkrådet
resourceCreator
Institute for Language and Folklore
Institutet för språk och folkminnen
Language Council of Sweden
Språkrådet
IsPartOf
http://liljeholmen.sprakochfolkminnen.se/sprakresurser/version/20190124/myndighetsdata/texter
corpus
text
multilingual
swe
Swedish
sizePerLanguage
33
texts
amh
Amharic
sizePerLanguage
23
texts
ara
Arabic
sizePerLanguage
33
texts
aze
Azerbaijani
sizePerLanguage
27
texts
ckb
Central Kurdish
sizePerLanguage
29
texts
eng
English
sizePerLanguage
33
texts
fas
Persian
sizePerLanguage
32
texts
hrv
Croatian
sizePerLanguage
23
texts
hye
Armenian
sizePerLanguage
24
texts
kat
Georgian
sizePerLanguage
1
texts
kmr
Northern Kurdish
sizePerLanguage
28
texts
mon
Mongolian
sizePerLanguage
25
texts
prs
Dari
sizePerLanguage
28
texts
pus
Pushto
sizePerLanguage
28
texts
rom
Romany
sizePerLanguage
24
texts
dialect
Arli
rus
Russian
sizePerLanguage
33
texts
som
Somali
sizePerLanguage
29
texts
spa
Spanish
sizePerLanguage
31
texts
sqi
Albanian
sizePerLanguage
27
texts
tha
Thai
sizePerLanguage
4
texts
tir
Tigrinya
sizePerLanguage
29
texts
tur
Turkish
sizePerLanguage
2
texts
uzb
Uzbek
sizePerLanguage
25
texts
zho
Chinese
sizePerLanguage
3
texts
fra
French
sizePerLanguage
31
texts
writtenLanguage
29008 (swe)
words
33 (swe)
texts
438614 (TOT)
words
580 (TOT)
texts
SE
Migrationsverket, www.migrationsverket.se