Swedish Treebank

SND-ID: EXT 0368

Description Data and documentation

Creator/Principal investigator(s)

Joakim Nivre - Uppsala University

Beáta Megyesi - Uppsala University

Bengt Dahlqvist - Uppsala University

Anna Sågvall Hein - Uppsala University, Department of Linguistics and Philology

Johan Hall - Uppsala University

... Show more..

Description

The Swedish Treebank is a syntactically annotated corpus. The annotation includes word and sentence boundaries, morphological information (word classes etc.), and syntactic information (phrases and grammatical functions, dependency structure). The Swedish Treebank is based on two previous corpora, Talbanken and SUC, which have been harmonised. The Swedish Treebank contains approx. 350,000 tokens.

Language

Swedish

Research principal, contributors, and funding

Research principal

Uppsala University

Contributor(s)

Filip Salomonsson - SolarWinds, Pingdom

Protection and ethical review

Data contains personal data

No

Method
Language resources

Resource type

Corpus
Geographic coverage

Geographic spread

Geographic location: Sweden

Topic and keywords
Publications

Access to data via

Dataset
Swedish Treebank

Description

The Swedish Treebank is a syntactically annotated corpus. The annotation includes word and sentence boundaries, morphological information (word classes etc.), and syntactic information (phrases and grammatical functions, dependency structure). The Swedish Treebank is based on two previous corpora, Talbanken and SUC, which have been harmonised. The Swedish Treebank contains approx. 350,000 tokens.

Data format / data structure

Text

Published: 2020-10-14