Skip to main content
Nasjonalbiblioteket

Norwegian UD Treebank

Description

Universal Dependencies (UD) is a framework for annotating grammar consistently in different languages. The grammatical annotations include tokenization, part-of-speech tags (POS), morphological features, and dependency relations.

For more information about the annotation standard and guidelines, see the official UD documentation (https://universaldependencies.org/guidelines.html).

UD for Bokmål (https://universaldependencies.org/treebanks/no_bokmaal/index.html) and Nynorsk (https://universaldependencies.org/treebanks/no_nynorsk/index.html) are based on the Norwegian Dependency Treebank (NDT).

The annotations have been automatically converted to UDs standard with Grew (https://grew.fr/). The conversion scripts are publicly available in the Github repo grew_ndt2ud (https://github.com/Sprakbanken/grew_ndt2ud).

Distributions
1

Nameless distribution
  • zip
Download

APIs providing this dataset
0

No registered APIs provide this dataset.

Similar datasets

SCARRIE LexiconNasjonalbiblioteket
Public access
Grapheme-to-Phoneme Models for NorwegianNasjonalbiblioteket
Public access
Translation Memories from Semantix ASNasjonalbiblioteket
Public access
NST Pronunciation Lexicon for SwedishNasjonalbiblioteket
Public access
Texts from Norwegian WikipediaNasjonalbiblioteket
Public access