Skip to main content
Nasjonalbiblioteket

NST N-gram - Norwegian Bokmål

Description

These n-grams are derived from parts of the Text Corpus from Nordic Language Technology AS (NST). The source material consists of 510 million words of running text.

The n-grams are also available as an overview listing only the 1000 most frequent n-grams (n=1-6).

In the full version, all the derived n-grams (n=1-6) are sorted alphabetically and by frequency, respectively. Frequency lists (unigrams) are also available separately.

Distributions
1

Download
Description:
Not provided
Access URL:
https://hdl.handle.net/21.11146/3
Direct download:
API:
Not provided
Documentation:
Not provided
License:
Conforms to:
Not provided

APIs providing this dataset
0

No registered APIs provide this dataset.

Similar datasets

Norsk Ordbank - Norwegian Nynorsk 2005-2012Nasjonalbiblioteket
Public access