Skip to main content
Nasjonalbiblioteket

NB N-gram

  • Datasets
  • Public access 

    Publicly available to everyone. Access may still require registration and an API key request, as long as anyone can request such registration and/or API keys.

    Read more about access levels here

  • Open data 

    The dataset is classified as public access and has at least one distribution with an approved open license.

Description

NB N-gram is a search service that lets you find and compare the frequencies of words, for example how often words occur in a historical perspective. The service is based on digitized books and newspapers at the National Library of Norway, spanning the time period 1810-2021. The size of the material is some 122 billion tokens (words and punctuation).

NB N-gram is updated regurarly, usually once a year. The next update is scheduled for summer 2022.

Distributions
1

Nameless distribution
  • html
Description:
Not provided
Access URL:
https://hdl.handle.net/21.11146/42
Direct download:
API:
Not provided
Documentation:
Not provided
License:
Conforms to:
Not provided
Download

APIs providing this dataset
0

No registered APIs provide this dataset.

Similar datasets

NST Pronunciation Lexicon for SwedishNasjonalbiblioteket
Public access
Grapheme-to-Phoneme Models for NorwegianNasjonalbiblioteket
Public access
SCARRIE LexiconNasjonalbiblioteket
Public access
ONOMASTICA Pronunciation LexiconNasjonalbiblioteket
Public access
N-grams from NBdigital 2021Nasjonalbiblioteket
Public access