Skip to main content
Nasjonalbiblioteket

Hyphenations from the National Library

Description

This database is a frequency-based list containing different ways of hyphenating words in Norwegian. It is based on a corpus of 26,344 digitized public domain texts and books at the National Library of Norway.

The data are in csv format, on the following form: frequency count, word, first part, second part. This is illustrated below with three examples (lines) from the database:

18162,imidlertid,imid,lertid 17747,ogsaa,og,saa 17534,mellom,mel,lom

Distributions
1

Download
Description:
Not provided
Access URL:
https://hdl.handle.net/21.11146/39
Direct download:
API:
Not provided
Documentation:
Not provided
License:
Conforms to:
Not provided

APIs providing this dataset
0

No registered APIs provide this dataset.

Similar datasets

Norsk Ordbank - Norwegian Nynorsk 2005-2012Nasjonalbiblioteket
Public access
ONOMASTICA Pronunciation Lexicon 2Nasjonalbiblioteket
Public access
Translation Memories from Semantix ASNasjonalbiblioteket
Public access
NST Pronunciation Lexicon for SwedishNasjonalbiblioteket
Public access
spaCy for Norwegian NynorskNasjonalbiblioteket
Public access