Skip to main content
Nasjonalbiblioteket

Norwegian Conversation Speech Corpus

Distributions 
1
APIs 
0
No registered APIs provide this dataset.
  • Datasets
  • Public access 

    Publicly available to everyone. Access may still require registration and an API key request, as long as anyone can request such registration and/or API keys.

    Read more about access levels here

  • Open data 

    The dataset is classified as public access and has at least one distribution with an approved open license.

OverviewDistributions & APIs 
1
DetailsDiscussions 
0
RDF

Description

NB Samtale is a speech corpus made by the Language Bank at the National Library of Norway. The corpus contains orthographically transcribed speech from podcasts and recordings of live events at the National Library. The corpus is intended as an open source dataset for Automatic Speech Recognition (ASR) development, and is specifically aimed at improving ASR systems' handle on conversational speech.

The corpus consists of 12,080 segments, a total of 24 hours transcribed speech from 69 speakers. The corpus ensures both gender and dialect variation, and speakers from five broad dialect areas are represented. Both Bokmål and Nynorsk transcriptions are present in the corpus, with Nynorsk making up approximately 25% of the transcriptions.

We greatly appreciate feedback and suggestions for improvements. PLease contact us at sprakbanken@nb.no.

Distributions
1

APIs providing this dataset
0

No registered APIs provide this dataset.

Similar datasets

SCARRIE LexiconNasjonalbiblioteket
Public access
Grapheme-to-Phoneme Models for NorwegianNasjonalbiblioteket
Public access
Translation Memories from Semantix ASNasjonalbiblioteket
Public access
NST Pronunciation Lexicon for SwedishNasjonalbiblioteket
Public access
Texts from Norwegian WikipediaNasjonalbiblioteket
Public access

Distributions
1

APIs providing this dataset
0

No registered APIs provide this dataset.

Contact information

Contact point:
Not provided
Website:
https://www.nb.no/sprakbanken/
Email:
sprakbanken@nb.no
Telephone:
Not provided

About the data

Language:
Content providers:
Not provided
Provenance:
Not provided
Update frequency:
Not provided
First issued:

This date indicates when the data in this dataset was first released. It may have happened before the dataset was published on data.norge.no.

July 1, 2022
Last updated:
August 18, 2023
Accuracy:
Not provided
Availability:
Not provided
Completeness:
Not provided
Currentness:
Not provided
Relevance:
Not provided
Geographical scope:
Not provided
Temporal scope:
Not provided
Conforms to:

Reference to an implementation rule or other specification that forms the basis for the dataset.

Not provided

Legal basis

Not provided

Concepts used in the dataset

Not provided

References

Not provided

About this dataset

Publisher:
Nasjonalbiblioteket
Published:

This date indicates when the dataset was harvested by data.norge.no. It may have been available earlier elsewhere.

Read more about harvesting here

March 3, 2026
Last updated:
March 13, 2026
Landing page:
Not provided
Documentation:
Not provided
Dataset type:
Not provided
Metadata Quality:

Metadata quality is an indicator of how well the datasets are described using metadata.

Read more about metadata quality here

Good (59%)
URI:

Themes

Keywords

Not provided

Discussions on Datalandsbyen
0

No discussions found

What is Datalandsbyen?

Datalandsbyen is our online forum where you can request data, share experiences, and ask for advice related to data sharing and information management.