ONOMASTICA Pronunciation Lexicon

Datasets
Public access
Publicly available to everyone. Access may still require registration and an API key request, as long as anyone can request such registration and/or API keys.
Read more about access levels here
Open data
The dataset is classified as public access and has at least one distribution with an approved open license.

Description

ONOMASTICA is a database containing original data from the Norwegian part of the ONOMASTICA project, a European research project aiming at producing pronunciation lexica of proper names for various European languages. The data include first names, family names, company names, street names, place names and foreign names.

The database contains a total of 556,499 transcribed names. The data was automatically transcribed, but partially checked manually by trained phoneticians. The material is transcribed using SAMPA. See the documentation files for details.

The database is published with permission from Telenor, and may be used freely and distributed without compensation. Telenor must be credited when the database is used or distributed.

Note that an updated and more user-frendly version of the database in csv format has been published by the Language Bank. Type "sbr-67" in the search bar to find the updated version.

Distributions
1

Nameless distribution

gtar

Description:

Not provided

Access URL:

https://hdl.handle.net/21.11146/38

Status:

Not provided

Direct download:

API:

Not provided

Documentation:

Not provided

License:

https://creativecommons.org/licenses/by/4.0/

Conforms to:

Not provided

Rights for use:

Not provided

Download

APIs providing this dataset
0

No registered APIs provide this dataset.

Similar datasets

SCARRIE Lexicon	Nasjonalbiblioteket	Public access
Grapheme-to-Phoneme Models for Norwegian	Nasjonalbiblioteket	Public access
Translation Memories from Semantix AS	Nasjonalbiblioteket	Public access
NST Pronunciation Lexicon for Swedish	Nasjonalbiblioteket	Public access
Texts from Norwegian Wikipedia	Nasjonalbiblioteket	Public access

ONOMASTICA Pronunciation Lexicon

Description

Distributions
1

APIs providing this dataset
0

Similar datasets

Distributions
1

APIs providing this dataset
0

Contact information

About the data

Legal basis

Concepts used in the dataset

References

About this dataset

Themes

Keywords

Discussions on Datalandsbyen
0

What is Datalandsbyen?

Resource Description Framework (RDF)
All URLs to resources on data.norge.no can provide RDF metadata in various formats, depending on the Accept header sent with the request.
Read more about RDF and the formats we support here

Did you find what you were looking for?

ONOMASTICA Pronunciation Lexicon

Description

Distributions1

APIs providing this dataset0

Similar datasets

Distributions1

APIs providing this dataset0

Contact information

About the data

Legal basis

Concepts used in the dataset

References

About this dataset

Themes

Keywords

Discussions on Datalandsbyen0

What is Datalandsbyen?

Resource Description Framework (RDF)All URLs to resources on data.norge.no can provide RDF metadata in various formats, depending on the Accept header sent with the request.Read more about RDF and the formats we support here

Did you find what you were looking for?

Distributions
1

APIs providing this dataset
0

Distributions
1

APIs providing this dataset
0

Discussions on Datalandsbyen
0

Resource Description Framework (RDF)
All URLs to resources on data.norge.no can provide RDF metadata in various formats, depending on the Accept header sent with the request.
Read more about RDF and the formats we support here