Setting up the data source that data.norge.no retrieves the description from
Now that we have the complete description, we need to make it available from a data source. The data source (or endpoint) must provide (English: serve) the file in a valid RDF format, such as Turtle. This allows data.norge.no to harvest, or "retrieve," the catalog and display its content on data.norge.no.
Note: If your organization already has a data catalog that data.norge.no harvests from, you do not need to register a separate endpoint for your dataset description. Just make sure that your dataset description is included in the organization's data catalog.
We upload the example file with the description of the AI overview to Github with the filename ai_projects_norwegian_state_dataset_description.ttl
. The file can then be accessed online at the URL https://raw.githubusercontent.com/Informasjonsforvaltning/ai-project-service/main/ai_projects_norwegian_state_dataset_description.ttl
.
We will use this URL when registering the data source.
Registering the data source in the administration portal
The next step is to register the data source in the data.norge.no administration portal. You log in via the page https://data.norge.no/publishing where you click "Log in to manage harvesting."
Note: You must have the correct permissions and rights to register the data source on behalf of your organization. Learn more about how to get access to data.norge.no's solutions.
Once you are in the administration portal, click the "Add data source" button.
Add data source
Check that the publisher is correct; this should be the organization publishing the dataset.
Note: If you do not see your organization here, check that you have the correct permissions and rights on behalf of your organization.
The Catalog should be "Dataset" and Data Source Type should be "DCAT-AP-NO."
The format (in our case) is Turtle, but here you select the RDF format you provide the description in.
The name of the data source is used only in the administration portal's display of all data sources. It is only meant for the organization to keep track of its data sources and is not displayed publicly.
The URL to the data source should be the URL where you provide the data description from. In our case, it is the URL https://raw.githubusercontent.com/Informasjonsforvaltning/ai-project-service/main/ai_projects_norwegian_state_dataset_description.ttl
.
We have no authentication for this data source, so we leave those fields empty:
Then click "Save."
The data source should now appear in the list of data sources. Ours looks like this:
Note: You can validate the description by clicking "Validate." You will be taken to data.norge.no's validation tool for descriptions.
Start harvesting
You can start a harvest by clicking "Harvest"; this will prompt data.norge.no to retrieve and process the description before displaying it on data.norge.no. This process may take a few minutes.
Tip: The description of the AI overview dataset is located here: TODO:add-link