DBpedia Development Wiki devilopment bible

Edit this page on Github

Databus Archivo

Information

Archivo is an online ontology interface and augmented archive, that discovers, crawls, versions and archives ontologies on the DBpedia Databus. Each Databus Artifact represents one certain ontology and each version represents a new version of the ontology.

Usage

Finding a ontology on Databus Archivo

There are a few options to find a ontology:

Accessing a Archivo Ontology

Generally Ontologies on Archivo can be accessed by querying the Databus SPARQL endpoint at https://databus.dbpedia.org/repo/sparql. You can test your query at https://databus.dbpedia.org/yasgui/.

The latest version of an ontology (example):

PREFIX dataid: <http://dataid.dbpedia.org/ns/core#>
PREFIX dct:    <http://purl.org/dc/terms/>
PREFIX dcat:   <http://www.w3.org/ns/dcat#>
PREFIX db:     <https://databus.dbpedia.org/>
PREFIX rdf:    <http://www.w3.org/1999/02/22-rdf-syntax-ns#>    
PREFIX rdfs:   <http://www.w3.org/2000/01/rdf-schema#>
PREFIX dataid-cv: <http://dataid.dbpedia.org/ns/cv#>
PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>

SELECT DISTINCT ?file WHERE {
  VALUES ?art { <ONTOLOGY-ARTIFACT> } .
  	    ?dataset dataid:account db:ontologies . 
  	    ?dataset dataid:artifact ?art .
  	    ?dataset dcat:distribution ?distribution .
  	    ?distribution dataid-cv:type 'parsed'^^xsd:string . 	
  	    ?distribution dataid:formatExtension 'FORMAT'^^xsd:string . 
  	    ?distribution dcat:downloadURL ?file .
  	    ?dataset dct:hasVersion ?latestVersion .
  	        {
	    SELECT DISTINCT ?art (MAX(?v) as ?latestVersion) WHERE {
  		    ?dataset dataid:account db:ontologies .
	        ?dataset dataid:artifact ?art .
	        ?dataset dct:hasVersion ?v .
	    }
        }
}

This query retrieves the URL of a parsed ontology, with the parameters:

  • ONTOLOGY-ARTIFACT as the artifact of your choosen ontology, e.g https://databus.dbpedia.org/ontologies/xmlns.com/foaf–0–1

  • FORMAT as the desired format. Archivo provides the ontologies as Turtle (ttl), XML-RDF (owl) and N-Triples (nt).

Archivo Documentation

Files

Archivo provides for each version different files:

type-values sub-cvs Explaination
orig   Snapshot of the original ontology file
parsed   Files parsed by rapper, available as owl, nt and ttl
OOPS   OOPS-report
generatedDocu   human readable documentation by LODE
profile   a profile check done by profilechecker
shaclReport validates={minLicense, goodLicense, lodeMetadata} shacl-report as turtle file
meta   a JSON file containing some meta info
pelletInfo imports={FULL,NONE} the pellet info report, with and without imports
pelletConsistency imports={FULL,NONE} the pellet consistency report, with and without imports
diff triples={adds,deletes} These file contain the added/deleted triples

Stars

Archivo provides a basic star-rating (not to be confused with the 5 stars of linked data).

Baseline:

  • ★: The Ontology parses (Errors and warnings are ok, but rapper must be able to retrieve at least some triples)

  • ★★: The Ontology contains some form of license, given by the most popular properties: dct:license, cc:license or xhv:license

If the ontology fulfills the baseline, it can earn two further stars by using good practises:

  • ★★★: The license is given by dct:license and is an IRI.

  • ★★★★: The ontology is consistent (tested by pellet, with or without imports).

Ontology Sources

Archivo uses four different sources of potential ontologies:

  1. Ontology Repositories: e.g ontologies listed in LOV
  2. Subjects, Predicates and Objects of Ontologies: Every SPO in an Ontology can lead to a potential new ontology, so Archivo can discover new vocabularies by analyzing already listed ontologies
  3. VOID Data: Search new ontologies by looking for rdfs:isDfinedBy triples in classes and properties used in the whole Databus
  4. [User Suggestions](http://archivo.dbpedia.org/add

Prefixes The prefixes used in this description:

  • PREFIX dct: <http://purl.org/dc/terms/>

  • PREFIX dc: <http://purl.org/dc/elements/1.1/>

  • PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>

  • PREFIX cc: <http://creativecommons.org/ns#>

  • PREFIX xhv: <http://www.w3.org/1999/xhtml/vocab#>