Last update: 
Thursday, January 22, 2009 - 09:45
Authors: 
Jun Zhao
Authors: 
Keith Alexander
Abstract: 
The Vocabulary of Interlinked Datasets (voiD) is a vocabulary and a set of instructions that enables the discovery and usage of linked datasets. A linked dataset is a collection of data, published and maintained by a single provider, available as RDF on the Web, where at least some of the resources in the dataset are identified by dereferencable URIs.

Note
To gain a deeper understanding on how to use voiD, we advise user to consult the voiD guide.

Additionally to the classes and properties listed below, the following instances of type scovo:Dimension are defined in voiD:

  • void:numberOfTriples - indicates that a scovo:Item represents a number of triples in the RDF sense.
  • void:numberOfResources - indicates that a scovo:Item represents a number of resources in the RDF sense, for example instances of foaf:Person. Each of these resources will typically be identified with a distinct URI.
  • void:numberOfDocuments - indicates that a scovo:Item represents a number of data-carrying documents, such as RDF/XML documents or RDFa-enabled web pages.
  • void:numberOfDistinctSubjects - indicates that a scovo:Item represents a number of distinct subjects, that is, the count of resources in an RDF statement S P O in the S position, with duplicates removed.
  • void:numberOfDistinctObjects - indicates that a scovo:Item represents the number of distinct objects, that is, the count of resources in an RDF statement S P O in the O position, with duplicates removed.

void at a glance

void classes and properties

Dataset

Label: 
dataset

A dataset is a collection of data, published and maintained by a single provider, available as RDF on the Web, where at least some of the resources in the dataset are identified by dereferencable URIs.

Note that void:Dataset rdfs:subClassOf http://purl.org/dc/dcmitype/Dataset holds.

Linkset

Label: 
linkset
Superclass: 
Dataset

Describes the linking between two datasets modeled using a class. Note that this enables to attach further characteristics, that is, describe the interlinking in greater detail (such as statistics, etc.).

TechnicalFeature

Label: 
technical feature

To express a certain technical feature of a dataset, such as formats (available in RDF/XML, etc.).

statItem

Label: 
statistical item
Domain: 
Dataset

States a certain quantitative measure. Note that the range is http://purl.org/NET/scovo#Item .

feature

Label: 
feature
Domain: 
Dataset
Range: 
TechnicalFeature

Defines which technical features a dataset supports.

subset

Label: 
has subset
Domain: 
Dataset
Range: 
Dataset

Defines a logical subset of a dataset (a linkset or a dataset containing only certain resources of a domain, for example).

target

Label: 
has target
Domain: 
Linkset
Range: 
Dataset

States that a linkset has a certain dataset as its target.

sparqlEndpoint

Label: 
has a SPARQL endpoint at
Domain: 
Dataset

Announcement of a SPARQL endpoint.

Note that the range is a rdfs:Resource (a URI rather than a string).

linkPredicate

Label: 
a link predicate
Domain: 
Linkset

States that the interlinking is about a certain RDF predicate. Note that the range is rdf:Property.

exampleResource

Label: 
example resource of dataset
Domain: 
Dataset

Gives an example of a representative resource of a dataset. Note that the range is rdfs:Resource.

vocabulary

Label: 
vocabulary
Domain: 
Dataset
Comment: 
A vocabulary that is used in the dataset.

This property links a void:Dataset to an RDFS vocabulary or OWL ontology whose classes and properties are used in the dataset.

Classes and properties are typically linked to their defining vocabulary using the rdfs:isDefinedBy property. The same URI that is used with rdfs:isDefinedBy to identify the vocabulary should also be used in the void:vocabulary statement.

subjectsTarget

Label: 
has target where subjects are located
Domain: 
Linkset
Range: 
Dataset
Comment: 
The source target of an interlinking

For RDF properties such as foaf:based_near where the direction shall be stated explicitly this property is used to indicate the source of the interlinking.

Functional Property: 
Functional Property
Superproperty: 
target

objectsTarget

Label: 
has target where objects are located
Domain: 
Linkset
Range: 
Dataset
Comment: 
The sink target of an interlinking

For RDF properties such as foaf:based_near where the direction shall be stated explicitly this property is used to indicate the sink of the interlinking.

Functional Property: 
Functional Property
Superproperty: 
target

dataDump

Label: 
has data dump at
Domain: 
Dataset
Comment: 
Announcement of an RDF dump of the dataset.

If an RDF dump of the dataset is available, then its location can be announced using this property.

Note that the range is rdfs:Resource.

uriLookupEndpoint

Label: 
has an URI look-up endpoint at
Domain: 
Dataset
Comment: 
Defines a simple URI look-up protocol for accessing a dataset.

Defines a simple URI look-up protocol for accessing a dataset based on an URI end-point.

Note that the range is rdfs:Resource.

uriRegexPattern

Label: 
has URI regular expression pattern
Domain: 
Dataset
Comment: 
Defines a regular expression pattern matching URIs in the dataset.

A regular expression pattern that matches one or more URI in the dataset.

Note that the range should use the same regular expression syntax as SPARQL, which uses the syntax definition of XML Schema 2: Regular Expressions (http://www.w3.org/TR/xmlschema-2/#regexs)