DBpedia domains and URIs: Difference between revisions

From DBpedia Mappings
Jump to navigationJump to search
No edit summary
Line 14: Line 14:
| <nowiki>http://dbpedia.org/datatype/xyz</nowiki>
| <nowiki>http://dbpedia.org/datatype/xyz</nowiki>
|-
|-
| Resource IRI for <nowiki>http://xx.wikipedia.org/wiki/Xyz</nowiki>
| Resource IRI for http://'''xx'''.wikipedia.org/wiki/Xyz
| <nowiki>http://xx.dbpedia.org/resource/Xyz</nowiki>
| http://'''xx'''.dbpedia.org/resource/Xyz
|-
|-
| Properties extracted by generic template extractor
| Properties extracted by generic template extractor from http://'''xx'''.wikipedia.org/ pages
| <nowiki>http://xx.dbpedia.org/property/xyz</nowiki>
| http://'''xx'''.dbpedia.org/property/xyz
|}
|}


Line 32: Line 32:
! deviating IRI pattern
! deviating IRI pattern
|-
|-
| Resource URI for <nowiki>http://en.wikipedia.org/wiki/Xyz</nowiki>
| Resource URI for http://'''en'''.wikipedia.org/wiki/Xyz
| <nowiki>http://dbpedia.org/resource/Xyz</nowiki>
| <nowiki>http://dbpedia.org/resource/Xyz</nowiki>
|-
|-
| Properties extracted by generic template extractor from <nowiki>http://en.wikipedia.org/</nowiki> pages
| Properties extracted by generic template extractor from http://'''en'''.wikipedia.org/ pages
| <nowiki>http://dbpedia.org/property/xyz</nowiki>
| <nowiki>http://dbpedia.org/property/xyz</nowiki>
|}
|}

Revision as of 23:45, 14 May 2012

Long-term goals

  • All languages use IRIs, not URIs
Items IRI pattern
Ontology classes and properties http://dbpedia.org/ontology/Xyz
Datatypes http://dbpedia.org/datatype/xyz
Resource IRI for http://xx.wikipedia.org/wiki/Xyz http://xx.dbpedia.org/resource/Xyz
Properties extracted by generic template extractor from http://xx.wikipedia.org/ pages http://xx.dbpedia.org/property/xyz

Deviations from these rules

In the past, we did not follow these rules. For backwards compatibility, we will allow some deviations for a while. Might be a long while. :-)

  • English and French use URIs, not IRIs. (DBpedia Berlin will publish additional datasets with IRIs, but the main datasets will use URIs.)
Items deviating IRI pattern
Resource URI for http://en.wikipedia.org/wiki/Xyz http://dbpedia.org/resource/Xyz
Properties extracted by generic template extractor from http://en.wikipedia.org/ pages http://dbpedia.org/property/xyz

Notes

The main DBpedia release and a DBpedia chapter must use the same syntax for equivalent IRIs.

We used to map page titles from non-English Wikipedias to IRIs using the inter-language link to the English Wikipedia. If there was no such inter-language link, we did not extract any data from the non-English page. That was because we used URIs like http://dbpedia.org/resource/Xyz for all languages and had to 'normalize' the URIs. We will not do that anymore - we will use http://xx.dbpedia.org/resource/Xyz IRIs.

Implementation details

During the extraction, the framework will use http://xx.dbpedia.org/ IRIs for all languages, even English. Different serializers will serialize them differently, according to some rules:

  • convert en.dbpedia.org to dbpedia.org
  • convert IRIs to URIs for some languages (in subjects, predicates and objects)