A Statistical and Schema Independent Approach to Identify Equivalent Properties on Linked Data

Kalpa Gunaratna, Krishnaprasad Thirunarayan, Prateek Jain, Amit Sheth, Sanjaya Wijeratne

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Linked Open Data (LOD) cloud has gained significant attention in the Semantic Web community recently. Currently it consists of approximately 295 interlinked datasets with over 50 billion triples including 500 million links, and continues to expand in size. This vast source of structured information has the potential to have a significant impact on knowledge-based applications. However, a key impediment to the use of LOD cloud is limited support for data integration tasks over concepts, instances, and properties. Efforts to address this limitation over properties have focused on matching data-type properties across datasets; however, matching of object-type properties has not received similar attention. We present an approach that can automatically match object-type properties across linked datasets, primarily exploiting and bootstrapping from entity co-reference links such as owl:sameAs. Our evaluation, using sample instance sets taken from Freebase, DBpedia, LinkedMDB, and DBLP datasets covering multiple domains shows that our approach matches properties with high precision and recall (on average, F measure gain of 57% - 78%).

Original languageAmerican English
Title of host publicationI-SEMANTICS '13: Proceedings of the 9th International Conference on Semantic Systems
PublisherPubl by ACM
Pages33-40
Number of pages8
ISBN (Print)9781450319720
DOIs
StatePublished - Sep 4 2013
Event9th International Conference on Semantic Systems - Graz, Austria
Duration: Sep 4 2013Sep 6 2013
Conference number: 9

Conference

Conference9th International Conference on Semantic Systems
Abbreviated titleI-SEMANTICS 2013
Country/TerritoryAustria
CityGraz
Period9/4/139/6/13

ASJC Scopus Subject Areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Keywords

  • Linked Open Data
  • Property Alignment
  • Relationship Identication
  • Statistical Equivalence

Disciplines

  • Bioinformatics
  • Communication
  • Communication Technology and New Media
  • Computer Sciences
  • Databases and Information Systems
  • Life Sciences
  • OS and Networks
  • Physical Sciences and Mathematics
  • Science and Technology Studies
  • Social and Behavioral Sciences

Cite this