SAKey: Scalable almost key discovery in RDF data

dc.contributor.authorSymeonidou, Danai
dc.contributor.authorArmant, Vincent
dc.contributor.authorPernelle, Nathalie
dc.contributor.authorSais, Fatiha
dc.contributor.funderScience Foundation Irelanden
dc.date.accessioned2016-04-20T15:50:48Z
dc.date.available2016-04-20T15:50:48Z
dc.date.issued2014-10
dc.date.updated2016-01-11T14:14:54Z
dc.description.abstractExploiting identity links among RDF resources allows applications to efficiently integrate data. Keys can be very useful to discover these identity links. A set of properties is considered as a key when its values uniquely identify resources. However, these keys are usually not available. The approaches that attempt to automatically discover keys can easily be overwhelmed by the size of the data and require clean data. We present SAKey, an approach that discovers keys in RDF data in an efficient way. To prune the search space, SAKey exploits characteristics of the data that are dynamically detected during the process. Furthermore, our approach can discover keys in datasets where erroneous data or duplicates exist (i.e., almost keys). The approach has been evaluated on different synthetic and real datasets. The results show both the relevance of almost keys and the efficiency of discovering them.en
dc.description.sponsorshipScience Foundation Ireland (Grant No. 12/RC/2289)en
dc.description.statusPeer revieweden
dc.description.urihttp://iswc2014.semanticweb.org/en
dc.description.versionAccepted Versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationSymeonidou, D., Armant, V., Pernelle, N. and Sais, F. (2014) "SAKey: Scalable almost key discovery in RDF data", 13th International Semantic Web Conference, ISWC 2014. Riva del Garda, Trento, Italy, 19-23 October, 2014. Springer: The Semantic Web – ISWC 2014, pp. 33-49. DOI: 10.1007/978-3-319-11964-9_3en
dc.identifier.doi10.1007/978-3-319-11964-9_3
dc.identifier.endpage49en
dc.identifier.isbn978-331911963-2
dc.identifier.issn03029743
dc.identifier.journaltitleLecture Notes in Computer Scienceen
dc.identifier.startpage33en
dc.identifier.urihttps://hdl.handle.net/10468/2471
dc.identifier.volume8796en
dc.language.isoenen
dc.publisherSpringer International Publishingen
dc.relation.ispartof13th International Semantic Web Conference, ISWC 2014. Riva del Garda, Trento, Italy, 19-23 October, 2014
dc.relation.urihttp://link.springer.com/chapter/10.1007/978-3-319-11964-9_3
dc.rights© 2014 Springer International Publishing. The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-11964-9_3en
dc.rights.urihttp://www.springer.com/gp/rights-permissions/obtaining-permissions/882en
dc.subjectData linkingen
dc.subjectIdentity linksen
dc.subjectKeysen
dc.subjectOWL2en
dc.subjectRDFen
dc.titleSAKey: Scalable almost key discovery in RDF dataen
dc.typeConference itemen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
iswc14.pdf
Size:
934.5 KB
Format:
Adobe Portable Document Format
Description:
Accepted Version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.71 KB
Format:
Item-specific license agreed upon to submission
Description: