Search | VHL Regional Portal

semQA: SPARQL with Idempotent Disjunction.

Shironoshita, E Patrick; Jean-Mary, Yves R; Bradley, Ray M; Kabuka, Mansur R.

IEEE Trans Knowl Data Eng ; 21(3): 401-414, 2009 Mar 01.

Article in English | MEDLINE | ID: mdl-19915690

ABSTRACT

The SPARQL LeftJoin abstract operator is not distributive over Union; this limits the algebraic manipulation of graph patterns, which in turn restricts the ability to create query plans for distributed processing or query optimization. In this paper, we present semQA, an algebraic extension for the SPARQL query language for RDF, which overcomes this issue by transforming graph patterns through the use of an idempotent disjunction operator Or as a substitute for Union. This permits the application of a set of equivalences that transform a query into distinct forms. We further present an algorithm to derive the solution set of the original query from the solution set of a query where Union has been substituted by Or. We also analyze the combined complexity of SPARQL, proving it to be NP-complete. It is also shown that the SPARQL query language is not, in the general case, fixed-parameter tractable. Experimental results are presented to validate the query evaluation methodology presented in this paper against the SPARQL standard to corroborate the complexity analysis and to illustrate the gains in processing cost reduction that can be obtained through the application of semQA.

Ontology Matching with Semantic Verification.

Jean-Mary, Yves R; Shironoshita, E Patrick; Kabuka, Mansur R.

Web Semant ; 7(3): 235-251, 2009 Sep 01.

Article in English | MEDLINE | ID: mdl-20186256

ABSTRACT

ASMOV (Automated Semantic Matching of Ontologies with Verification) is a novel algorithm that uses lexical and structural characteristics of two ontologies to iteratively calculate a similarity measure between them, derives an alignment, and then verifies it to ensure that it does not contain semantic inconsistencies. In this paper, we describe the ASMOV algorithm, and then present experimental results that measure its accuracy using the OAEI 2008 tests, and that evaluate its use with two different thesauri: WordNet, and the Unified Medical Language System (UMLS). These results show the increased accuracy obtained by combining lexical, structural and extensional matchers with semantic verification, and demonstrate the advantage of using a domain-specific thesaurus for the alignment of specialized ontologies.

semCDI: a query formulation for semantic data integration in caBIG.

Shironoshita, E Patrick; Jean-Mary, Yves R; Bradley, Ray M; Kabuka, Mansur R.

J Am Med Inform Assoc ; 15(4): 559-68, 2008.

Article in English | MEDLINE | ID: mdl-18436897

ABSTRACT

OBJECTIVES: To develop mechanisms to formulate queries over the semantic representation of cancer-related data services available through the cancer Biomedical Informatics Grid (caBIG). DESIGN: The semCDI query formulation uses a view of caBIG semantic concepts, metadata, and data as an ontology, and defines a methodology to specify queries using the SPARQL query language, extended with Horn rules. semCDI enables the joining of data that represent different concepts through associations modeled as object properties, and the merging of data representing the same concept in different sources through Common Data Elements (CDE) modeled as datatype properties, using Horn rules to specify additional semantics indicating conditions for merging data. Validation In order to validate this formulation, a prototype has been constructed, and two queries have been executed against currently available caBIG data services. DISCUSSION: The semCDI query formulation uses the rich semantic metadata available in caBIG to build queries and integrate data from multiple sources. Its promise will be further enhanced as more data services are registered in caBIG, and as more linkages can be achieved between the knowledge contained within caBIG's NCI Thesaurus and the data contained in the Data Services. CONCLUSION: semCDI provides a formulation for the creation of queries on the semantic representation of caBIG. This constitutes the foundation to build a semantic data integration system for more efficient and effective querying and exploratory searching of cancer-related data.

Subject(s)

Information Management/methods , Information Storage and Retrieval/methods , Information Systems/organization & administration , Medical Oncology/organization & administration , Semantics , Biomedical Research/organization & administration , Cancer Care Facilities/organization & administration , Computational Biology , Humans , Internet , Vocabulary, Controlled

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL