Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2012

Open Access 01-12-2012 | Research article

Querying phenotype-genotype relationships on patient datasets using semantic web technology: the example of cerebrotendinous xanthomatosis

Authors: María Taboada, Diego Martínez, Belén Pilo, Adriano Jiménez-Escrig, Peter N Robinson, María J Sobrido

Published in: BMC Medical Informatics and Decision Making | Issue 1/2012

Login to get access

Abstract

Background

Semantic Web technology can considerably catalyze translational genetics and genomics research in medicine, where the interchange of information between basic research and clinical levels becomes crucial. This exchange involves mapping abstract phenotype descriptions from research resources, such as knowledge databases and catalogs, to unstructured datasets produced through experimental methods and clinical practice. This is especially true for the construction of mutation databases. This paper presents a way of harmonizing abstract phenotype descriptions with patient data from clinical practice, and querying this dataset about relationships between phenotypes and genetic variants, at different levels of abstraction.

Methods

Due to the current availability of ontological and terminological resources that have already reached some consensus in biomedicine, a reuse-based ontology engineering approach was followed. The proposed approach uses the Ontology Web Language (OWL) to represent the phenotype ontology and the patient model, the Semantic Web Rule Language (SWRL) to bridge the gap between phenotype descriptions and clinical data, and the Semantic Query Web Rule Language (SQWRL) to query relevant phenotype-genotype bidirectional relationships. The work tests the use of semantic web technology in the biomedical research domain named cerebrotendinous xanthomatosis (CTX), using a real dataset and ontologies.

Results

A framework to query relevant phenotype-genotype bidirectional relationships is provided. Phenotype descriptions and patient data were harmonized by defining 28 Horn-like rules in terms of the OWL concepts. In total, 24 patterns of SWQRL queries were designed following the initial list of competency questions. As the approach is based on OWL, the semantic of the framework adapts the standard logical model of an open world assumption.

Conclusions

This work demonstrates how semantic web technologies can be used to support flexible representation and computational inference mechanisms required to query patient datasets at different levels of abstraction. The open world assumption is especially good for describing only partially known phenotype-genotype relationships, in a way that is easily extensible. In future, this type of approach could offer researchers a valuable resource to infer new data from patient data for statistical analysis in translational research. In conclusion, phenotype description formalization and mapping to clinical data are two key elements for interchanging knowledge between basic and clinical research.
Appendix
Available only for authorised users
Literature
1.
go back to reference Federico A, Dotti MT: Cerebrotendinous Xanthomatosis: Clinical Manifestations, Diagnostic Criteria, Pathogenesis, and Therapy. J Child Neurol. 2003, 18: 633-638. 10.1177/08830738030180091001.CrossRefPubMed Federico A, Dotti MT: Cerebrotendinous Xanthomatosis: Clinical Manifestations, Diagnostic Criteria, Pathogenesis, and Therapy. J Child Neurol. 2003, 18: 633-638. 10.1177/08830738030180091001.CrossRefPubMed
2.
go back to reference Lindblom A, Robinson PN: Bioinformatics for Human Genetics: Promises and Challenges. Hum Mutat. 2011, 32: 495-500. 10.1002/humu.21468.CrossRefPubMed Lindblom A, Robinson PN: Bioinformatics for Human Genetics: Promises and Challenges. Hum Mutat. 2011, 32: 495-500. 10.1002/humu.21468.CrossRefPubMed
3.
go back to reference Fokkema I, den Dunnen J, Taschner P: LOVD: easy creation of a locus specific sequence variation database using an LSDB-in-a-box approach. Hum Mutat. 2005, 26: 63-68. 10.1002/humu.20201.CrossRefPubMed Fokkema I, den Dunnen J, Taschner P: LOVD: easy creation of a locus specific sequence variation database using an LSDB-in-a-box approach. Hum Mutat. 2005, 26: 63-68. 10.1002/humu.20201.CrossRefPubMed
4.
go back to reference Beroud C, Collod-Beroud G, Boileau C, Soussi T, Junien C: UMD (Universal Mutation Database): a generic software to build and analyze locus-specific databases. Hum Mutat. 2000, 15: 86-94. 10.1002/(SICI)1098-1004(200001)15:1<86::AID-HUMU16>3.0.CO;2-4.CrossRefPubMed Beroud C, Collod-Beroud G, Boileau C, Soussi T, Junien C: UMD (Universal Mutation Database): a generic software to build and analyze locus-specific databases. Hum Mutat. 2000, 15: 86-94. 10.1002/(SICI)1098-1004(200001)15:1<86::AID-HUMU16>3.0.CO;2-4.CrossRefPubMed
5.
go back to reference Kaput J, Cotton R, Hardman L, Watson M, Al Aqeel AI, Al-Aama JY, Al-Mulla F, Alonso S, Aretz S, Auerbach AD, Bapat B, Bernstein IT, Bhak J, Bleoo SL, Blöcker H, Brenner SE, Burn J, Bustamante M, Calzone R, Cambon-Thomsen A, Cargill M, Carrera P, Cavedon L, Cho YS, Chung YJ, Claustres M, Cutting G, Dalgleish R, den Dunnen JT, Díaz C, Dobrowolski S, Dos Santos MR, Ekong R, Flanagan SB, Flicek P, Furukawa Y, Genuardi M, Ghang H, Golubenko MV, Greenblatt MS, Hamosh A, Hancock JM, Hardison R, Harrison TM, Hoffmann R, Horaitis R, Howard HJ, Barash CI, Izagirre N, Jung J, Kojima T, Laradi S, Lee YS, Lee JY, Gil-da-Silva-Lopes VL, Macrae FA, Maglott D, Marafie MJ, Marsh SG, Matsubara Y, Messiaen LM, Möslein G, Netea MG, Norton ML, Oefner PJ, Oetting WS, O'Leary JC, de Ramirez AM, Paalman MH, Parboosingh J, Patrinos GP, Perozzi G, Phillips IR, Povey S, Prasad S, Qi M, Quin DJ, Ramesar RS, Richards CS, Savige J, Scheible DG, Scott RJ, Seminara D, Shephard EA, Sijmons RH, Smith TD, Sobrido MJ, Tanaka T, Tavtigian SV, Taylor GR, Teague J, Töpel T, Ullman-Cullere M, Utsunomiya J, van Kranen HJ, Vihinen M, Webb E, Weber TK, Yeager M, Yeom YI, Yim SH, Yoo HS: Contributors to the Human Variome Project Planning Meeting: Planning the Human Variome Project: the Spain report. Hum Mutat. 2009, 30: 496-510. 10.1002/humu.20972.CrossRefPubMed Kaput J, Cotton R, Hardman L, Watson M, Al Aqeel AI, Al-Aama JY, Al-Mulla F, Alonso S, Aretz S, Auerbach AD, Bapat B, Bernstein IT, Bhak J, Bleoo SL, Blöcker H, Brenner SE, Burn J, Bustamante M, Calzone R, Cambon-Thomsen A, Cargill M, Carrera P, Cavedon L, Cho YS, Chung YJ, Claustres M, Cutting G, Dalgleish R, den Dunnen JT, Díaz C, Dobrowolski S, Dos Santos MR, Ekong R, Flanagan SB, Flicek P, Furukawa Y, Genuardi M, Ghang H, Golubenko MV, Greenblatt MS, Hamosh A, Hancock JM, Hardison R, Harrison TM, Hoffmann R, Horaitis R, Howard HJ, Barash CI, Izagirre N, Jung J, Kojima T, Laradi S, Lee YS, Lee JY, Gil-da-Silva-Lopes VL, Macrae FA, Maglott D, Marafie MJ, Marsh SG, Matsubara Y, Messiaen LM, Möslein G, Netea MG, Norton ML, Oefner PJ, Oetting WS, O'Leary JC, de Ramirez AM, Paalman MH, Parboosingh J, Patrinos GP, Perozzi G, Phillips IR, Povey S, Prasad S, Qi M, Quin DJ, Ramesar RS, Richards CS, Savige J, Scheible DG, Scott RJ, Seminara D, Shephard EA, Sijmons RH, Smith TD, Sobrido MJ, Tanaka T, Tavtigian SV, Taylor GR, Teague J, Töpel T, Ullman-Cullere M, Utsunomiya J, van Kranen HJ, Vihinen M, Webb E, Weber TK, Yeager M, Yeom YI, Yim SH, Yoo HS: Contributors to the Human Variome Project Planning Meeting: Planning the Human Variome Project: the Spain report. Hum Mutat. 2009, 30: 496-510. 10.1002/humu.20972.CrossRefPubMed
6.
go back to reference Webb AJ, Thorisson GA, Brookes AJ: An informatics project and online “Knowledge Centre” supporting modern genotype-to-phenotype research. Hum Mutat. 2011, 32: 543-550. 10.1002/humu.21469.CrossRefPubMed Webb AJ, Thorisson GA, Brookes AJ: An informatics project and online “Knowledge Centre” supporting modern genotype-to-phenotype research. Hum Mutat. 2011, 32: 543-550. 10.1002/humu.21469.CrossRefPubMed
7.
go back to reference Bada M, Stevens R, Goble C, Gil Y, Ashburner M, Blake JA, Cherry JM, Harris M, Lewis S: A short study on the success of the Gene Ontology. Web Semantics. 2004, 1: 235-240. 10.1016/j.websem.2003.12.003.CrossRef Bada M, Stevens R, Goble C, Gil Y, Ashburner M, Blake JA, Cherry JM, Harris M, Lewis S: A short study on the success of the Gene Ontology. Web Semantics. 2004, 1: 235-240. 10.1016/j.websem.2003.12.003.CrossRef
8.
go back to reference Schulze TG, McMahon FJ: Defining the Phenotype in Human Genetic Studies: Forward Genetics and Reverse Phenotyping. Hum Hered. 2004, 58: 131-138. 10.1159/000083539.CrossRefPubMed Schulze TG, McMahon FJ: Defining the Phenotype in Human Genetic Studies: Forward Genetics and Reverse Phenotyping. Hum Hered. 2004, 58: 131-138. 10.1159/000083539.CrossRefPubMed
10.
go back to reference Mungall C, Gkoutos G, Smith C, Haendel MA, Lewis SE, Ashburner M: Integrating phenotype ontologies across multiple species. Genome Biol. 2010, 11: R9-10.1186/gb-2010-11-1-r9.CrossRef Mungall C, Gkoutos G, Smith C, Haendel MA, Lewis SE, Ashburner M: Integrating phenotype ontologies across multiple species. Genome Biol. 2010, 11: R9-10.1186/gb-2010-11-1-r9.CrossRef
11.
go back to reference Cheung K, Frost HR, Marshall MS: Prud'hommeaux E, Samwald M, Zhao J. Paschke A: A journey to Semantic Web query federation in the life sciences. BMC Bioinformatics. 2009, 10: S10-PubMed Cheung K, Frost HR, Marshall MS: Prud'hommeaux E, Samwald M, Zhao J. Paschke A: A journey to Semantic Web query federation in the life sciences. BMC Bioinformatics. 2009, 10: S10-PubMed
12.
go back to reference Hoehndorf R, Loebe F, Kelso J, Herre H: Representing default knowledge in biomedical ontologies: Application to the integration of anatomy and phenotype ontologies. BMC Bioinformatics. 2007, 8: 377-10.1186/1471-2105-8-377.CrossRefPubMedPubMedCentral Hoehndorf R, Loebe F, Kelso J, Herre H: Representing default knowledge in biomedical ontologies: Application to the integration of anatomy and phenotype ontologies. BMC Bioinformatics. 2007, 8: 377-10.1186/1471-2105-8-377.CrossRefPubMedPubMedCentral
13.
go back to reference Ruttenberg A, Clark T, Bug W, Samwald M, Bodenreider O, Chen H, Doherty D, Forsberg K, Gao Y, Kashyap V, Kinoshita J, Luciano J, Marshall MS, Ogbuji C, Rees J, Stephens S, Wong GT, Wu E, Zaccagnini D, Hongsermeier T, Neumann E, Herman I, Cheung KH: Advancing translational research with the semantic web. BMC Bioinformatics. 2007, 8: S2-CrossRefPubMedPubMedCentral Ruttenberg A, Clark T, Bug W, Samwald M, Bodenreider O, Chen H, Doherty D, Forsberg K, Gao Y, Kashyap V, Kinoshita J, Luciano J, Marshall MS, Ogbuji C, Rees J, Stephens S, Wong GT, Wu E, Zaccagnini D, Hongsermeier T, Neumann E, Herman I, Cheung KH: Advancing translational research with the semantic web. BMC Bioinformatics. 2007, 8: S2-CrossRefPubMedPubMedCentral
14.
go back to reference Robinson P, Köhler S, Bauer S, Seelow D: HornD, Mundlos S: The human phenotype ontology: A tool for annotating and analyzing human hereditary disease. Am J Hum Genet. 2008, 83: 610-15. 10.1016/j.ajhg.2008.09.017.CrossRefPubMedPubMedCentral Robinson P, Köhler S, Bauer S, Seelow D: HornD, Mundlos S: The human phenotype ontology: A tool for annotating and analyzing human hereditary disease. Am J Hum Genet. 2008, 83: 610-15. 10.1016/j.ajhg.2008.09.017.CrossRefPubMedPubMedCentral
15.
go back to reference Gómez-Pérez A, Fernández-López M, Corcho O: Ontological Engineering. 2004, Berlin, Springer Verlag Gómez-Pérez A, Fernández-López M, Corcho O: Ontological Engineering. 2004, Berlin, Springer Verlag
16.
go back to reference Gómez-Pérez A, Suárez-Figueroa MC: Scenarios for building ontology networks within the NeOn methodology. In Proceedings of the K-CAP: 1–4: Redondo Beach. New York: ACM. September 2009, 2009: 183-184. Gómez-Pérez A, Suárez-Figueroa MC: Scenarios for building ontology networks within the NeOn methodology. In Proceedings of the K-CAP: 1–4: Redondo Beach. New York: ACM. September 2009, 2009: 183-184.
17.
go back to reference Pilo B: Xantomatosis Cerebrotendinosa en España: mutaciones, aspectos clínicos y terapéuticos. 2009, PhD thesis, University of Alcalá de Henares, Faculty of Medicine Pilo B: Xantomatosis Cerebrotendinosa en España: mutaciones, aspectos clínicos y terapéuticos. 2009, PhD thesis, University of Alcalá de Henares, Faculty of Medicine
18.
go back to reference Pilo B, Jimenez-Escrig A, Lorenzo JR, Pardo J, Arias M, Ares-Luque A, Duarte J, Muñiz-Pérez S, Sobrido MJ: Cerebrotendinous xanthomatosis in Spain: clinical, prognostic, and genetic survey. Eur J of Neurol. 2011, 18 (10): 1203-1211. 10.1111/j.1468-1331.2011.03439.x.CrossRef Pilo B, Jimenez-Escrig A, Lorenzo JR, Pardo J, Arias M, Ares-Luque A, Duarte J, Muñiz-Pérez S, Sobrido MJ: Cerebrotendinous xanthomatosis in Spain: clinical, prognostic, and genetic survey. Eur J of Neurol. 2011, 18 (10): 1203-1211. 10.1111/j.1468-1331.2011.03439.x.CrossRef
19.
20.
go back to reference Noy N, Shah N, Whetzel P, Dai B, Dorf M, Griffith N, Jonquet C, Rubin D, Storey M, Chute C, Musen M: BioPortal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Res. 2009, 37: W170-W173. 10.1093/nar/gkp440.CrossRefPubMedPubMedCentral Noy N, Shah N, Whetzel P, Dai B, Dorf M, Griffith N, Jonquet C, Rubin D, Storey M, Chute C, Musen M: BioPortal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Res. 2009, 37: W170-W173. 10.1093/nar/gkp440.CrossRefPubMedPubMedCentral
21.
go back to reference Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg LJ, Eilbeck K, Ireland A: Mungall CJ; OBI Consortium, Leontis N, Rocca-Serra P, Ruttenberg A, Sansone SA, Scheuermann RH, Shah N, Whetzel PL, Lewis S: The OBO foundry: Coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol. 2007, 25: 1251-1255. 10.1038/nbt1346.CrossRefPubMedPubMedCentral Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg LJ, Eilbeck K, Ireland A: Mungall CJ; OBI Consortium, Leontis N, Rocca-Serra P, Ruttenberg A, Sansone SA, Scheuermann RH, Shah N, Whetzel PL, Lewis S: The OBO foundry: Coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol. 2007, 25: 1251-1255. 10.1038/nbt1346.CrossRefPubMedPubMedCentral
23.
go back to reference Knublauch H, Fergerson R, Noy N, Musen M: The Protege OWL plugin: an open development environment for Semantic Web applications. In Proceedings of the Third ISWC: 7–11: Hiroshima: Edited by McIlraith S, Plexousakis D, van Harmelen F. Hiroshima: Springer, LNCS. November 2004, 3298: 229-243. Knublauch H, Fergerson R, Noy N, Musen M: The Protege OWL plugin: an open development environment for Semantic Web applications. In Proceedings of the Third ISWC: 7–11: Hiroshima: Edited by McIlraith S, Plexousakis D, van Harmelen F. Hiroshima: Springer, LNCS. November 2004, 3298: 229-243.
24.
go back to reference O'Connor M, Knublauch H, Tu SW, Musen MA: In Proceedings of the 8th International Protégé Conference. 2005, Madrid, Protégé with Rules Workshop O'Connor M, Knublauch H, Tu SW, Musen MA: In Proceedings of the 8th International Protégé Conference. 2005, Madrid, Protégé with Rules Workshop
25.
go back to reference O'Connor M, Das A: SQWRL: a Query Language for OWL. 2009, In Proceedings of the Fifth International Workshop on OWL, Chantilly, 23-24. O'Connor M, Das A: SQWRL: a Query Language for OWL. 2009, In Proceedings of the Fifth International Workshop on OWL, Chantilly, 23-24.
26.
27.
go back to reference Robinson PN, Mundlos S: The Human Phenotype Ontology. Clin Genet. 2010, 77: 525-534. 10.1111/j.1399-0004.2010.01436.x.CrossRefPubMed Robinson PN, Mundlos S: The Human Phenotype Ontology. Clin Genet. 2010, 77: 525-534. 10.1111/j.1399-0004.2010.01436.x.CrossRefPubMed
28.
go back to reference Coulet A, Smaïl-Tabbone M, Benlian P, Napoli A, Devignes MD: Ontology-guided data preparation for discovering genotype-phenotype relationships. BMC Bioinformatics. 2008, 9 (4): S3-10.1186/1471-2105-9-S4-S3.CrossRefPubMedPubMedCentral Coulet A, Smaïl-Tabbone M, Benlian P, Napoli A, Devignes MD: Ontology-guided data preparation for discovering genotype-phenotype relationships. BMC Bioinformatics. 2008, 9 (4): S3-10.1186/1471-2105-9-S4-S3.CrossRefPubMedPubMedCentral
29.
go back to reference Tu SW, Tennakoon L, O'Connor M, Shankar R, Das A: Using an Integrated Ontology and Information Model for Querying and Reasoning about Phenotypes. 2008, In Proceedings of AMIA Annu Symp: 8–12 November 2008, Washington DC, 727-731. Tu SW, Tennakoon L, O'Connor M, Shankar R, Das A: Using an Integrated Ontology and Information Model for Querying and Reasoning about Phenotypes. 2008, In Proceedings of AMIA Annu Symp: 8–12 November 2008, Washington DC, 727-731.
30.
go back to reference Povey S: Al Aqeel AI, Cambon-Thomsen A, Dalgleish R, den Dunnen JT, Firth HV, Greenblatt MS, Barash CI, Parker M, Patrinos GP, Savige J, Sobrido MJ, Winship I, Cotton RG; Ethics Committee of the Human Genome Organization (HUGO): Practical guidelines addressing ethical issues pertaining to the curation of human locus-specific variation databases (LSDBs). Hum Mutat. 2010, 31: 1179-1184. 10.1002/humu.21339.CrossRefPubMedPubMedCentral Povey S: Al Aqeel AI, Cambon-Thomsen A, Dalgleish R, den Dunnen JT, Firth HV, Greenblatt MS, Barash CI, Parker M, Patrinos GP, Savige J, Sobrido MJ, Winship I, Cotton RG; Ethics Committee of the Human Genome Organization (HUGO): Practical guidelines addressing ethical issues pertaining to the curation of human locus-specific variation databases (LSDBs). Hum Mutat. 2010, 31: 1179-1184. 10.1002/humu.21339.CrossRefPubMedPubMedCentral
31.
go back to reference Jonquet C, LePendu P, Falconer S, Coulet A, Noy NF, Musen MA, Shah NH: NCBO Resource Index: Ontology-Based Search and Mining of Biomedical Resources. Web Semantics. 2011, 9: 316-324. 10.1016/j.websem.2011.06.005.CrossRefPubMedPubMedCentral Jonquet C, LePendu P, Falconer S, Coulet A, Noy NF, Musen MA, Shah NH: NCBO Resource Index: Ontology-Based Search and Mining of Biomedical Resources. Web Semantics. 2011, 9: 316-324. 10.1016/j.websem.2011.06.005.CrossRefPubMedPubMedCentral
32.
go back to reference Hoehndorf R, Dumontier M, Oellrich A, Wimalaratne S, Rebholz-Schuhmann D, Schofield P, Gkoutos GV: A common layer of interoperability for biomedical ontologies based on OWL EL. Bioinformatics. 2011, 27: 1001-1008. 10.1093/bioinformatics/btr058.CrossRefPubMedPubMedCentral Hoehndorf R, Dumontier M, Oellrich A, Wimalaratne S, Rebholz-Schuhmann D, Schofield P, Gkoutos GV: A common layer of interoperability for biomedical ontologies based on OWL EL. Bioinformatics. 2011, 27: 1001-1008. 10.1093/bioinformatics/btr058.CrossRefPubMedPubMedCentral
34.
go back to reference Köhler S, Schulz MH, Krawitz P, Bauer S, Dölken S, Ott CE, Mundlos C, Horn D, Mundlos S, Robinson PN: Clinical diagnostics in human genetics with semantic similarity searches in ontologies. Am J Hum Genet. 2009, 85: 457-64. 10.1016/j.ajhg.2009.09.003.CrossRefPubMedPubMedCentral Köhler S, Schulz MH, Krawitz P, Bauer S, Dölken S, Ott CE, Mundlos C, Horn D, Mundlos S, Robinson PN: Clinical diagnostics in human genetics with semantic similarity searches in ontologies. Am J Hum Genet. 2009, 85: 457-64. 10.1016/j.ajhg.2009.09.003.CrossRefPubMedPubMedCentral
Metadata
Title
Querying phenotype-genotype relationships on patient datasets using semantic web technology: the example of cerebrotendinous xanthomatosis
Authors
María Taboada
Diego Martínez
Belén Pilo
Adriano Jiménez-Escrig
Peter N Robinson
María J Sobrido
Publication date
01-12-2012
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2012
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/1472-6947-12-78

Other articles of this Issue 1/2012

BMC Medical Informatics and Decision Making 1/2012 Go to the issue