Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2012

Open Access 01-12-2012 | Technical advance

A repository based on a dynamically extensible data model supporting multidisciplinary research in neuroscience

Authors: Luca Corradi, Ivan Porro, Andrea Schenone, Parastoo Momeni, Raffaele Ferrari, Flavio Nobili, Michela Ferrara, Gabriele Arnulfo, Marco M Fato

Published in: BMC Medical Informatics and Decision Making | Issue 1/2012

Login to get access

Abstract

Background

Robust, extensible and distributed databases integrating clinical, imaging and molecular data represent a substantial challenge for modern neuroscience. It is even more difficult to provide extensible software environments able to effectively target the rapidly changing data requirements and structures of research experiments. There is an increasing request from the neuroscience community for software tools addressing technical challenges about: (i) supporting researchers in the medical field to carry out data analysis using integrated bioinformatics services and tools; (ii) handling multimodal/multiscale data and metadata, enabling the injection of several different data types according to structured schemas; (iii) providing high extensibility, in order to address different requirements deriving from a large variety of applications simply through a user runtime configuration.

Methods

A dynamically extensible data structure supporting collaborative multidisciplinary research projects in neuroscience has been defined and implemented. We have considered extensibility issues from two different points of view. First, the improvement of data flexibility has been taken into account. This has been done through the development of a methodology for the dynamic creation and use of data types and related metadata, based on the definition of “meta” data model. This way, users are not constrainted to a set of predefined data and the model can be easily extensible and applicable to different contexts. Second, users have been enabled to easily customize and extend the experimental procedures in order to track each step of acquisition or analysis. This has been achieved through a process-event data structure, a multipurpose taxonomic schema composed by two generic main objects: events and processes. Then, a repository has been built based on such data model and structure, and deployed on distributed resources thanks to a Grid-based approach. Finally, data integration aspects have been addressed by providing the repository application with an efficient dynamic interface designed to enable the user to both easily query the data depending on defined datatypes and view all the data of every patient in an integrated and simple way.

Results

The results of our work have been twofold. First, a dynamically extensible data model has been implemented and tested based on a “meta” data-model enabling users to define their own data types independently from the application context. This data model has allowed users to dynamically include additional data types without the need of rebuilding the underlying database. Then a complex process-event data structure has been built, based on this data model, describing patient-centered diagnostic processes and merging information from data and metadata. Second, a repository implementing such a data structure has been deployed on a distributed Data Grid in order to provide scalability both in terms of data input and data storage and to exploit distributed data and computational approaches in order to share resources more efficiently. Moreover, data managing has been made possible through a friendly web interface. The driving principle of not being forced to preconfigured data types has been satisfied. It is up to users to dynamically configure the data model for the given experiment or data acquisition program, thus making it potentially suitable for customized applications.

Conclusions

Based on such repository, data managing has been made possible through a friendly web interface. The driving principle of not being forced to preconfigured data types has been satisfied. It is up to users to dynamically configure the data model for the given experiment or data acquisition program, thus making it potentially suitable for customized applications.
Appendix
Available only for authorised users
Literature
2.
go back to reference Amari S, Beltrame F, Bjaalie J, Dalkara T, Schutter ED, Egan G, Goddard N, Gonzalez C, Grillner S, Herz A, Hoffmann K, Jaaskelainen I, Koslow S, Lee S, Matthiessen L, Miller P, Silva FD, Novak M, Ravindranath V, Ritz R, Ruotsalainen U, Sebestra V, Subramaniam S, Tang Y, Toga A, Usui S, Pelt JV, Verschure P, Willshaw D, Wrobel A: Neuroinformatics: the integration of shared databases and tools towards integrative neuroscience. J Integr Neurosci. 2002, 1 (2): 117-128. 10.1142/S0219635202000128.CrossRefPubMed Amari S, Beltrame F, Bjaalie J, Dalkara T, Schutter ED, Egan G, Goddard N, Gonzalez C, Grillner S, Herz A, Hoffmann K, Jaaskelainen I, Koslow S, Lee S, Matthiessen L, Miller P, Silva FD, Novak M, Ravindranath V, Ritz R, Ruotsalainen U, Sebestra V, Subramaniam S, Tang Y, Toga A, Usui S, Pelt JV, Verschure P, Willshaw D, Wrobel A: Neuroinformatics: the integration of shared databases and tools towards integrative neuroscience. J Integr Neurosci. 2002, 1 (2): 117-128. 10.1142/S0219635202000128.CrossRefPubMed
3.
go back to reference Phan J, Quo C, Wang M: Functional genomics and proteomics in the clinical neurosciences: data mining and bioinformatics. Progress in brain research. 2006, 158: 83-108.CrossRefPubMed Phan J, Quo C, Wang M: Functional genomics and proteomics in the clinical neurosciences: data mining and bioinformatics. Progress in brain research. 2006, 158: 83-108.CrossRefPubMed
7.
go back to reference Jones A, Miller M, Aebersold R, Apweiler R, Ball C, Brazma A, DeGreef J, Hardy N, Hermjakob H, Hubbard S, Hussey P, Igra M, Jenkins H, Julian R, Laursen K, Oliver S, Paton N, Sansone S, Sarkans U, Stoeckert C, Taylor C, Whetzel P, White J, Spellman P, Pizarro A: The Functional Genomics Experiment model (FuGE): an extensible framework for standards in functional genomics. Nat Biotech. 2007, 25 (10): 1127-1133. 10.1038/nbt1347. [http://dx.doi.org/10.1038/nbt1347],CrossRef Jones A, Miller M, Aebersold R, Apweiler R, Ball C, Brazma A, DeGreef J, Hardy N, Hermjakob H, Hubbard S, Hussey P, Igra M, Jenkins H, Julian R, Laursen K, Oliver S, Paton N, Sansone S, Sarkans U, Stoeckert C, Taylor C, Whetzel P, White J, Spellman P, Pizarro A: The Functional Genomics Experiment model (FuGE): an extensible framework for standards in functional genomics. Nat Biotech. 2007, 25 (10): 1127-1133. 10.1038/nbt1347. [http://​dx.​doi.​org/​10.​1038/​nbt1347],CrossRef
8.
go back to reference Perez-Rey D, Maojo V, Garcia-Remesal M, Alonso-Calvo R: Biomedical ontologies in post-genomic information systems. Fourth IEEE Symposium on Bioinformatics and Bioengineering BIBE 2004. Proceedings. 2004, 207-214.CrossRef Perez-Rey D, Maojo V, Garcia-Remesal M, Alonso-Calvo R: Biomedical ontologies in post-genomic information systems. Fourth IEEE Symposium on Bioinformatics and Bioengineering BIBE 2004. Proceedings. 2004, 207-214.CrossRef
10.
go back to reference Smith B, Shburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg L, Eilbeck K, Ireland A, Mungall C, Leontis N, Rocca-Serra P, Ruttenberg A, Sansone S, Scheuermann R, Shah N, Whetzel P, Lewis S: The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotech. 2007, 25 (11): 1251-1255. 10.1038/nbt1346. [http://dx.doi.org/10.1038/nbt1346],CrossRef Smith B, Shburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg L, Eilbeck K, Ireland A, Mungall C, Leontis N, Rocca-Serra P, Ruttenberg A, Sansone S, Scheuermann R, Shah N, Whetzel P, Lewis S: The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotech. 2007, 25 (11): 1251-1255. 10.1038/nbt1346. [http://​dx.​doi.​org/​10.​1038/​nbt1346],CrossRef
12.
go back to reference Whetzel PL, Brinkman RR, Causton HC, Fan L, Field D, Fostel J, Fragoso G, Gray T, Heiskanen M, Hernandez-Boussard T, Morrison N, Parkinson H, Rocca-Serra P, Sansone SA, Schober D, Smith B, Stevens R, Stoeckert CJ, Jr CT, White J, Wood A, Group FW: Development of FuGO: An Ontology for Functional Genomics Investigations. OMICS. 2006, 10 (2): 199-204. 10.1089/omi.2006.10.199.CrossRefPubMedPubMedCentral Whetzel PL, Brinkman RR, Causton HC, Fan L, Field D, Fostel J, Fragoso G, Gray T, Heiskanen M, Hernandez-Boussard T, Morrison N, Parkinson H, Rocca-Serra P, Sansone SA, Schober D, Smith B, Stevens R, Stoeckert CJ, Jr CT, White J, Wood A, Group FW: Development of FuGO: An Ontology for Functional Genomics Investigations. OMICS. 2006, 10 (2): 199-204. 10.1089/omi.2006.10.199.CrossRefPubMedPubMedCentral
15.
go back to reference Marenco L, Tosches T, Crasto C, Shepherd G, Miller P, Nadkarni P: Achieving evolvable web-database bioscience applications using the EAV/CR framework: recent advances. J Am Med Inform Assoc. 2003, 10: 444-453. 10.1197/jamia.M1303.CrossRefPubMedPubMedCentral Marenco L, Tosches T, Crasto C, Shepherd G, Miller P, Nadkarni P: Achieving evolvable web-database bioscience applications using the EAV/CR framework: recent advances. J Am Med Inform Assoc. 2003, 10: 444-453. 10.1197/jamia.M1303.CrossRefPubMedPubMedCentral
16.
go back to reference Hastings S, Oster S, Langella S, Kurc T, Pan T, Catalyurek U, Saltz J: A Grid-Based Image Archival and Analysis System. J Am Med Inform Assoc. 2005, 12 (3): 286-295. 10.1197/jamia.M1698.CrossRefPubMedPubMedCentral Hastings S, Oster S, Langella S, Kurc T, Pan T, Catalyurek U, Saltz J: A Grid-Based Image Archival and Analysis System. J Am Med Inform Assoc. 2005, 12 (3): 286-295. 10.1197/jamia.M1698.CrossRefPubMedPubMedCentral
17.
go back to reference Fernández M, Kadiyska Y, Suciu D, Morishima A, Tan W: SilkRoute: A framework for publishing relational data in XML. ACM Transactions on Database Systems. 2002, 27 (4): 438-493. 10.1145/582410.582413.CrossRef Fernández M, Kadiyska Y, Suciu D, Morishima A, Tan W: SilkRoute: A framework for publishing relational data in XML. ACM Transactions on Database Systems. 2002, 27 (4): 438-493. 10.1145/582410.582413.CrossRef
18.
go back to reference Bui A, Weinger G, Barretta S, Dionisio J, Kangarloo H: An XML Gateway to Patient Data for Medical Research Applications. Annals New York Academy Sciences. 2002, 980: 236-246. 10.1111/j.1749-6632.2002.tb04900.x.CrossRef Bui A, Weinger G, Barretta S, Dionisio J, Kangarloo H: An XML Gateway to Patient Data for Medical Research Applications. Annals New York Academy Sciences. 2002, 980: 236-246. 10.1111/j.1749-6632.2002.tb04900.x.CrossRef
19.
go back to reference Marcus D, Olsen T, Ramaratnam M, Buckner R: The Extensible Neuroimaging Archive Toolkit: an informatics platform for managing, exploring, and sharing neuroimaging data. Neuroinformatics. 2007, 5: 11-34.CrossRefPubMed Marcus D, Olsen T, Ramaratnam M, Buckner R: The Extensible Neuroimaging Archive Toolkit: an informatics platform for managing, exploring, and sharing neuroimaging data. Neuroinformatics. 2007, 5: 11-34.CrossRefPubMed
20.
go back to reference Ozyurt WDKDBPSGBGG BI, Grethe JS: Federated Web-accessible clinical data management within an extensible neuroimaging database. Neuroinformatics. 2010, 8 (4): 231-249. 10.1007/s12021-010-9078-6.CrossRef Ozyurt WDKDBPSGBGG BI, Grethe JS: Federated Web-accessible clinical data management within an extensible neuroimaging database. Neuroinformatics. 2010, 8 (4): 231-249. 10.1007/s12021-010-9078-6.CrossRef
21.
go back to reference Maojo V, Tsiknakis M: Biomedical informatics and healthGRIDs: a European perspective. IEEE Eng Med Biol Mag. 2007, 26 (3): 34-41.CrossRefPubMed Maojo V, Tsiknakis M: Biomedical informatics and healthGRIDs: a European perspective. IEEE Eng Med Biol Mag. 2007, 26 (3): 34-41.CrossRefPubMed
29.
go back to reference Ueng W, Chen H: Grid Interoperation: SRM-iRODS interface Development. Proceedings International Symposium on Grids and Clouds (ISGC 2011), 19-25 March 2011. Taipei, Taiwan Ueng W, Chen H: Grid Interoperation: SRM-iRODS interface Development. Proceedings International Symposium on Grids and Clouds (ISGC 2011), 19-25 March 2011. Taipei, Taiwan
Metadata
Title
A repository based on a dynamically extensible data model supporting multidisciplinary research in neuroscience
Authors
Luca Corradi
Ivan Porro
Andrea Schenone
Parastoo Momeni
Raffaele Ferrari
Flavio Nobili
Michela Ferrara
Gabriele Arnulfo
Marco M Fato
Publication date
01-12-2012
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2012
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/1472-6947-12-115

Other articles of this Issue 1/2012

BMC Medical Informatics and Decision Making 1/2012 Go to the issue