Skip to main content
Top
Published in: International Journal of Diabetes in Developing Countries 1/2011

01-02-2011 | Original Article

Application of data mining techniques on diabetes related proteins

Authors: R. Bhramaramba, Appa Rao Allam, Vakula Vijay Kumar, G. R. Sridhar

Published in: International Journal of Diabetes in Developing Countries | Issue 1/2011

Login to get access

Abstract

Genomic Data is growing very rapidly with the sequencing of genomes of various forms of life. To understand the overwhelming data and to obtain meaningful information, Data Mining techniques such as Principal Component Analysis and Discriminant Analysis are used for the purpose. Data Mining is basically used when the data is vast and there is need to extract the hidden knowledge in the form of useful patterns. The data set taken into consideration is protein data pertaining to diabetes mellitus obtained from a database. The task at hand was to find out in which species most of the diabetes related proteins exist. It so happened that most of these proteins were prevalent in Human Beings, House Mice and Norway Rat as they are all mammals and Human Beings have orthologs as House Mice and Norway Rat. Both these techniques prove that human beings show a variation from those of House Mice and Norway Rat which are similar in terms of the variation of protein attributes. This can also be inferred from statistical analysis by using histograms and bivariate plots. Other Data Mining Techniques such as Regression and Clustering can be used to further explore the above inference.
Appendix
Available only for authorised users
Literature
1.
go back to reference Ewens WJ, Grant GR. Statistical methods in bioinformatics an introduction. New Delhi: Springer Verlag; 2004. Ewens WJ, Grant GR. Statistical methods in bioinformatics an introduction. New Delhi: Springer Verlag; 2004.
2.
go back to reference Nagabhushana S. Datawarehousing OLAP and data mining. New Delhi: New Age International Publishers; 2006. p. 252. Nagabhushana S. Datawarehousing OLAP and data mining. New Delhi: New Age International Publishers; 2006. p. 252.
3.
go back to reference Mount DW. Bioinformatics sequence and genome analysis. CBS Publishers and Distributors; 2003. pp. 534. Mount DW. Bioinformatics sequence and genome analysis. CBS Publishers and Distributors; 2003. pp. 534.
4.
go back to reference Sridhar GR, Murali G. Technology Spectrum. 2008;2:69–72. Sridhar GR, Murali G. Technology Spectrum. 2008;2:69–72.
5.
go back to reference Allam AR, Ravi B, Sridhar GR. Mathematical analysis of diabetes related proteins having high sequence complexity, ictai, Proc. 18th IEEE International Conference on Tools with Artificial Intelligence. 2006; pp. 810–21 Allam AR, Ravi B, Sridhar GR. Mathematical analysis of diabetes related proteins having high sequence complexity, ictai, Proc. 18th IEEE International Conference on Tools with Artificial Intelligence. 2006; pp. 810–21
6.
go back to reference Rubin EM, Barsh GS. Biological insights through genomics: mouse to man. J Clin Invest. 1996;97:275–80.PubMedCrossRef Rubin EM, Barsh GS. Biological insights through genomics: mouse to man. J Clin Invest. 1996;97:275–80.PubMedCrossRef
7.
go back to reference Huynen MA, Nimwegen EV. The frequency distribution of gene family sizes in complete genomes. Mol Biol Evol. 1998;15:583–9.PubMed Huynen MA, Nimwegen EV. The frequency distribution of gene family sizes in complete genomes. Mol Biol Evol. 1998;15:583–9.PubMed
8.
go back to reference Mouse Genome Sequencing Consortium. Initial sequencing and comparative analysis of the human genome. Nature. 2002;420:520–62.CrossRef Mouse Genome Sequencing Consortium. Initial sequencing and comparative analysis of the human genome. Nature. 2002;420:520–62.CrossRef
9.
go back to reference Zhang L, Pavlovic V, Cantor CR, et al. Human-mouse gene identification by comparative evidence integration and evolutionary analysis. Genome Res. 2003;13:1190–120.PubMedCrossRef Zhang L, Pavlovic V, Cantor CR, et al. Human-mouse gene identification by comparative evidence integration and evolutionary analysis. Genome Res. 2003;13:1190–120.PubMedCrossRef
10.
go back to reference Foucan L, Vaillant J. Hypertension in the metabolic syndrome among Caribbean non Diabetic Subjects. Arch Mal Coeur Vaiss. 2007;100:649–53.PubMed Foucan L, Vaillant J. Hypertension in the metabolic syndrome among Caribbean non Diabetic Subjects. Arch Mal Coeur Vaiss. 2007;100:649–53.PubMed
11.
go back to reference Nandi T, B-Rao C, Ramachandran S. Comparative genomics using data mining tools. J Biosci. 2002;27 Suppl 1:15–25.PubMedCrossRef Nandi T, B-Rao C, Ramachandran S. Comparative genomics using data mining tools. J Biosci. 2002;27 Suppl 1:15–25.PubMedCrossRef
Metadata
Title
Application of data mining techniques on diabetes related proteins
Authors
R. Bhramaramba
Appa Rao Allam
Vakula Vijay Kumar
G. R. Sridhar
Publication date
01-02-2011
Publisher
Springer-Verlag
Published in
International Journal of Diabetes in Developing Countries / Issue 1/2011
Print ISSN: 0973-3930
Electronic ISSN: 1998-3832
DOI
https://doi.org/10.1007/s13410-010-0001-3

Other articles of this Issue 1/2011

International Journal of Diabetes in Developing Countries 1/2011 Go to the issue
Live Webinar | 27-06-2024 | 18:00 (CEST)

Keynote webinar | Spotlight on medication adherence

Live: Thursday 27th June 2024, 18:00-19:30 (CEST)

WHO estimates that half of all patients worldwide are non-adherent to their prescribed medication. The consequences of poor adherence can be catastrophic, on both the individual and population level.

Join our expert panel to discover why you need to understand the drivers of non-adherence in your patients, and how you can optimize medication adherence in your clinics to drastically improve patient outcomes.

Prof. Kevin Dolgin
Prof. Florian Limbourg
Prof. Anoop Chauhan
Developed by: Springer Medicine
Obesity Clinical Trial Summary

At a glance: The STEP trials

A round-up of the STEP phase 3 clinical trials evaluating semaglutide for weight loss in people with overweight or obesity.

Developed by: Springer Medicine

Highlights from the ACC 2024 Congress

Year in Review: Pediatric cardiology

Watch Dr. Anne Marie Valente present the last year's highlights in pediatric and congenital heart disease in the official ACC.24 Year in Review session.

Year in Review: Pulmonary vascular disease

The last year's highlights in pulmonary vascular disease are presented by Dr. Jane Leopold in this official video from ACC.24.

Year in Review: Valvular heart disease

Watch Prof. William Zoghbi present the last year's highlights in valvular heart disease from the official ACC.24 Year in Review session.

Year in Review: Heart failure and cardiomyopathies

Watch this official video from ACC.24. Dr. Biykem Bozkurt discusses last year's major advances in heart failure and cardiomyopathies.