Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2020

Open Access 01-12-2020 | Research article

Identifying essential proteins in dynamic protein networks based on an improved h-index algorithm

Authors: Caiyan Dai, Ju He, Kongfa Hu, Youwei Ding

Published in: BMC Medical Informatics and Decision Making | Issue 1/2020

Login to get access

Abstract

Background

The essential proteins in protein networks play an important role in complex cellular functions and in protein evolution. Therefore, the identification of essential proteins in a network can help to explain the structure, function, and dynamics of basic cellular networks. The existing dynamic protein networks regard the protein components as the same at all time points; however, the role of proteins can vary over time.

Methods

To improve the accuracy of identifying essential proteins, an improved h-index algorithm based on the attenuation coefficient method is proposed in this paper. This method incorporates previously neglected node information to improve the accuracy of the essential protein search. Based on choosing the appropriate attenuation coefficient, the values, such as monotonicity, SN, SP, PPV and NPV of different essential protein search algorithms are tested.

Results

The experimental results show that, the algorithm proposed in this paper can ensure the accuracy of the found proteins while identifying more essential proteins.

Conclusions

The described experiments show that this method is more effective than other similar methods in identifying essential proteins in dynamic protein networks. This study can better explain the mechanism of life activities and provide theoretical basis for the research and development of targeted drugs.
Literature
1.
go back to reference Qiao S, Yan B, Li J. Ensemble learning for protein multiplex subcellular localization prediction based on weighted KNN with different features. Appl Intell. 2018;48(7):1813–24.CrossRef Qiao S, Yan B, Li J. Ensemble learning for protein multiplex subcellular localization prediction based on weighted KNN with different features. Appl Intell. 2018;48(7):1813–24.CrossRef
2.
go back to reference Zhao B, Wang J, Li M, Wu FX, Pan Y. Prediction of essential proteins based on overlapping essential modules. IEEE Transact Nano Biosci. 2014;13(4):415–24.CrossRef Zhao B, Wang J, Li M, Wu FX, Pan Y. Prediction of essential proteins based on overlapping essential modules. IEEE Transact Nano Biosci. 2014;13(4):415–24.CrossRef
3.
go back to reference Peng W, Wang JX, Wang W, et al. Iteration method for predicting essential proteins based on orthology and protein-protein interaction networks. BMC Syst Biol. 2012;6(1):87.CrossRef Peng W, Wang JX, Wang W, et al. Iteration method for predicting essential proteins based on orthology and protein-protein interaction networks. BMC Syst Biol. 2012;6(1):87.CrossRef
4.
go back to reference Wang S, Cuomo S, Mei S, Cheng W, Xu N. Efficient method for identifying influential vertices in dynamic networks using the strategy of local detection and updating. Futur Gener Comput Syst. 2019;91:10–24.CrossRef Wang S, Cuomo S, Mei S, Cheng W, Xu N. Efficient method for identifying influential vertices in dynamic networks using the strategy of local detection and updating. Futur Gener Comput Syst. 2019;91:10–24.CrossRef
5.
go back to reference Wan S, Mak MW, Kung SY. Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins. BMC Bioinform. 2016;17(1):97.CrossRef Wan S, Mak MW, Kung SY. Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins. BMC Bioinform. 2016;17(1):97.CrossRef
6.
go back to reference Li M, Lu Y, Xiang N, Pan W. Identification of essential proteins by using complexes and interaction network. Bioinform Res Appl. 2014:255–65. Li M, Lu Y, Xiang N, Pan W. Identification of essential proteins by using complexes and interaction network. Bioinform Res Appl. 2014:255–65.
7.
go back to reference Luo J, Qi Y. Identification of essential proteins based on a new combination of local interaction density and protein complexes. PLoS One. 2015;10(6):e0131418.CrossRef Luo J, Qi Y. Identification of essential proteins based on a new combination of local interaction density and protein complexes. PLoS One. 2015;10(6):e0131418.CrossRef
8.
go back to reference Hu P, Mei T. Ranking influential nodes in complex networks with structural holes. Phys A: Stat Mech Appl. 2018;490:624–31.CrossRef Hu P, Mei T. Ranking influential nodes in complex networks with structural holes. Phys A: Stat Mech Appl. 2018;490:624–31.CrossRef
9.
go back to reference Wang Z, Zhao Y, Xia J, Du C. Fast ranking influential nodes in complex networks using a k-shell iteration factor. Phys A: Stat Mech Appl. 2016;461:171–81.CrossRef Wang Z, Zhao Y, Xia J, Du C. Fast ranking influential nodes in complex networks using a k-shell iteration factor. Phys A: Stat Mech Appl. 2016;461:171–81.CrossRef
10.
go back to reference Lei X, Yang X, Fujita H. Random walk based method to identify essential proteins by integrating network topology and biological characteristics. Knowl-Based Syst. 1671;2019:53–67. Lei X, Yang X, Fujita H. Random walk based method to identify essential proteins by integrating network topology and biological characteristics. Knowl-Based Syst. 1671;2019:53–67.
11.
go back to reference Zhang W, Xu J, Li Y, Zou X. Detecting Essential Proteins Based on Network Topology, Gene Expression Data, and Gene Ontology Information. IEEE/ACM Trans Comput Biol Bioinform. 2018;15(1):109–16.CrossRef Zhang W, Xu J, Li Y, Zou X. Detecting Essential Proteins Based on Network Topology, Gene Expression Data, and Gene Ontology Information. IEEE/ACM Trans Comput Biol Bioinform. 2018;15(1):109–16.CrossRef
12.
go back to reference Lei X, Wang S, Wu FX. Identification of essential proteins based on improved HITS algorithm. IEEE/ACM Trans Comput Biol Bioinform Genes. 2019;10(2):177. Lei X, Wang S, Wu FX. Identification of essential proteins based on improved HITS algorithm. IEEE/ACM Trans Comput Biol Bioinform Genes. 2019;10(2):177.
13.
go back to reference Mistry D, Wise R, Dickerson J. DiffSLC : a graph centrality method to detect essential proteins of a protein-protein interaction network. PloS One. 2017. Mistry D, Wise R, Dickerson J. DiffSLC : a graph centrality method to detect essential proteins of a protein-protein interaction network. PloS One. 2017.
14.
go back to reference Zaki N, Berengueres J, Efimov D. Detection of protein complexes using a protein ranking algorithm. Proteins: Structure, Function, and Bioinformatics. 2012;80(10):2459–68. Zaki N, Berengueres J, Efimov D. Detection of protein complexes using a protein ranking algorithm. Proteins: Structure, Function, and Bioinformatics. 2012;80(10):2459–68.
15.
go back to reference Jensen LJ, Gupta R, Staerfeldt HH, Brunak S. Prediction of human protein function according to gene ontology categories. Bioinformatics. 2003;19(5):635–42.CrossRef Jensen LJ, Gupta R, Staerfeldt HH, Brunak S. Prediction of human protein function according to gene ontology categories. Bioinformatics. 2003;19(5):635–42.CrossRef
16.
go back to reference Wan S, Mak MW, Kung SY g-LEN. Interpretable prediction of subcellular multi-localization of gram-positive and gram-negative bacterial proteins. Chemom Intell Lab Syst. 2017;162:1–9.CrossRef Wan S, Mak MW, Kung SY g-LEN. Interpretable prediction of subcellular multi-localization of gram-positive and gram-negative bacterial proteins. Chemom Intell Lab Syst. 2017;162:1–9.CrossRef
17.
go back to reference Schlicker A, Domingues FS, Rahnenführer J, Lengauer T. A new measure for functional similarity of gene products based on Gene Ontology. BMC Bioinform. 2006;7(1):302.CrossRef Schlicker A, Domingues FS, Rahnenführer J, Lengauer T. A new measure for functional similarity of gene products based on Gene Ontology. BMC Bioinform. 2006;7(1):302.CrossRef
18.
go back to reference Masic I, Begic E. Scientometric dilemma: is H-index adequate for scientific validity of Academic’s work? ACTA INFORM MED. 2016;24(4):228–32.CrossRef Masic I, Begic E. Scientometric dilemma: is H-index adequate for scientific validity of Academic’s work? ACTA INFORM MED. 2016;24(4):228–32.CrossRef
19.
go back to reference Xiao Y, Lv Y, Zhao H, Gong Y. Predicting the Functions of Long Noncoding RNAs Using RNA-Seq Based on Bayesian Network. BioMed Res Int. 2015;3:1–14. Xiao Y, Lv Y, Zhao H, Gong Y. Predicting the Functions of Long Noncoding RNAs Using RNA-Seq Based on Bayesian Network. BioMed Res Int. 2015;3:1–14.
20.
go back to reference Lo YS, Chen YC, Yang JM. 3D-interologs: an evolution database of physical protein-protein interactions across multiple genomes. 2010;11(Suppl 3):S7. Lo YS, Chen YC, Yang JM. 3D-interologs: an evolution database of physical protein-protein interactions across multiple genomes. 2010;11(Suppl 3):S7.
21.
go back to reference Fleur J, Claire J, Andreas H. Integrated web visualizations for protein-protein interaction databases. BMC Bioinformatics. 2015;195. Fleur J, Claire J, Andreas H. Integrated web visualizations for protein-protein interaction databases. BMC Bioinformatics. 2015;195.
22.
go back to reference Luo J, Li G, Dan S, Cheng L. Integrating Functional and Topological Properties to Identify Biological Network Motif in Protein Interaction Networks. J Comput Theor Nanosci. 2014;744–50. Luo J, Li G, Dan S, Cheng L. Integrating Functional and Topological Properties to Identify Biological Network Motif in Protein Interaction Networks. J Comput Theor Nanosci. 2014;744–50.
23.
go back to reference Zhang R, Lin Y. DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes[J]. Nucleic Acids Res. 2009;7:D455–D458. Zhang R, Lin Y. DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes[J]. Nucleic Acids Res. 2009;7:D455–D458.
24.
go back to reference Zhang X, Xiao W, Hu X. Predicting essential proteins by integrating orthology, gene expressions, and PPI networks. Plos One. 2018;13(4):e0195410. Zhang X, Xiao W, Hu X. Predicting essential proteins by integrating orthology, gene expressions, and PPI networks. Plos One. 2018;13(4):e0195410.
25.
go back to reference Bae J, Kim S. Identifying and ranking influential spreaders in complex networks by neighborhood coreness. Phys A Stat Mech Appl. 2014;395:549–59.CrossRef Bae J, Kim S. Identifying and ranking influential spreaders in complex networks by neighborhood coreness. Phys A Stat Mech Appl. 2014;395:549–59.CrossRef
26.
go back to reference Lü L, Zhou T, Zhang QM, Stanley HE. The H-index of a network node and its relation to degree and coreness. Nat Commun. 2016;7:10168.CrossRef Lü L, Zhou T, Zhang QM, Stanley HE. The H-index of a network node and its relation to degree and coreness. Nat Commun. 2016;7:10168.CrossRef
27.
go back to reference Wang J, Li C, Xia C. Improved centrality indicators to characterize the nodal spreading capability in complex networks. Appl Math Comput. 2018;334:388–400. Wang J, Li C, Xia C. Improved centrality indicators to characterize the nodal spreading capability in complex networks. Appl Math Comput. 2018;334:388–400.
28.
go back to reference Zhong J, Wang J, Peng W, Zhang Z, Pan Y. Prediction of essential proteins based on gene expression programming. BMC Genomics. 2013;14(4):S7.CrossRef Zhong J, Wang J, Peng W, Zhang Z, Pan Y. Prediction of essential proteins based on gene expression programming. BMC Genomics. 2013;14(4):S7.CrossRef
29.
go back to reference Wan S, Mak M, Kung SY. Ensemble linear neighborhood propagation for predicting subchloroplast localization of multi-location proteins. J Proteome Res. 2016;15(12):4755–62.CrossRef Wan S, Mak M, Kung SY. Ensemble linear neighborhood propagation for predicting subchloroplast localization of multi-location proteins. J Proteome Res. 2016;15(12):4755–62.CrossRef
30.
go back to reference Chou KC, Wu ZC, Xiao X. iLoc-Euk: a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins. PloS One. 2011;6(3):e18258.CrossRef Chou KC, Wu ZC, Xiao X. iLoc-Euk: a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins. PloS One. 2011;6(3):e18258.CrossRef
Metadata
Title
Identifying essential proteins in dynamic protein networks based on an improved h-index algorithm
Authors
Caiyan Dai
Ju He
Kongfa Hu
Youwei Ding
Publication date
01-12-2020
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2020
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-020-01141-x

Other articles of this Issue 1/2020

BMC Medical Informatics and Decision Making 1/2020 Go to the issue