Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2015

Open Access 01-12-2015 | Research article

Grid multi-category response logistic models

Authors: Yuan Wu, Xiaoqian Jiang, Shuang Wang, Wenchao Jiang, Pinghao Li, Lucila Ohno-Machado

Published in: BMC Medical Informatics and Decision Making | Issue 1/2015

Login to get access

Abstract

Background

Multi-category response models are very important complements to binary logistic models in medical decision-making. Decomposing model construction by aggregating computation developed at different sites is necessary when data cannot be moved outside institutions due to privacy or other concerns. Such decomposition makes it possible to conduct grid computing to protect the privacy of individual observations.

Methods

This paper proposes two grid multi-category response models for ordinal and multinomial logistic regressions. Grid computation to test model assumptions is also developed for these two types of models. In addition, we present grid methods for goodness-of-fit assessment and for classification performance evaluation.

Results

Simulation results show that the grid models produce the same results as those obtained from corresponding centralized models, demonstrating that it is possible to build models using multi-center data without losing accuracy or transmitting observation-level data. Two real data sets are used to evaluate the performance of our proposed grid models.

Conclusions

The grid fitting method offers a practical solution for resolving privacy and other issues caused by pooling all data in a central site. The proposed method is applicable for various likelihood estimation problems, including other generalized linear models.
Appendix
Available only for authorised users
Literature
1.
go back to reference Ohno-Machado L, Agha Z, Bell DS, Dahm L, Day ME, Doctor JN, et al. pSCANNER team: patient-centered Scalable National Network for Effectiveness Research. J Am Med Informatics Assoc. 2014; 21:amiajnl–2014. doi:10.1136/amiajnl-2014-002751 Ohno-Machado L, Agha Z, Bell DS, Dahm L, Day ME, Doctor JN, et al. pSCANNER team: patient-centered Scalable National Network for Effectiveness Research. J Am Med Informatics Assoc. 2014; 21:amiajnl–2014. doi:10.1136/amiajnl-2014-002751
2.
go back to reference Crandall W, Kappelman MD, Colletti RB, Leibowitz I, Grunow JE, Ali S, et al. ImproveCareNow: The development of a pediatric inflammatory bowel disease improvement network. Inflamm Bowel Dis. 2011;17:450–7. doi:10.1002/ibd.21394.CrossRefPubMed Crandall W, Kappelman MD, Colletti RB, Leibowitz I, Grunow JE, Ali S, et al. ImproveCareNow: The development of a pediatric inflammatory bowel disease improvement network. Inflamm Bowel Dis. 2011;17:450–7. doi:10.1002/ibd.21394.CrossRefPubMed
3.
4.
go back to reference Kennedy RL, Burton AM, Fraser HS, McStay LN, Harrison RF. Early diagnosis of acute myocardial infarction using clinical and electrocardiographic data at presentation: derivation and evaluation of logistic regression models. Eur Hear J. 1996;17:1181–91.CrossRef Kennedy RL, Burton AM, Fraser HS, McStay LN, Harrison RF. Early diagnosis of acute myocardial infarction using clinical and electrocardiographic data at presentation: derivation and evaluation of logistic regression models. Eur Hear J. 1996;17:1181–91.CrossRef
5.
go back to reference Boxwala AA, Kim J, Grillo JM, Ohno-Machado L. Using statistical and machine learning to help institutions detect suspicious access to electronic health records. J Am Med Inf Assoc. 2011;18:498–505.CrossRef Boxwala AA, Kim J, Grillo JM, Ohno-Machado L. Using statistical and machine learning to help institutions detect suspicious access to electronic health records. J Am Med Inf Assoc. 2011;18:498–505.CrossRef
6.
go back to reference Wu Y, Jiang X, Kim J, Ohno-Machado L. Grid Binary LOgistic REgression (GLORE): building shared models without sharing data. J Am Med Inform Assoc. 2012;2012:758–64. doi:10.1136/amiajnl-2012-000862.CrossRef Wu Y, Jiang X, Kim J, Ohno-Machado L. Grid Binary LOgistic REgression (GLORE): building shared models without sharing data. J Am Med Inform Assoc. 2012;2012:758–64. doi:10.1136/amiajnl-2012-000862.CrossRef
7.
go back to reference Wang S, Jiang X, Wu Y, Cui L, Cheng S, Ohno-Machado L. EXpectation Propagation LOgistic REgRession ( EXPLORER ): Distributed Privacy-Preserving Online Model Learning. J Biomed Inform. 2013;46:480–96.CrossRefPubMedPubMedCentral Wang S, Jiang X, Wu Y, Cui L, Cheng S, Ohno-Machado L. EXpectation Propagation LOgistic REgRession ( EXPLORER ): Distributed Privacy-Preserving Online Model Learning. J Biomed Inform. 2013;46:480–96.CrossRefPubMedPubMedCentral
8.
go back to reference McCullagh P. Regression Models for Ordinal Data. J Royal Stat Soc Series B. 1980;42:109–42. McCullagh P. Regression Models for Ordinal Data. J Royal Stat Soc Series B. 1980;42:109–42.
10.
go back to reference Bradley AP. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 1997;30:1145–59.CrossRef Bradley AP. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 1997;30:1145–59.CrossRef
11.
go back to reference Hand DJ, Till RJ. A simple generalisation of the area under the ROC curve for multiple class classification problems. Mach Learn. 2001;45:171–86.CrossRef Hand DJ, Till RJ. A simple generalisation of the area under the ROC curve for multiple class classification problems. Mach Learn. 2001;45:171–86.CrossRef
12.
go back to reference Van Calster B, Van Belle V, Vergouwe Y, Steyerberg EW. Discrimination ability of prediction models for ordinal outcomes: Relationships between existing measures and a new measure. Biometrical J. 2012;54:674–85.CrossRef Van Calster B, Van Belle V, Vergouwe Y, Steyerberg EW. Discrimination ability of prediction models for ordinal outcomes: Relationships between existing measures and a new measure. Biometrical J. 2012;54:674–85.CrossRef
13.
go back to reference Yang H, Carlin D. ROC surface: a generalization of ROC curve analysis. J Biopharm Stat. 2000;10:183–96.CrossRefPubMed Yang H, Carlin D. ROC surface: a generalization of ROC curve analysis. J Biopharm Stat. 2000;10:183–96.CrossRefPubMed
14.
go back to reference Dreiseitl S, Ohno-Machado L, Binder M. Comparing three-class diagnostic tests by three-way ROC analysis. Med Decis Mak. 2000;20:323–31.CrossRef Dreiseitl S, Ohno-Machado L, Binder M. Comparing three-class diagnostic tests by three-way ROC analysis. Med Decis Mak. 2000;20:323–31.CrossRef
15.
go back to reference Brant R. Assessing Proportionality in the Proportional Odds Model for Ordinal Logistic Regression. Biometrics. 1990;46:1171–8.CrossRefPubMed Brant R. Assessing Proportionality in the Proportional Odds Model for Ordinal Logistic Regression. Biometrics. 1990;46:1171–8.CrossRefPubMed
16.
go back to reference Williams R. Generalized ordered logit/partial proportional odds models for ordinal dependent variables. Stata J. 2006;6:58–82. Williams R. Generalized ordered logit/partial proportional odds models for ordinal dependent variables. Stata J. 2006;6:58–82.
17.
go back to reference Fagerland MW, Hosmer DW. A goodness-of-fit test for the proportional odds regression model. Stat Med. 2013;32:2235–49.CrossRefPubMed Fagerland MW, Hosmer DW. A goodness-of-fit test for the proportional odds regression model. Stat Med. 2013;32:2235–49.CrossRefPubMed
18.
go back to reference Fagerland MW, Hosmer DW, Bofin AM. Multinomial goodness-of-fit tests for logistic regression models. Stat Med. 2008;27:4238–53.CrossRefPubMed Fagerland MW, Hosmer DW, Bofin AM. Multinomial goodness-of-fit tests for logistic regression models. Stat Med. 2008;27:4238–53.CrossRefPubMed
19.
go back to reference Lyles RH. Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models. J Am Stat Assoc. 2006;101:403–4.CrossRef Lyles RH. Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models. J Am Stat Assoc. 2006;101:403–4.CrossRef
20.
go back to reference Lu C, Wang S, Ji Z, Wu Y, Xiong L, Jiang X, et al. WebDISCO: a Web service for DIStributed COx model learning without patient-level data sharing. In: Translational Bioinformatics Conference (accepted). 2014. Lu C, Wang S, Ji Z, Wu Y, Xiong L, Jiang X, et al. WebDISCO: a Web service for DIStributed COx model learning without patient-level data sharing. In: Translational Bioinformatics Conference (accepted). 2014.
21.
go back to reference Jiang W, Li P, Wang S, Wu Y, Xue M, Ohno-Machado L, et al. WebGLORE: a web service for Grid LOgistic REgression. Bioinformatics. 2013;29:3238–40. doi: 10.1093/bioinformatics/btt559.CrossRefPubMedPubMedCentral Jiang W, Li P, Wang S, Wu Y, Xue M, Ohno-Machado L, et al. WebGLORE: a web service for Grid LOgistic REgression. Bioinformatics. 2013;29:3238–40. doi: 10.1093/bioinformatics/btt559.CrossRefPubMedPubMedCentral
Metadata
Title
Grid multi-category response logistic models
Authors
Yuan Wu
Xiaoqian Jiang
Shuang Wang
Wenchao Jiang
Pinghao Li
Lucila Ohno-Machado
Publication date
01-12-2015
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2015
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-015-0133-y

Other articles of this Issue 1/2015

BMC Medical Informatics and Decision Making 1/2015 Go to the issue