survey

Machine Learning for Survival Analysis: A Survey

Authors:
Ping Wang

Virginia Tech, Arlington, VA

Virginia Tech, Arlington, VA

0000-0002-0379-9183
View Profile

,
Yan Li

University of Michigan, Ann Arbor, WA

University of Michigan, Ann Arbor, WA
View Profile

,
Chandan K. Reddy

Virginia Tech, Arlington, VA

Virginia Tech, Arlington, VA
View Profile

Authors Info & Claims

ACM Computing Surveys Volume 51 Issue 6Article No.: 110pp 1–36https://doi.org/10.1145/3214306

Published:27 February 2019Publication History

ACM Computing Surveys

Abstract

Survival analysis is a subfield of statistics where the goal is to analyze and model data where the outcome is the time until an event of interest occurs. One of the main challenges in this context is the presence of instances whose event outcomes become unobservable after a certain time point or when some instances do not experience any event during the monitoring period. This so-called censoring can be handled most effectively using survival analysis techniques. Traditionally, statistical approaches have been widely developed in the literature to overcome the issue of censoring. In addition, many machine learning algorithms have been adapted to deal with such censored data and tackle other challenging problems that arise in real-world data. In this survey, we provide a comprehensive and structured review of the statistical methods typically used and the machine learning techniques developed for survival analysis, along with a detailed taxonomy of the existing methods. We also discuss several topics that are closely related to survival analysis and describe several successful applications in a variety of real-world application domains. We hope that this article will give readers a more comprehensive understanding of recent advances in survival analysis and offer some guidelines for applying these approaches to solve new problems arising in applications involving censored data.

Supplemental Material

Available for Download

zip

wang.zip (89.6 KB)

Supplemental movie, appendix, image and software files for, Machine Learning for Survival Analysis: A Survey

References

Odd Aalen. 1978. Nonparametric inference for a family of counting processes. The Annals of Statistics 6, 4 (1978), 701--726.Google ScholarCross Ref
Paul D Allison. 2010. Survival Analysis Using SAS: A Practical Guide. Sas Institute. Google ScholarDigital Library
Sattar Ameri, Mahtab J Fard, Ratna B Chinnam, and Chandan K Reddy. 2016. Survival analysis based framework for early prediction of student dropouts. In Proceedings of ACM International Conference on Conference on Information and Knowledge Management. ACM, IN, 903--912. Google ScholarDigital Library
Per Kragh Andersen, Ornulf Borgan, Richard D. Gill, and Niels Keiding. 2012. Statistical Models Based on Counting Processes. Springer Science 8 Business Media.Google Scholar
A. V. Antonov, M. Krestyaninova, R. A. Knight, I. Rodchenkov, G. Melino, and N. A. Barlev. 2014. PPISURV: A novel bioinformatics tool for uncovering the hidden role of specific genes in cancer survival outcome. Oncogene 33, 13 (2014), 1621--1628.Google ScholarCross Ref
Nihal Ata and M. Tekin Sözer. 2007. Cox regression models with nonproportional hazards applied to lung cancer survival data. Hacettepe Journal of Mathematics and Statistics 36, 2 (2007), 157--167.Google Scholar
Francis Bach, Rodolphe Jenatton, Julien Mairal, and Guillaume Obozinski. 2012. Structured sparsity through convex optimization. Statistical Sciences 27, 4 (2012), 450--468.Google ScholarCross Ref
Bart Baesens, Tony Van Gestel, Maria Stepanova, Dirk Van den Poel, and Jan Vanthienen. 2005. Neural network survival analysis for personal loan data. Journal of the Operational Research Society 56, 9 (2005), 1089--1098.Google ScholarCross Ref
Nicola Barbieri, Fabrizio Silvestri, and Mounia Lalmas. 2016. Improving post-click user engagement on native ads via survival analysis. In Proceedings of the 25th International Conference on World Wide Web. International World Wide Web Conference Committee, Montreal, 761--770. Google ScholarDigital Library
David G. Beer, Sharon L. R. Kardia, Chiang-Ching Huang, Thomas J. Giordano, Albert M. Levin, David E. Misek, Lin Lin, Guoan Chen, Tarek G. Gharib, Dafydd G. Thomas, et al. 2002. Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nature Medicine 8, 8 (2002), 816--824.Google ScholarCross Ref
Riccardo Bellazzi and Blaz Zupan. 2008. Predictive data mining in clinical medicine: Current issues and guidelines. International Journal of Medical Informatics 77, 2 (2008), 81--97.Google ScholarCross Ref
Paul D. Berger and Nada I. Nasr. 1998. Customer lifetime value: Marketing models and applications. Journal of Interactive Marketing 12, 1 (1998), 17--30.Google ScholarCross Ref
Elia Biganzoli, Patrizia Boracchi, Luigi Mariani, and Ettore Marubini. 1998. Feed forward neural networks for the analysis of censored survival data: A partial logistic regression approach. Statistics in Medicine 17, 10 (1998), 1169--1186.Google ScholarCross Ref
Harald Binder and Martin Schumacher. 2008. Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models. BMC Bioinformatics 9, 1 (2008), 1--10.Google ScholarCross Ref
Imad Bou-Hamad, Denis Larocque, Hatem Ben-Ameur, et al. 2011. A review of survival trees. Statistics Surveys 5 (2011), 44--71.Google ScholarCross Ref
Leo Breiman. 1996. Bagging predictors. Machine Learning 24, 2 (1996), 123--140. Google ScholarDigital Library
Leo Breiman. 2001. Random forests. Machine Learning 45, 1 (2001), 5--32. Google ScholarDigital Library
Norman E. Breslow. 1972. Discussion of the paper by D. R. Cox. Journalof the Royal Statistical Society B 34 (1972), 216--217.Google Scholar
Glenn W. Brier. 1950. Verification of forecasts expressed in terms of probability. Monthly Weather Review 78, 1 (1950), 1--3.Google ScholarCross Ref
Stephen F. Brown, Alan J. Branford, and William Moran. 1997. On the use of artificial neural networks for the analysis of survival data. IEEE Transactions on Neural Networks 8, 5 (1997), 1071--1077. Google ScholarDigital Library
Jonathan Buckley and Ian James. 1979. Linear regression with censored data. Biometrika 66, 3 (1979), 429--436.Google ScholarCross Ref
Peter Bühlmann and Torsten Hothorn. 2007. Boosting algorithms: Regularization, prediction and model fitting. Statistical Sciences 22, 4 (2007), 477--505.Google ScholarCross Ref
Harry B. Burke, Philip H. Goodman, David B. Rosen, Donald E. Henson, John N. Weinstein, Frank E. Harrell, Jeffrey R. Marks, David P. Winchester, and David G. Bostwick. 1997. Artificial neural networks improve the accuracy of cancer survival prediction. Cancer 79, 4 (1997), 857--862.Google ScholarCross Ref
Ching-Fan Chung, Peter Schmidt, and Ana D. Witte. 1991. Survival analysis: A survey. Journal of Quantitative Criminology 7, 1 (1991), 59--98.Google ScholarCross Ref
A. Ciampi, R. S. Bush, M. Gospodarowicz, and J. E. Till. 1981. An approach to classifying prognostic factors related to survival experience for non-Hodgkin’s lymphoma patients: Based on a series of 982 patients: 1967--1975. Cancer 47, 3 (1981), 621--627.Google ScholarCross Ref
A. Ciampi, C-H. Chang, S. Hogg, and S. McKinney. 1987. Recursive partition: A versatile method for exploratory-data analysis in biostatistics. In Biostatistics, I. B. MacNeill, G. J. Umphrey, A. Donner, and V. K. Jandhyala (Eds.). Springer, 23--50.Google Scholar
Antonio Ciampi, Johanne Thiffault, Jean-Pierre Nakache, and Bernard Asselain. 1986. Stratification by stepwise regression, correspondence analysis and recursive partition: A comparison of three methods of analysis for survival data with covariates. Computational Statistics 8 Data Analysis 4, 3 (1986), 185--204. Google ScholarDigital Library
Joseph A. Cruz and David S. Wishart. 2006. Applications of machine learning in cancer prediction and prognosis. Cancer Informatics 2 (2006).Google Scholar
Sidney J. Cutler and Fred Ederer. 1958. Maximum utilization of the life table method in analyzing survival. Journal of Chronic Diseases 8, 6 (1958), 699--712.Google ScholarCross Ref
Cox R. David. 1972. Regression models and life tables. Journal of the Royal Statistical Society 34, 2 (1972), 187--220.Google Scholar
Cox R. David. 1975. Partial likelihood. Biometrika 62, 2 (1975), 269--276.Google ScholarCross Ref
Roger B. Davis and James R. Anderson. 1989. Exponential survival trees. Statistics in Medicine 8, 8 (1989), 947--961.Google ScholarCross Ref
Dursun Delen, Glenn Walker, and Amit Kadam. 2005. Predicting breast cancer survivability: A comparison of three data mining methods. Artificial Intelligence in Medicine 34, 2 (2005), 113--127. Google ScholarDigital Library
Thomas G. Dietterich. 2000. Ensemble methods in machine learning. In Proceedings of the International Workshop on Multiple Classifier Systems. Springer, 1--15. Google ScholarDigital Library
Olive J. Dunn and Virginia A. Clark. 2009. Basic Statistics: A Primer for the Biomedical Sciences. John Wiley 8 Sons.Google Scholar
David Faraggi and Richard Simon. 1995. A neural network model for survival data. Statistics in Medicine 14, 1 (1995), 73--82.Google ScholarCross Ref
Mahtab J. Fard, Ping Wang, Sanjay Chawla, and Chandan K. Reddy. 2016. A bayesian perspective on early stage event prediction in longitudinal data. IEEE Transactions on Knowledge and Data Engineering 28, 12 (2016), 3126--3139. Google ScholarDigital Library
Jerome Friedman, Trevor Hastie, and Robert Tibshirani. 2001. The Elements of Statistical Learning. Vol. 1. Springer series in statistics. Springer.Google Scholar
Jerome Friedman, Trevor Hastie, and Robert Tibshirani. 2008. Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9, 3 (2008), 432--441.Google ScholarCross Ref
Nir Friedman, Dan Geiger, and Moises Goldszmidt. 1997. Bayesian network classifiers. Machine Learning 29, 2 (1997), 131--163. Google ScholarDigital Library
Dani Gamerman and Mike West. 1987. An application of dynamic survival models in unemployment studies. The Statistician (1987), 269--274.Google Scholar
Louis Gordon and Richard A. Olshen. 1985. Tree-structured survival analysis. Cancer Treatment Reports 69, 10 (1985), 1065--1069.Google Scholar
Erika Graf, Claudia Schmoor, Willi Sauerbrei, and Martin Schumacher. 1999. Assessment and comparison of prognostic classification schemes for survival data. Statistics in Medicine 18, 17-18 (1999), 2529--2545.Google ScholarCross Ref
Sariel Har-Peled, Dan Roth, and Dav Zimak. 2002. Constraint classification: A new approach to multiclass classification. In Algorithmic Learning Theory, N. Cesa-Bianchi, M. Numao, and R. Reischuk (Eds.). Springer, 365--379. Google ScholarDigital Library
Frank E. Harrell, Robert M. Califf, David B. Pryor, Kerry L. Lee, and Robert A. Rosati. 1982. Evaluating the yield of medical tests. Journal of the American Medical Association 247, 18 (1982), 2543--2546.Google ScholarCross Ref
Frank E. Harrell, Kerry L. Lee, Robert M. Califf, David B. Pryor, and Robert A. Rosati. 1984. Regression modelling strategies for improved prognostic prediction. Statistics in Medicine 3, 2 (1984), 143--152.Google ScholarCross Ref
Patrick J. Heagerty and Yingye Zheng. 2005. Survival model predictive accuracy and ROC curves. Biometrics 61, 1 (2005), 92--105.Google ScholarCross Ref
Arthur E. Hoerl and Robert W. Kennard. 1970. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12, 1 (1970), 55--67.Google ScholarCross Ref
Torsten Hothorn, Peter Bühlmann, Sandrine Dudoit, Annette Molinaro, and Mark J. Van Der Laan. 2006. Survival ensembles. Biostatistics 7, 3 (2006), 355--373.Google ScholarCross Ref
Torsten Hothorn, Berthold Lausen, Axel Benner, and Martin Radespiel-Tröger. 2004. Bagging survival trees. Statistics in Medicine 23, 1 (2004), 77--91.Google ScholarCross Ref
Hemant Ishwaran, Udaya B. Kogalur, Eugene H. Blackstone, and Michael S. Lauer. 2008. Random survival forests. The Annals of Applied Statistics 2, 3 (2008), 841--860.Google ScholarCross Ref
Hemant Ishwaran, Udaya B. Kogalur, Xi Chen, and Andy J. Minn. 2011. Random survival forests for high-dimensional data. Statistical Analysis and Data Mining 4, 1 (2011), 115--132. Google ScholarDigital Library
How Jing and Alexander J. Smola. 2017. Neural survival recommender. In Proceedings of the 10th ACM International Conference on Web Search and Data Mining. ACM, Cambridge, 515--524. Google ScholarDigital Library
John D. Kalbfleisch and Ross L. Prentice. 2011. The Statistical Analysis of Failure Time Data. Vol. 360. John Wiley 8 Sons.Google Scholar
Edward L. Kaplan and Paul Meier. 1958. Nonparametric estimation from incomplete observations. Journal of the American Statistical Association 53, 282 (1958), 457--481.Google ScholarCross Ref
Jared Katzman, Uri Shaham, Jonathan Bates, Alexander Cloninger, Tingting Jiang, and Yuval Kluger. 2016. Deep survival: A deep Cox proportional hazards network. arXiv preprint arXiv:1606.00931 (2016).Google Scholar
Carl T. Kelley. 1999. Iterative Methods for Optimization. Vol. 18. SIAM.Google Scholar
Faisal M. Khan and Valentina B. Zubek. 2008. Support vector regression for censored data (SVRc): A novel tool for survival analysis. In Proceedings of the IEEE International Conference on Data Mining (ICDM). IEEE, Pisa, 863--868. Google ScholarDigital Library
Farkhondeh Kiaee, Hamid Sheikhzadeh, and Samaneh Eftekhari Mahabadi. 2016. Relevance vector machine for survival analysis. IEEE Transactions on Neural Networks and Learning Systems 27, 3 (2016), 648--660.Google ScholarCross Ref
Nicholas M. Kiefer. 1988. Economic duration data and hazard functions. Journal of Economic Literature 26, 2 (1988), 646--679.Google Scholar
John P. Klein and Melvin L. Moeschberger. 2005. Survival Analysis: Techniques for Censored and Truncated Data. Springer Science 8 Business Media.Google Scholar
David G. Kleinbaum and Mitchel Klein. 2006. Survival Analysis: A Self-learning Text. Springer Science 8 Business Media.Google Scholar
Igor Kononenko. 1993. Inductive and Bayesian learning in medical diagnosis. Applied Artificial Intelligence an International Journal 7, 4 (1993), 317--337.Google Scholar
Konstantina Kourou, Themis P. Exarchos, Konstantinos P. Exarchos, Michalis V. Karamouzis, and Dimitrios I. Fotiadis. 2015. Machine learning applications in cancer prognosis and prediction. Computational and Structural Biotechnology Journal 13 (2015), 8--17.Google ScholarCross Ref
Minjung Kyung, Jeff Gill, Malay Ghosh, George Casella, et al. 2010. Penalized regression, standard errors, and Bayesian lassos. Bayesian Analysis 5, 2 (2010), 369--411.Google ScholarCross Ref
Michael LeBlanc and John Crowley. 1992. Relative risk trees for censored survival data. Biometrics 48, 2 (1992), 411--425.Google ScholarCross Ref
Elisa T. Lee and John Wang. 2003. Statistical Methods for Survival Data Analysis. Vol. 476. John Wiley 8 Sons. Google ScholarDigital Library
Yan Li, Vineeth Rakesh, and Chandan K. Reddy. 2016a. Project success prediction in crowdfunding environments. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining. ACM, San Francisco, California, 247--256. Google ScholarDigital Library
Yan Li, Bhanukiran Vinzamuri, and Chandan K. Reddy. 2016b. Regularized weighted linear regression for high-dimensional censored data. In Proceedings of SIAM International Conference on Data Mining. SIAM, Miami, FL, 45--53.Google Scholar
Yan Li, Jie Wang, Jieping Ye, and Chandan K. Reddy. 2016d. A multi-task learning formulation for survival analysis. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, San Francisco, CA, 1715--1724. Google ScholarDigital Library
Yan Li, Lu Wang, Jie Wang, Jieping Ye, and Chandan K. Reddy. 2016c. Transfer learning for survival analysis via efficient L2,1-norm regularized Cox regression. In Proceedings of the IEEE International Conference on Data Mining (ICDM). IEEE, Barcelona, 231--240.Google Scholar
Yan Li, Kevin S. Xu, and Chandan K. Reddy. 2016e. Regularized parametric regression for high-dimensional survival analysis. In Proceedings of the 2016 SIAM International Conference on Data Mining. SIAM, Miami, FL, 765--773.Google Scholar
Knut Liestbl, Per Kragh Andersen, and Ulrich Andersen. 1994. Survival analysis and neural nets. Statistics in Medicine 13, 12 (1994), 1189--1200.Google ScholarCross Ref
Paulo J. G. Lisboa, H. Wong, P. Harris, and Ric Swindell. 2003. A bayesian neural network approach for modelling censored data with an application to prognosis after surgery for breast cancer. Artificial Intelligence in Medicine 28, 1 (2003), 1--25. Google ScholarDigital Library
Michael R. Lyu. 1996. Handbook of Software Reliability Engineering. Vol. 222. IEEE computer society press CA. Google Scholar
David J. C. MacKay. 1995. Probable networks and plausible predictions-a review of practical bayesian methods for supervised neural networks. Network: Computation in Neural Systems 6, 3 (1995), 469--505.Google ScholarCross Ref
D. R. Mani, James Drew, Andrew Betz, and Piew Datta. 1999. Statistics and data mining techniques for lifetime value modeling. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, San Diego, CA, 94--103. Google ScholarDigital Library
L. Mariani, D. Coradini, E. Biganzoli, P. Boracchi, E. Marubini, S. Pilotti, B. Salvadori, R. Silvestrini, U. Veronesi, R. Zucali, et al. 1997. Prognostic factors for metachronous contralateral breast cancer: A comparison of the linear Cox regression model and its artificial neural network extension. Breast Cancer Research and Treatment 44, 2 (1997), 167--178.Google ScholarCross Ref
Ettore Marubini and Maria Grazia Valsecchi. 2004. Analysing Survival Data from Clinical Trials and Observational Studies. Vol. 15. John Wiley 8 Sons.Google Scholar
Rupert Miller and Jerry Halpern. 1982. Regression with Censored Data. Biometrika 69, 3 (1982), 521--531.Google ScholarCross Ref
Rupert G. Miller Jr. 2011. Survival Analysis. Vol. 66. John Wiley 8 Sons.Google Scholar
Mohammad Modarres, Mark P. Kaminskiy, and Vasiliy Krivtsov. 2009. Reliability Engineering and Risk Analysis: A Practical Guide. CRC press.Google Scholar
Paul A. Murtaugh, Leslie D. Burns, and Jill Schuster. 1999. Predicting the retention of university students. Research in Higher Education 40, 3 (1999), 355--371.Google ScholarCross Ref
Wayne Nelson. 1972. Theory and applications of hazard plotting for censored failure data. Technometrics 14, 4 (1972), 945--966.Google ScholarCross Ref
Sinno J. Pan and Qiang Yang. 2010. A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22, 10 (2010), 1345--1359. Google ScholarDigital Library
Michael J. Pencina and Ralph B. D’Agostino. 2004. Overall C as a measure of discrimination in survival analysis: Model specific population value and confidence interval estimation. Statistics in Medicine 23, 13 (2004), 2109--2123.Google ScholarCross Ref
Margaret S. Pepe. 2003. The Statistical Evaluation of Medical Tests for Classification and Prediction. Oxford University Press.Google Scholar
James L. Powell. 1994. Estimation of Semiparametric Models. Handbook of Econometrics 4 (1994), 2443--2521.Google ScholarCross Ref
Adrian Raftery, David Madigan, and Chris T. Volinsky. 1995. Accounting for model uncertainty in survival analysis improves predictive performance. Bayesian Statistics 5, 323--349.Google Scholar
Adrian E. Raftery. 1995. Bayesian model selection in social research. Sociological Methodology 25 (1995), 111--163.Google ScholarCross Ref
Vineeth Rakesh, Jaegul Choo, and Chandan K. Reddy. 2015. Project recommendation using heterogeneous traits in crowdfunding. In Proceedings of the 9th International AAAI Conference on Web and Social Media. AAAI Press, Oxford, 337--346.Google Scholar
Vineeth Rakesh, Wang-Chien Lee, and Chandan K. Reddy. 2016. Probabilistic group recommendation model for crowdfunding domains. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining. ACM, San Francisco, California, 257--266. Google ScholarDigital Library
Rajesh Ranganath, Adler Perotte, Noémie Elhadad, and David Blei. 2016. Deep survival analysis. arXiv preprint arXiv:1608.02158 (2016).Google Scholar
Chandan K. Reddy and Yan Li. 2015. A review of clinical prediction models. Healthcare Data Analytics 36 (2015), 343--378.Google Scholar
Matthew Richardson, Ewa Dominowska, and Robert Ragno. 2007. Predicting clicks: Estimating the click-through rate for new ads. In Proceedings of the 16th International Conference on World Wide Web. ACM, Banff, Alberta, 521--530. Google ScholarDigital Library
Frank Rosenblatt. 1958. The perceptron: A probabilistic model for information storage and organization in the brain.Psychological Review 65, 6 (1958), 386--408.Google Scholar
Saharon Rosset, Einat Neumann, Uri Eick, and Nurit Vatnik. 2003. Customer lifetime value models for decision support. Data mining and knowledge discovery 7, 3 (2003), 321--339. Google ScholarDigital Library
S. Rasoul Safavian and David Landgrebe. 1991. A survey of decision tree classifier methodology. IEEE Transactions on Systems, Man, and Cybernetics 21, 3 (1991), 660--674.Google ScholarCross Ref
Mark R. Segal. 1988. Regression trees for censored data. Biometrics 44, 1 (1988), 35--47.Google ScholarCross Ref
Pannagadatta K. Shivaswamy, Wei Chu, and Martin Jansche. 2007. A support vector approach to censored targets. In Proceedings of the IEEE International Conference on Data Mining (ICDM). IEEE, Omaha, Nebraska, 655--660. Google ScholarDigital Library
Noah Simon, Jerome Friedman, Trevor Hastie, Rob Tibshirani, et al. 2011. Regularization paths for Coxs proportional hazards model via coordinate descent. Journal of Statistical Software 39, 5 (2011), 1--13.Google ScholarCross Ref
Alex J. Smola and Bernhard Schölkopf. 1998. Learning with Kernels. https://mitpress.mit.edu/books/learning-kernels.Google Scholar
Alex J. Smola and Bernhard Schölkopf. 2004. A tutorial on support vector regression. Statistics and Computing 14, 3 (2004), 199--222. Google ScholarDigital Library
Harald Steck, Balaji Krishnapuram, Cary Dehing-oberije, Philippe Lambin, and Vikas C Raykar. 2008. On ranking in survival analysis: Bounds on the concordance index. In Advances in Neural Information Processing Systems. Whistler, British Columbia, 1209--1216. Google ScholarDigital Library
Robert Tibshirani. 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) 58, 1 (1996), 267--288.Google ScholarCross Ref
Robert Tibshirani. 1997. The lasso method for variable selection in the Cox model. Statistics in Medicine 16, 4 (1997), 385--395.Google ScholarCross Ref
Robert Tibshirani, Michael Saunders, Saharon Rosset, Ji Zhu, and Keith Knight. 2005. Sparsity and smoothness via the fused lasso. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67, 1 (2005), 91--108.Google ScholarCross Ref
James Tobin. 1958. Estimation of relationships for limited dependent variables. Econometrica: Journal of the Econometric Society 26, 1 (1958), 24--36.Google ScholarCross Ref
Belle Van, Kristiaan Pelckmans, Huffel S. Van, and Johan A. K. Suykens. 2011. Support vector methods for survival analysis: A comparison between ranking and regression approaches. Artificial Intelligence in Medicine 53, 2 (2011), 107--118. Google ScholarDigital Library
Belle V. Van, Kristiaan Pelckmans, Johan A. K. Suykens, and S. Van Huffel. 2007. Support vector machines for survival analysis. In Proceedings of the 3rd International Conference on Computational Intelligence in Medicine and Healthcare (CIMED’07). Plymouth, 1--8.Google Scholar
Hans C. van Houwelingen and Hein Putter. 2011. Dynamic Prediction in Clinical Survival Analysis. CRC Press. Google ScholarDigital Library
Pierre J. M. Verweij and Hans C. Van Houwelingen. 1994. Penalized likelihood in Cox regression. Statistics in Medicine 13, 23-24 (1994), 2427--2436.Google ScholarCross Ref
Bhanukiran Vinzamuri, Yan Li, and Chandan K. Reddy. 2014. Active learning based survival regression for censored data. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. ACM, Shanghai, 241--250. Google ScholarDigital Library
Bhanukiran Vinzamuri, Yan Li, and Chandan K. Reddy. 2017. Pre-processing censored survival data using inverse covariance matrix based calibration. Transactions on Knowledge and Data Engineering 29, 10 (2017), 2111--2124.Google ScholarDigital Library
Bhanukiran Vinzamuri and Chandan K. Reddy. 2013. Cox regression with correlation based regularization for electronic health records. In Proceedings of the IEEE International Conference on Data Mining (ICDM). IEEE, Dallas, TX, 757--766.Google Scholar
Sijian Wang, Bin Nan, Ji Zhu, and David G. Beer. 2008. Doubly penalized Buckley--James method for survival data with high-dimensional covariates. Biometrics 64, 1 (2008), 132--140.Google ScholarCross Ref
Achmad Widodo and Bo-Suk Yang. 2011. Application of relevance vector machine and survival probability to machine degradation assessment. Expert Systems with Applications 38, 3 (2011), 2592--2599. Google ScholarDigital Library
Guolei Yang, Ying Cai, and Chandan K. Reddy. 2018. Spatio-temporal check-in time prediction with recurrent neural network based survival analysis. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI). ACM, Stockholm, 2203--2211. Google ScholarDigital Library
Sen Yang, Lei Yuan, Ying-Cheng Lai, Xiaotong Shen, Peter Wonka, and Jieping Ye. 2012. Feature grouping and selection over an undirected graph. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, Beijing, 922--930. Google ScholarDigital Library
Jiawen Yao, Xinliang Zhu, Feiyun Zhu, and Junzhou Huang. 2017. Deep correlational learning for survival prediction from multi-modality data. In Medical Image Computing and Computer-Assisted Intervention (MICCAI’17), Maxime Descoteaux, Lena Maier-Hein, Alfred Franz, Pierre Jannin, D. Louis Collins, and Simon Duchesne (Eds.). Springer International Publishing, 406--414.Google Scholar
Jieping Ye and Jun Liu. 2012. Sparse methods for biomedical data. ACM SIGKDD Explorations Newsletter 14, 1 (2012), 4--15. Google ScholarDigital Library
Peifeng Yin, Ping Luo, Wang-Chien Lee, and Min Wang. 2013. Silence is also evidence: Interpreting dwell time for recommendation from psychological perspective. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, Chicago, IL, 989--997. Google ScholarDigital Library
Valarie A. Zeithaml, Katherine N. Lemon, and Roland T. Rust. 2001. Driving Customer Equity: How Customer Lifetime Value Is Reshaping Corporate Strategy. Simon and Schuster.Google Scholar
Hao H. Zhang and Wenbin Lu. 2007. Adaptive Lasso for Cox’s proportional hazards model. Biometrika 94, 3 (2007), 691--703.Google ScholarCross Ref
Hui Zou and Trevor Hastie. 2005. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67, 2 (2005), 301--320.Google ScholarCross Ref
Blaž Zupan, Janez Demšar, Michael W. Kattan, Robert J. Beck, and Ivan Bratko. 2000. Machine learning for survival analysis: A case study on recurrence of prostate cancer. Artificial Intelligence in Medicine 20, 1 (2000), 59--75. Google ScholarDigital Library

Index Terms

Machine Learning for Survival Analysis: A Survey

Recommendations

Fuzzy survival analysis of AIDS patients under ten years old in Hamadan-Iran

A common and critical issue in survival data analysis is the way in which censored data are handled. The Kaplan-Meier KM estimator is a frequently used statistical method in survival analysis that works well with censored data. In small sample sizes ...
Read More
Support Vector Regression for Censored Data (SVRc): A Novel Tool for Survival Analysis
ICDM '08: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining

A crucial challenge in predictive modeling for survival analysis is managing censored observations in the data. The Cox proportional hazards model is the standard tool for the analysis of continuous censored survival data. We propose a novel machine ...
Read More
Flexible estimation in cure survival models using Bayesian P-splines

In the analysis of survival data, it is usually assumed that any unit will experience the event of interest if it is observed for a sufficiently long time. However, it can be explicitly assumed that an unknown proportion of the population under study ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Computing Surveys Volume 51, Issue 6
November 2019
786 pages
ISSN:0360-0300
EISSN:1557-7341
DOI:10.1145/3303862
Editor:
Sartaj Sahni
Department of Computer and Information Science and Engineering
Issue’s Table of Contents
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 February 2019
- Accepted: 1 April 2018
- Revised: 1 December 2017
- Received: 1 November 2016
Published in csur Volume 51, Issue 6

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Cox model
Machine learning
censoring
concordance index
hazard rate
regression
survival analysis
survival data
Qualifiers
- survey
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 344
  Total Citations
  View Citations
- 7,605
  Total Downloads
- Downloads (Last 12 months)1,485
- Downloads (Last 6 weeks)226
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Machine Learning for Survival Analysis: A Survey

ACM Computing Surveys

Abstract

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

Fuzzy survival analysis of AIDS patients under ten years old in Hamadan-Iran

Support Vector Regression for Censored Data (SVRc): A Novel Tool for Survival Analysis

Flexible estimation in cure survival models using Bayesian P-splines