Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Spatial Habitat Features Derived from Multiparametric Magnetic Resonance Imaging Data Are Associated with Molecular Subtype and 12-Month Survival Status in Glioblastoma Multiforme

  • Joonsang Lee,

    Affiliation Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America

  • Shivali Narang,

    Affiliation Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America

  • Juan Martinez,

    Affiliation Department of Neurosurgery, The University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America

  • Ganesh Rao,

    Affiliation Department of Neurosurgery, The University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America

  • Arvind Rao

    aruppore@mdanderson.org

    Affiliation Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America

Abstract

One of the most common and aggressive malignant brain tumors is Glioblastoma multiforme. Despite the multimodality treatment such as radiation therapy and chemotherapy (temozolomide: TMZ), the median survival rate of glioblastoma patient is less than 15 months. In this study, we investigated the association between measures of spatial diversity derived from spatial point pattern analysis of multiparametric magnetic resonance imaging (MRI) data with molecular status as well as 12-month survival in glioblastoma. We obtained 27 measures of spatial proximity (diversity) via spatial point pattern analysis of multiparametric T1 post-contrast and T2 fluid-attenuated inversion recovery MRI data. These measures were used to predict 12-month survival status (≤12 or >12 months) in 74 glioblastoma patients. Kaplan-Meier with receiver operating characteristic analyses was used to assess the relationship between derived spatial features and 12-month survival status as well as molecular subtype status in patients with glioblastoma. Kaplan-Meier survival analysis revealed that 14 spatial features were capable of stratifying overall survival in a statistically significant manner. For prediction of 12-month survival status based on these diversity indices, sensitivity and specificity were 0.86 and 0.64, respectively. The area under the receiver operating characteristic curve and the accuracy were 0.76 and 0.75, respectively. For prediction of molecular subtype status, proneural subtype shows highest accuracy of 0.93 among all molecular subtypes based on receiver operating characteristic analysis. We find that measures of spatial diversity from point pattern analysis of intensity habitats from T1 post-contrast and T2 fluid-attenuated inversion recovery images are associated with both tumor subtype status and 12-month survival status and may therefore be useful indicators of patient prognosis, in addition to providing potential guidance for molecularly-targeted therapies in Glioblastoma multiforme.

Introduction

Glioblastoma multiforme (GBM) is one of the most common and malignant brain tumors in humans. The general treatment of GBM generally involves surgical resection followed by combination of radiation therapy and chemotherapy (temozolomide). Even with multimodality treatment, the median survival rate of GBM patients is between 12 ~ 15 months [1].

Analysis of tumor heterogeneity has the potential to aid treatment of GBM, as different tumor subpopulations may respond differently [24]. It has been suggested that heterogeneity in gray-level intensity in magnetic resonance imaging (MRI) may be indicative of such distinct tumor habitats [5]. Such heterogeneity is often measured by texture analysis, which provides information about the spatial arrangement of gray-level intensities and quantifies the local characteristic patterns in an image. Texture analyses using modalities such as MRI, positron emission tomography (PET), and computed tomography (CT) have been shown to be promising for cancer prognosis [69].

Aside from texture analysis, recently, spatial heterogeneity of tumor with distinctly different MRI intensity characteristics was investigated using the concept of radiologically defined tumor regions (habitats) within the tumor [3]. Each of these regions or habitats can characterize regional variations in blood flow, cell density, and necrosis within the tumor, based on multiparametric measurements. In this study, we extended this investigation by treating such multiparametric tumor habitats as “multitype spatial point patterns,” a commonly used paradigm in ecological and spatial statistics. Apart from the abundance of these habitats, the spatial extents and proximity of these habitats might have physiologic and clinical relevance for assessment of treatment response.

Spatial point pattern analysis is the study of the spatial arrangements of points that describe the geometric structure and relationship of patterns in two-dimensional space [10]. Spatial point pattern analysis produces several diversity and proximity measurements such as the Simpson index and the Shannon index, which are quantitative measures of abundance of point types and their relative spatial proximities/relationships within a spatial region. Diversity analysis of spatial point patterns has been applied to understand spatial relationships among point patterns in ecology, astronomy and material science [11, 12]. To the best of our knowledge, this is the first study to use point process theory to study the spatial relationships of tumor habitats and assess their relationship with molecular status and clinical outcome. We sought to determine the association between the spatial habitat features derived from point patterns of tumor habitats and overall survival (OS) in GBM. Our results show that spatial habitat features derived from point patterns of MR tumor habitats are associated with 12-month OS status as well as molecular subtype status in GBM patients.

Materials and Methods

Data

We identified 74 GBM patients from The Cancer Genome Atlas on the basis of available post-contrast T1-weighted and T2-weighted Fluid Attenuated Inversion Recovery (FLAIR) image data from The Cancer Imaging Archive (TCIA): https://public.cancerimagingarchive.net/ncia/login.jsf as well as companion genomic information (Glioblastoma Multiforme TCGA, Provisional). Molecular information (describing gene expression—based molecular classification of GBM such as classical, mesenchymal, neural, and proneural subtype) [13] and survival data were obtained from the cBioPortal for Cancer Genomics (http://www.cbioportal.org). The patients’ TCGA IDs are listed in Table A in S1 Appendix and the information about frequent gene mutations in all the patients in this study is listed in Table B in S1 Appendix. Patients were assigned to one of two survival classes on the basis of their OS: ≤ 12 months or > 12 months. Thus, 31 patients had an OS of 12 months or less, and 43 patients had an OS of more than 12 months. This 12-month OS cut-off was based on the typical median survival time (12~15 months) in GBM, also used in other studies [14, 15]. The descriptive statistics regarding patient demographics are summarized in Table 1.

Image Preprocessing

Image preprocessing procedures such as registration, non-uniformity intensity correction, reslicing, and intensity scaling were performed prior to multiparametric image analysis. First, 3-dimensional rigid registration was performed by spatially registering the T2 FLAIR image to the T1 post-contrast image as the reference image using Medical Image Processing, Analysis, and Visualization (MIPAV) software [16]. Subsequently, nonparametric nonuniform intensity normalization (N3) correction in MIPAV [17] was performed to correct MRI artifacts such as radiofrequency field nonuniformity, main magnetic field (B0) nonuniformity, and shading artifacts in MRI. Image reslicing was performed using the NIFTI toolbox in MATLAB v8.0 (The MathWorks Inc., Natick, MA) to obtain isotropic pixel resolution with 1 mm pixel size. Image intensities are quantized to 256 gray levels (Zhou et al., 2014) [3]. Fig 1 shows a T1 post-contrast image and a T2 FLAIR image after preprocessing.

thumbnail
Fig 1. T1 post-contrast image (left) and T2 FLAIR image (right) after preprocessing.

Arrows indicate the enhanced tumor area.

https://doi.org/10.1371/journal.pone.0136557.g001

Identification of Tumor habitats

A tumor region of interest (ROI) was segmented semi-automatically by experts using the Medical Image Interaction Toolkit (www.mitk.org). The maximum tumor area slice in a T1 post-contrast image and the corresponding slice in a T2 FLAIR image were selected for this study.

Following intensity scaling of the segmented tumor region [3], the ROI was divided into two pixel groups based on its constituent pixel intensities using Gaussian mixture modeling [18, 19]. The intensities are assumed to follow a mixture of two Gaussian distributions, following previous work (Zhou et al., 2014) [3]. A representative intensity histogram for a T1 post-contrast signal is shown in Fig 2. Similar grouping of pixels was done with the linearly scaled T2 FLAIR signal. Each pixel within an ROI in a T1 post contrast image or T2 FLAIR image was assigned to a low- or high-intensity group (“habitat”) based on the cluster membership of that pixel. The threshold between the two Gaussian groups was determined by calculating the average of the means of the two Gaussian populations underlying the low-intensity pixel group and high-intensity pixel group within a tumor.

thumbnail
Fig 2. A representative 2D histogram of the ROI of a T1 post-contrast image.

Intensity values are binned into two groups based on pixel membership to a Gaussian mixture model. Solid red line represents each Gaussian population; dotted green line represents total intensity distribution of the ROI.

https://doi.org/10.1371/journal.pone.0136557.g002

Coordinates of the centroids

Based on the aforementioned clusters identified from the T1 post contrast and FLAIR image data, each pixel (from the tumor ROI of the T1 post-contrast and T2 FLAIR image) is assigned to a low- or high-intensity habitat based on its intensity relative to the threshold. This results in four non-overlapping subpopulations of pixels: a low-intensity T1 habitat, a high-intensity T1 habitat, a low-intensity T2 habitat, and a high-intensity T2 habitat (the T1 and T2 habitats can overlap spatially, however). A spatial point pattern was derived from these habitat regions by overlaying a two-dimensional rectangular grid on the ROIs (Fig 3). Within each grid box, a centroid was estimated from the set of pixels belonging to a specific habitat. The set of all centroids from each of the four habitats, across all grid boxes, specifies a spatial “multitype” point pattern within the tumor region/ROI, t(xi) ∈ T = {1, …, S}, S being the set of four habitats, that can then be statistically analyzed [20]. Here, xi is the co-ordinate of the point and t(xi) is its label, ‘type’ or ‘mark’, representing the specific habitat that pixel belongs to. More details are listed in the following section. In Fig 3 the spatial point pattern from an example tumor ROI is indicated as an illustration. Fig 4 shows the flowchart that represents the various processing steps to obtain a spatial point pattern map.

thumbnail
Fig 3. A sample ROI spatial map.

A spatial map was combined with the low-intensity T1 post-contrast ROI, high-intensity T1 post-contrast ROI, low-intensity T2 FLAIR ROI, and high-intensity T2 FLAIR ROI. Each binary mask is equally divided into several bounding boxes and the size of each bounding box is 8×8 pixels. The coordinates of each centroid indicating the center of mass of each group inside of the small bounding grid box were calculated, and coordinates from all four regions were combined into a spatial map. The centroids of each subpopulation are designated by four different shapes based on group to illustrate the spatial relations between the four groups. Different gray levels of ROI represent each habitat and overlaps of habitats.

https://doi.org/10.1371/journal.pone.0136557.g003

thumbnail
Fig 4. Flowchart.

This flowchart represents the global clustering steps for the spatial point pattern map.

https://doi.org/10.1371/journal.pone.0136557.g004

Feature analysis

Spatial point pattern analysis is a study of the spatial relationships between multiple point patterns. The simplest form of such a point pattern is a point set X = {xi: i = 1, …} located randomly in Rd (here, d = 2, since we are looking at a 2D ROI, like in Fig 3). In our study, there are four different “types/marks” (borrowing from terminology in multitype spatial point pattern statistics) of points, t(xi) ∈ T = {1, …, S}, ‘S’ representing the four habitats. These represent the T1 low-, T1 high-, T2 low-, and T2 high-intensity habitats, respectively, that can be used to label each point. Thus, the subset of points of type τ can be expressed as Xτ = {xiXi: t(xi) = τ}.

For an individual point xi and point type τ, we used the following definition for determining the abundance of neighboring points [21]: (1) (2) (3) where xixj indicates that xj is a neighbor of xi. δτ(xi) and δ= (xi) represent the abundance of specific types of neighbors and the abundance of the same ‘type’ of neighbors as xi, respectively. 1(.) denotes an indicator variable that is 1 if its argument is true, else it is 0. Based on this framework, we obtained 27 distinct spatial habitat features to characterize the spatial distribution of these four intensity habitats within a tumor.

In addition to the aforementioned features, we derived spatial-association (spatial proximity) features from multitype point pattern analysis such as marked K-function, marked J-function, marked G-function, and inhomogeneous F-function [22]. In this study, we used four point “types”, or “marks”, high- and low-intensity T1 post-contrast and high- and low-intensity T2 FLAIR. The marked G-function, Guv, also called the “marked nearest neighbor distance function,” estimates the distribution of the distance from a typical point in type u to the nearest point of type v [23]. Similar to the G-function, the F-function, Fuv, also called the “empty space function” or the “spherical contact distribution,” is defined as probability distribution of the radius of a sphere when it first makes contact with a point in a point process [24]. The marked K-function, Kuv, also called the “reduced second moment function,” estimates the expected number of points of type v within a given distance from a typical point in type u. Each of these features quantifies the spatial dependencies (intra-species or inter-species) of these four habitats within the tumor region.

These features were subsequently examined for their association with patient survival rate and GBM molecular subtype status (classical, mesenchymal, neural and proneural). The detailed equations of diversity indices and summary statistics such as F, G, J, and K functions for spatial habitat features are described in the supporting information file (S1 Appendix).

Statistical analysis

Each of the spatial point features described above is a function of spatial distance r (either for the definition of the ‘neighborhood’, or explicitly in the definition of the proximity/dependence function) between any two points in a spatial pattern. Thus, we can extract their means and standard deviations as summaries of their variation across r. A total of 27 features that consist of the mean and standard deviation of 12 spatial point pattern feature functions (individual species area relationship (ISAR), mingling index, Shannon index, Simpson index, mean composite information (MCI), marked K for T1 post-contrast and T2 FLAIR, marked J for T1 and T2, marked G for T1 and T2, marked K for all pairs of the four types of spatial point patterns (corresponding to the four habitats), marked J for all pairs, marked G for all pairs, and inhomogeneous F), and three original spatial point pattern features (ISAR, Shannon index, and Simpson index) were computed from the coordinates of the centroids from the spatial map. For marked point pattern functions, we used both point patterns (T1 post-contrast and T2 FLAIR) individually as well as in pairs for all the four types of spatial point patterns (corresponding to the four distinct habitats). These spatial features (Table 2) were used to predict 12-month overall survival status as well as GBM molecular subtype status (classical, mesenchymal, neural and proneural) using a classification framework. Relationships between spatial habitat features and OS were assessed by Kaplan-Meier and receiver operating characteristic (ROC) analyses [25]. Each feature was dichotomized based on an optimum (cutoff) value derived from ROC analysis. Survival differences between the induced groups were assessed by the K-Adaptive Partitioning (KAP) algorithm [26]. To determine the predictive accuracy of the spatial point pattern features in predicting 12-month survival status and GBM molecular subtype status, a symbolic regression approach [27] was employed, using threefold cross-validation. Specifically, the top 5 features (std-dev K multi, mean K multi, std-dev J multi, mean J multi, std-dev J multi for T1 and T2) based on the coefficient of variation were used to predict both survival status (either > 12 months or ≤12 months) and subtype status. The subtype status is a binary response (one for a specific subtype, zero for other subtypes). The respective classifier’s performance was assessed via ROC curves.

Results

Kaplan-Meier survival curves for 14 of the 27 spatial features differed significantly (p < 0.05) at the ROC-derived cutoff (Table 3). The mean ISAR and the standard deviation of the Shannon index had p-values of less than 0.001. For the prediction of OS status based on the top 5 features (based on CoV), the ROC is indicated in Fig 5. The x-axis represents the false-positive rate (FPR or 1 –specificity) and the y-axis represents the true-positive rate (TPR or sensitivity). The optimal threshold was determined by maximizing the sum of sensitivity and specificity. The area under the ROC curve was 0.76. The TPR (sensitivity) and true-negative rate (TNR or specificity) were 0.86 and 0.66, respectively. The accuracy was 0.75 that calculated by using Eq 4 for the optimal model, (4) where TP, FP, TN, and FN represent true-positive, false-positive, true-negative, and false-negative predictions, respectively. The same procedure was followed to obtain a classifier to predict GBM molecular subtype status using the top five features based on CoV. Accuracy for classical, mesenchymal, neural, and proneural subtypes were 0.88, 0.70, 0.85, and 0.93, respectively, in the ROC analysis for subtype status prediction. TPR, TNR, and accuracy for each subtype were listed in Table 4.

thumbnail
Table 3. p-value of survival difference between feature-induced groups (from ROC analysis).

https://doi.org/10.1371/journal.pone.0136557.t003

thumbnail
Table 4. ROC analysis for subtypes (the TPR/TNR/ACC values were obtained at an operating point by maximizing the sum of sensitivity and specificity).

https://doi.org/10.1371/journal.pone.0136557.t004

thumbnail
Fig 5. ROC curve for the prediction of survival at the 12-month time point.

The x-axis is the false- positive rate, 1 –specificity; the y-axis is the true- positive rate, sensitivity. The area under the ROC curve is 0.76.

https://doi.org/10.1371/journal.pone.0136557.g005

Discussion

In this study, we showed that the spatial features obtained using a multitype spatial point pattern representations of T1 post-contrast and T2-FLAIR MRI data are associated with 12-month survival status and molecular subtypes of GBM. Spatial point pattern analysis with habitat features is widely to assess the abundance, evenness, and spatial interactions of species or point types in a specified area. Some studies have shown that the spatial heterogeneity of CT and MRI intensity values within a tumor is associated with intratumor heterogeneity, tumor malignancy, and patient survival rates [3, 5, 2830]. In this study, we used the concept of radiologically defined tumor habitats within the tumor based on work by Zhou et al. [3] for clustering pixels based on low- and high-intensity of T1 post-contrast and FLAIR MR images. To our knowledge, this is the first study in which multitype spatial point pattern analysis has been used to obtain spatial features associated with the proximity of intensity habitats within the tumor, and to assess the association of these features measurements with clinical outcomes and molecular subtypes. Therefore, such features can provide a valuable insight into the extent and spatial relationship of tumor sub-regions which could be indicative of tumor heterogeneity. This is an important factor for understanding and characterizing the cause and progression of such tumors. Another key point is that although this study focuses on only two MRI sequences, the concepts and methods presented here are applicable to any number of MRI sequences, thus creating a scalable framework for data that include modalities like T1, T2, FLAIR, dynamic contrast-enhanced and dynamic susceptibility contrast MRI. These methods are also applicable to other disease sites for which multiparametric MRI is used (e.g., prostate, head and neck, etc.).

Our results from the ROC curve show that spatial features from point pattern analysis can discriminate the 12-month survival status of GBM patients with a reasonably high accuracy of 0.75. Also, the corresponding area under the ROC curve value was 0.76, which represents the ability to discriminate between the survival classes. Further, we have high sensitivity (also called the TPR) of 0.86, which represents the high percentage of actual positives that are correctly identified as high survival patients (> 12 months) and have relatively high specificity (also called the TNR) of 0.66, which represents the high percentage of negatives that are correctly identified as low survival patients (≤ 12 months).

This retrospective analysis was performed using a publicly available (TCIA) database, so the scanning protocols and MRI systems are variable across patients; pixel resolutions (256×256 or 512×512), slice thickness (1.4 ~ 5.0mm), repetition time (4.9 ~ 3285.6ms for T1 and 400 ~ 11000ms for T2), and echo time (2.1 ~ 20ms for T1 and 14 ~ 155ms for T2). In this study, we performed intensity scaling as a preprocessing step (based on similar work with TCIA data by Zhou et al., 2014) to minimize the effects of potential variations in MR images from various patients across different scanning or acquisition protocols. However, the effect of these variations in MR images on spatial diversity indices needs to be examined more systematically. Although the level of intensity variation within a tumor has been characterized via tumor habitats, their spatial relationships have not been adequately investigated. The objective of this study is to study the spatial relationship between tumor habitats and assess their relationship with molecular status and clinical outcome. An important direction for follow-up research is the examination of tumor features (cellular morphology, gene expression, metabolism, blood flow, and metastatic potential) that correlate with such spatial features. Such data could enable generation of valuable hypotheses for the investigation of genotype-phenotype relationships based on public domain datasets like TCGA. Further evaluation on larger standardized patient cohorts, along with suitable correction for multiple hypothesis testing, given the number of features examined in this study (twenty seven), is essential to strengthen evidence for the clinical translation of this finding.

In general, the quality of the MR images is very important and key for image analysis. In the study, we investigated the spatial heterogeneity of tumor using the concept of radiologically defined tumor habitats and studied their spatial relationships. Our analysis is based on habitats covered by an 8×8 spatial grid box instead of on a pixel-by-pixel basis, where the grid size was picked empirically in order to trade-off quadrat area and the computational cost of estimating these indices for each grid box. The habitats are defined using a pixel clustering procedure, similar to Zhou et. al., 2014, intended to create only two intensity groups based on pixel intensities (low/high habitats). This process provides some robustness to pixel intensity variations within the image. However, a systematic examination of the nature of index variation with grid size, acquisition/scanning parameters and pixel intensities will be an important consideration for future investigation. Finally, extension of these analyses of spatial diversity indices to 3D tumor regions will also be an important next step.

On the basis of our findings, we conclude that spatial habitat features based on point pattern analysis in MRI-derived tumor habitats may be a promising clinical prognostic tool in radiology studies. Future research should focus on characterizing the biological meaning of these spatial features and exploring a multivariate model such as a multivariate proportional hazards regression analysis incorporating the spatial diversity indices along with clinical variables like age, tumor volume, and Karnofsky performance score to identify combinations of factors with the highest accuracy in predicting disease progression.

Supporting Information

S1 Appendix. Diversity Indicies.

Listing of the case IDs for 74 GBM patients (Table A). Top mutated genes in the dataset from eBioPortal (Table B).

https://doi.org/10.1371/journal.pone.0136557.s001

(DOCX)

Acknowledgments

We would like to thank scientific editors, Markeda Wade, Tamara Locke, and Arthur Gelmis, for their help with manuscript editing and suggestions.

Author Contributions

Conceived and designed the experiments: AR JL. Performed the experiments: AR JL GR JM. Analyzed the data: AR JL SN. Contributed reagents/materials/analysis tools: SN JL AR. Wrote the paper: JL AR.

References

  1. 1. Johnson DR, O'Neill BP. Glioblastoma survival in the United States before and during the temozolomide era. J Neuro-Oncol. 2012;107(2):359–64.
  2. 2. Bedard PL, Hansen AR, Ratain MJ, Siu LL. Tumour heterogeneity in the clinic. Nature. 2013;501(7467):355–64. pmid:24048068
  3. 3. Zhou M, Hall L, Goldgof D, Russo R, Balagurunathan Y, Gillies R, et al. Radiologically defined ecological dynamics and clinical outcomes in glioblastoma multiforme: preliminary results. Translational oncology. 2014;7(1):5–13. pmid:24772202
  4. 4. Inda MD, Bonavia R, Mukasa A, Narita Y, Sah DWY, Vandenberg S, et al. Tumor heterogeneity is an active process maintained by a mutant EGFR-induced cytokine circuit in glioblastoma. Gene Dev. 2010;24(16):1731–45. pmid:20713517
  5. 5. Gatenby RA, Grove O, Gillies RJ. Quantitative imaging in cancer evolution and ecology. Radiology. 2013;269(1):8–15. pmid:24062559
  6. 6. Miles KA, Ganeshan B, Griffiths MR, Young RC, Chatwin CR. Colorectal cancer: texture analysis of portal phase hepatic CT images as a potential marker of survival. Radiology. 2009;250(2):444–52. pmid:19164695
  7. 7. Held K, Rota Kops E, Krause BJ, Wells WM 3rd, Kikinis R, Muller-Gartner HW. Markov random field segmentation of brain MR images. IEEE transactions on medical imaging. 1997;16(6):878–86. pmid:9533587
  8. 8. Chen CC, Daponte JS, Fox MD. Fractal feature analysis and classification in medical imaging. IEEE transactions on medical imaging. 1989;8(2):133–42. pmid:18230510
  9. 9. Cheng NM, Fang YH, Chang JT, Huang CG, Tsan DL, Ng SH, et al. Textural features of pretreatment 18F-FDG PET/CT images: prognostic significance in patients with advanced T-stage oropharyngeal squamous cell carcinoma. Journal of nuclear medicine: official publication, Society of Nuclear Medicine. 2013;54(10):1703–9.
  10. 10. Gatrell AC, Bailey TC, Diggle PJ, Rowlingson BS. Spatial point pattern analysis and its application in geographical epidemiology. T I Brit Geogr. 1996;21(1):256–74.
  11. 11. Moeur M. Characterizing Spatial Patterns of Trees Using Stem-Mapped Data. Forest Sci. 1993;39(4):756–75.
  12. 12. Illian J, Penttinen A, Stoyan H, Stoyan D. Statistical analysis and modelling of spatial point patterns: John Wiley & Sons; 2008.
  13. 13. Verhaak RG, Hoadley KA, Purdom E, Wang V, Qi Y, Wilkerson MD, et al. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer cell. 2010;17(1):98–110. pmid:20129251
  14. 14. Mazurowski MA, Desjardins A, Malof JM. Imaging descriptors improve the predictive power of survival models for glioblastoma patients. Neuro Oncol. 2013;15(10):1389–94. pmid:23396489
  15. 15. Affronti ML, Heery CR, Herndon JE, Rich JN, Reardon DA, Desjardins A, et al. Overall Survival of Newly Diagnosed Glioblastoma Patients Receiving Carmustine Wafers Followed by Radiation and Concurrent Temozolomide Plus Rotational Multiagent Chemotherapy. Cancer. 2009;115(15):3501–11. pmid:19514083
  16. 16. McAuliffe MJ, Lalonde FM, McGarry D, Gandler W, Csaky K, Trus BL. Medical Image Processing, Analysis & Visualization in clinical research. Comp Med Sy. 2001:381–6.
  17. 17. Sled JG, Zijdenbos AP, Evans AC. A nonparametric method for automatic correction of intensity nonuniformity in MRI data. IEEE transactions on medical imaging. 1998;17(1):87–97. pmid:9617910
  18. 18. Figueiredo MAT, Jain AK. Unsupervised learning of finite mixture models. Ieee T Pattern Anal. 2002;24(3):381–96.
  19. 19. Reynolds D. Gaussian mixture models. Encyclopedia of Biometrics. 2009:659–63.
  20. 20. Jiao Y, Berman H, Kiehl TR, Torquato S. Spatial organization and correlations of cell nuclei in brain tumors. PloS one. 2011;6(11):e27323. pmid:22110626
  21. 21. Rajala T, Illian J. A family of spatial biodiversity measures based on graphs. Environ Ecol Stat. 2012;19(4):545–72.
  22. 22. Baddeley A, Turner R. Spatstat: an R package for analyzing spatial point patterns. Journal of statistical software. 2005;12(6):1–42.
  23. 23. Van Lieshout MNM, Baddeley AJ. Indices of dependence between types in multivariate point patterns. Scand J Stat. 1999;26(4):511–32.
  24. 24. Baddeley A. Spatial point processes and their applications. Lect Notes Math. 2007;1892:1–75.
  25. 25. Eo S-H, Hong S-M, Cho H. K-Adaptive Partitioning for Survival Data with an Application to SEER: The kaps Add-on Package for R. arXiv preprint arXiv:13064615. 2013.
  26. 26. Eo S-H, Hong S-M, Cho H. K-Adaptive Partitioning for Survival Data: The kaps Add-on Package for R. arXiv preprint arXiv:13064615. 2013.
  27. 27. Schmidt M, Lipson H. Distilling free-form natural laws from experimental data. Science. 2009;324(5923):81–5. pmid:19342586
  28. 28. Gonzalez-Garcia I, Sole RV, Costa J. Metapopulation dynamics and spatial heterogeneity in cancer. Proc Natl Acad Sci U S A. 2002;99(20):13085–9. pmid:12351679
  29. 29. Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Cavalho S, et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nature communications. 2014;5:4006. pmid:24892406
  30. 30. Grove O, Berglund AE, Schabath MB, Aerts HJ, Dekker A, Wang H, et al. Quantitative computed tomographic descriptors associate tumor shape complexity and intratumor heterogeneity with prognosis in lung adenocarcinoma. PloS one. 2015;10(3):e0118261. pmid:25739030