Introduction

Pancreatic ductal adenocarcinoma (PDAC), one of the most common malignant neoplasms of digestive system, is generally associated with poor prognosis and high mortality. The overall 5-year survival rate among PDAC patients is less than 5%1. In recent years, the morbidity of PDAC increased year by year. In 2010, the global incidence of PDAC was up to 292,471 cases2. According to World Cancer Report 2014, the United States had 46,420 (male: 23,530 vs female: 22,890) new cases of PDAC in 2014, and an estimated 39,590 patients died from this disease3. While in China, the new cases of 2015 were 90,100, and 79,400 patients died. The number of new cases ranked 9th and deaths ranked 6th among the 10 most common cancers. In addition, from 2000 to 2011, the morbidity and mortality of PDAC in China have taken on overall upward trends4.

It is known that prognostic evaluation is the basis of personalized cancer treatment. However, there are some intrinsic demerits in the traditional prognostic evaluation methods. TNM staging system is one of the most widely used methods in the prognostic evaluation of cancer; nevertheless, it only takes histologic metastasis of tumor into account and doesn’t incorporate many other important prognostic factors, such as age, gender, tumor size, and tumor marker. In this sense, the traditional staging system is an inaccurate method for prognostic evaluation. Therefore, a more accurate and comprehensive tool is needed. Nomogram is one such tool.

Nomogram is a visualization method of regression, it is primarily based on logistic and cox regression model. Compared to the traditional TNM staging system, nomogram can integrate more significant prognostic factors, which enables it to make more accurate evaluation. While compared to the traditional regression model, nomogram is more user-friendly due to the visualization function, and even clinicians lacking statistical expertise can easily read it. As a result, nomogram is being widely used in clinical prognostic evaluation, and may serve as a potential surrogate for the traditional staging system5.

In the field of PDAC, several nomograms have been already proposed. However, to our knowledge, a nomogram specific to Chinese PDAC patients, especially advanced patients, has not been reported. Besides, most of the proposed nomograms were constructed for specific population and derived from retrospectively collected data, so the clinical utility was limited. In this setting, the current study aimed to develop and validate a widely applicable prognostic nomogram for advanced PDAC patients via a large prospective multicenter cohort study in China.

Patients and Methods

Study population

From January 2004 to December 2013, we conducted a multicenter dynamic cohort study. Consecutive patients from three participating hospitals (Shanghai Cancer Center, Changhai Hospital, Ruijin Hospital) were prospectively included between January 1, 2004 and December 31, 2011. Inclusion criteria were as follows: pathologically proven PDAC; no history of other malignancies; treatment-naive patients of stage III and stage IV, who received palliative chemotherapy, and the first-line anticancer treatment was gemcitabine-based chemotherapy; met the requirements of follow-up and signed the informed consent. All the patients were first reviewed by two doctors at each participating hospital, and then the clinical data was transferred to Shanghai Cancer Center. Two experienced doctors at Shanghai Cancer Center went through these clinical materials to determine inclusion or exclusion of the patients. If divergent views arose, a third doctor would be consulted for the final assessment of inclusion or exclusion.

This study was carried on in accordance with the precepts of the Helsinki Declaration. Approvals were obtained from the Ethics Committee for Medical Research, School of Public Health, Fudan University. During the hospital stay, every included patient or their guardians signed the informed consent in view of prospective research of the clinical data.

Data collection

Patient demographics (age and gender), smoking status, alcohol consumption, tumor stage, tumor size, tumor site, serum ALT (alanine aminotransferase), AST (aspartate aminotransferase), ALB (albumin), HBV infection status, and CA 19-9 were obtained at the diagnosis.

Based on the five indices in HBV profile (including HBsAg, HBsAb, HBeAg, HBeAb, HBcAb) and HBV-DNA level, HBV infection in this study was categorized into four statuses, namely, non infection, chronic HBV infection, inactive HBV carrier and resolved HBV infection. “Non infection” was defined as being negative for HBsAg and anti-HBc. “Chronic HBV infection” was defined as being HBsAg and anti-HBc positive, and either HBeAg positive or HBV DNA positive. “Inactive HBV carrier” was defined as HBsAg positive, and both HBeAg and HBV-DNA negative. “Resolved HBV infection” was defined as being HBsAg negative and either anti-HBe or anti-HBc positive6, 7.

Follow up

The follow-up started at the time of diagnosis and ended when the patients were dead or censored. All the patients in this study were followed up regularly. Survival conditions were actively obtained once a month within the first year and then every 3 months thereafter. The follow-up ended on December 31, 2013.

Statistical analysis

The statistical analyses were performed using R software version 3.3.28 (R Development Core Team; http://www.r-project.org) with the survival, rms, and survivalROC package9,10,11. The significant level was set at 0.05 and all tests were two sided.

For nomogram construction and validation, we randomly assigned two thirds of the patients to the training set and one third to the validation set. The characteristics of the two sets were described and compared using chi-square test or one sample t test. Variables to develop the nomogram were selected by the stepwise selection method using Akaike information criterion (AIC) in the Cox proportional hazards (PH) model. The PH assumption was examined using Schoenfeld residual plots and multicollinearity was examined using variance inflation factor (VIF). Based on the predictive model with identified prognostic factors, a nomogram predicting median survival time (MST) and 1-year overall survival (OS) was constructed. After the construction of the nomogram, internal and external validation were performed in the training set and validation set respectively. Nomogram validation consisted of two parts, discrimination and calibration. Discrimination was evaluated using a concordance index (C-index). The value of the C-index ranged from 0.5 (no discrimination at all) to 1.0 (perfect discrimination). Calibration was performed by comparing the means of predicted survival with those of actual survival with observed Kaplan-Meier estimates after grouping of the nomogram predicted survival by decile. To reduce bias, bootstraps with 1,000 resamples were used for these activities. For clinical use of the model, the total points of each patient were calculated based on the proposed nomogram. Receiver operating characteristic curve analysis with censored survival data was performed to calculate the optimal cutoff values that were determined by maximizing the Youden index (ie, sensitivity + specificity − 1). Accuracy of the optimal cutoff value was assessed by the sensitivity, specificity, predictive values, and likelihood ratios. In addition, to further examine the discrimination of the proposed nomogram, we categorized the patients of the training set into four groups by the quartiles of nomogram total points and then calculated each group’s MST and plot each group’s Kaplan-Meier survival curve. These curves were compared using log-rank test.

Data Availability

The dataset analyzed during this study is included in its Supplementary Information files.

Results

Cohort characteristics

In total, 1,526 consecutive patients with advanced PDAC were identified. In the training set (n = 1,017), the median follow-up was 6.8 months (range: 0.4 to 110.2 months). During this period, 912 patients (89.7%) died, and 105 patients (10.3%) were censored. The MST was 7.2 months (95%CI: 6.7 to 7.9 months). The 3-, 6- and 12-month survival rates were 84.2%, 57.3% and 27.2% respectively. While in the validation set (n = 509), the median follow-up was 6.7 months (range: 0.5 to 108.6 months). During this period, 450 patients (88.4%) died, and 59 patients (11.6%) were censored. The MST was 7.1 months (95%CI: 6.6 to 7.9 months). The 3-, 6- and 12-month survival rates were 86.3%, 58.1% and 30.8% respectively.

The patient characteristics were listed in Table 1. There were no significant differences among the following variables between the training set and validation set.

Table 1 Characteristics of the training set and validation set.

Independent prognostic factors in the training set

The results of multivariate cox regression were listed in Table 2. Older age (≥60 vs. <60, P < 0.001), stage IV (vs. III, P < 0.001), larger tumor size (P < 0.001), elevated ALT (vs. normal, P < 0.001), low ALB (vs. normal, P < 0.001), and elevated CA 19-9 (vs. normal, P < 0.001) were associated with poor prognosis. As for HBV infection status, chronic HBV infection (P = 0.007) was associated with better prognosis compared to non infection.

Table 2 Multivariate cox regression of the training set.

Prognostic nomogram for MST and OS

The prognostic nomogram that integrated all the variables selected by multivariate cox regression was shown in Fig. 1. The nomogram illustrated tumor size as sharing the largest contribution to prognosis, followed by tumor stage, HBV infection status, ALB, CA 19-9, ALT, and age. Each subtype within these variables was assigned a score on the point scale. By adding up the total score and locating it on the total point scale, we can easily draw a straight line down to determine the estimated median survival time and 1-year survival probability.

Figure 1
figure 1

Nomogram for predicting median survival time and 1-year survival probability of advanced PDAC patients. (To use the nomogram, an individual patient’s value is located on each variable axis, and a line is drawn upward to determine the number of points received for each variable value. The sum of these numbers is located on the Total Points axis, and a line is drawn downward to the survival axes to determine the estimated median survival time and 1-year survival probability). Abbreviations: CHB = Chronic HBV infection, Carrier = Inactive HBV carrier, None = Non infection, Resolved = Resolved HBV infection.

Validation of the nomogram

The calibration curves for predicting 1-year survival probability in the training set and validation set were shown in Fig. 2 and Fig. 3. Both of the curves showed good agreement between the nomogram prediction and actual observation for 1-year OS.

Figure 2
figure 2

The calibration curve for predicting 1-year survival probability of advanced PDAC patients in the training set.

Figure 3
figure 3

The calibration curve for predicting 1-year survival probability of advanced PDAC patients in the validation set.

In the training set, the C-index was 0.720 (95%CI: 0.703 to 0.737), which was significantly higher than that of the staging system (C-index = 0.613, 95%CI: 0.597 to 0.629; P < 0.001). In the validation set, the C-index was 0.696 (95%CI: 0.670 to 0.722), which was also significantly higher than that of the staging system (P < 0.001). The bias-corrected C-indexes in the training set and validation set were 0.716 and 0.696 respectively.

The area under ROC curve (AUC) of the training set and validation set were 0.792 and 0.732 respectively. The optimal cutoff value of nomogram total points was determined to be 216 in the training set and 214 in the validation set. The sensitivity, specificity, positive predictive value, negative predictive value, positive likelihood ratio, and negative likelihood ratio for predicting 1-year survival probability were as listed in Table 3.

Table 3 Accuracy of the proposed nomogram for predicting 1-year survival probability.

To determine the performance of the proposed nomogram in stratifying risk of patients, we categorized the patients of the training set into four subgroups based on the quartiles of nomogram total points (ie, lowest to 194, 195 to 245, 246 to 299, 300 to highest). The MST of each group was 13.4 months (95%CI: 12.0 to 14.4 months), 8.6 months (95%CI: 8.2 to 9.5 months), 5.7 months (95%CI: 5.4 to 6.2 months), 3.7 months (95%CI: 3.2 to 4.5 months), respectively. Fig. 4 illustrated the Kaplan–Meier survival curves according to the nomogram-based groupings. The survival times were significantly differentiated between the subgroups (P < 0.001).

Figure 4
figure 4

Kaplan-Meier survival curves of training set categorized by the quartiles of proposed nomogram total points.

Discussion

Pancreatic ductal adenocarcinoma is heterogeneous in regard to survival of individual patients; therefore, prognostic evaluation solely based on the traditional staging system is imprecise. Despite several previously reported nomograms among PDAC patients, a nomogram has not been developed for Chinese advanced PDAC patients. Thus, we sought to develop and validate such a prognostic nomogram to predict overall survival of this population. Favorable discrimination and calibration could be found in the proposed nomogram derived from prospectively collected data on 1526 patients from three hospitals in China. Superior to the existing TNM staging system, nomogram would facilitate the popularization of patient counselling and personalized treatment. Additionally, the proposed nomogram was constructed with a large prospective multicenter cohort study, rendering it more widely applicable than the previous reported nomograms.

In this study, we identified seven independent prognostic factors for advanced PDAC, namely, age, tumor stage, tumor size, ALT, ALB, CA 19-9, and HBV infection status. Our previous study12 revealed that HBV status is a significant factor affecting the progression of advanced PDAC, and that chronic hepatitis B infection may be a protective factor for these patients. As is generally recognized, long-term persistence of HBV infection can cause an inflammatory microenvironment in the liver13, which may trigger enhanced immune defense14. The boosted immune responses were assumed to be beneficial in the inhibition of tumor progression among PDAC patients. In addition, HBV replication may boost tumor necrosis factor α (TNF-α) secretion via regulating hepatocytes and immune cells residing in the liver15. In view that HBV can also replicate in pancreas16, we speculate that HBV also plays a role in the biological behavior of PDAC, thereby attenuating tumor invasiveness. These underlying effects altogether prolonged overall survival of PDAC patients. This result of our study was consistent with Qian HG et al.’s study17 among colorectal cancer patients. However, Wei et al.18 found that HBV infection increased synchronous liver metastasis incidence. The prognostic role of HBV infection for advanced PDAC patients and the underlying mechanisms need to be further explored via rigorously designed RCTs and biomedical experiments.

CA 19-9 was first isolated from a colorectal cell line and has since become the most widely used tumor marker for PDAC19. Because serum CA 19-9 level is intimately associated with the overall tumor burden of PDAC, patients with elevated CA 19-9 level may be at more advanced stage, compared to those without elevated CA 19-920, 21. Perioperative CA 19-9 level has been studied to determine the efficacy from radical surgery22,23,24. Several studies reported that preoperative CA 19-9 was linked with resectability and postoperative prognosis25,26,27. Moreover, postoperative CA 19-9 level can predict overall survival and disease-free survival after cancer resection and adjuvant chemotherapy28, 29. Our study confirmed the results from other studies. Saad et al.30 found that pretreatment CA19-9 level was an independent predictor of OS in PDAC patients who once had received gemcitabine chemotherapy. On the contrary, in a study with the sample size of 67 patients, Sezgin et al.31 identified performance status was the only independent prognostic factor of OS in locally advanced or metastatic PDAC patients who had undergone gemcitabine treatment and CA 19-9 level had nothing to do with the treatment response to gemcitabine. Whether CA 19-9 could be served as a reliable and applicable prognostic marker of advanced PDAC needs to be verified in large cohort studies. With around 10-year follow-up and a larger sample size, the current multi-center study suggested that CA 19-9 was a confirmative independent prognostic factor for advanced PDAC.

In addition to serum CA 19-9, decreased ALB level was found as a poor prognostic factor for advanced PDAC in this study. Serum ALB is the most abundant blood protein in mammals. For advanced PDAC, decreased ALB level often means relatively later stage of the disease, which can be caused by insufficient intake of nutrition, malabsorption, and accelerated decomposition of food32. Clinically, ALB supplementation during the disease course may improve nutrition status and thus help extend survival of advanced PDAC patients, but this needs to be verified by well-designed clinical trials.

Serum ALT and AST are commonly used as clinical indicators for liver function33. However, they have been rarely taken as prognostic factors for advanced PDAC. Our study suggested that elevated ALT level was a poor prognostic factor for advanced PDAC. It may be explained that elevated ALT reflects the consequences of cancer cell’s invasion into the liver and liver injury. Most of the time, without liver metastasis or liver injury, ALT or AST may not elevate. Thus, maintenance of ALT and AST levels to the normal ranges could prolong survival of advanced PDAC patients.

Besides the mentioned factors above, age at diagnosis, tumor stage, and tumor size were also found to be associated with prognosis of advanced PDAC. Most previous studies34,35,36 illustrated the prognostic roles of these factors in advanced PDAC and the results of our study were consistent with their findings.

There are also some limitations in the present study. Firstly, we randomly divided the patients into two subgroups, 2/3 for construction and 1/3 for validation of the nomogram. Although this is a generally accepted method of nomogram build and validation, yet it is still an alternative when external cohort is not available. Hence, our proposed nomogram needs to be validated in other population derived from an external cohort. Secondly, not all the potential prognostic factors are included in the nomogram, so it can’t make absolutely accurate prediction. However, the results of validation demonstrated good fitness of the present nomogram based on the seven variables for survival prediction. Thirdly, tumor stage was included into the nomogram as dichotomous variable (stage III or IV) rather than subdivision of TNM stage, which may result in less accuracy for prediction. Fourthly, none of included patients received FOLFIRINOX. FOLFIRINOX is a combination chemotherapy regimen consisting of oxaliplatin, irinotecan, fluorouracil, and leucovorin, which has been shown to prolong survival of patients with metastatic pancreatic adenocarcinoma compared to gemcitabine as first-line therapy37. FOLFIRINOX was first published in 2011, while our study started at the year of 2004 and gemcitabine-based chemotherapy was the standard regimen at that time. Besides, FOLFIRINOX is relatively expensive and patients who receive FOLFIRINOX may experience more severe side effects than those who receive gemcitabine alone, so this combination is usually given to patients who are healthy enough to tolerate the potential side effects37, 38. Therefore, FOLFIRINOX is not widely used for the treatment of advanced pancreatic cancer in China. In spite of this, FOLFIRINOX may become a prospective and promising chemotherapy for PDAC in the coming future in China because of its notable treatment response. Accordingly, we are undertaking to take it into clinical practice and planning to further study this regimen in future research.

In conclusion, the proposed nomogram can accurately predict overall survival of patients with advanced PDAC. Compared to the traditional staging system, clinicians can promote individual-oriented cancer therapy and disease management by using this tool.