--
WilliamDupont - 04 Mar 2022
Tumor Mutation Burden and Survival in Cancer Patients
We (1) have written a paper on the effects of tumor mutation burden (TMB), age, cancer type, stage and grade on survival. We found that survival is affected by this burden through a non-linear function of the number of
short base substitution and insertion-deletion
mutations (SM and copy number alteration (CNA) fractions. These results are based on survival among 10,844 patients from the
Pan-Cancer Atlas who were diagnosed with 32 different cancers. I have posted a
RStudio Shiny app that estimates hazard ratios for patients with specific SM counts and CNA fractions, and that estimates 15-year survival plots given a patient’s TMB, cancer-type, age, stage and grade.
TMB data is available on 91% of evaluated patients in the Pan-Cancer Atlas. Hence, our results concerning the effects of TMB on survival are unlikely to be badly affected by missing values. The estimation of survival curves is affected by missing values of age, stage and grade. Survival curves cannot be drawn for some combination of these variables when all patients with the cancer of interest is missing one of these values. For some cancers and combinations of variables these missing values make sense. For example, hematological cancers do not have a grade. In other cancers, we do not know why these values were not entered into the Pan-Cancer Atlas. Table 1 shows the variables that may be completely missing for some cancers. When this happens, the Shiny app gives the message “Pan-cancer Atlas does not contain the data needed for this survival curve” For example, the app will generate a survival curve for a breast cancer patient whose age and stage are known. The program fails when age, stage and grade are given because none of the breast cancer patients in the Pan-Cancer Atlas have a recorded grade.
Table 1
|
Cancers that have the indicated prognostic variable for some patients |
------------------------ Cancer --------------------- |
SM count |
CNA fraction |
Age |
Stage |
Grade |
Adrenocortical Carcinoma |
Yes |
Yes |
Yes |
Yes |
No |
Bladder Cancer |
Yes |
Yes |
Yes |
Yes |
Yes |
Breast Cancer |
Yes |
Yes |
Yes |
Yes |
No |
Cervical Squamous Cell Carcinoma |
Yes |
Yes |
Yes |
Yes |
Yes |
Cholangiocarcinoma |
Yes |
Yes |
Yes |
Yes |
Yes |
Colorectal Adenocarcinoma |
Yes |
Yes |
Yes |
Yes |
Yes |
Diffuse Large B-Cell Lymphoma |
Yes |
Yes |
Yes |
No |
No |
Esophageal Adenocarcinoma |
Yes |
Yes |
Yes |
Yes |
Yes |
Glioblastoma Multiforme |
Yes |
Yes |
Yes |
No |
No |
Head and Neck Squamous Cell Carcinoma |
Yes |
Yes |
Yes |
Yes |
Yes |
Kidney Chromophobe |
Yes |
Yes |
Yes |
Yes |
No |
Kidney Renal Clear Cell Carcinoma |
Yes |
Yes |
Yes |
Yes |
Yes |
Kidney Renal Papillary Cell Carcinoma |
Yes |
Yes |
Yes |
Yes |
No |
Acute Myeloid Leukemia |
Yes |
Yes |
No |
No |
No |
Brain Lower Grade Glioma |
Yes |
Yes |
Yes |
No |
Yes |
Liver Hepatocellular Carcinoma |
Yes |
Yes |
Yes |
Yes |
Yes |
Lung Adenocarcinoma |
Yes |
Yes |
Yes |
Yes |
No |
Lung Squamous Cell Carcinoma |
Yes |
Yes |
Yes |
Yes |
No |
Mesothelioma |
Yes |
Yes |
Yes |
Yes |
No |
Ovarian Serous Cystadenocarcinoma |
Yes |
Yes |
Yes |
No |
Yes |
Pancreatic Adenocarcinoma |
Yes |
Yes |
Yes |
Yes |
Yes |
Pheochromocytoma and Paraganglioma |
Yes |
Yes |
Yes |
No |
No |
Prostate Adenocarcinoma |
Yes |
Yes |
Yes |
No |
No |
Sarcoma |
Yes |
Yes |
Yes |
No |
No |
Skin Cutaneous Melanoma |
Yes |
Yes |
Yes |
Yes |
No |
Stomach Adenocarcinoma |
Yes |
Yes |
Yes |
Yes |
Yes |
Testicular Germ Cell Tumors |
Yes |
Yes |
Yes |
Yes |
No |
Thyroid Carcinoma |
Yes |
Yes |
Yes |
Yes |
No |
Thymoma |
Yes |
Yes |
Yes |
No |
No |
Uterine Corpus Endometrial Carcinoma |
Yes |
Yes |
Yes |
No |
Yes |
Uterine Carcinosarcoma |
Yes |
Yes |
Yes |
No |
No |
Uveal Melanoma |
Yes |
Yes |
No |
No |
No |
Table 2 gives the proportion of patients with one or more missing covariates in the models evaluated by the Shiny app. It evaluates seven different models in which survival is regressed against seven different combinations of covariates. The sets of covariates in these models are labeled A through G in Table 2. They are
A: SM count and CNA fraction,
B: SM count, CNA fraction and cancer
C: SM count CNA fraction, cancer and age at diagnosis,
D: SM count CNA fraction, cancer and stage,
E: SM count CNA fraction, cancer, age at diagnosis and stage,
F: SM count CNA fraction, cancer, stage and grade, and
G: SM count CNA fraction, cancer, age at diagnosis, stage and grade.
The percent of patients with each cancer who are missing one or more of the covariates in these models is given in Table 2. These percentages vary considerably from cancer to cancer and model to model. For many cancers and models the proportion of patients with the needed covariates is high. For a few, the proportion of patients with the required covariates is low but not zero. For example, only 44% of patients with Glioblastoma Multiforme have recorded values of the TMB covariates and age at diagnosis.
Table 2
Supplemental Table 3. Percentage of patients with missing covariates for each cancer and model evaluated by the Shiny app.
Models Evaluated*
Cancer A B C D E F G
Adrenocortical Carcinoma 97.8 97.8 97.8 95.6 95.6 0.0 0.0
Bladder Cancer 98.3 98.3 98.3 97.8 97.8 97.3 97.3
Breast Cancer 91.7 91.7 91.7 90.0 90.0 0.0 0.0
Cervical Squamous Cell Carcinoma 93.3 93.3 93.3 72.1 72.1 66.2 66.2
Cholangiocarcinoma 100.0 100.0 100.0 100.0 100.0 100.0 100.0
Colorectal Adenocarcinoma 87.0 87.0 87.0 84.9 84.9 84.7 84.7
Diffuse Large B-Cell Lymphoma 76.6 76.6 76.6 0.0 0.0 0.0 0.0
Esophageal Adenocarcinoma 100.0 100.0 100.0 87.9 87.9 70.3 70.3
Glioblastoma Multiforme 64.3 64.3 44.0 0.0 0.0 0.0 0.0
Head and Neck Squamous Cell Carcinoma 95.0 95.0 95.0 82.8 82.8 79.7 79.7
Kidney Chromophobe 100.0 100.0 100.0 100.0 100.0 0.0 0.0
Kidney Renal Clear Cell Carcinoma 68.6 68.6 68.6 68.6 68.6 67.3 67.3
Kidney Renal Papillary Cell Carcinoma 96.8 96.8 96.1 87.9 87.1 0.0 0.0
Acute Myeloid Leukemia 95.4 95.4 0.0 0.0 0.0 0.0 0.0
Brain Lower Grade Glioma 98.4 98.4 98.4 0.0 0.0 0.0 0.0
Liver Hepatocellular Carcinoma 94.8 94.8 94.8 89.9 89.9 89.3 89.3
Lung Adenocarcinoma 96.2 96.2 94.2 95.8 93.8 0.0 0.0
Lung Squamous Cell Carcinoma 96.2 96.2 94.8 95.6 94.6 0.0 0.0
Mesothelioma 94.1 94.1 94.1 94.1 94.1 0.0 0.0
Ovarian Serous Cystadenocarcinoma 69.5 69.5 59.5 0.0 0.0 0.0 0.0
Pancreatic Adenocarcinoma 94.5 94.5 94.5 93.4 93.4 91.8 91.8
Pheochromocytoma and Paraganglioma 90.4 90.4 90.4 0.0 0.0 0.0 0.0
Prostate Adenocarcinoma 98.4 98.4 98.4 0.0 0.0 0.0 0.0
Sarcoma 90.6 90.6 90.6 0.0 0.0 0.0 0.0
Skin Cutaneous Melanoma 98.1 98.1 98.1 90.4 90.4 0.0 0.0
Stomach Adenocarcinoma 98.6 98.6 97.8 96.4 95.7 94.2 93.5
Testicular Germ Cell Tumors 95.5 95.5 95.5 58.6 58.6 0.0 0.0
Thyroid Carcinoma 96.4 96.4 96.4 96.0 96.0 0.0 0.0
Thymoma 100.0 100.0 100.0 0.0 0.0 0.0 0.0
Uterine Corpus Endometrial Carcinoma 95.4 95.4 95.1 0.0 0.0 0.0 0.0
Uterine Carcinosarcoma 98.2 98.2 48.2 0.0 0.0 0.0 0.0
Uveal Melanoma 100.0 100.0 0.0 0.0 0.0 0.0 0.0
* Proprotional hazards regression of survival against covariates from different models as described in the text. The covariates in the different regression models are as follows:
A: SM count and CNA fraction,
B: SM count, CNA fraction and cancer
C: SM count CNA fraction, cancer and age at diagnosis,
D: SM count CNA fraction, cancer and stage,
E: SM count CNA fraction, cancer, age at diagnosis and stage,
F: SM count CNA fraction, cancer, stage and grade, and
G: SM count CNA fraction, cancer, age at diagnosis, stage and grade.