Description
Cancer researchers have traditionally used a handful of markers to understand the origin of tumors and to predict therapeutic response. Additionally, performing machine learning activities on disparate data sources of varying quality is fraught with inherent bias. The Caris Life Sciences Molecular Database (CMD) is an immense resource for discovery as it contains over 215,000 molecular profiles of tumors with consistently gathered clinical grade molecular data along with immense amounts of clinical outcomes data. This resource was leveraged to generate two artificial intelligence algorithms aiding in diagnosis and one for therapy selection.
The Molecular Disease Classifier (MDC) was trained on 34,352 cases and tested on 15,473 unambiguously diagnosed cases. The MDC predicted the correct tumor type out of thirteen possibilities in the labeled data set with sensitivity, specificity, PPV, and NPV of 90.5%, 99.2%, 90.5% and 99.2% respectively when considering up to 5 predictions for a case.
The availability of whole transcriptome data in the CMD prompted its inclusion into a new platform called MI GPSai (MI Genomic Prevalence Score). The algorithm trained on genomic data from 34,352 cases and genomic and transcriptomic data from 23,137 cases and was validated on 19,555 cases. MI GPSai can predict the correct tumor type out of 21 possibilities on 93% of cases with 94% accuracy. When considering the top two predictions for a case, the accuracy increases to 97%.
Finally, a 67 gene molecular signature predictive of efficacy of oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer was developed - FOLFOXai. The signature was predictive of survival in an independent real-world evidence (RWE) dataset of 412 patients who had received FOLFOX/BV in 1st line and inversely predictive of survival in RWE data from 55 patients who had received 1st line FOLFIRI. Blinded analysis of TRIBE2 samples confirmed that FOLFOXai was predictive of OS in both oxaliplatin-containing arms (FOLFOX HR=0.629, p=0.04 and FOLFOXIRI HR=0.483, p=0.02).
The Molecular Disease Classifier (MDC) was trained on 34,352 cases and tested on 15,473 unambiguously diagnosed cases. The MDC predicted the correct tumor type out of thirteen possibilities in the labeled data set with sensitivity, specificity, PPV, and NPV of 90.5%, 99.2%, 90.5% and 99.2% respectively when considering up to 5 predictions for a case.
The availability of whole transcriptome data in the CMD prompted its inclusion into a new platform called MI GPSai (MI Genomic Prevalence Score). The algorithm trained on genomic data from 34,352 cases and genomic and transcriptomic data from 23,137 cases and was validated on 19,555 cases. MI GPSai can predict the correct tumor type out of 21 possibilities on 93% of cases with 94% accuracy. When considering the top two predictions for a case, the accuracy increases to 97%.
Finally, a 67 gene molecular signature predictive of efficacy of oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer was developed - FOLFOXai. The signature was predictive of survival in an independent real-world evidence (RWE) dataset of 412 patients who had received FOLFOX/BV in 1st line and inversely predictive of survival in RWE data from 55 patients who had received 1st line FOLFIRI. Blinded analysis of TRIBE2 samples confirmed that FOLFOXai was predictive of OS in both oxaliplatin-containing arms (FOLFOX HR=0.629, p=0.04 and FOLFOXIRI HR=0.483, p=0.02).
Details
Title
- Leveraging Artificial Intelligence to Find Previously Undiscovered Patterns in Tumor Molecular Data to Aid in Diagnosis and Therapy Selection
Contributors
- Abraham, Jim (Author)
- Spetzler, David (Thesis advisor)
- Frasch, Wayne (Thesis advisor)
- Lake, Douglas (Committee member)
- Compton, Carolyn (Committee member)
- Arizona State University (Publisher)
Date Created
The date the item was original created (prior to any relationship with the ASU Digital Repositories.)
2020
Subjects
Resource Type
Collections this item is in
Note
- Doctoral Dissertation Molecular and Cellular Biology 2020