There are two principal challenges of applying ML in cancer progression modeling: Having enough phenotypically rich data to train models and professional validation of generated insights. So we had all the ingredients to deal with the scientific problem and achieve the objective.
We created a predictive model to forecast the risk of a disease progression (e.g., relapse, protracted clinical course) for patients with lung cancer and lymphoma: After transforming raw data to the OMOP CDM data standard, we defined the required cohorts. Based on it, we applied CNN-based deep learning models, predicting cancer outcomes to data collected for patients during the "time at risk" window.
As a result, we contributed to the early diagnosis of lung cancer and lymphoma with an ML-driven cancer progression predictive model.