Main Session
Sep 28
PQA 02 - Lung Cancer/Thoracic Malignancies, Patient Reported Outcomes/QoL/Survivorship, Pediatric Cancer

2399 - Phenotyping Chronological Accumulation of Comorbidities for Prostate Cancer Patients: A Computational Approach

04:45pm - 06:00pm PT
Hall F
Screen: 25
POSTER

Presenter(s)

Matthew Kim, BS - University of California, San Francisco, San Francisco, CA

M. J. Kim1, M. Zhao2, R. Darawsheh1, W. Tang1, S. Washington III3, F. W. Huang4, and J. C. Hong5; 1University of California, San Francisco, San Francisco, CA, 2Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, San Francisco, CA, 3University of California, San Francisco, Department of Urology, San Francisco, CA, 4Division of Hematology and Oncology, Department of Medicine, University of California San Francisco, San Francisco, CA, 5University of California San Francisco, Department of Radiation Oncology, San Francisco, CA

Purpose/Objective(s): Prostate cancer (PC) is the fifth leading cause of death worldwide. Understanding the broader context of disease-accumulated comorbidities can enhance risk stratification, improve decision-making, and shift clinical priorities toward more effective, personalized management strategies. This study introduces a novel approach for using Non-negative Matrix Factorization (NMF) for chronological comorbidity accumulation (CCA) in PC patients. We hypothesize that applying NMF to CCA in PC patients will reveal multimorbidity clusters and disease trajectories that align with known PC risk factors and progression pathways.

Materials/Methods: We randomly sampled 2000 patients with PC from a single institutional dataset. We identified the first chronological instance of each patient’s inpatient and outpatient diagnoses, categorized by 3-digit ICD codes to more broadly capture comorbidities managed in the outpatient setting, after their PC diagnosis. Disease clustering (DC) and temporal relationship phenotyping between ICD diagnoses were characterized via NMF following Hassaine et al. Steps include (1) disease matrix generation for every patient by first incidence of disease at age (in years) with number of unique conditions in the patient dataset, (2) disease frequency and inverse patient frequency calculation to correct for bias towards more frequent diseases, (3) Gaussian smoothing to correct for the variance of diagnosis noise of recorded chronic diseases, and (4) matrix concatenation and decomposition via NMF to output 2 matrixes: diagnosis prevalence in a DC and temporal phenotype per DC. Optimal DCs were identified by measuring the cophenetic correlation coefficient to identify the most stable clustering derived from NMF results. DCs were analyzed by joint activation probabilities and chronicity of DC diagnoses by ascendancy analysis.

Results: 6 disease clusters were identified: DC1) cardiovascular – I25, I48; DC2) cardiometabolic & secondary malignancies – I10, C79, E78; DC3) skin & joint disorders- M79, M25, L82; DC4) aging – N40, H90, H25; DC5) post-treatment complications – N39, E78; and DC6) hematologic – D64, E87, D72. All clusters had a moderate strength of co-occurrence between 0.45 to 0.64. Unique temporal phenotypes were generated per cluster. DC6, compared to DC2, had the highest chronicity score of 0.31.

Conclusion: Our preliminary results demonstrate that NMF-based disease clustering captures temporally coherent CCA patterns in PC patients, including secondary malignancies, treatment-related effects, and routine care. DCs had general co-chronicity occurrence, except for DC6 and DC2. The identified DCs align with possible disease progression pathways, validating this approach as a tool for comorbidity assessment. Future work will investigate comorbidity trajectories based on temporal diagnosis profiles post-PC diagnosis and assess long-term morbidity burden alongside different comorbidity profiles.