3687 - Validation of Claims-Based Algorithms to Characterize Thoracic Radiation Therapy Treatment Courses: Are Claims Enough to Study Radiation Therapy?
Presenter(s)
S. S. Neibart1, N. Lin2, S. Moningi3, B. H. Kann4, R. H. Mak5, and M. Lam6; 1Harvard Radiation Oncology Program, Boston, MA, 2Brigham and Women's Hospital, Boston, MA, 3Department of Radiation Oncology, Brigham and Women’s Hospital/Dana-Farber Cancer Institute, Boston, MA, 4Department of Radiation Oncology, Dana-Farber Cancer Institute/Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 5Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, 6Department of Health Policy and Management, Harvard T.H. Chan School of Public Health, Boston, MA
Purpose/Objective(s): Routinely collected administrative data offer insights into healthcare utilization and outcomes but lack detailed clinical information—such as the specific site of radiation therapy (RT)—which limits the study of RT toxicity and efficacy in claims databases. This study aims to develop and validate claims-based algorithms to accurately identify thoracic RT in administrative databases.
Materials/Methods: Patients at our institution with an International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM) lung cancer diagnosis and any RT Current Procedural Terminology (CPT) code from 10/2015-1/2024 were analyzed. RT claims for each treatment episodes were extracted. RT details were manually abstracted from the treatment planning system and electronic health record to classify episodes as thoracic or non-thoracic RT. A priori algorithms were defined as the presence of respiratory motion management (RMM) codes, more than 14 treatment codes (except for stereotactic body radiation therapy courses), and exclusively thoracic ICD-10-CM codes during the episode (i.e., no claims for bone metastases). Decision tree models using Classification and Regression Tree algorithms were developed on 70% of the cohort and validated on the remaining 30%. Performance metrics (sensitivity (Sn), specificity (Sp), positive predictive value (PPV), and negative predictive value (NPV)) were calculated, with an acceptable algorithm defined by a lower-bound Clopper-Pearson 95% confidence interval for PPV exceeding 70%.
Results: Of 3491 patients and 4715 RT episodes initially identified, 131 episodes (3%) and 738 (16%) episodes were excluded for missing claims data and absence of external beam RT claims, respectively, leaving 3846 episodes for analysis. Performance metrics for the tested algorithms are shown.
Conclusion: Clinically-informed and decision tree–based models can accurately identify thoracic RT in claims data, achieving high PPVs for IMRT and SBRT courses, while limiting false negatives. These algorithms can be applied in claims databases to assess RT toxicity and effectiveness.
Abstract 3687 - Table 1: Algorithm performance
Modality | Sn (95% CI) | Sp (95% CI) | PPV (95% CI) | NPV (95% CI) |
RMM OR 3DCRT and IMRT treatment codes > 14 OR Exclusive thoracic codes | ||||
3DCRT | 46% (42-49%) | 87% (84-90%) | 84% (80-87%) | 53% (50-57%) |
IMRT | 82% (81-84%) | 92% (90-93%) | 96% (95-97%) | 70% (68-72%) |
SBRT | 91% (89-93%) | 83% (80-86%) | 89% (87-91%) | 87% (84-90%) |
RMM OR IMRT or 3DCRT treatment codes > 14 | ||||
3DCRT | 76% (70-80%) | 81% (78-83%) | 53% (48-58%) | 92% (90-94%) |
IMRT | 94% (93-95%) | 58% (50-66%) | 95% (93-96%) | 56% (48-63%) |
SBRT | 96% (94-97%) | 28% (24-32%) | 62% (59-65%) | 84% (78-90%) |
Decision Tree | ||||
3DCRT | 72% (63%-80%) | 88% (83-92%) | 73% (64-81%) | 87% (83-91%) |
IMRT | 98% (96%-99%) | 59% (43-74%) | 96% (93-98%) | 76% (59-89%) |
SBRT | 97% (95%-99%) | 70% (57-81%) | 94% (91-96%) | 84% (71-93%) |
3DCRT = Three-dimensional conformal radiation therapy; IMRT = Intensity-modulated radiation therapy; SBRT = Stereotactic body radiotherapy |