Home
Sessions
Central Nervous System, Professional Development/Medical Education
A Large Language Model-Augmented Multimodal Framework for Predicting Pain Relief Outcomes Following Stereotactic Body Radiotherapy (SBRT) in Spinal Metastases: Integrating Clinical Factors and Imaging Reports

Main Session

Sep 29

PQA 03 - Central Nervous System, Professional Development/Medical Education

2697 - A Large Language Model-Augmented Multimodal Framework for Predicting Pain Relief Outcomes Following Stereotactic Body Radiotherapy (SBRT) in Spinal Metastases: Integrating Clinical Factors and Imaging Reports

08:00am - 09:00am PT

Hall F

Screen: 24

POSTER

Presenter(s)

Xuejun Gu, PhD - Stanford University School of Medicine, Stanford, CA

K. Zhang¹, X. Ye², G. A. Szalkowski³, Z. Yang⁴, C. Chuang⁴, L. Wang⁵, L. Liu⁶, S. G. Soltys⁴, E. L. Pollom⁴, E. Rahimy⁴, J. Byun⁶, D. Park⁷, Y. Hori⁷, F. Lam⁷, D. Reesh⁷, S. D. Chang⁷, G. Li⁷, M. Hayden⁸, M. Kazemimoghadam⁹, Q. Wang⁹, M. Chen⁹, H. Jiang¹⁰, W. Lu⁹, and X. Gu¹¹; ¹Department of Radiation Oncology, Stanford University, s, CA, ²Department of Radiation Oncology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China, ³Georgia Institute of Technology, Atlanta, GA, ⁴Department of Radiation Oncology, Stanford University School of Medicine, Stanford, CA, ⁵Department of Radiation Oncology, Stanford University School of Medicine, Palo Alto, CA, ⁶Department of Radiation Oncology, Stanford University, Stanford, CA, ⁷Department of Neurosurgery, Stanford University School of Medicine, Stanford, CA, ⁸Department of Neurosurgery, Stanford University, Stanford, CA, ⁹Medical Artificial Intelligence and Automation (MAIA) Lab, Department of Radiation Oncology, UT Southwestern Medical Center, Dallas, TX, ¹⁰Department of Radiation Oncology, The University of Texas Southwestern Medical Center, Dallas, TX, ¹¹Stanford University Department of Radiation Oncology, Palo Alto, CA

Purpose/Objective(s):

Accurate prediction of pain relief following spinal stereotactic body radiotherapy (SBRT) is essential for assessing treatment effectiveness and optimizing patient care; however, this remains challenging due to the complex and multifactorial nature of pain. We hypothesis that the multimodal deep-learning framework, which integrates augmented clinical factors and imaging features extracted from reports using large language models (LLMs) can lead to accurate prediction of pain relief outcomes.

Materials/Methods:

We retrospectively collected 160 spine SBRT cases with spinal metastases from our institutional frameless robotic radiosurgery system database partitioned into 104 cases for training, 26 for validation and 30 for testing. Each case included two modalities of data: clinical factors and imaging reports. The proposed framework comprises three key components: (1) a data augmentation strategy to encode clinical features such as tumor type, metastasis location, and pain severity; (2) an LLM-driven analyzer (e.g., ChatGPT-4o) to extract high-dimensional semantic embeddings directly from narrative and impression sections of imaging reports; and (3) a cross-attention-based transformer classifier that dynamically fuses clinical data and imaging features to capture interdependencies and enhance predictive accuracy. By unifying these components into an end-to-end workflow, our framework takes clinical factors and image reports to predict the outcome of pain relief conditions.

Results:

The proposed method achieved an accuracy of 85.15% and an area under the curve (AUC) of 0.88 (as shown in Table I). Additionally, we conducted an ablation study using only the clinical factors as a single modality. The preliminary results indicated enhanced performance with multi-modal data, which surpassed the 80.95% accuracy achieved using the single modality.

Conclusion:

Our study introduces a novel LLM-augmented multimodal framework that integrates clinical factors and imaging reports to predict pain relief outcomes following SBRT for spinal metastases. By leveraging a cross-attention transformer to fuse structured clinical data with semantic embeddings extracted from imaging narratives, our multimodal framework outperforming the single-modal approach. These results underscore the value of combining advanced LLM with multimodal data to enhance predictive precision in oncology workflows.

Abstract 2697 - Table 1: Evaluation of methods based on multi-modal inputs in single-modal (clinical factors) inputs

	Accuracy	Sensitivity	Specificity	AUC
Multi-Modal	85.15%	93.02%	79.31%	0.88
Single-Modal	80.95%	85.00%	73.91%	0.81