Main Session
Sep 30
PQA 09 - Hematologic Malignancies, Health Services Research, Digital Health Innovation and Informatics

3643 - Autoregressive Tracking Transformer for Whole-Body 3D Lymph Node Detection in CT Scans

04:00pm - 05:00pm PT
Hall F
Screen: 6
POSTER

Presenter(s)

Dakai Jin, PhD - Alibaba Group (US) Inc., New York, NY

Q. Yu1, Y. Wang2, D. Ai3, D. Zheng4, K. Yan5, D. Guo2, Z. Ji2, Y. Su6, Q. Wang7, Y. Bian8, N. Shen9, L. Lu2, K. Zhao3, D. Jin2, and X. Ye4; 1Shanghai Jiao Tong University, Shanghai, China, 2Alibaba Group (US) Inc., Washington, DC, 3Fudan University Shanghai Cancer Center, Shanghai, China, 4Department of Radiation Oncology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China, 5Alibaba DAMO Academy, Beijing, China, 6Alibaba DAMO Academy, Hangzhou, China, 7Department of Radiation Oncology, Sichuan Cancer Hospital and Institution, Chengdu, China, 8Departments of Radiology, Changhai Hospital, Shanghai, China, 9Zhongshan Hospital, Fudan University, Shanghai, China

Purpose/Objective(s): Identifying scatteredly-distributed and low-contrast LNs in 3D CT scans is highly challenging, even for experienced clinicians. Previous lesion and LN detection methods demonstrate effectiveness of 2.5D approaches (i.e, using 2D network with multi-slice inputs) by leveraging pretrained 2D model weights, and show improved accuracy as compared to 2D or 3D detectors. However, slice-based 2.5D detectors do not explicitly model inter-slice consistency for LN as a 3D object, requiring heuristic post-merging steps to generate final 3D LN instances. In this study, we formulate 3D LN detection as a tracking task and propose LN-Tracker, a novel LN tracking transformer, that can effectively tackle the challenging LN detection task across major body sections.

Materials/Methods: We collected and curated data from 4 institutions including 1000 patients with 7000+ annotated LNs including various body parts (neck, chest, and abdomen) and different diseases (head & neck, esophageal, lung, and pancreatic cancers). Built upon a transformer-based detector (DETR), LN-Tracker decouples transformer decoder's query into the track and detection groups, where the track query autoregressively follows previously tracked LN instances along the z-axis of a CT scan. We design a new transformer decoder with masked attention and inter-slice similarity loss to align track query's content to the context of current slice, and meanwhile preserving detection query's high accuracy in current slice. We evaluate the LN detection performance on each LN dataset (70% training, 10% validation, and 20% testing), and report the average sensitivity (AS) at 1, 2, 4, 8 FPs per patient and the average precision (AP).

Results: The quantitative evaluation on four LN datasets is summarized in Table 1. As shown, our LN-Tracker surpasses all comparing methods across four datasets, outperforming the closest competitor, LN-DETR (the-state-of-the-art 2.5D LN detector), by 2.7% in mean AS (62.7% vs. 60.0%) and 2.4% in mean AP (52.7% vs. 50.3%). Notably, on the Pan-LN dataset that have thinner z-axis spacing, LN-Tracker achieves significant improvements over the strong 3D detector nnDetection, with markedly margin of 5.1% on AS.

Conclusion: We developed a new whole-body 3D LN tracking transformer, LN-Tracker, which significantly enhances lymph node detection performance with strong generalizability across different body parts and diseases. This model has the potential to play an important role in clinical LN assessment.

Abstract 3643 - Table 1: Quantitative results on four LN detection datasets

AS: average sensitivity over 1, 2, 4, and 8 FPs per patient. AP: average precision in 3D.

Lung-LN Head&Neck-LN Esophageal-LN Pancreas-LN Mean
AS AP AS AP AS AP AS AP AS AP
nnDetection 51.5 50.7 64.6 48.3 66.3 59.3 52.4 38.8 58.7 49.3
LN-DETR 54.0 50.1 64.9 50.8 69.9 62.5 51.0 37.9 60.0 50.3
LN-Tracker 57.7 54.3 67.0 53.2 70.0 63.1 56.1 40.1 62.7 52.7