Poster
RadGPT: Constructing 3D Image-Text Tumor Datasets
Pedro Bassi · Mehmet Yavuz · Ibrahim Ethem Hamamci · Sezgin Er · Xiaoxi Chen · Wenxuan Li · Bjoern Menze · Sergio Decherchi · Andrea Cavalli · Kang Wang · Yang Yang · Alan Yuille · Zongwei Zhou
With over 85 million CT scans performed annually in the United States, creating tumor-related reports is a challenging and time-consuming task for radiologists. To address this need, we present Rad-GPT, an Anatomy-Aware Vision-Language AI Agent for generating detailed reports from CT scans. Rad-GPT first segments tumors, including benign cysts and malignant tumors, and their surrounding anatomical structures, then transforms this information into both structured reports and narrative reports. These reports provide tumor size, shape, location, attenuation, volume, and interactions with surrounding blood vessels and organs. Extensive evaluation on unseen hospitals shows that RAD-GPT can produce accurate reports, with high sensitivity/specificity for small tumor (<2 cm) detection: 80/73% for liver tumors, 92/78% for kidney tumors, and 77/77% for pancreatic tumors. For large tumors, sensitivity ranges from 89% to 97%. The results significantly surpass the state-of-the-art in abdominal CT report generation.Rad-GPT generated reports for 17 public datasets. Through radiologist review and refinement, we have ensured the reports' accuracy, and created the first publicly available image-text 3D medical dataset, comprising over 1.8 million text tokens and 2.7 million images from 9,262 CT scans, including 2,947 tumor scans/reports of 2,562 tumor instances. Our reports can: (1) localize tumors in eight liver sub-segments and three pancreatic sub-segments annotated per-voxel; (2) determine pancreatic tumor stage (T1-T4) in \numofpancreatictumorstaging\ reports; and (3) report on multiple tumors individually while radiologists typically report only the largest or a few largest tumors. Importantly, 948 of the reports are for early-stage tumors.
Live content is unavailable. Log in and register to view live content