Roupen Odabashian, Hematology/Oncology Fellow at the Karmanos Cancer Institute, posted on LinkedIn:

“New Study at ASCO2025.

Can large language models like GPT-4 and Claude Opus reason like oncologists?

The way we’re currently evaluating large language models—with those shiny journal titles touting multiple-choice exam benchmarks for accuracy—is just horribly WRONG.

Would you trust a fresh-out-of-med-school doctor to treat your cancer based solely on passing a multiple-choice test, without any real-world experience handling complex cases with multiple, difficult treatment options?

In our study at ASCO2025, we assessed large language models using multiple-choice questions, but we focused on their clinical reasoning, not just their accuracy. And the results? Shocking.

We benchmarked the clinical reasoning of AI models using 273 breast oncology multiple-choice questions from the ASCO QBank.

Key findings: GPT-4 and Claude Opus both started with high accuracy (81.3% and 79.5%, respectively).

After applying chain-of-thought prompting to simulate stepwise reasoning: Claude’s performance improved to 86.4%. GPT-4’s accuracy slightly declined to 80.95%.

Common AI errors included. That’s where we looked at their clinical reasoning!

Reliance on outdated guidelines
Misinterpretation of clinical trial data
Lack of individualized/multidisciplinary care reasoning

Conclusion: LLMs are promising tools, but still fall short in nuanced, real-world oncology decision-making. Human supervision remains essential.

Read the abstract.”

Roupen Odabashian: Benchmarking LLM Performance on Breast Oncology Multiple-Choice Questions

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

Roupen Odabashian: Benchmarking LLM Performance on Breast Oncology Multiple-Choice Questions

OncoDaily

Oncology

cancer

fight against cancer

Global fight against cancer

cancer care

Breast Cancer

ASCO

Lung cancer

Cancer research

MD Anderson Cancer Center

cancer awareness

NSCLC

ESMO

immunotherapy

cancer treatment

Research

ASCO24

Headlines

colorectal cancer

Childhood Cancer

prostate cancer

AACR

clinical trials

FDA

Dana-Farber Cancer Institute

UICC

American Cancer Society

European Cancer Organisation

NCI

National Cancer Institute

healthcare

SIOP

ASCO25

chemotherapy

Vivek Subbiah

American Society of Clinical Oncology

AI

Paolo Tarantino

IASLC

WHO

Robert Orlowski

multiple myeloma

Memorial Sloan Kettering Cancer Center

Zainab Shinkafi-Bagudu

radiation oncology

Myeloma Paper of the Day

cervical cancer

Myeloma

patient care

European School of Oncology

Sitemap