Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    AI reveals ocean currents we couldn’t see before

    April 22, 2026

    Myanmar’s ‘mysterious’ new snake appears to be multiple species at once

    April 22, 2026

    Ancient DNA reveals hidden Neanderthal group frozen in time

    April 22, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Health Magazine
    • Home
    • Environmental Health
    • Health Technology
    • Medical Research
    • Mental Health
    • Nutrition Science
    • Pharma
    • Public Health
    • Discover
      • Daily Health Tips
      • Financial Health & Stability
      • Holistic Health & Wellness
      • Mental Health
      • Nutrition & Dietary Trends
      • Professional & Personal Growth
    • Our Mission
    Health Magazine
    Home » News » LLMs still lack ‘clinical reasoning skills’: study
    Health Technology

    LLMs still lack ‘clinical reasoning skills’: study

    healthadminBy healthadminApril 17, 2026No Comments3 Mins Read
    LLMs still lack ‘clinical reasoning skills’: study
    Share
    Facebook Twitter Reddit Telegram Pinterest Email


    Despite the increasing use of artificial intelligence in healthcare for both patients and healthcare professionals, a new study from Commander Mass Brigham finds that publicly available generative AI models often fail to adequately navigate diagnostic situations.

    The study, published April 13 in JAMA Network Open, evaluated 21 different generic large-scale language models (LLMs) on 29 standardized clinical cases from January to December 2025. The model received a set of case records that “preserved clinical context and maintained continuity” throughout the clinical reasoning process.

    Medical student raters then scored the output of each stage against the MSD manual. The researchers also developed a new measure called the Proportional Index of Medical Evaluation of LLM (PrIME-LLM) to determine accuracy across five clinical reasoning domains.

    Among the LLMs tested by researchers at Mass General Brigham’s MESH Incubator were GPT-5, Gemini 3.0 Flash, and Grok 4.

    Although all LLMs achieved an accurate final diagnosis more than 90% of the time, the researchers found that the models “performed poorly in generating differential diagnoses and avoiding uncertainty compared to other inference stages.” All models failed to generate an appropriate differential diagnosis more than 80% of the time.

    “While these models are great for assigning a final diagnosis once the data is complete, they are difficult at the beginning of an open-ended case when there is less information,” lead author Alia Rao, a MESH researcher and MD student at Harvard Medical School, said in a statement.

    MESH Incubator Executive Director Marc Succi, MD, is one of the study’s corresponding authors. Suchi said in a statement that off-the-shelf LLMs are “not ready to be introduced to clinical grade without oversight” despite continued improvements.

    “Differential diagnosis is central to clinical reasoning and is the basis of ‘medical technology’ that currently cannot be replicated by AI,” Succi said.

    The new study builds on previous research by Succi and the MESH group. Researchers evaluated the clinical capabilities of ChatGPT 3.5 in August 2023 and found that the chatbot was approximately 72% accurate in overall clinical decision making.

    Researchers in the study said most models demonstrated improved accuracy when test results and images were provided in addition to text, and that recently released models performed better than older models.

    Limitations noted include that web search and inference are disabled, prior exposure to standardized cases cannot be completely excluded, and the evaluation does not incorporate model extensions.

    The study highlighted that LLM has the potential to “enhance, rather than replace, physician reasoning.”

    “The consistent gap between differential and final diagnoses highlights how differently these systems process information compared to physicians,” the researchers wrote. “Clinicians retain uncertainty and iteratively refine differential diagnoses, but LLM collapses prematurely into a single answer, and this limitation persists across generations of models.”



    Source link

    Visited 1 times, 1 visit(s) today
    Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
    Previous ArticleHealthy diet is associated with increased risk of lung cancer in young non-smokers
    Next Article New CDC candidate again
    healthadmin

    Related Posts

    Covera Health and Medmo build collaborative imaging platform

    April 21, 2026

    ECRI spins out supply chain intelligence division to Staritas

    April 21, 2026

    Inside Highmark and Spring’s mental health partnership

    April 20, 2026

    Hippocrates AI launches two new tools for patients and nurses

    April 20, 2026

    Department of Justice charges telemedicine company Zealthy with fraud

    April 20, 2026

    APA launches digital resource library for mental health apps

    April 16, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Categories

    • Daily Health Tips
    • Discover
    • Environmental Health
    • Exercise & Fitness
    • Featured
    • Featured Videos
    • Financial Health & Stability
    • Fitness
    • Fitness Updates
    • Health
    • Health Technology
    • Healthy Aging
    • Healthy Living
    • Holistic Healing
    • Holistic Health & Wellness
    • Medical Research
    • Medical Research & Insights
    • Mental Health
    • Mental Wellness
    • Natural Remedies
    • New Workouts
    • Nutrition
    • Nutrition & Dietary Trends
    • Nutrition & Superfoods
    • Nutrition Science
    • Pharma
    • Preventive Healthcare
    • Professional & Personal Growth
    • Public Health
    • Public Health & Awareness
    • Selected
    • Sleep & Recovery
    • Top Programs
    • Weight Management
    • Workouts
    Popular Posts
    • the-pros-and-cons-of-paleo-dietsThe Pros and Cons of Paleo Diets: What Science Really Says April 16, 2025
    • Improve Mental Health10 Science-Backed Practices to Improve Mental Health… March 11, 2025
    • How Healthy Living Is Transforming Modern Wellness TrendsHow Healthy Living Is Transforming Modern Wellness… December 3, 2025
    • Kankakee_expansion.jpgCSL releases details of $1.5 billion U.S.… March 10, 2026
    • urlhttps3A2F2Fcalifornia-times-brightspot.s3.amazonaws.com2Fc32Fcd2F988500d440f2a55515940909.jpegA ‘reckless’ scrapyard with a history of… October 24, 2025
    • Healthy Living: Expert Tips to Improve Your Health in 2026Healthy Living: Expert Tips to Improve Your Health in 2026 November 16, 2025

    Demo
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss

    AI reveals ocean currents we couldn’t see before

    By healthadminApril 22, 2026

    Scientists have introduced a new method to track ocean surface currents over vast areas in…

    Myanmar’s ‘mysterious’ new snake appears to be multiple species at once

    April 22, 2026

    Ancient DNA reveals hidden Neanderthal group frozen in time

    April 22, 2026

    Increase in rotavirus infections highlights the importance of childhood vaccinations

    April 22, 2026

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    HealthxMagazine
    HealthxMagazine

    At HealthX Magazine, we are dedicated to empowering entrepreneurs, doctors, chiropractors, healthcare professionals, personal trainers, executives, thought leaders, and anyone striving for optimal health.

    Our Picks

    Increase in rotavirus infections highlights the importance of childhood vaccinations

    April 22, 2026

    Epigenomic proteins shape dynamic gene expression beyond simple on-off

    April 22, 2026

    Stem cell model recreates early human embryo with yolk sac

    April 22, 2026
    New Comments
      Facebook X (Twitter) Instagram Pinterest
      • Home
      • Privacy Policy
      • Our Mission
      © 2026 ThemeSphere. Designed by ThemeSphere.

      Type above and press Enter to search. Press Esc to cancel.