Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    After 200 years, scientists have finally solved the “dolomite problem”

    April 20, 2026

    Department of Justice charges telemedicine company Zealthy with fraud

    April 20, 2026

    Climate change could make heat risks from humidity more dangerous, study finds

    April 20, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Health Magazine
    • Home
    • Environmental Health
    • Health Technology
    • Medical Research
    • Mental Health
    • Nutrition Science
    • Pharma
    • Public Health
    • Discover
      • Daily Health Tips
      • Financial Health & Stability
      • Holistic Health & Wellness
      • Mental Health
      • Nutrition & Dietary Trends
      • Professional & Personal Growth
    • Our Mission
    Health Magazine
    Home » News » LLMs still lack ‘clinical reasoning skills’: study
    Health Technology

    LLMs still lack ‘clinical reasoning skills’: study

    healthadminBy healthadminApril 17, 2026No Comments3 Mins Read
    LLMs still lack ‘clinical reasoning skills’: study
    Share
    Facebook Twitter Reddit Telegram Pinterest Email


    Despite the increasing use of artificial intelligence in healthcare for both patients and healthcare professionals, a new study from Commander Mass Brigham finds that publicly available generative AI models often fail to adequately navigate diagnostic situations.

    The study, published April 13 in JAMA Network Open, evaluated 21 different generic large-scale language models (LLMs) on 29 standardized clinical cases from January to December 2025. The model received a set of case records that “preserved clinical context and maintained continuity” throughout the clinical reasoning process.

    Medical student raters then scored the output of each stage against the MSD manual. The researchers also developed a new measure called the Proportional Index of Medical Evaluation of LLM (PrIME-LLM) to determine accuracy across five clinical reasoning domains.

    Among the LLMs tested by researchers at Mass General Brigham’s MESH Incubator were GPT-5, Gemini 3.0 Flash, and Grok 4.

    Although all LLMs achieved an accurate final diagnosis more than 90% of the time, the researchers found that the models “performed poorly in generating differential diagnoses and avoiding uncertainty compared to other inference stages.” All models failed to generate an appropriate differential diagnosis more than 80% of the time.

    “While these models are great for assigning a final diagnosis once the data is complete, they are difficult at the beginning of an open-ended case when there is less information,” lead author Alia Rao, a MESH researcher and MD student at Harvard Medical School, said in a statement.

    MESH Incubator Executive Director Marc Succi, MD, is one of the study’s corresponding authors. Suchi said in a statement that off-the-shelf LLMs are “not ready to be introduced to clinical grade without oversight” despite continued improvements.

    “Differential diagnosis is central to clinical reasoning and is the basis of ‘medical technology’ that currently cannot be replicated by AI,” Succi said.

    The new study builds on previous research by Succi and the MESH group. Researchers evaluated the clinical capabilities of ChatGPT 3.5 in August 2023 and found that the chatbot was approximately 72% accurate in overall clinical decision making.

    Researchers in the study said most models demonstrated improved accuracy when test results and images were provided in addition to text, and that recently released models performed better than older models.

    Limitations noted include that web search and inference are disabled, prior exposure to standardized cases cannot be completely excluded, and the evaluation does not incorporate model extensions.

    The study highlighted that LLM has the potential to “enhance, rather than replace, physician reasoning.”

    “The consistent gap between differential and final diagnoses highlights how differently these systems process information compared to physicians,” the researchers wrote. “Clinicians retain uncertainty and iteratively refine differential diagnoses, but LLM collapses prematurely into a single answer, and this limitation persists across generations of models.”



    Source link

    Visited 1 times, 1 visit(s) today
    Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
    Previous ArticleHealthy diet is associated with increased risk of lung cancer in young non-smokers
    Next Article New CDC candidate again
    healthadmin

    Related Posts

    Department of Justice charges telemedicine company Zealthy with fraud

    April 20, 2026

    APA launches digital resource library for mental health apps

    April 16, 2026

    DiMe-led initiative brings together pharma companies, virtual providers, and digital pharmacies to develop blueprint for DTC pharma model

    April 16, 2026

    Seqster’s launches end-to-end clinical point data tool

    April 16, 2026

    Carrot launches AI platform for personalized care

    April 16, 2026

    Progyny Announces Fertility Benefit Option for Small Employers

    April 16, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Categories

    • Daily Health Tips
    • Discover
    • Environmental Health
    • Exercise & Fitness
    • Featured
    • Featured Videos
    • Financial Health & Stability
    • Fitness
    • Fitness Updates
    • Health
    • Health Technology
    • Healthy Aging
    • Healthy Living
    • Holistic Healing
    • Holistic Health & Wellness
    • Medical Research
    • Medical Research & Insights
    • Mental Health
    • Mental Wellness
    • Natural Remedies
    • New Workouts
    • Nutrition
    • Nutrition & Dietary Trends
    • Nutrition & Superfoods
    • Nutrition Science
    • Pharma
    • Preventive Healthcare
    • Professional & Personal Growth
    • Public Health
    • Public Health & Awareness
    • Selected
    • Sleep & Recovery
    • Top Programs
    • Weight Management
    • Workouts
    Popular Posts
    • the-pros-and-cons-of-paleo-dietsThe Pros and Cons of Paleo Diets: What Science Really Says April 16, 2025
    • Improve Mental Health10 Science-Backed Practices to Improve Mental Health… March 11, 2025
    • How Healthy Living Is Transforming Modern Wellness TrendsHow Healthy Living Is Transforming Modern Wellness… December 3, 2025
    • Kankakee_expansion.jpgCSL releases details of $1.5 billion U.S.… March 10, 2026
    • urlhttps3A2F2Fcalifornia-times-brightspot.s3.amazonaws.com2Fc32Fcd2F988500d440f2a55515940909.jpegA ‘reckless’ scrapyard with a history of… October 24, 2025
    • Healthy Living: Expert Tips to Improve Your Health in 2026Healthy Living: Expert Tips to Improve Your Health in 2026 November 16, 2025

    Demo
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss

    After 200 years, scientists have finally solved the “dolomite problem”

    By healthadminApril 20, 2026

    For more than two centuries, scientists have tried and failed to grow dolomite in the…

    Department of Justice charges telemedicine company Zealthy with fraud

    April 20, 2026

    Climate change could make heat risks from humidity more dangerous, study finds

    April 20, 2026

    Research shows many British adults want to avoid ultra-processed foods but are unable to clearly define this

    April 20, 2026

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    HealthxMagazine
    HealthxMagazine

    At HealthX Magazine, we are dedicated to empowering entrepreneurs, doctors, chiropractors, healthcare professionals, personal trainers, executives, thought leaders, and anyone striving for optimal health.

    Our Picks

    Research shows many British adults want to avoid ultra-processed foods but are unable to clearly define this

    April 20, 2026

    Generative AI could help scientists connect the many layers of cancer

    April 19, 2026

    Belief in ‘chemical imbalance’ may cause patients to continue taking antidepressants for long periods of time

    April 19, 2026
    New Comments
      Facebook X (Twitter) Instagram Pinterest
      • Home
      • Privacy Policy
      • Our Mission
      © 2026 ThemeSphere. Designed by ThemeSphere.

      Type above and press Enter to search. Press Esc to cancel.