Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Clarifying the 2025-2030 Dietary Guidelines Contradictions

    June 29, 2026

    Doctronic and Simple HealthKit partners to connect at-home screening with AI-powered clinical care

    June 29, 2026

    988 Hotline, Private ER, Pulmonary Hypertension: Morning rounds

    June 29, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Health Magazine
    • Home
    • Environmental Health
    • Health Technology
    • Medical Research
    • Mental Health
    • Nutrition Science
    • Pharma
    • Public Health
    • Discover
      • Daily Health Tips
      • Financial Health & Stability
      • Holistic Health & Wellness
      • Mental Health
      • Nutrition & Dietary Trends
      • Professional & Personal Growth
    • Our Mission
    Health Magazine
    Home » News » LLMs still lack ‘clinical reasoning skills’: study
    Health Technology

    LLMs still lack ‘clinical reasoning skills’: study

    healthadminBy healthadminApril 17, 2026No Comments3 Mins Read
    LLMs still lack ‘clinical reasoning skills’: study
    Share
    Facebook Twitter Reddit Telegram Pinterest Email


    Despite the increasing use of artificial intelligence in healthcare for both patients and healthcare professionals, a new study from Commander Mass Brigham finds that publicly available generative AI models often fail to adequately navigate diagnostic situations.

    The study, published April 13 in JAMA Network Open, evaluated 21 different generic large-scale language models (LLMs) on 29 standardized clinical cases from January to December 2025. The model received a set of case records that “preserved clinical context and maintained continuity” throughout the clinical reasoning process.

    Medical student raters then scored the output of each stage against the MSD manual. The researchers also developed a new measure called the Proportional Index of Medical Evaluation of LLM (PrIME-LLM) to determine accuracy across five clinical reasoning domains.

    Among the LLMs tested by researchers at Mass General Brigham’s MESH Incubator were GPT-5, Gemini 3.0 Flash, and Grok 4.

    Although all LLMs achieved an accurate final diagnosis more than 90% of the time, the researchers found that the models “performed poorly in generating differential diagnoses and avoiding uncertainty compared to other inference stages.” All models failed to generate an appropriate differential diagnosis more than 80% of the time.

    “While these models are great for assigning a final diagnosis once the data is complete, they are difficult at the beginning of an open-ended case when there is less information,” lead author Alia Rao, a MESH researcher and MD student at Harvard Medical School, said in a statement.

    MESH Incubator Executive Director Marc Succi, MD, is one of the study’s corresponding authors. Suchi said in a statement that off-the-shelf LLMs are “not ready to be introduced to clinical grade without oversight” despite continued improvements.

    “Differential diagnosis is central to clinical reasoning and is the basis of ‘medical technology’ that currently cannot be replicated by AI,” Succi said.

    The new study builds on previous research by Succi and the MESH group. Researchers evaluated the clinical capabilities of ChatGPT 3.5 in August 2023 and found that the chatbot was approximately 72% accurate in overall clinical decision making.

    Researchers in the study said most models demonstrated improved accuracy when test results and images were provided in addition to text, and that recently released models performed better than older models.

    Limitations noted include that web search and inference are disabled, prior exposure to standardized cases cannot be completely excluded, and the evaluation does not incorporate model extensions.

    The study highlighted that LLM has the potential to “enhance, rather than replace, physician reasoning.”

    “The consistent gap between differential and final diagnoses highlights how differently these systems process information compared to physicians,” the researchers wrote. “Clinicians retain uncertainty and iteratively refine differential diagnoses, but LLM collapses prematurely into a single answer, and this limitation persists across generations of models.”



    Source link

    Visited 5 times, 1 visit(s) today
    Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
    Previous ArticleHealthy diet is associated with increased risk of lung cancer in young non-smokers
    Next Article New CDC candidate again
    healthadmin

    Related Posts

    Doctronic and Simple HealthKit partners to connect at-home screening with AI-powered clinical care

    June 29, 2026

    Industry Voices — 3 healthcare takeaways from the Pope’s AI encyclical

    June 26, 2026

    OpenAI touts GPT-5.5 instant health feature

    June 26, 2026

    Oracle and Theator collaborate to integrate AI-powered OR analytics

    June 26, 2026

    Upside Raises $20M in Series A to Expand Housing Assistance Platform

    June 25, 2026

    First Stop Health expands weight management program

    June 25, 2026
    Add A Comment

    Comments are closed.

    Categories

    • Daily Health Tips
    • Discover
    • Environmental Health
    • Exercise & Fitness
    • Featured
    • Featured Videos
    • Financial Health & Stability
    • Fitness
    • Fitness Updates
    • Health
    • Health Technology
    • Healthy Aging
    • Healthy Living
    • Holistic Healing
    • Holistic Health & Wellness
    • Medical Research
    • Medical Research & Insights
    • Mental Health
    • Mental Wellness
    • Natural Remedies
    • New Workouts
    • Nutrition
    • Nutrition & Dietary Trends
    • Nutrition & Superfoods
    • Nutrition Science
    • Pharma
    • Preventive Healthcare
    • Professional & Personal Growth
    • Public Health
    • Public Health & Awareness
    • Selected
    • Sleep & Recovery
    • Top Programs
    • Weight Management
    • Workouts
    Popular Posts
    • 1773313737_bacteria_-_Sebastian_Kaulitzki_46826fb7971649bfaca04a9b4cef3309-620x480.jpgHow Sino Biological ProPure™ redefines ultra-low… March 12, 2026
    • pexels-david-bartus-442116The food industry needs to act now to cut greenhouse… January 2, 2022
    • 1773729862_TagImage-3347-458389964760995353448-620x480.jpgDespite safety concerns, parents underestimate the… March 17, 2026
    • 1773209206_futuristic_techno_design_on_background_of_supercomputer_data_center_-_Image_-_Timofeev_Vladimir_M1_4.jpegMulti-agent AI systems outperform single models… March 11, 2026
    • 1774403998_image_28620e4b6b0047f7ab9154b41d739db1-620x480.jpgGait pattern helps distinguish between Lewy body… March 24, 2026
    • Leukemia-620x480.jpgBiomimetic platform powers CAR T therapy for… March 9, 2026

    Demo
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss

    Clarifying the 2025-2030 Dietary Guidelines Contradictions

    By healthadminJune 29, 2026

    Recent updates to the 2025-2030 Dietary Guidelines have sparked confusion among dietitians and researchers regarding saturated fat recommendations, protein intake levels, and guidance on processed foods.

    Doctronic and Simple HealthKit partners to connect at-home screening with AI-powered clinical care

    June 29, 2026

    988 Hotline, Private ER, Pulmonary Hypertension: Morning rounds

    June 29, 2026

    Study finds that authoritarianism acts as a psychological bridge for dark personalities

    June 29, 2026

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    HealthxMagazine
    HealthxMagazine

    At HealthX Magazine, we are dedicated to empowering entrepreneurs, doctors, chiropractors, healthcare professionals, personal trainers, executives, thought leaders, and anyone striving for optimal health.

    Our Picks

    Study finds that authoritarianism acts as a psychological bridge for dark personalities

    June 29, 2026

    Millions of people take omega-3 fish oil for brain health, but new study finds no benefit

    June 29, 2026

    These fat-filled brain cells may be worsening multiple sclerosis

    June 29, 2026
    New Comments
      Facebook X (Twitter) Instagram Pinterest
      • Home
      • Privacy Policy
      • Our Mission
      © 2026 ThemeSphere. Designed by ThemeSphere.

      Type above and press Enter to search. Press Esc to cancel.