Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Study finds ChatGPT gets science wrong more often than you think

    March 18, 2026

    Scientists link childhood stress to lifelong digestive disorders

    March 18, 2026

    Scientists used 7,000 GPUs to simulate tiny quantum chips in great detail

    March 18, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Health Magazine
    • Home
    • Environmental Health
    • Health Technology
    • Medical Research
    • Mental Health
    • Nutrition Science
    • Pharma
    • Public Health
    • Discover
      • Daily Health Tips
      • Financial Health & Stability
      • Holistic Health & Wellness
      • Mental Health
      • Nutrition & Dietary Trends
      • Professional & Personal Growth
    • Our Mission
    Health Magazine
    Home » News » Study finds ChatGPT gets science wrong more often than you think
    Nutrition Science

    Study finds ChatGPT gets science wrong more often than you think

    healthadminBy healthadminMarch 18, 2026No Comments4 Mins Read
    Study finds ChatGPT gets science wrong more often than you think
    Share
    Facebook Twitter Reddit Telegram Pinterest Email


    Washington State University professor Mesut Cicek and his research team fed ChatGPT with hypotheses from scientific papers and repeated the tests. The goal was to see if the AI ​​could correctly determine whether each claim was supported by research, meaning it was true or false.

    In total, the team evaluated more than 700 hypotheses and asked the same question 10 times for each hypothesis to measure consistency.

    Accuracy results and AI performance limits

    When first tested in 2024, ChatGPT got it right 76.5% of the time. A follow-up test in 2025 saw a slight increase in accuracy to 80%. But when the researchers adjusted for random guesses, the results became less impressive. The AI ​​performed only about 60% better than chance, which was close to a D below strong reliability.

    This system had the hardest time identifying false statements, correctly labeling them only 16.4% of the time. It also showed notable inconsistencies. Even when given the exact same prompt 10 times, ChatGPT produced a consistent answer only 73% of the time.

    Inconsistent answers raise concerns

    “We’re not just talking about accuracy, we’re talking about inconsistency, because if you ask the same question over and over again, you’re going to get different answers,” said Cicek, associate professor in the Department of Marketing and International Business in the WSU Carson College of Business and lead author of the new book.

    “We used 10 prompts with the exact same question, all the same. The answer is true. Then it says false. True, false, false, true. There were some cases where there were five true and five false.”

    AI fluency and real understanding

    The survey results are rutgers business reviewhighlights the need for caution when relying on AI for important decisions, especially those that require nuanced or complex reasoning. Although generative AI can generate smooth and convincing language, it has not yet demonstrated the same level of conceptual understanding.

    According to Cicek, these results suggest that artificial general intelligence that can truly “think” may still be further away than many expect.

    “Current AI tools can’t understand the world the way we do. They don’t have a ‘brain’,” Cicek says. “They just memorize it and can give you some insight, but they don’t understand what they’re saying.”

    Research design and methods

    Cicek collaborated with co-authors Sevincgul Ulu of Southern Illinois University, Can Uslay of Rutgers University, and Kate Karniouchina of Northeastern University.

    The team used 719 hypotheses from scientific studies published in business journals since 2021. These types of questions are often nuanced, and multiple factors influence whether a hypothesis is supported. Reducing such complexity to simple truth-or-false judgments requires careful reasoning.

    Researchers tested the free version of ChatGPT-3.5 in 2024 and the updated ChatGPT-5 mini in 2025. Overall, performance remained similar for both versions. After adjusting for a random probability of 50% correct, AI effectiveness was only about 60% above chance in both years.

    Key weaknesses of AI inference

    This result points to a fundamental limitation of large-scale language model AI systems. Although they can produce fluent and persuasive responses, they often struggle to understand complex questions logically. This can lead to answers that seem convincing but are actually wrong, Cicek said.

    Why experts caution about AI

    Based on these findings, the researchers recommend that business leaders examine AI-generated information and approach it with a degree of skepticism. They also highlight the need for training to better understand what AI systems can and cannot do effectively.

    Although the study focused specifically on ChatGPT, Cicek noted that similar experiments using other AI tools have yielded comparable results. This study also builds on previous research that points to caution against AI hype. A 2024 national study found that consumers are less likely to purchase a product if it is marketed with an AI focus.

    “Always be skeptical,” he said. “I’m not against AI. I’m using AI. But we have to be very careful.”



    Source link

    Visited 1 times, 1 visit(s) today
    Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
    Previous ArticleScientists link childhood stress to lifelong digestive disorders
    healthadmin

    Related Posts

    Scientists link childhood stress to lifelong digestive disorders

    March 18, 2026

    Scientists used 7,000 GPUs to simulate tiny quantum chips in great detail

    March 18, 2026

    Scientists have discovered that bull sharks have friends.

    March 18, 2026

    JWST reveals a strange sulfur world unlike any planet we know

    March 17, 2026

    Scientists finally reveal how this Alzheimer’s drug actually works

    March 17, 2026

    NASA’s Webb photographs strange brain-shaped nebula around dying star

    March 17, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Categories

    • Daily Health Tips
    • Discover
    • Environmental Health
    • Exercise & Fitness
    • Featured
    • Featured Videos
    • Financial Health & Stability
    • Fitness
    • Fitness Updates
    • Health
    • Health Technology
    • Healthy Aging
    • Healthy Living
    • Holistic Healing
    • Holistic Health & Wellness
    • Medical Research
    • Medical Research & Insights
    • Mental Health
    • Mental Wellness
    • Natural Remedies
    • New Workouts
    • Nutrition
    • Nutrition & Dietary Trends
    • Nutrition & Superfoods
    • Nutrition Science
    • Pharma
    • Preventive Healthcare
    • Professional & Personal Growth
    • Public Health
    • Public Health & Awareness
    • Selected
    • Sleep & Recovery
    • Top Programs
    • Weight Management
    • Workouts
    Popular Posts
    • the-pros-and-cons-of-paleo-dietsThe Pros and Cons of Paleo Diets: What Science Really Says April 16, 2025
    • Improve Mental Health10 Science-Backed Practices to Improve Mental Health… March 11, 2025
    • How Healthy Living Is Transforming Modern Wellness TrendsHow Healthy Living Is Transforming Modern Wellness… December 3, 2025
    • daily vitamin D needsWhy Sunlight Is Crucial for Your Daily Vitamin D Needs June 12, 2025
    • "The Best Daily Health Apps to Track Your Wellness Goals"The Best Daily Health Apps to Track Your Wellness… August 15, 2025
    • Healthy Living: Expert Tips to Improve Your Health in 2026Healthy Living: Expert Tips to Improve Your Health in 2026 November 16, 2025

    Demo
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss

    Study finds ChatGPT gets science wrong more often than you think

    By healthadminMarch 18, 2026

    Washington State University professor Mesut Cicek and his research team fed ChatGPT with hypotheses from…

    Scientists link childhood stress to lifelong digestive disorders

    March 18, 2026

    Scientists used 7,000 GPUs to simulate tiny quantum chips in great detail

    March 18, 2026

    Using AI to verify human advice can damage professional relationships

    March 18, 2026

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    HealthxMagazine
    HealthxMagazine

    At HealthX Magazine, we are dedicated to empowering entrepreneurs, doctors, chiropractors, healthcare professionals, personal trainers, executives, thought leaders, and anyone striving for optimal health.

    Our Picks

    Using AI to verify human advice can damage professional relationships

    March 18, 2026

    Scientists have discovered that bull sharks have friends.

    March 18, 2026

    How a single dose of antibiotics can rebuild the gut microbiome over many years

    March 18, 2026
    New Comments
      Facebook X (Twitter) Instagram Pinterest
      • Home
      • Privacy Policy
      • Our Mission
      © 2026 ThemeSphere. Designed by ThemeSphere.

      Type above and press Enter to search. Press Esc to cancel.