Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Estrogen levels may influence the brain’s response to psychedelics, new animal study shows

    May 15, 2026

    Study: PSA test likely reduces risk of death from prostate cancer

    May 15, 2026

    Musicians show a small but steady advantage in sustained attention from childhood to adulthood

    May 14, 2026
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Health Magazine
    • Home
    • Environmental Health
    • Health Technology
    • Medical Research
    • Mental Health
    • Nutrition Science
    • Pharma
    • Public Health
    • Discover
      • Daily Health Tips
      • Financial Health & Stability
      • Holistic Health & Wellness
      • Mental Health
      • Nutrition & Dietary Trends
      • Professional & Personal Growth
    • Our Mission
    Health Magazine
    Home » News » Study finds ChatGPT gets science wrong more often than you think
    Nutrition Science

    Study finds ChatGPT gets science wrong more often than you think

    healthadminBy healthadminMarch 18, 2026No Comments4 Mins Read
    Study finds ChatGPT gets science wrong more often than you think
    Share
    Facebook Twitter Reddit Telegram Pinterest Email


    Washington State University professor Mesut Cicek and his research team fed ChatGPT with hypotheses from scientific papers and repeated the tests. The goal was to see if the AI ​​could correctly determine whether each claim was supported by research, meaning it was true or false.

    In total, the team evaluated more than 700 hypotheses and asked the same question 10 times for each hypothesis to measure consistency.

    Accuracy results and AI performance limits

    When first tested in 2024, ChatGPT got it right 76.5% of the time. A follow-up test in 2025 saw a slight increase in accuracy to 80%. But when the researchers adjusted for random guesses, the results became less impressive. The AI ​​performed only about 60% better than chance, which was close to a D below strong reliability.

    This system had the hardest time identifying false statements, correctly labeling them only 16.4% of the time. It also showed notable inconsistencies. Even when given the exact same prompt 10 times, ChatGPT produced a consistent answer only 73% of the time.

    Inconsistent answers raise concerns

    “We’re not just talking about accuracy, we’re talking about inconsistency, because if you ask the same question over and over again, you’re going to get different answers,” said Cicek, associate professor in the Department of Marketing and International Business in the WSU Carson College of Business and lead author of the new book.

    “We used 10 prompts with the exact same question, all the same. The answer is true. Then it says false. True, false, false, true. There were some cases where there were five true and five false.”

    AI fluency and real understanding

    The survey results are rutgers business reviewhighlights the need for caution when relying on AI for important decisions, especially those that require nuanced or complex reasoning. Although generative AI can generate smooth and convincing language, it has not yet demonstrated the same level of conceptual understanding.

    According to Cicek, these results suggest that artificial general intelligence that can truly “think” may still be further away than many expect.

    “Current AI tools can’t understand the world the way we do. They don’t have a ‘brain’,” Cicek says. “They just memorize it and can give you some insight, but they don’t understand what they’re saying.”

    Research design and methods

    Cicek collaborated with co-authors Sevincgul Ulu of Southern Illinois University, Can Uslay of Rutgers University, and Kate Karniouchina of Northeastern University.

    The team used 719 hypotheses from scientific studies published in business journals since 2021. These types of questions are often nuanced, and multiple factors influence whether a hypothesis is supported. Reducing such complexity to simple truth-or-false judgments requires careful reasoning.

    Researchers tested the free version of ChatGPT-3.5 in 2024 and the updated ChatGPT-5 mini in 2025. Overall, performance remained similar for both versions. After adjusting for a random probability of 50% correct, AI effectiveness was only about 60% above chance in both years.

    Key weaknesses of AI inference

    This result points to a fundamental limitation of large-scale language model AI systems. Although they can produce fluent and persuasive responses, they often struggle to understand complex questions logically. This can lead to answers that seem convincing but are actually wrong, Cicek said.

    Why experts caution about AI

    Based on these findings, the researchers recommend that business leaders examine AI-generated information and approach it with a degree of skepticism. They also highlight the need for training to better understand what AI systems can and cannot do effectively.

    Although the study focused specifically on ChatGPT, Cicek noted that similar experiments using other AI tools have yielded comparable results. This study also builds on previous research that points to caution against AI hype. A 2024 national study found that consumers are less likely to purchase a product if it is marketed with an AI focus.

    “Always be skeptical,” he said. “I’m not against AI. I’m using AI. But we have to be very careful.”



    Source link

    Visited 15 times, 1 visit(s) today
    Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
    Previous ArticleSartorius unveils next-generation platform to increase production efficiency of cell therapy drugs
    Next Article Even JWST can’t see through the giant fog of this planet.
    healthadmin

    Related Posts

    After 100 years, scientists finally uncover the hidden laws behind cosmic rays

    May 14, 2026

    Huge ‘stealth’ magma surge triggers thousands of earthquakes beneath Atlantic island

    May 14, 2026

    Scientists say taking a daily multivitamin may slow aging

    May 14, 2026

    Giant squid discovery reveals hidden deep-sea world off the coast of Australia

    May 14, 2026

    Organic molecules discovered in 66-million-year-old dinosaur bones shake up paleontology

    May 14, 2026

    Scientists discover strange way CO2 cools parts of Earth’s atmosphere

    May 14, 2026
    Add A Comment

    Comments are closed.

    Categories

    • Daily Health Tips
    • Discover
    • Environmental Health
    • Exercise & Fitness
    • Featured
    • Featured Videos
    • Financial Health & Stability
    • Fitness
    • Fitness Updates
    • Health
    • Health Technology
    • Healthy Aging
    • Healthy Living
    • Holistic Healing
    • Holistic Health & Wellness
    • Medical Research
    • Medical Research & Insights
    • Mental Health
    • Mental Wellness
    • Natural Remedies
    • New Workouts
    • Nutrition
    • Nutrition & Dietary Trends
    • Nutrition & Superfoods
    • Nutrition Science
    • Pharma
    • Preventive Healthcare
    • Professional & Personal Growth
    • Public Health
    • Public Health & Awareness
    • Selected
    • Sleep & Recovery
    • Top Programs
    • Weight Management
    • Workouts
    Popular Posts
    • 1773313737_bacteria_-_Sebastian_Kaulitzki_46826fb7971649bfaca04a9b4cef3309-620x480.jpgHow Sino Biological ProPure™ redefines ultra-low… March 12, 2026
    • the-pros-and-cons-of-paleo-dietsThe Pros and Cons of Paleo Diets: What Science Really Says April 16, 2025
    • pexels-david-bartus-442116The food industry needs to act now to cut greenhouse… January 2, 2022
    • 1773729862_TagImage-3347-458389964760995353448-620x480.jpgDespite safety concerns, parents underestimate the… March 17, 2026
    • Improve Mental Health10 Science-Backed Practices to Improve Mental Health… March 11, 2025
    • 1773209206_futuristic_techno_design_on_background_of_supercomputer_data_center_-_Image_-_Timofeev_Vladimir_M1_4.jpegMulti-agent AI systems outperform single models… March 11, 2026

    Demo
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss

    Estrogen levels may influence the brain’s response to psychedelics, new animal study shows

    By healthadminMay 15, 2026

    Psilocybin induces different behavioral responses depending on the age of the rat and the reproductive…

    Study: PSA test likely reduces risk of death from prostate cancer

    May 15, 2026

    Musicians show a small but steady advantage in sustained attention from childhood to adulthood

    May 14, 2026

    Supreme Court upholds access to mifepristone while litigation continues

    May 14, 2026

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    HealthxMagazine
    HealthxMagazine

    At HealthX Magazine, we are dedicated to empowering entrepreneurs, doctors, chiropractors, healthcare professionals, personal trainers, executives, thought leaders, and anyone striving for optimal health.

    Our Picks

    Supreme Court upholds access to mifepristone while litigation continues

    May 14, 2026

    Making instant judgments about dating apps can hurt your sense of worth as a partner.

    May 14, 2026

    Eli Lilly contributes $50 million to UNICEF’s childhood health initiatives

    May 14, 2026
    New Comments
      Facebook X (Twitter) Instagram Pinterest
      • Home
      • Privacy Policy
      • Our Mission
      © 2026 ThemeSphere. Designed by ThemeSphere.

      Type above and press Enter to search. Press Esc to cancel.