Close Menu
    Trending
    • There are more AI health tools than ever—but how well do they work?
    • MIT researchers use AI to uncover atomic defects in materials | MIT News
    • The Pentagon’s culture war tactic against Anthropic has backfired
    • How to Lie with Statistics with your Robot Best Friend
    • Why Data Scientists Should Care About Quantum Computing
    • Explainable AI in Production: A Neuro-Symbolic Model for Real-Time Fraud Detection
    • Everything You Need to Know
    • What is Large Language Models (LLM)
    ProfitlyAI
    • Home
    • Latest News
    • AI Technology
    • Latest AI Innovations
    • AI Tools & Technologies
    • Artificial Intelligence
    ProfitlyAI
    Home » There are more AI health tools than ever—but how well do they work?
    AI Technology

    There are more AI health tools than ever—but how well do they work?

    ProfitlyAIBy ProfitlyAIMarch 30, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Singhal, the OpenAI well being lead, notes that the corporate’s present GPT-5 collection of fashions, which had not but been launched when the unique HealthBench research was performed, do a significantly better job of soliciting extra info than their predecessors. Nonetheless, OpenAI has reported that GPT-5.4, the present flagship, is definitely worse at searching for context than GPT-5.2, an earlier model.

    Ideally, Bean says, well being chatbots can be subjected to managed assessments with human customers, as they had been in his research, earlier than being launched to the general public. That is perhaps a heavy elevate, notably given how briskly the AI world strikes and the way lengthy human research can take. Bean’s personal research used GPT-4o, which got here out nearly a yr in the past and is now outdated. 

    Earlier this month, Google launched a research that meets Bean’s requirements. Within the research, sufferers mentioned medical issues with the corporate’s Articulate Medical Intelligence Explorer (AMIE), a medical LLM chatbot that isn’t but obtainable to the general public, earlier than assembly with a human doctor. General, AMIE’s diagnoses had been simply as correct as physicians’, and not one of the conversations raised main security issues for researchers. 

    Regardless of the encouraging outcomes, Google isn’t planning to launch AMIE anytime quickly. “Whereas the analysis has superior, there are vital limitations that have to be addressed earlier than real-world translation of programs for prognosis and remedy, together with additional analysis into fairness, equity, and security testing,” wrote Alan Karthikesalingam, a analysis scientist at Google DeepMind, in an electronic mail. Google did not too long ago reveal that Health100, a well being platform it’s constructing in partnership with CVS, will embody an AI assistant powered by its flagship Gemini fashions, although that device will presumably not be meant for prognosis or remedy.

    Rodman, who led the AMIE research with Karthikesalingam, doesn’t suppose such intensive, multiyear research are essentially the suitable strategy for chatbots like ChatGPT Well being and Copilot Well being. “There’s a lot of causes that the scientific trial paradigm doesn’t all the time work in generative AI,” he says. “And that’s the place this benchmarking dialog is available in. Are there benchmarks [from] a trusted third occasion that we will agree are significant, that the labs can maintain themselves to?”

    They key there’s “third occasion.” Irrespective of how extensively corporations consider their very own merchandise, it’s powerful to belief their conclusions fully. Not solely does a third-party analysis convey impartiality, but when there are lots of third events concerned, it additionally helps shield in opposition to blind spots.

    OpenAI’s Singhal says he’s strongly in favor of exterior analysis. “We attempt our greatest to assist the group,” he says. “A part of why we put out HealthBench was really to provide the group and different mannequin builders an instance of what an excellent analysis seems to be like.” 

    Given how costly it’s to provide a high-quality analysis, he says, he’s skeptical that any particular person educational laboratory would be capable of produce what he calls “the one analysis to rule all of them.” However he does communicate extremely of efforts that educational teams have made to convey preexisting and novel evaluations collectively into complete evaluations suites—reminiscent of Stanford’s MedHELM framework, which assessments fashions on all kinds of medical duties. At the moment, OpenAI’s GPT-5 holds the very best MedHELM rating.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleMIT researchers use AI to uncover atomic defects in materials | MIT News
    ProfitlyAI
    • Website

    Related Posts

    AI Technology

    The Pentagon’s culture war tactic against Anthropic has backfired

    March 30, 2026
    AI Technology

    This startup wants to change how mathematicians do math

    March 25, 2026
    AI Technology

    Agentic commerce runs on truth and context

    March 25, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    New computational chemistry techniques accelerate the prediction of molecules and materials | MIT News

    April 7, 2025

    Topp 8 populära iPhone AI-appar

    November 7, 2025

    What I Learned in my First 18 Months as a Freelance Data Scientist

    July 9, 2025

    From Reporting to Reasoning: How AI Is Rewriting the Rules of Data App Development

    July 1, 2025

    Hybrid Neuro-Symbolic Fraud Detection: Guiding Neural Networks with Domain Rules

    March 10, 2026
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    Most Popular

    Phi-4 Reasoning är en toppmodern AI-modell utvecklad av Microsoft Research

    May 3, 2025

    Diffusion Models, Explained Simply | Towards Data Science

    May 6, 2025

    Hyper-Realistic AI Video Is Outpacing Our Ability to Label It

    June 3, 2025
    Our Picks

    There are more AI health tools than ever—but how well do they work?

    March 30, 2026

    MIT researchers use AI to uncover atomic defects in materials | MIT News

    March 30, 2026

    The Pentagon’s culture war tactic against Anthropic has backfired

    March 30, 2026
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 ProfitlyAI All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.