Close Menu
    Trending
    • Machine Learning at Scale: Managing More Than One Model in Production
    • Improving AI models’ ability to explain their predictions | MIT News
    • Write C Code Without Learning C: The Magic of PythoC
    • LatentVLA: Latent Reasoning Models for Autonomous Driving
    • Understanding Context and Contextual Retrieval in RAG
    • The AI Bubble Has a Data Science Escape Hatch
    • Is the Pentagon allowed to surveil Americans with AI?
    • What Makes Quantum Machine Learning “Quantum”?
    ProfitlyAI
    • Home
    • Latest News
    • AI Technology
    • Latest AI Innovations
    • AI Tools & Technologies
    • Artificial Intelligence
    ProfitlyAI
    Home » Improving AI models’ ability to explain their predictions | MIT News
    Artificial Intelligence

    Improving AI models’ ability to explain their predictions | MIT News

    ProfitlyAIBy ProfitlyAIMarch 9, 2026No Comments6 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    In high-stakes settings like medical diagnostics, customers usually need to know what led a pc imaginative and prescient mannequin to make a sure prediction, to allow them to decide whether or not to belief its output.

    Idea bottleneck modeling is one technique that permits synthetic intelligence methods to elucidate their decision-making course of. These strategies power a deep-learning mannequin to make use of a set of ideas, which will be understood by people, to make a prediction. In new analysis, MIT laptop scientists developed a technique that coaxes the mannequin to attain higher accuracy and clearer, extra concise explanations.

    The ideas the mannequin makes use of are often outlined prematurely by human specialists. As an illustration, a clinician might recommend using ideas like “clustered brown dots” and “variegated pigmentation” to foretell {that a} medical picture exhibits melanoma.

    However beforehand outlined ideas may very well be irrelevant or lack ample element for a particular activity, decreasing the mannequin’s accuracy. The brand new technique extracts ideas the mannequin has already discovered whereas it was educated to carry out that individual activity, and forces the mannequin to make use of these, producing higher explanations than customary idea bottleneck fashions.

    The strategy makes use of a pair of specialised machine-learning fashions that routinely extract information from a goal mannequin and translate it into plain-language ideas. In the long run, their approach can convert any pretrained laptop imaginative and prescient mannequin into one that may use ideas to elucidate its reasoning.

    “In a way, we wish to have the ability to learn the minds of those laptop imaginative and prescient fashions. An idea bottleneck mannequin is a technique for customers to inform what the mannequin is pondering and why it made a sure prediction. As a result of our technique makes use of higher ideas, it could possibly result in greater accuracy and finally enhance the accountability of black-box AI fashions,” says lead creator Antonio De Santis, a graduate pupil at Polytechnic College of Milan who accomplished this analysis whereas a visiting graduate pupil within the Laptop Science and Synthetic Intelligence Laboratory (CSAIL) at MIT.

    He’s joined on a paper about the work by Schrasing Tong SM ’20, PhD ’26; Marco Brambilla, professor of laptop science and engineering at Polytechnic College of Milan; and senior creator Lalana Kagal, a principal analysis scientist in CSAIL. The analysis can be introduced on the Worldwide Convention on Studying Representations.

    Constructing a greater bottleneck

    Idea bottleneck fashions (CBMs) are a preferred strategy for bettering AI explainability. These strategies add an intermediate step by forcing a pc imaginative and prescient mannequin to foretell the ideas current in a picture, then use these ideas to make a ultimate prediction.

    This intermediate step, or “bottleneck,” helps customers perceive the mannequin’s reasoning.

    For instance, a mannequin that identifies hen species might choose ideas like “yellow legs” and “blue wings” earlier than predicting a barn swallow.

    However as a result of these ideas are sometimes generated prematurely by people or giant language fashions (LLMs), they won’t match the precise activity. As well as, even when given a set of pre-defined ideas, the mannequin generally makes use of undesirable discovered data anyway, which is an issue generally known as data leakage.

    “These fashions are educated to maximise efficiency, so the mannequin may secretly use ideas we’re unaware of,” De Santis explains.

    The MIT researchers had a unique concept: For the reason that mannequin has been educated on an enormous quantity of knowledge, it might have discovered the ideas wanted to generate correct predictions for the actual activity at hand. They sought to construct a CBM by extracting this current information and changing it into textual content a human can perceive.

    In step one of their technique, a specialised deep-learning mannequin known as a sparse autoencoder selectively takes probably the most related options the mannequin discovered and reconstructs them right into a handful of ideas. Then, a multimodal LLM describes every idea in plain language.

    This multimodal LLM additionally annotates photos within the dataset by figuring out which ideas are current and absent in every picture. The researchers use this annotated dataset to coach an idea bottleneck module to acknowledge the ideas.

    They incorporate this module into the goal mannequin, forcing it to make predictions utilizing solely the set of discovered ideas the researchers extracted.

    Controlling the ideas

    They overcame many challenges as they developed this technique, from making certain the LLM annotated ideas appropriately to figuring out whether or not the sparse autoencoder had recognized human-understandable ideas.

    To stop the mannequin from utilizing unknown or undesirable ideas, they limit it to make use of solely 5 ideas for every prediction. This additionally forces the mannequin to decide on probably the most related ideas and makes the reasons extra comprehensible.

    After they in contrast their strategy to state-of-the-art CBMs on duties like predicting hen species and figuring out pores and skin lesions in medical photos, their technique achieved the very best accuracy whereas offering extra exact explanations.

    Their strategy additionally generated ideas that have been extra relevant to the photographs within the dataset. 

    “We’ve proven that extracting ideas from the unique mannequin can outperform different CBMs, however there’s nonetheless a tradeoff between interpretability and accuracy that must be addressed. Black-box fashions that aren’t interpretable nonetheless outperform ours,” De Santis says.

    Sooner or later, the researchers need to research potential options to the data leakage drawback, maybe by including further idea bottleneck modules so undesirable ideas can’t leak by. In addition they plan to scale up their technique through the use of a bigger multimodal LLM to annotate an even bigger coaching dataset, which might enhance efficiency.

    “I’m excited by this work as a result of it pushes interpretable AI in a really promising route and creates a pure bridge to symbolic AI and information graphs,” says Andreas Hotho, professor and head of the Knowledge Science Chair on the College of Würzburg, who was not concerned with this work. “By deriving idea bottlenecks from the mannequin’s personal inner mechanisms reasonably than solely from human-defined ideas, it gives a path towards explanations which might be extra trustworthy to the mannequin and opens many alternatives for follow-up work with structured information.”

    This analysis was supported by the Progetto Rocca Doctoral Fellowship, the Italian Ministry of College and Analysis underneath the Nationwide Restoration and Resilience Plan, Thales Alenia House, and the European Union underneath the NextGenerationEU undertaking.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleWrite C Code Without Learning C: The Magic of PythoC
    Next Article Machine Learning at Scale: Managing More Than One Model in Production
    ProfitlyAI
    • Website

    Related Posts

    Artificial Intelligence

    Machine Learning at Scale: Managing More Than One Model in Production

    March 9, 2026
    Artificial Intelligence

    Write C Code Without Learning C: The Magic of PythoC

    March 8, 2026
    Artificial Intelligence

    LatentVLA: Latent Reasoning Models for Autonomous Driving

    March 8, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Chatbots are surprisingly effective at debunking conspiracy theories

    October 30, 2025

    5 Essential Questions to Ask Before Outsourcing Healthcare Data Labeling

    April 9, 2025

    Coding the Pong Game from Scratch in Python

    February 27, 2026

    A “scientific sandbox” lets researchers explore the evolution of vision systems | MIT News

    December 18, 2025

    OpenAI is launching a version of ChatGPT for college students

    July 29, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    Most Popular

    How to Reduce Your Power BI Model Size by 90%

    May 26, 2025

    Automated Testing: A Software Engineering Concept Data Scientists Must Know To Succeed

    July 30, 2025

    TDS Newsletter: Is It Time to Revisit RAG?

    January 17, 2026
    Our Picks

    Machine Learning at Scale: Managing More Than One Model in Production

    March 9, 2026

    Improving AI models’ ability to explain their predictions | MIT News

    March 9, 2026

    Write C Code Without Learning C: The Magic of PythoC

    March 8, 2026
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 ProfitlyAI All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.