Close Menu
    Trending
    • OpenAIs nya webbläsare ChatGPT Atlas
    • Creating AI that matters | MIT News
    • Scaling Recommender Transformers to a Billion Parameters
    • Hidden Gems in NumPy: 7 Functions Every Data Scientist Should Know
    • Is RAG Dead? The Rise of Context Engineering and Semantic Layers for Agentic AI
    • ChatGPT Gets More Personal. Is Society Ready for It?
    • Why the Future Is Human + Machine
    • Why AI Is Widening the Gap Between Top Talent and Everyone Else
    ProfitlyAI
    • Home
    • Latest News
    • AI Technology
    • Latest AI Innovations
    • AI Tools & Technologies
    • Artificial Intelligence
    ProfitlyAI
    Home » A new model predicts how molecules will dissolve in different solvents | MIT News
    Artificial Intelligence

    A new model predicts how molecules will dissolve in different solvents | MIT News

    ProfitlyAIBy ProfitlyAIAugust 19, 2025No Comments7 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Utilizing machine studying, MIT chemical engineers have created a computational mannequin that may predict how nicely any given molecule will dissolve in an natural solvent — a key step within the synthesis of practically any pharmaceutical. The sort of prediction may make it a lot simpler to develop new methods to provide medication and different helpful molecules.

    The brand new mannequin, which predicts how a lot of a solute will dissolve in a selected solvent, ought to assist chemists to decide on the suitable solvent for any given response of their synthesis, the researchers say. Widespread natural solvents embody ethanol and acetone, and there are a whole bunch of others that will also be utilized in chemical reactions.

    “Predicting solubility actually is a rate-limiting step in artificial planning and manufacturing of chemical substances, particularly medication, so there’s been a longstanding curiosity in with the ability to make higher predictions of solubility,” says Lucas Attia, an MIT graduate pupil and one of many lead authors of the brand new research.

    The researchers have made their model freely out there, and plenty of firms and labs have already began utilizing it. The mannequin may very well be notably helpful for figuring out solvents which are much less hazardous than a number of the mostly used industrial solvents, the researchers say.

    “There are some solvents that are identified to dissolve most issues. They’re actually helpful, however they’re damaging to the surroundings, and so they’re damaging to folks, so many firms require that it’s a must to reduce the quantity of these solvents that you just use,” says Jackson Burns, an MIT graduate pupil who can be a lead writer of the paper. “Our mannequin is extraordinarily helpful in with the ability to determine the next-best solvent, which is hopefully a lot much less damaging to the surroundings.”

    William Inexperienced, the Hoyt Hottel Professor of Chemical Engineering and director of the MIT Vitality Initiative, is the senior writer of the research, which seems at the moment in Nature Communications. Patrick Doyle, the Robert T. Haslam Professor of Chemical Engineering, can be an writer of the paper.

    Fixing solubility

    The brand new mannequin grew out of a venture that Attia and Burns labored on collectively in an MIT course on making use of machine studying to chemical engineering issues. Historically, chemists have predicted solubility with a software often known as the Abraham Solvation Mannequin, which can be utilized to estimate a molecule’s general solubility by including up the contributions of chemical constructions inside the molecule. Whereas these predictions are helpful, their accuracy is proscribed.

    Up to now few years, researchers have begun utilizing machine studying to attempt to make extra correct solubility predictions. Earlier than Burns and Attia started engaged on their new mannequin, the state-of-the-art mannequin for predicting solubility was a mannequin developed in Inexperienced’s lab in 2022.

    That mannequin, often known as SolProp, works by predicting a set of associated properties and mixing them, utilizing thermodynamics, to in the end predict the solubility. Nevertheless, the mannequin has issue predicting solubility for solutes that it hasn’t seen earlier than.

    “For drug and chemical discovery pipelines the place you’re growing a brand new molecule, you need to have the ability to predict forward of time what its solubility appears like,” Attia says.

    A part of the explanation that current solubility fashions haven’t labored nicely is as a result of there wasn’t a complete dataset to coach them on. Nevertheless, in 2023 a brand new dataset known as BigSolDB was launched, which compiled knowledge from practically 800 printed papers, together with info on solubility for about 800 molecules dissolved about greater than 100 natural solvents which are generally utilized in artificial chemistry.

    Attia and Burns determined to strive coaching two various kinds of fashions on this knowledge. Each of those fashions characterize the chemical constructions of molecules utilizing numerical representations often known as embeddings, which incorporate info such because the variety of atoms in a molecule and which atoms are sure to which different atoms. Fashions can then use these representations to foretell quite a lot of chemical properties.

    One of many fashions used on this research, often known as FastProp and developed by Burns and others in Inexperienced’s lab, incorporates “static embeddings.” Which means that the mannequin already is aware of the embedding for every molecule earlier than it begins doing any form of evaluation.

    The opposite mannequin, ChemProp, learns an embedding for every molecule throughout the coaching, on the identical time that it learns to affiliate the options of the embedding with a trait reminiscent of solubility. This mannequin, developed throughout a number of MIT labs, has already been used for duties reminiscent of antibiotic discovery, lipid nanoparticle design, and predicting chemical response charges.

    The researchers skilled each sorts of fashions on over 40,000 knowledge factors from BigSolDB, together with info on the results of temperature, which performs a big function in solubility. Then, they examined the fashions on about 1,000 solutes that had been withheld from the coaching knowledge. They discovered that the fashions’ predictions have been two to 3 instances extra correct than these of SolProp, the earlier finest mannequin, and the brand new fashions have been particularly correct at predicting variations in solubility on account of temperature.

    “With the ability to precisely reproduce these small variations in solubility on account of temperature, even when the overarching experimental noise may be very massive, was a very optimistic signal that the community had accurately discovered an underlying solubility prediction operate,” Burns says.

    Correct predictions

    The researchers had anticipated that the mannequin based mostly on ChemProp, which is ready to be taught new representations because it goes alongside, would be capable of make extra correct predictions. Nevertheless, to their shock, they discovered that the 2 fashions carried out primarily the identical. That means that the principle limitation on their efficiency is the standard of the information, and that the fashions are performing in addition to theoretically attainable based mostly on the information that they’re utilizing, the researchers say.

    “ChemProp ought to all the time outperform any static embedding when you might have adequate knowledge,” Burns says. “We have been blown away to see that the static and discovered embeddings have been statistically indistinguishable in efficiency throughout all of the completely different subsets, which signifies to us that that the information limitations which are current on this area dominated the mannequin efficiency.”

    The fashions may develop into extra correct, the researchers say, if higher coaching and testing knowledge have been out there — ideally, knowledge obtained by one individual or a gaggle of individuals all skilled to carry out the experiments the identical method.

    “One of many large limitations of utilizing these sorts of compiled datasets is that completely different labs use completely different strategies and experimental situations once they carry out solubility checks. That contributes to this variability between completely different datasets,” Attia says.

    As a result of the mannequin based mostly on FastProp makes its predictions sooner and has code that’s simpler for different customers to adapt, the researchers determined to make that one, often known as FastSolv, out there to the general public. A number of pharmaceutical firms have already begun utilizing it.

    “There are functions all through the drug discovery pipeline,” Burns says. “We’re additionally excited to see, outdoors of formulation and drug discovery, the place folks could use this mannequin.”

    The analysis was funded, partially, by the U.S. Division of Vitality.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleShould AI flatter us, fix us, or just inform us?
    Next Article Gamers Nexus avslöjar omfattande GPU-smugglingsimperium från Kina
    ProfitlyAI
    • Website

    Related Posts

    Artificial Intelligence

    Creating AI that matters | MIT News

    October 21, 2025
    Artificial Intelligence

    Scaling Recommender Transformers to a Billion Parameters

    October 21, 2025
    Artificial Intelligence

    Hidden Gems in NumPy: 7 Functions Every Data Scientist Should Know

    October 21, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Do ChatGPT Prompts Aimed at Avoiding AI Detection Work?

    April 3, 2025

    Visa and Mastercard Just Gave AI the Power to Shop and Pay for You

    May 1, 2025

    What’s next for AI and math

    June 4, 2025

    Melding data, systems, and society | MIT News

    June 10, 2025

    GraphRAG in Action: A Simple Agent for Know-Your-Customer Investigations

    July 3, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    Most Popular

    Taking ResNet to the Next Level

    July 3, 2025

    What It Means and Where It’s Headed

    April 10, 2025

    The Westworld Blunder | Towards Data Science

    May 13, 2025
    Our Picks

    OpenAIs nya webbläsare ChatGPT Atlas

    October 22, 2025

    Creating AI that matters | MIT News

    October 21, 2025

    Scaling Recommender Transformers to a Billion Parameters

    October 21, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 ProfitlyAI All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.