Close Menu
    Trending
    • Building Robust Credit Scoring Models (Part 3)
    • How to Measure AI Value
    • What’s the right path for AI? | MIT News
    • MIT and Hasso Plattner Institute establish collaborative hub for AI and creativity | MIT News
    • OpenAI is throwing everything into building a fully automated researcher
    • Agentic RAG Failure Modes: Retrieval Thrash, Tool Storms, and Context Bloat (and How to Spot Them Early)
    • The Basics of Vibe Engineering
    • Beyond Prompt Caching: 5 More Things You Should Cache in RAG Pipelines
    ProfitlyAI
    • Home
    • Latest News
    • AI Technology
    • Latest AI Innovations
    • AI Tools & Technologies
    • Artificial Intelligence
    ProfitlyAI
    Home » OpenAI is throwing everything into building a fully automated researcher
    AI Technology

    OpenAI is throwing everything into building a fully automated researcher

    ProfitlyAIBy ProfitlyAIMarch 20, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    For Pachocki, that’s a transparent Sure. Actually, he thinks it’s only a matter of pushing forward on the trail we’re already on. A easy increase in all-round functionality additionally results in fashions working for longer with out assist, he says. He factors to the leap from 2020’s GPT-3 to 2023’s GPT-4, two of OpenAI’s earlier fashions. GPT-4 was capable of work on an issue for much longer than its predecessor, even with out specialised coaching, he says. 

    So-called reasoning fashions introduced one other bump. Coaching LLMs to work by means of issues step-by-step, backtracking after they make a mistake or hit a useless finish, has additionally made fashions higher at working for longer durations of time. And Pachocki is satisfied that OpenAI’s reasoning fashions will proceed to get higher.

    However OpenAI can be coaching its techniques to work by themselves for longer by feeding them particular samples of advanced duties, reminiscent of onerous puzzles taken from math and coding contests, which pressure fashions to discover ways to do issues like preserve monitor of very massive chunks of textual content and cut up issues up into (after which handle) a number of subtasks.

    The goal isn’t to construct fashions that simply win math competitions. “That allows you to show that the expertise works earlier than you join it to the true world,” says Pachocki. “If we actually wished to, we might construct a tremendous automated mathematician, we’ve got all of the instruments, and I believe it might be comparatively simple. However it’s not one thing we’ll prioritize now as a result of, you recognize, on the level the place you consider you are able to do it, there’s way more pressing issues to do.”

    “We’re way more centered now on analysis that’s related in the true world,” he provides.

    Proper now meaning taking what Codex (and instruments prefer it) can do with coding and making an attempt to use that to problem-solving on the whole. “There’s an enormous change occurring, particularly in programming,” he says. “Our jobs at the moment are completely completely different than they had been even a 12 months in the past. No one actually edits code on a regular basis anymore. As a substitute, you handle a gaggle of Codex brokers.” If Codex can resolve coding issues (the argument goes), it may well resolve any drawback.

    The road all the time goes up

    It’s true that OpenAI has had a handful of outstanding successes in the previous couple of months. Researchers have used GPT-5 (the LLM that powers Codex) to find new options to various unsolved math issues and punch by means of obvious useless ends in a handful of biology, chemistry and physics puzzles.   

    “Simply these fashions arising with concepts that might take most PhD weeks, at the least, makes me count on that we’ll see way more acceleration coming from this expertise within the close to future,” Pachocki says.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleAgentic RAG Failure Modes: Retrieval Thrash, Tool Storms, and Context Bloat (and How to Spot Them Early)
    Next Article MIT and Hasso Plattner Institute establish collaborative hub for AI and creativity | MIT News
    ProfitlyAI
    • Website

    Related Posts

    AI Technology

    DataRobot + Nebius: An enterprise-ready AI Factory optimized for agents

    March 18, 2026
    AI Technology

    The Pentagon is making plans for AI companies to train on classified data, defense official says

    March 17, 2026
    AI Technology

    Identity-first AI governance: Securing the agentic workforce

    March 16, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    “Robot, make me a chair” | MIT News

    December 16, 2025

    Building India’s Largest Open-Source Speech Dataset

    February 12, 2026

    Gemini är nu en universal translator

    December 15, 2025

    Gamers Nexus avslöjar omfattande GPU-smugglingsimperium från Kina

    August 19, 2025

    Learning Triton One Kernel at a Time: Softmax

    November 23, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    Most Popular

    The Absolute Beginner’s Guide to Pandas DataFrames

    November 17, 2025

    Robotic helper making mistakes? Just nudge it in the right direction | MIT News

    April 5, 2025

    Forecast demand with precision using advanced AI for SAP IBP

    April 30, 2025
    Our Picks

    Building Robust Credit Scoring Models (Part 3)

    March 20, 2026

    How to Measure AI Value

    March 20, 2026

    What’s the right path for AI? | MIT News

    March 20, 2026
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 ProfitlyAI All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.