Close Menu
    Trending
    • How to Automate Workflows with AI
    • TDS Newsletter: How Compelling Data Stories Lead to Better Business Decisions
    • I Measured Neural Network Training Every 5 Steps for 10,000 Iterations
    • “The success of an AI product depends on how intuitively users can interact with its capabilities”
    • How to Crack Machine Learning System-Design Interviews
    • Music, Lyrics, and Agentic AI: Building a Smart Song Explainer using Python and OpenAI
    • An Anthropic Merger, “Lying,” and a 52-Page Memo
    • Apple’s $1 Billion Bet on Google Gemini to Fix Siri
    ProfitlyAI
    • Home
    • Latest News
    • AI Technology
    • Latest AI Innovations
    • AI Tools & Technologies
    • Artificial Intelligence
    ProfitlyAI
    Home » DeepSeek may have found a new way to improve AI’s ability to remember
    AI Technology

    DeepSeek may have found a new way to improve AI’s ability to remember

    ProfitlyAIBy ProfitlyAIOctober 29, 2025No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    At present, most giant language fashions break textual content down into 1000’s of tiny items referred to as tokens. This turns the textual content into representations that fashions can perceive. Nevertheless, these tokens shortly grow to be costly to retailer and compute with as conversations with finish customers develop longer. When a consumer chats with an AI for prolonged durations, this problem may cause the AI to neglect issues the consumer has already instructed it and get data muddled, an issue some name “context rot.”

    The brand new strategies developed by DeepSeek (and printed in its latest paper) may assist to beat this problem. As an alternative of storing phrases as tokens, its system packs written data into picture type, virtually as if it’s taking an image of pages from a e-book. This enables the mannequin to retain almost the identical data whereas utilizing far fewer tokens, the researchers discovered. 

    Basically, the OCR mannequin is a testbed for these new strategies that let extra data to be packed into AI fashions extra effectively. 

    Moreover utilizing visible tokens as an alternative of simply textual content ones, the mannequin is constructed on a sort of tiered compression that’s not not like how human recollections fade: Older or much less essential content material is saved in a barely extra blurry type with a view to save area. Regardless of that, the paper’s authors argue that this compressed content material can nonetheless stay accessible within the background, whereas sustaining a excessive degree of system effectivity.

    Textual content tokens have lengthy been the default constructing block in AI techniques. Utilizing visible tokens as an alternative is unconventional, and because of this, DeepSeek’s mannequin is shortly capturing researchers’ consideration. Andrej Karpathy, the previous Tesla AI chief and a founding member of OpenAI, praised the paper on X, saying that pictures might in the end be higher than textual content as inputs for LLMs. Textual content tokens is likely to be “wasteful and simply horrible on the enter,” he wrote. 

    Manling Li, an assistant professor of pc science at Northwestern College, says the paper affords a brand new framework for addressing the present challenges in AI reminiscence. “Whereas the thought of utilizing image-based tokens for context storage isn’t fully new, that is the primary research I’ve seen that takes it this far and reveals it would really work,” Li says.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleDelivering the agent workforce in high-security environments
    Next Article The AI Hype Index: Data centers’ neighbors are pivoting to power blackouts
    ProfitlyAI
    • Website

    Related Posts

    AI Technology

    OpenAI’s new LLM exposes the secrets of how AI really works

    November 13, 2025
    AI Technology

    Google Deepmind is using Gemini to train agents inside Goat Simulator 3

    November 13, 2025
    AI Technology

    Improving VMware migration workflows with agentic AI

    November 12, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Federated Learning and Custom Aggregation Schemes

    October 22, 2025

    AI tool generates high-quality images faster than state-of-the-art approaches | MIT News

    April 4, 2025

    How Computers “See” Molecules | Towards Data Science

    August 1, 2025

    Nvidia’s $5 Trillion Milestone

    November 6, 2025

    [The AI Show Episode 163]: AI Answers

    August 21, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    Most Popular

    Rethinking AI Vendor Trust: Why Ethical Partnerships Matter

    November 13, 2025

    5 Techniques to Prevent Hallucinations in Your RAG Question Answering

    September 23, 2025

    FramePack videodiffusion som kan köras på konsument-GPU:er

    April 18, 2025
    Our Picks

    How to Automate Workflows with AI

    November 15, 2025

    TDS Newsletter: How Compelling Data Stories Lead to Better Business Decisions

    November 15, 2025

    I Measured Neural Network Training Every 5 Steps for 10,000 Iterations

    November 15, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 ProfitlyAI All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.