Close Menu
    Trending
    • OpenAIs nya webbläsare ChatGPT Atlas
    • Creating AI that matters | MIT News
    • Scaling Recommender Transformers to a Billion Parameters
    • Hidden Gems in NumPy: 7 Functions Every Data Scientist Should Know
    • Is RAG Dead? The Rise of Context Engineering and Semantic Layers for Agentic AI
    • ChatGPT Gets More Personal. Is Society Ready for It?
    • Why the Future Is Human + Machine
    • Why AI Is Widening the Gap Between Top Talent and Everyone Else
    ProfitlyAI
    • Home
    • Latest News
    • AI Technology
    • Latest AI Innovations
    • AI Tools & Technologies
    • Artificial Intelligence
    ProfitlyAI
    Home » Generalists Can Also Dig Deep
    Artificial Intelligence

    Generalists Can Also Dig Deep

    ProfitlyAIBy ProfitlyAISeptember 12, 2025No Comments7 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Within the Creator Highlight collection, TDS Editors chat with members of our group about their profession path in knowledge science and AI, their writing, and their sources of inspiration. Immediately, we’re thrilled to share our dialog with Ida Silfverskiöld.

    Ida is a generalist, educated as an economist and self-taught in software program engineering. She has knowledgeable background in product and advertising administration, which suggests she has a uncommon mix of product, advertising and improvement expertise. Over the previous few years, she’s been instructing and constructing within the LLM, NLP, and pc imaginative and prescient area, digging into areas corresponding to agentic AI, chain‑of‑thought methods, and the economics of internet hosting fashions.


    You studied economics, then realized to code and moved by product, development, and now hands-on AI constructing. What perspective does that generalist path provide you with that specialists typically miss?

    I’m unsure. 

    Folks see generalists as having shallow data, however generalists may dig deep. 

    I see generalists as individuals with a number of pursuits and a drive to know the entire, not only one half. As a generalist you take a look at the tech, the shopper, the info, the market, the price of the structure, and so forth. It offers you an edge to maneuver throughout matters and nonetheless do good work. 

    I’m not saying specialists can’t do that, however generalists are likely to adapt sooner as a result of they’re used to selecting issues up shortly.

    You’ve been writing so much about agentic techniques currently. When do “brokers” truly outperform easier LLM + RAG patterns, and when are we overcomplicating issues?

    It relies on the use case, however usually we throw AI into numerous issues that most likely don’t want it. For those who can management the system programmatically, it’s best to. LLMs are nice for translating human language into one thing a pc can perceive, however additionally they introduce unpredictability.

    As for RAG, including an agent means including prices, so doing it only for the sake of getting an agent isn’t a fantastic concept. You’ll be able to work round it by utilizing smaller fashions as routers (however this provides work). I’ve added an agent to a RAG system as soon as as a result of I knew there could be questions on constructing it out to additionally “act.” So once more, it relies on the use case. 

    Whenever you say Agentic AI wants “evaluations” what’s your checklist of go-to metrics? And the way do you resolve which one to make use of?

    I wouldn’t say you all the time want evals, however firms will ask for them, so it’s good to know what groups measure for product high quality. If a product can be utilized by lots of people, be sure you have some in place. I did various analysis right here to know the frameworks and metrics which have been outlined. 

    Generic metrics are most likely not sufficient although. You want a number of customized ones in your use case. So the evals differ by software. 

    For a coding copilot, you might observe what p.c of completions a developer accepts (acceptance price) and whether or not the total chat reached the objective (completeness).

    For commerce brokers, you would possibly  measure whether or not the agent picked the precise merchandise and whether or not solutions are grounded within the retailer’s knowledge.

    Safety and security associated metrics are necessary too, corresponding to bias, toxicity, and the way straightforward it’s to interrupt the system (jailbreaks, knowledge leaks).

    For RAG, see my article the place I break down the standard metrics. Personally, I’ve solely arrange metrics for RAG up to now.

    It might be fascinating to map how totally different AI apps arrange evals in an article. For instance, Shopify Sidekick for commerce brokers and different instruments corresponding to authorized analysis assistants.

    In your Agentic RAG Applications article, you constructed a Slack agent that takes firm data under consideration (with LlamaIndex and Modal). What design alternative ended up mattering greater than anticipated? 

    The retrieval half is the place you’ll get caught, particularly chunking. Whenever you work with RAG purposes, you break up the method into two. The primary half is about fetching the proper info, and getting it proper is necessary as a result of you’ll be able to’t overload an agent with an excessive amount of irrelevant info. To make it exact the chunks must be fairly small and related to the search question.

    Nonetheless, in case you make the chunks too small, you threat giving the LLM too little context. With chunks which can be too massive, the search system might grow to be imprecise.

    I arrange a system that chunked based mostly on the kind of doc, however proper now I’ve an concept for utilizing context growth after retrieval. 

    One other design alternative you want to bear in mind is that though retrieval usually advantages from hybrid search, it is probably not sufficient. Semantic search can join issues that reply the query with out utilizing the precise wording, whereas sparse strategies can determine actual key phrases. However sparse strategies like BM25 are token-based by default, so plain BM25 received’t match substrings.

    So, in case you additionally wish to seek for substrings (a part of product IDs, that sort of factor), you might want to add a search layer that helps partial matches as effectively.

    There’s extra, however I threat this changing into a complete article if I maintain going.

    Throughout your consulting initiatives over the previous two years, what issues have come up most frequently in your shoppers, and the way do you handle them? 

    The problems I see are that the majority firms are in search of one thing customized, which is nice for consultants, however constructing in-house is riddled with complexities, particularly for individuals who haven’t carried out it earlier than. I noticed that 95% quantity from the MIT study about initiatives failing, and I’m not stunned. I feel consultants ought to get good at sure use instances the place they will shortly implement and tweak the product for shoppers, having already learnt do it. However we’ll see what occurs.

    You’ve written on TDS about so many alternative matters. The place do your article concepts come from? Shopper work, instruments you wish to attempt, or your individual experiments? And what matter or drawback is prime of thoughts for you proper now?

    A little bit of all the pieces, frankly. The articles additionally assist me floor my very own data, filling in lacking items I’ll not have researched myself but. Proper now I’m researching a bit on how smaller fashions (mid-sized, round 3B–7B) can be utilized in agent techniques, safety, and particularly enhance RAG. 

    Zooming out: what’s one non-obvious functionality groups ought to domesticate within the subsequent 12–18 months (technical or cultural) to grow to be genuinely AI-productive somewhat than simply AI-busy?

    In all probability study to construct within the area (particularly for enterprise individuals): simply getting an LLM to do one thing constantly is a method to perceive how unpredictable LLMs are. It makes you a bit extra humble. 

    To study extra about Ida‘s work and keep up-to-date together with her newest articles, you’ll be able to observe her on TDS or LinkedIn.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleHow do AI models generate videos?
    Next Article A Focused Approach to Learning SQL
    ProfitlyAI
    • Website

    Related Posts

    Artificial Intelligence

    Creating AI that matters | MIT News

    October 21, 2025
    Artificial Intelligence

    Scaling Recommender Transformers to a Billion Parameters

    October 21, 2025
    Artificial Intelligence

    Hidden Gems in NumPy: 7 Functions Every Data Scientist Should Know

    October 21, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    The End-to-End Data Scientist’s Prompt Playbook

    September 8, 2025

    Agentic AI 101: Starting Your Journey Building AI Agents

    May 2, 2025

    How to Transition From Data Analyst to Data Scientist

    June 9, 2025

    Model predicts long-term effects of nuclear waste on underground disposal systems | MIT News

    July 18, 2025

    What the Latest AI Meltdown Reveals About Alignment

    July 22, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    Most Popular

    Can we fix AI’s evaluation crisis?

    June 24, 2025

    Ethical AI Innovations for Empowering Linguistic Diversity and Economic Empowerment

    April 9, 2025

    When Physics Meets Finance: Using AI to Solve Black-Scholes

    April 18, 2025
    Our Picks

    OpenAIs nya webbläsare ChatGPT Atlas

    October 22, 2025

    Creating AI that matters | MIT News

    October 21, 2025

    Scaling Recommender Transformers to a Billion Parameters

    October 21, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 ProfitlyAI All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.