Close Menu
    Trending
    • Building Cost-Efficient Agentic RAG on Long-Text Documents in SQL Tables
    • Why Every Analytics Engineer Needs to Understand Data Architecture
    • Agentic AI for Modern Deep Learning Experimentation
    • Google DeepMind wants to know if chatbots are just virtue signaling
    • Claude AI Used in Venezuela Raid: The Human Oversight Gap
    • The robots who predict the future
    • Personalization features can make LLMs more agreeable | MIT News
    • Use OpenClaw to Make a Personal AI Assistant
    ProfitlyAI
    • Home
    • Latest News
    • AI Technology
    • Latest AI Innovations
    • AI Tools & Technologies
    • Artificial Intelligence
    ProfitlyAI
    Home » Google DeepMind wants to know if chatbots are just virtue signaling
    AI Technology

    Google DeepMind wants to know if chatbots are just virtue signaling

    ProfitlyAIBy ProfitlyAIFebruary 18, 2026No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    With coding and math, you could have clear-cut, appropriate solutions which you can verify, William Isaac, a analysis scientist at Google DeepMind, advised me after I met him and Julia Haas, a fellow analysis scientist on the agency, for an unique preview of their work, which is published in Nature at present. That’s not the case for ethical questions, which usually have a spread of acceptable solutions: “Morality is a vital functionality however arduous to guage,” says Isaac.

    “Within the ethical area, there’s no proper and flawed,” provides Haas. “Nevertheless it’s not by any means a free-for-all. There are higher solutions and there are worse solutions.”

    The researchers have recognized a number of key challenges and advised methods to handle them. However it’s extra a want checklist than a set of ready-made options. “They do a pleasant job of bringing collectively totally different views,” says Vera Demberg, who research LLMs at Saarland College in Germany.

    Quite a few research have proven that LLMs can present exceptional ethical competence. One study revealed final yr discovered that individuals within the US scored moral recommendation from OpenAI’s GPT-4o as being extra ethical, reliable, considerate, and proper than recommendation given by the (human) author of “The Ethicist,” a well-liked New York Instances recommendation column.  

    The issue is that it’s arduous to unpick whether or not such behaviors are a efficiency—mimicking a memorized response, say—or proof that there’s in reality some sort of ethical reasoning going down contained in the mannequin. In different phrases, is it advantage or advantage signaling?

    This query issues as a result of a number of research additionally present simply how untrustworthy LLMs could be. For a begin, fashions could be too desperate to please. They’ve been discovered to flip their reply to an ethical query and say the precise reverse when an individual disagrees or pushes again on their first response. Worse, the solutions an LLM provides to a query can change in response to how it’s introduced or formatted. For instance, researchers have discovered that fashions quizzed about political values can provide totally different—typically reverse—solutions relying on whether or not the questions supply multiple-choice solutions or instruct the mannequin to reply in its personal phrases.

    In an much more hanging case, Demberg and her colleagues introduced a number of LLMs, together with variations of Meta’s Llama 3 and Mistral, with a sequence of ethical dilemmas and requested them to select which of two choices was the higher end result. The researchers discovered that the fashions typically reversed their alternative when the labels for these two choices have been modified from “Case 1” and “Case 2” to “(A)” and “(B).”

    In addition they confirmed that fashions modified their solutions in response to different tiny formatting tweaks, together with swapping the order of the choices and ending the query with a colon as an alternative of a query mark.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleClaude AI Used in Venezuela Raid: The Human Oversight Gap
    Next Article Agentic AI for Modern Deep Learning Experimentation
    ProfitlyAI
    • Website

    Related Posts

    AI Technology

    Claude AI Used in Venezuela Raid: The Human Oversight Gap

    February 18, 2026
    AI Technology

    The robots who predict the future

    February 18, 2026
    AI Technology

    The digital quant: instant portfolio optimization with JointFM

    February 16, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Man Cures 5-Year Jaw Problem in 60 Seconds Using ChatGPT, Doctors Are Stunned

    April 29, 2025

    Systematic LLM Prompt Engineering Using DSPy Optimization

    August 25, 2025

    Types, Benefits, and Use Cases

    November 13, 2025

    Real-Time Interactive Sentiment Analysis in Python

    May 8, 2025

    A Basic to Advanced Guide for 2025

    April 4, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    Most Popular

    Probabilistic Multi-Variant Reasoning: Turning Fluent LLM Answers Into Weighted Options

    January 7, 2026

    Energy Grid Challenges & Innovation Guide

    April 10, 2025

    The fast and the future-focused are revolutionizing motorsport

    December 15, 2025
    Our Picks

    Building Cost-Efficient Agentic RAG on Long-Text Documents in SQL Tables

    February 18, 2026

    Why Every Analytics Engineer Needs to Understand Data Architecture

    February 18, 2026

    Agentic AI for Modern Deep Learning Experimentation

    February 18, 2026
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 ProfitlyAI All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.