Close Menu
    Trending
    • Gemini introducerar funktionen schemalagda åtgärder i Gemini-appen
    • AIFF 2025 Runway’s tredje årliga AI Film Festival
    • AI-agenter kan nu hjälpa läkare fatta bättre beslut inom cancervård
    • Not Everything Needs Automation: 5 Practical AI Agents That Deliver Enterprise Value
    • Prescriptive Modeling Unpacked: A Complete Guide to Intervention With Bayesian Modeling.
    • 5 Crucial Tweaks That Will Make Your Charts Accessible to People with Visual Impairments
    • Why AI Projects Fail | Towards Data Science
    • The Role of Luck in Sports: Can We Measure It?
    ProfitlyAI
    • Home
    • Latest News
    • AI Technology
    • Latest AI Innovations
    • AI Tools & Technologies
    • Artificial Intelligence
    ProfitlyAI
    Home » ASR (Automatic Speech Recognition) – Definition, Use Cases, Example
    Latest News

    ASR (Automatic Speech Recognition) – Definition, Use Cases, Example

    ProfitlyAIBy ProfitlyAIApril 5, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Automated Speech Recognition know-how has been there for a protracted haul however lately gained prominence after its use turned prevalent in varied smartphone purposes like Siri and Alexa. These AI-based smartphone purposes have illustrated the facility of ASR in simplifying on a regular basis duties for all of us.

    Moreover, as totally different trade verticals additional transfer towards automation, the underlying want for ASR is subjected to surge. Therefore, allow us to perceive this terrific speech recognition know-how in-depth and why it’s thought of one of the essential applied sciences for the long run.

    A Temporary Historical past of ASR Expertise

    Earlier than continuing forward and exploring the potential of Automated Speech Recognition, allow us to first check out its evolution.

    Decade Evolution of ASR
    Nineteen Fifties Speech Recognition know-how was first launched by Bell Laboratories within the Nineteen Fifties. The Bell Labs created a digital speech recognizer generally known as ‘Audrey’ that would establish the numbers between 1-9 when spoken by a single voice.
    Sixties In 1952, IBM launched its first voice recognition system, ‘Shoebox.’ Shoebox may perceive and differentiate between sixteen spoken English phrases.
    Nineteen Seventies Carnegie Mellon College within the 12 months 1976 developed a ‘Harpy’ system that would acknowledge over 1000 phrases.
    Nineteen Nineties After a protracted wait of just about 40 years, Bell Applied sciences once more breakthrough the trade with its dial-in interactive voice recognition methods that would dictate human speech.
    2000s This was a transformative interval for ASR know-how as the large know-how big Google began engaged on speech recognition know-how. They created superior speech software program with an accuracy price of roughly 80%, making it widespread worldwide.
    2010s The final decade turned a golden interval for ASR, with Amazon and Apple launching their first-ever AI-based speech software program, Alexa and Siri.

    Shifting forward of 2010, ASR is tremendously evolving and changing into increasingly prevalent and correct. Immediately, Amazon, Google, and Apple are essentially the most outstanding leaders in ASR know-how.

    [ Also Read: The Complete Guide to Conversational AI ]

    How Does Voice Recognition Work?

    Automated Speech Recognition is a reasonably superior know-how that’s extraordinarily laborious to design and develop. There are literally thousands of languages worldwide with varied dialects and accents, so it’s laborious to develop software program that may perceive all of it.

    ASR makes use of ideas of pure language processing and machine studying for its improvement. By incorporating quite a few language-learning mechanisms within the software program, builders make sure the precision and effectivity of speech recognition software program.

    Automated Speech Recognition (ASR) is a fancy know-how that depends on a number of key processes to transform spoken language into textual content. At a excessive stage, the primary steps concerned are:

    1. Audio Seize: A microphone captures the person’s speech and converts the acoustic waves into {an electrical} sign.
    2. Audio Pre-processing: {The electrical} sign is then digitized and undergoes varied pre-processing steps, similar to noise discount, to reinforce the standard of the audio enter.
    3. Function Extraction: The digital audio is analyzed to extract acoustic options, similar to pitch, vitality, and spectral coefficients, which can be attribute of various speech sounds.
    4. Acoustic Modeling: The extracted options are in contrast in opposition to pre-trained acoustic fashions, which map the audio options to particular person speech sounds or phonemes.
    5. Language Modeling: The acknowledged phonemes are then assembled into phrases & phrases utilizing statistical language fashions that predict the almost definitely phrase sequences primarily based on context.
    6. Decoding: The ultimate step includes decoding essentially the most possible phrase sequence that matches the enter audio, taking into consideration each the acoustic and language fashions.

    These core parts work collectively seamlessly to allow extremely correct speech-to-text conversion, even within the presence of background noise, accents, and numerous vocabularies.

    [ Also Read: What is Speech-to-Text Technology and How it works]



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticlePuzzling out climate change | MIT News
    Next Article Can deep learning transform heart failure prevention? | MIT News
    ProfitlyAI
    • Website

    Related Posts

    Latest News

    Benefits an End to End Training Data Service Provider Can Offer Your AI Project

    June 4, 2025
    Latest News

    AI Will Destroy 50% of Entry-Level Jobs, Veo 3’s Scary Lifelike Videos, Meta Aims to Fully Automate Ads & Perplexity’s Burning Cash

    June 3, 2025
    Latest News

    Hyper-Realistic AI Video Is Outpacing Our Ability to Label It

    June 3, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    AI is pushing the limits of the physical world

    April 21, 2025

    Medical Image Annotation: Definition, Application, Use Cases & Types

    April 9, 2025

    Gemini Live-funktionen rullas ut till Android användare

    April 18, 2025

    Retrieval Augmented Generation (RAG) — An Introduction

    April 22, 2025

    Multiple Linear Regression Analysis | Towards Data Science

    May 23, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    Most Popular

    The AI Cheating Crisis in Higher Education Is Worse Than Anyone Expected

    May 13, 2025

    Talking to Kids About AI

    May 2, 2025

    Want Better Clusters? Try DeepType | Towards Data Science

    May 3, 2025
    Our Picks

    Gemini introducerar funktionen schemalagda åtgärder i Gemini-appen

    June 7, 2025

    AIFF 2025 Runway’s tredje årliga AI Film Festival

    June 7, 2025

    AI-agenter kan nu hjälpa läkare fatta bättre beslut inom cancervård

    June 7, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 ProfitlyAI All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.