Close Menu
    Trending
    • Dispatch: Partying at one of Africa’s largest AI gatherings
    • Topp 10 AI-filmer genom tiderna
    • OpenAIs nya webbläsare ChatGPT Atlas
    • Creating AI that matters | MIT News
    • Scaling Recommender Transformers to a Billion Parameters
    • Hidden Gems in NumPy: 7 Functions Every Data Scientist Should Know
    • Is RAG Dead? The Rise of Context Engineering and Semantic Layers for Agentic AI
    • ChatGPT Gets More Personal. Is Society Ready for It?
    ProfitlyAI
    • Home
    • Latest News
    • AI Technology
    • Latest AI Innovations
    • AI Tools & Technologies
    • Artificial Intelligence
    ProfitlyAI
    Home » ASR (Automatic Speech Recognition) – Definition, Use Cases, Example
    Latest News

    ASR (Automatic Speech Recognition) – Definition, Use Cases, Example

    ProfitlyAIBy ProfitlyAIApril 5, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Automated Speech Recognition know-how has been there for a protracted haul however lately gained prominence after its use turned prevalent in varied smartphone purposes like Siri and Alexa. These AI-based smartphone purposes have illustrated the facility of ASR in simplifying on a regular basis duties for all of us.

    Moreover, as totally different trade verticals additional transfer towards automation, the underlying want for ASR is subjected to surge. Therefore, allow us to perceive this terrific speech recognition know-how in-depth and why it’s thought of one of the essential applied sciences for the long run.

    A Temporary Historical past of ASR Expertise

    Earlier than continuing forward and exploring the potential of Automated Speech Recognition, allow us to first check out its evolution.

    Decade Evolution of ASR
    Nineteen Fifties Speech Recognition know-how was first launched by Bell Laboratories within the Nineteen Fifties. The Bell Labs created a digital speech recognizer generally known as ‘Audrey’ that would establish the numbers between 1-9 when spoken by a single voice.
    Sixties In 1952, IBM launched its first voice recognition system, ‘Shoebox.’ Shoebox may perceive and differentiate between sixteen spoken English phrases.
    Nineteen Seventies Carnegie Mellon College within the 12 months 1976 developed a ‘Harpy’ system that would acknowledge over 1000 phrases.
    Nineteen Nineties After a protracted wait of just about 40 years, Bell Applied sciences once more breakthrough the trade with its dial-in interactive voice recognition methods that would dictate human speech.
    2000s This was a transformative interval for ASR know-how as the large know-how big Google began engaged on speech recognition know-how. They created superior speech software program with an accuracy price of roughly 80%, making it widespread worldwide.
    2010s The final decade turned a golden interval for ASR, with Amazon and Apple launching their first-ever AI-based speech software program, Alexa and Siri.

    Shifting forward of 2010, ASR is tremendously evolving and changing into increasingly prevalent and correct. Immediately, Amazon, Google, and Apple are essentially the most outstanding leaders in ASR know-how.

    [ Also Read: The Complete Guide to Conversational AI ]

    How Does Voice Recognition Work?

    Automated Speech Recognition is a reasonably superior know-how that’s extraordinarily laborious to design and develop. There are literally thousands of languages worldwide with varied dialects and accents, so it’s laborious to develop software program that may perceive all of it.

    ASR makes use of ideas of pure language processing and machine studying for its improvement. By incorporating quite a few language-learning mechanisms within the software program, builders make sure the precision and effectivity of speech recognition software program.

    Automated Speech Recognition (ASR) is a fancy know-how that depends on a number of key processes to transform spoken language into textual content. At a excessive stage, the primary steps concerned are:

    1. Audio Seize: A microphone captures the person’s speech and converts the acoustic waves into {an electrical} sign.
    2. Audio Pre-processing: {The electrical} sign is then digitized and undergoes varied pre-processing steps, similar to noise discount, to reinforce the standard of the audio enter.
    3. Function Extraction: The digital audio is analyzed to extract acoustic options, similar to pitch, vitality, and spectral coefficients, which can be attribute of various speech sounds.
    4. Acoustic Modeling: The extracted options are in contrast in opposition to pre-trained acoustic fashions, which map the audio options to particular person speech sounds or phonemes.
    5. Language Modeling: The acknowledged phonemes are then assembled into phrases & phrases utilizing statistical language fashions that predict the almost definitely phrase sequences primarily based on context.
    6. Decoding: The ultimate step includes decoding essentially the most possible phrase sequence that matches the enter audio, taking into consideration each the acoustic and language fashions.

    These core parts work collectively seamlessly to allow extremely correct speech-to-text conversion, even within the presence of background noise, accents, and numerous vocabularies.

    [ Also Read: What is Speech-to-Text Technology and How it works]



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticlePuzzling out climate change | MIT News
    Next Article Can deep learning transform heart failure prevention? | MIT News
    ProfitlyAI
    • Website

    Related Posts

    Latest News

    ChatGPT Gets More Personal. Is Society Ready for It?

    October 21, 2025
    Latest News

    Why the Future Is Human + Machine

    October 21, 2025
    Latest News

    Why AI Is Widening the Gap Between Top Talent and Everyone Else

    October 21, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Computer Vision’s Annotation Bottleneck Is Finally Breaking

    June 18, 2025

    The Future of Data with Intelligent Character Recognition (ICR)

    April 9, 2025

    Transformer Lab: Öppen källkods-plattform förenklar arbetet med AI-språkmodeller

    April 13, 2025

    Responding to the climate impact of generative AI | MIT News

    September 30, 2025

    AI and machine learning for engineering design | MIT News

    September 7, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    Most Popular

    Learn Your Way: Googles AI skapar personliga läroböcker

    October 4, 2025

    Chinese universities want students to use more AI, not less

    July 28, 2025

    ChatGPT’s New Memory, Shopify CEO’s Leaked “AI First” Memo, Google Cloud Next Releases, o3 and o4-mini Coming Soon & Llama 4’s Rocky Launch

    April 16, 2025
    Our Picks

    Dispatch: Partying at one of Africa’s largest AI gatherings

    October 22, 2025

    Topp 10 AI-filmer genom tiderna

    October 22, 2025

    OpenAIs nya webbläsare ChatGPT Atlas

    October 22, 2025
    Categories
    • AI Technology
    • AI Tools & Technologies
    • Artificial Intelligence
    • Latest AI Innovations
    • Latest News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 ProfitlyAI All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.