Home Uncategorized Improving Open Source Speech Recognition

Uncategorized

Improving Open Source Speech Recognition

October 11, 2006

“VoxForge collects free GPL Transcribed Speech Audio that can be used in the creation of Acoustic Models for use with Open Source Speech Recognition Engines. We are essentially creating a user-submitted repository of the ‘source’ speech audio for the creation of Acoustic Models to be used by Speech Recognition Engines. The Speech Audio files will then be ‘compiled’ into Acoustic Models for use with Open Source Speech Recognition engines such as Sphinx, HTK, CAVS and Julius.”

Why free GPL Speech Audio?
Speech Recognition Engines require two types of files to recognize speech. The first is an Acoustic Model, which is created by taking a very large number of audio recordings of speech and their transcriptions (called Speech Corpus or Corpora) and ‘compiling’ them into statistical representations of the sounds that make up each word. The second is a Language Model or Grammar file. A Language Model is a very large file containing the probabilities of certain sequences of words. A Grammar is a much smaller file containing sets of predefined combinations of words.
Most Acoustic Models used by ‘Open Source’ Speech Recognition engines are ‘closed source’. They do not give you access to the speech audio (the ‘source’) used to create the acoustic model, or if they do, there are licensing restrictions on the distribution of the ‘source’ (i.e. you can only use it for personal or research purposes). The reason for this is because there is no free Speech Corpora in a form that can readily be used to create Acoustic Models for Speech Recognition Engines. Open Source projects are required to purchase Speech Copora which has restrictive licensing

Which Boring but Cheap Web Development Stack to Use in 2023?

Introducing Pest 2.0: The Next Generation of PHP Testing

PHP Design Patterns Game : The Singleton Pattern

Introducing the PHP Design Patterns Game Series

A Beginner’s Guide to Business Management for Freelancers

5 Common Mistakes Freelancers Make and How to Avoid Them

Navigating Freelance Contracts: What to Look for and What to Avoid

The Art of Saying ‘No’ to Clients: Setting Boundaries for Freelancers

Getting Started with Bref: Deploying Serverless PHP Applications on AWS Lambda

Unlock the Power of Real-time Intelligence with Redis Enterprise Cloud on…

Centreon, powerful IT and Application monitoring software

MySQL 8.0.26 and 5.7.35 Released

PeachPie, Do We Really Need a .NET Development Platform for PHP…

React Introduce Zero-Bundle-Size React Server Components

Bootstrap 4.6.0 Released with all new backend !

Visx, low-level visualization primitives for React by Airbnb

The Power of Design Sprints: A Modern Approach to Software Development

Optimizing your SQL queries: Understanding the execution order

Introducing MRSK: Your Ticket to Deploying Web Apps Anywhere

The Balancing Act of Web Security and Performance: How to Keep…

Improving Open Source Speech Recognition

Like this:

Related

Social Media

Latest articles

Get Started with Laravel Volt: A Free Full Stack Laravel App...

Unleashing the Power of PHP Fibers: Boost Web Development with Efficient...

Which Boring but Cheap Web Development Stack to Use in 2023?

Initializer for Laravel – A Visual Approach to Setting Up a...

PHP Coding Puzzle 10 : Sudoku Game

Open Source Dress for Success University Opens