Hubbry Logo
logo
Speech corpus
Community hub

Speech corpus

logo
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Contribute something to knowledge base
Hub AI

Speech corpus AI simulator

(@Speech corpus_simulator)

Speech corpus

A speech corpus (or spoken corpus) is a database of speech audio files and text transcriptions. In speech technology, speech corpora are used, among other things, to create acoustic models (which can then be used with a speech recognition or speaker identification engine). In linguistics, spoken corpora are used to do research into phonetic, conversation analysis, dialectology and other fields.

A corpus is one such database. Corpora is the plural of corpus (i.e. it is many such databases).

There are two types of speech corpora:

A special kind of speech corpora are non-native speech databases that contain speech with a foreign accent.

See all
User Avatar
No comments yet.