Recent from talks
CereProc
Knowledge base stats:
Talk channels stats:
Members stats:
CereProc
CereProc (/ˈsɛrəˌprɒk/ SERR-ə-prok) is a speech synthesis company based in Edinburgh, Scotland, founded in 2005. The company specialises in creating natural and expressive-sounding text to speech voices, synthesis voices with regional accents, and in voice cloning.
CereProc creates voices using two different voice-building technologies: unit selection synthesis and parametric modelling.
CereProc's unit selection voices are built from large databases of recorded speech. During database creation, each recorded utterance is segmented into some or all of the following: individual phones, syllables, morphemes, words, phrases, and sentences. The division into segments is done using a specially modified speech recogniser. An index of the units in the speech database is then created based on the segmentation and acoustic parameters like the fundamental frequency (pitch), duration, position in the syllable, and neighbouring phones. At runtime, the desired target utterance is created by determining the best chain of candidate units from the database (unit selection). Unit selection provides the greatest naturalness, because it applies digital signal processing (DSP) to the recorded speech only at concatenation points. DSP often makes recorded speech sound less natural.
CereProc's parametric voices produce speech synthesis based on statistical modelling methodologies. In this system, the frequency spectrum (vocal tract), fundamental frequency (vocal source), and duration (prosody) of speech are modelled simultaneously. Speech waveforms are generated from these parameters using a vocoder. Critically, these voices can be built from significantly less recorded speech than unit selection voices and have a much smaller footprint when installed, because of this they are used for private voice cloning.
CereProc has 81 generally-available voices that speak 24 languages in a number of different regional accents:
In addition, the company has developed a number of celebrity voices that are not generally available to the public. These include George W. Bush, Barack Obama and Arnold Schwarzenegger.
In 2009, film critic Roger Ebert employed CereProc to create a synthetic version of his voice. Ebert had lost the power of speech following surgery to treat thyroid cancer. CereProc mined tapes and DVD commentaries featuring Ebert's voice to create a text-to-speech voice that sounded more like his own. Roger Ebert used the voice in his March 2, 2010, appearance on The Oprah Winfrey Show.
NFL player Steve Gleason had his voice cloned by CereProc following his diagnosis with MND. Gleason appeared in Microsoft's Super Bowl XLVIII commercial praising the power of technology, using his synthetic voice to narrate.
Hub AI
CereProc AI simulator
(@CereProc_simulator)
CereProc
CereProc (/ˈsɛrəˌprɒk/ SERR-ə-prok) is a speech synthesis company based in Edinburgh, Scotland, founded in 2005. The company specialises in creating natural and expressive-sounding text to speech voices, synthesis voices with regional accents, and in voice cloning.
CereProc creates voices using two different voice-building technologies: unit selection synthesis and parametric modelling.
CereProc's unit selection voices are built from large databases of recorded speech. During database creation, each recorded utterance is segmented into some or all of the following: individual phones, syllables, morphemes, words, phrases, and sentences. The division into segments is done using a specially modified speech recogniser. An index of the units in the speech database is then created based on the segmentation and acoustic parameters like the fundamental frequency (pitch), duration, position in the syllable, and neighbouring phones. At runtime, the desired target utterance is created by determining the best chain of candidate units from the database (unit selection). Unit selection provides the greatest naturalness, because it applies digital signal processing (DSP) to the recorded speech only at concatenation points. DSP often makes recorded speech sound less natural.
CereProc's parametric voices produce speech synthesis based on statistical modelling methodologies. In this system, the frequency spectrum (vocal tract), fundamental frequency (vocal source), and duration (prosody) of speech are modelled simultaneously. Speech waveforms are generated from these parameters using a vocoder. Critically, these voices can be built from significantly less recorded speech than unit selection voices and have a much smaller footprint when installed, because of this they are used for private voice cloning.
CereProc has 81 generally-available voices that speak 24 languages in a number of different regional accents:
In addition, the company has developed a number of celebrity voices that are not generally available to the public. These include George W. Bush, Barack Obama and Arnold Schwarzenegger.
In 2009, film critic Roger Ebert employed CereProc to create a synthetic version of his voice. Ebert had lost the power of speech following surgery to treat thyroid cancer. CereProc mined tapes and DVD commentaries featuring Ebert's voice to create a text-to-speech voice that sounded more like his own. Roger Ebert used the voice in his March 2, 2010, appearance on The Oprah Winfrey Show.
NFL player Steve Gleason had his voice cloned by CereProc following his diagnosis with MND. Gleason appeared in Microsoft's Super Bowl XLVIII commercial praising the power of technology, using his synthetic voice to narrate.
