Multilingual Phoneme Models for Rapid Speech Processing System Development

Hansen, Eric G

ISBN 10: 1286862507 ISBN 13: 9781286862506
Editorial: Biblioscholar, 2012
Nuevos Encuadernación de tapa blanda

Librería: Ria Christie Collections, Uxbridge, Reino Unido Calificación del vendedor: 5 de 5 estrellas Valoración 5 estrellas, Más información sobre las valoraciones de los vendedores

Vendedor de AbeBooks desde 25 de marzo de 2015

Este artículo en concreto ya no está disponible.

Descripción

Descripción:

In. N° de ref. del artículo ria9781286862506_new

Denunciar este artículo

Sinopsis:

Current speech recognition systems tend to be developed only for commercially viable languages. The resources needed for a typical speech recognition system include hundreds of hours of transcribed speech for acoustic models and 10 to 100 million words of text for language models; both of these requirements can be costly in time and money. The goal of this research is to facilitate rapid development of speech systems to new languages by using multilingual phoneme models to alleviate requirements for large amounts of transcribed speech. The GlobalPhone database, which contains transcribed speech from 15 languages, is used as source data to derive multilingual phoneme models. Various bootstrapping processes are used to develop an Arabic speech recognition system starting from monolingual English models, International Phonetic Association (IPA) based multilingual models, and data-driven multilingual models. The Kullback-Leibler distortion measure is used to derive datadriven phoneme clusters. It was found that multilingual bootstrapping methods outperform monolingual English bootstrapping methods on the Arabic evaluation data initially, and after three iterations of bootstrapping all systems show similar performance levels. Applications of this research are in speech recognition, word spotting, information retrieval, and speech-to-speech translation.

Reseña del editor: Current speech recognition systems tend to be developed only for commercially viable languages. The resources needed for a typical speech recognition system include hundreds of hours of transcribed speech for acoustic models and 10 to 100 million words of text for language models; both of these requirements can be costly in time and money. The goal of this research is to facilitate rapid development of speech systems to new languages by using multilingual phoneme models to alleviate requirements for large amounts of transcribed speech. The GlobalPhone database, which contains transcribed speech from 15 languages, is used as source data to derive multilingual phoneme models. Various bootstrapping processes are used to develop an Arabic speech recognition system starting from monolingual English models, International Phonetic Association (IPA) based multilingual models, and data-driven multilingual models. The Kullback-Leibler distortion measure is used to derive datadriven phoneme clusters. It was found that multilingual bootstrapping methods outperform monolingual English bootstrapping methods on the Arabic evaluation data initially, and after three iterations of bootstrapping all systems show similar performance levels. Applications of this research are in speech recognition, word spotting, information retrieval, and speech-to-speech translation.

"Sobre este título" puede pertenecer a otra edición de este libro.

Detalles bibliográficos

Título: Multilingual Phoneme Models for Rapid Speech...
Editorial: Biblioscholar
Año de publicación: 2012
Encuadernación: Encuadernación de tapa blanda
Condición: New

Los mejores resultados en AbeBooks