While human listeners have little problems in dealing with the strong variation in spoken language, the same cannot be said about automatic speech recognition (ASR). This work compares recognition performance of man and machine with the aim of learning from the distinct errors between these two. Based on the differences, the signal processing mechanisms are analyzed that are suitable to increase the robustness of ASR. The comparison focuses on the influence of intrinsic variation of speech, i.e., changes in speaking rate, effort and style, as well as dialect and accent. The outcome of the experiments suggests that the processing of temporal cues in ASR bears room for improvement. Therefore, spectro-temporal features are employed as input to ASR systems, which results in an increase of recognition performance for varying speaking effort and speaking style compared to standard features. This documents the usefulness of spectro-temporal and temporal information for automatic recognizers.
"Sinopsis" puede pertenecer a otra edición de este libro.
While human listeners have little problems in dealing with the strong variation in spoken language, the same cannot be said about automatic speech recognition (ASR). This work compares recognition performance of man and machine with the aim of learning from the distinct errors between these two. Based on the differences, the signal processing mechanisms are analyzed that are suitable to increase the robustness of ASR. The comparison focuses on the influence of intrinsic variation of speech, i.e., changes in speaking rate, effort and style, as well as dialect and accent. The outcome of the experiments suggests that the processing of temporal cues in ASR bears room for improvement. Therefore, spectro-temporal features are employed as input to ASR systems, which results in an increase of recognition performance for varying speaking effort and speaking style compared to standard features. This documents the usefulness of spectro-temporal and temporal information for automatic recognizers.
Bernd T. Meyer studied physics at the University in Oldenburg, and received his diploma and Ph.D. in 2004 and 2009, respectively. He has been working on the improvement of automatic speech recognizers and modeling human speech perception both in Oldenburg and the International Computer Science Institute in Berkeley, CA.
"Sobre este título" puede pertenecer a otra edición de este libro.
EUR 7,67 gastos de envío en Estados Unidos de America
Destinos, gastos y plazos de envíoLibrería: Best Price, Torrance, CA, Estados Unidos de America
Condición: New. SUPER FAST SHIPPING. Nº de ref. del artículo: 9783838121550
Cantidad disponible: 2 disponibles
Librería: Lucky's Textbooks, Dallas, TX, Estados Unidos de America
Condición: New. Nº de ref. del artículo: ABLING22Oct2817100592155
Cantidad disponible: Más de 20 disponibles
Librería: PBShop.store US, Wood Dale, IL, Estados Unidos de America
PAP. Condición: New. New Book. Shipped from UK. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Nº de ref. del artículo: L0-9783838121550
Cantidad disponible: Más de 20 disponibles
Librería: California Books, Miami, FL, Estados Unidos de America
Condición: New. Nº de ref. del artículo: I-9783838121550
Cantidad disponible: Más de 20 disponibles
Librería: PBShop.store UK, Fairford, GLOS, Reino Unido
PAP. Condición: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Nº de ref. del artículo: L0-9783838121550
Cantidad disponible: Más de 20 disponibles
Librería: Ria Christie Collections, Uxbridge, Reino Unido
Condición: New. In. Nº de ref. del artículo: ria9783838121550_new
Cantidad disponible: Más de 20 disponibles
Librería: Chiron Media, Wallingford, Reino Unido
Paperback. Condición: New. Nº de ref. del artículo: 6666-IUK-9783838121550
Cantidad disponible: 10 disponibles
Librería: moluna, Greven, Alemania
Kartoniert / Broschiert. Condición: New. Dieser Artikel ist ein Print on Demand Artikel und wird nach Ihrer Bestellung fuer Sie gedruckt. While human listeners have little problems in dealing with the strong variation in spoken language, the same cannot be said about automatic speech recognition (ASR). This work compares recognition performance of man and machine with the aim of learning from. Nº de ref. del artículo: 5406503
Cantidad disponible: Más de 20 disponibles
Librería: AHA-BUCH GmbH, Einbeck, Alemania
Taschenbuch. Condición: Neu. nach der Bestellung gedruckt Neuware - Printed after ordering - While human listeners have little problems in dealing with the strong variation in spoken language, the same cannot be said about automatic speech recognition (ASR). This work compares recognition performance of man and machine with the aim of learning from the distinct errors between these two. Based on the differences, the signal processing mechanisms are analyzed that are suitable to increase the robustness of ASR. The comparison focuses on the influence of intrinsic variation of speech, i.e., changes in speaking rate, effort and style, as well as dialect and accent. The outcome of the experiments suggests that the processing of temporal cues in ASR bears room for improvement. Therefore, spectro-temporal features are employed as input to ASR systems, which results in an increase of recognition performance for varying speaking effort and speaking style compared to standard features. This documents the usefulness of spectro-temporal and temporal information for automatic recognizers. Nº de ref. del artículo: 9783838121550
Cantidad disponible: 2 disponibles
Librería: Books Puddle, New York, NY, Estados Unidos de America
Condición: New. pp. 140. Nº de ref. del artículo: 26128843777
Cantidad disponible: 4 disponibles