Hidden Conditional Random Fields for Speech Recognition

Hidden Conditional Random Fields for Speech Recognition
Author :
Publisher : Stanford University
Total Pages : 161
Release :
ISBN-10 : STANFORD:zn927hy7753
ISBN-13 :
Rating : 4/5 (53 Downloads)

Book Synopsis Hidden Conditional Random Fields for Speech Recognition by : Yun-Hsuan Sung

Download or read book Hidden Conditional Random Fields for Speech Recognition written by Yun-Hsuan Sung and published by Stanford University. This book was released on 2010 with total page 161 pages. Available in PDF, EPUB and Kindle. Book excerpt: This thesis investigates using a new graphical model, hidden conditional random fields (HCRFs), for speech recognition. Conditional random fields (CRFs) are discriminative sequence models that have been successfully applied to several tasks in text processing, such as named entity recognition. Recently, there has been increasing interest in applying CRFs to speech recognition due to the similarity between speech and text processing. HCRFs are CRFs augmented with hidden variables that are capable of representing the dynamic changes and variations in speech signals. HCRFs also have the ability to incorporate correlated features from both speech signals and text without making strong independence assumptions among them. This thesis presents my current research on applying HCRFs to speech recognition and HCRFs' potential to replace the current hidden Markov model (HMM) for acoustic modeling. Experimental results of phone classification, phone recognition, and speaker adaptation are presented and discussed. Our monophone HCRFs outperform both maximum mutual information estimation (MMIE) and minimum phone error (MPE) trained HMMs and achieve the-start-of-the-art performance in TIMIT phone classification and recognition tasks. We also show how to jointly train acoustic models and language models in HCRFs, which shows improvement in the results. Maximum a posterior (MAP) and maximum conditional likelihood linear regression (MCLLR) successfully adapt speaker-independent models to speaker-dependent models with a small amount of adaptation data for HCRF speaker adaptation. Finally, we explore adding gender and dialect features for phone recognition, and experimental results are presented.


Hidden Conditional Random Fields for Speech Recognition Related Books

Hidden Conditional Random Fields for Speech Recognition
Language: en
Pages: 161
Authors: Yun-Hsuan Sung
Categories:
Type: BOOK - Published: 2010 - Publisher: Stanford University

DOWNLOAD EBOOK

This thesis investigates using a new graphical model, hidden conditional random fields (HCRFs), for speech recognition. Conditional random fields (CRFs) are d
Automatic Speech Recognition
Language: en
Pages: 329
Authors: Dong Yu
Categories: Technology & Engineering
Type: BOOK - Published: 2014-11-11 - Publisher: Springer

DOWNLOAD EBOOK

This book provides a comprehensive overview of the recent advancement in the field of automatic speech recognition with a focus on deep learning models includin
Discriminative Learning for Speech Recognition
Language: en
Pages: 112
Authors: Xiadong He
Categories: Technology & Engineering
Type: BOOK - Published: 2022-06-01 - Publisher: Springer Nature

DOWNLOAD EBOOK

In this book, we introduce the background and mainstream methods of probabilistic modeling and discriminative parameter optimization for speech recognition. The
Handbook of Pattern Recognition and Computer Vision (5th Edition)
Language: en
Pages: 582
Authors: Chi-hau Chen
Categories: Computers
Type: BOOK - Published: 2015-12-15 - Publisher: World Scientific

DOWNLOAD EBOOK

The book provides an up-to-date and authoritative treatment of pattern recognition and computer vision, with chapters written by leaders in the field. On the ba
Semantics in Action
Language: en
Pages: 281
Authors: Muhammad Tanvir Afzal
Categories: Computers
Type: BOOK - Published: 2012-04-25 - Publisher: BoD – Books on Demand

DOWNLOAD EBOOK

The current book is a combination of number of great ideas, applications, case studies, and practical systems in the domain of Semantics. The book has been divi