Hidden Conditional Random Fields for Speech Recognition

Author	: Yun-Hsuan Sung
Publisher	: Stanford University
Total Pages	: 161
Release	: 2010
ISBN-10	: STANFORD:zn927hy7753
ISBN-13	:
Rating	: 4/5 (53 Downloads)

DOWNLOAD EBOOK

Book Synopsis Hidden Conditional Random Fields for Speech Recognition by : Yun-Hsuan Sung

Download or read book Hidden Conditional Random Fields for Speech Recognition written by Yun-Hsuan Sung and published by Stanford University. This book was released on 2010 with total page 161 pages. Available in PDF, EPUB and Kindle. Book excerpt: This thesis investigates using a new graphical model, hidden conditional random ﬁelds (HCRFs), for speech recognition. Conditional random ﬁelds (CRFs) are discriminative sequence models that have been successfully applied to several tasks in text processing, such as named entity recognition. Recently, there has been increasing interest in applying CRFs to speech recognition due to the similarity between speech and text processing. HCRFs are CRFs augmented with hidden variables that are capable of representing the dynamic changes and variations in speech signals. HCRFs also have the ability to incorporate correlated features from both speech signals and text without making strong independence assumptions among them. This thesis presents my current research on applying HCRFs to speech recognition and HCRFs' potential to replace the current hidden Markov model (HMM) for acoustic modeling. Experimental results of phone classiﬁcation, phone recognition, and speaker adaptation are presented and discussed. Our monophone HCRFs outperform both maximum mutual information estimation (MMIE) and minimum phone error (MPE) trained HMMs and achieve the-start-of-the-art performance in TIMIT phone classiﬁcation and recognition tasks. We also show how to jointly train acoustic models and language models in HCRFs, which shows improvement in the results. Maximum a posterior (MAP) and maximum conditional likelihood linear regression (MCLLR) successfully adapt speaker-independent models to speaker-dependent models with a small amount of adaptation data for HCRF speaker adaptation. Finally, we explore adding gender and dialect features for phone recognition, and experimental results are presented.

Hidden Conditional Random Fields For Speech Recognition

Hidden Conditional Random Fields for Speech Recognition

Hidden Conditional Random Fields for Speech Recognition Related Books

Hidden Conditional Random Fields for Speech Recognition

Automatic Speech Recognition

Discriminative Learning for Speech Recognition

Handbook of Pattern Recognition and Computer Vision (5th Edition)

Semantics in Action