Jump to content
Main menu
Main menu
move to sidebar
hide
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Special pages
Niidae Wiki
Search
Search
Appearance
Create account
Log in
Personal tools
Create account
Log in
Pages for logged out editors
learn more
Contributions
Talk
Editing
Speech recognition
(section)
Page
Discussion
English
Read
Edit
View history
Tools
Tools
move to sidebar
hide
Actions
Read
Edit
View history
General
What links here
Related changes
Page information
Appearance
move to sidebar
hide
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
=== Books === Books like "Fundamentals of Speech Recognition" by [[Lawrence Rabiner]] can be useful to acquire basic knowledge but may not be fully up to date (1993). Another good source can be "Statistical Methods for Speech Recognition" by [[Frederick Jelinek]] and "Spoken Language Processing (2001)" by [[Xuedong Huang]] etc., "Computer Speech", by [[Manfred R. Schroeder]], second edition published in 2004, and "Speech Processing: A Dynamic and Optimization-Oriented Approach" published in 2003 by Li Deng and Doug O'Shaughnessey. The updated textbook ''Speech and Language Processing'' (2008) by [[Daniel Jurafsky|Jurafsky]] and Martin presents the basics and the state of the art for ASR. [[Speaker recognition]] also uses the same features, most of the same front-end processing, and classification techniques as is done in speech recognition. A comprehensive textbook, "Fundamentals of Speaker Recognition" is an in depth source for up to date details on the theory and practice.<ref name="auto">{{Cite book |last=Beigi |first=Homayoon |url=http://www.fundamentalsofspeakerrecognition.org |title=Fundamentals of Speaker Recognition |publisher=Springer |year=2011 |isbn=978-0-387-77591-3 |location=New York |archive-url=https://web.archive.org/web/20180131140911/http://www.fundamentalsofspeakerrecognition.org/ |archive-date=31 January 2018 |url-status=live |df=dmy-all}}</ref> A good insight into the techniques used in the best modern systems can be gained by paying attention to government sponsored evaluations such as those organised by [[DARPA]] (the largest speech recognition-related project ongoing as of 2007 is the GALE project, which involves both speech recognition and translation components). A good and accessible introduction to speech recognition technology and its history is provided by the general audience book "The Voice in the Machine. Building Computers That Understand Speech" by [[Roberto Pieraccini]] (2012). The most recent book on speech recognition is ''Automatic Speech Recognition: A Deep Learning Approach'' (Publisher: Springer) written by Microsoft researchers D. Yu and L. Deng and published near the end of 2014, with highly mathematically oriented technical detail on how deep learning methods are derived and implemented in modern speech recognition systems based on DNNs and related deep learning methods.<ref name="ReferenceA">{{Cite journal |last1=Yu |first1=D. |last2=Deng |first2=L. |date=2014 |title=Automatic Speech Recognition: A Deep Learning Approach (Publisher: Springer)}}</ref> A related book, published earlier in 2014, "Deep Learning: Methods and Applications" by L. Deng and D. Yu provides a less technical but more methodology-focused overview of DNN-based speech recognition during 2009β2014, placed within the more general context of deep learning applications including not only speech recognition but also image recognition, natural language processing, information retrieval, multimodal processing, and multitask learning.<ref name="BOOK2014">{{Cite journal |last1=Deng |first1=Li |last2=Yu |first2=Dong |year=2014 |title=Deep Learning: Methods and Applications |url=http://research.microsoft.com/pubs/209355/DeepLearning-NowPublishing-Vol7-SIG-039.pdf |url-status=live |journal=Foundations and Trends in Signal Processing |volume=7 |issue=3β4 |pages=197β387 |citeseerx=10.1.1.691.3679 |doi=10.1561/2000000039 |archive-url=https://web.archive.org/web/20141022161017/http://research.microsoft.com/pubs/209355/DeepLearning-NowPublishing-Vol7-SIG-039.pdf |archive-date=22 October 2014 |df=dmy-all}}</ref>
Summary:
Please note that all contributions to Niidae Wiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Encyclopedia:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Search
Search
Editing
Speech recognition
(section)
Add topic