Isolated Digit Recognition with LSTM Recurrent Networks. Graves, A., Eck, D., Beringer, N., & Schmidhuber, J. In First International Workshop on Biologically Inspired Approaches to Advanced Information Technology, Lausanne, 2004.
abstract   bibtex   
Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) are local in space and time and closely related to a biological model of memory in the prefrontal cortex. Not only are they more biologically plausible than previous artificial RNNs, they also outperformed them on many artificially generated sequential processing tasks. This encouraged us to apply LSTM to more realistic problems, such as the recognition of spoken digits. Without any modification of the underlying algorithm, we achieved results comparable to state-of-the-art Hidden Markov Model (HMM) based recognisers on both the TIDIGITS and TI46 speech corpora. We conclude that LSTM should be further investigated as a biologically plausible basis for a bottom-up, neural net-based approach to speech recognition.
@INPROCEEDINGS{graves+eck+beringer+schmidhuber:2004,
  AUTHOR = {A. Graves and D. Eck and N. Beringer and J. Schmidhuber},
  TITLE = {Isolated Digit Recognition with {LSTM} Recurrent Networks},
  BOOKTITLE = {First International Workshop on Biologically Inspired Approaches to Advanced Information Technology},
  ADDRESS = {Lausanne},
  YEAR = {2004},
  SOURCE = {OwnPublication},
  ABSTRACT = {Long Short-Term Memory (LSTM) recurrent neural networks 
		  (RNNs) are local in space and time and closely related to a biological model of memory in the 
		  prefrontal cortex. Not only are they more biologically plausible than previous artificial RNNs, 
		  they also outperformed them on many artificially generated sequential processing tasks. 
		  This encouraged us to apply LSTM to more realistic problems, such as the recognition of spoken digits. 
		  Without any modification of the underlying algorithm, we achieved results comparable to state-of-the-art Hidden Markov Model 
		  (HMM) based recognisers on both the TIDIGITS and TI46 speech corpora. 
		  We conclude that LSTM should be further investigated as a biologically plausible basis for a bottom-up, 
		  neural net-based approach to speech recognition.}
}

Downloads: 0