Baseline Acoustic Models for Brazilian Portuguese Using CMU Sphinx Tools. Oliveira, R., Batista, P., Neto, N., & Klautau, A. In Computational Processing of the Portuguese Language, of Lecture Notes in Computer Science, pages 375–380, April, 2012. Springer, Berlin, Heidelberg. 00000
Baseline Acoustic Models for Brazilian Portuguese Using CMU Sphinx Tools [link]Paper  doi  abstract   bibtex   
Advances in speech processing research rely on the availability of public resources such as corpora, statistical models and baseline systems. In contrast to languages such as English, there are few specific resources for Brazilian Portuguese. This work describes efforts aiming to decrease such gap. Baseline acoustic models for Brazilian Portuguese were built using the CMU Sphinx toolkit and public domain resources: speech corpora, phonetic dictionary and language model. Experiments were carried on for dictation and grammar tasks and the obtained results can be used to support further researches. Part of the trained acoustic models and a reference speech corpus were made publicly available.
@inproceedings{oliveira_baseline_2012,
	series = {Lecture {Notes} in {Computer} {Science}},
	title = {Baseline {Acoustic} {Models} for {Brazilian} {Portuguese} {Using} {CMU} {Sphinx} {Tools}},
	isbn = {978-3-642-28884-5 978-3-642-28885-2},
	url = {https://link.springer.com/chapter/10.1007/978-3-642-28885-2_42},
	doi = {10.1007/978-3-642-28885-2_42},
	abstract = {Advances in speech processing research rely on the availability of public resources such as corpora, statistical models and baseline systems. In contrast to languages such as English, there are few specific resources for Brazilian Portuguese. This work describes efforts aiming to decrease such gap. Baseline acoustic models for Brazilian Portuguese were built using the CMU Sphinx toolkit and public domain resources: speech corpora, phonetic dictionary and language model. Experiments were carried on for dictation and grammar tasks and the obtained results can be used to support further researches. Part of the trained acoustic models and a reference speech corpus were made publicly available.},
	language = {en},
	booktitle = {Computational {Processing} of the {Portuguese} {Language}},
	publisher = {Springer, Berlin, Heidelberg},
	author = {Oliveira, Rafael and Batista, Pedro and Neto, Nelson and Klautau, Aldebaro},
	month = apr,
	year = {2012},
	note = {00000},
	pages = {375--380}
}

Downloads: 0