textless-lib: a Library for Textless Spoken Language Processing. Kharitonov, E., Copet, J., Lakhotia, K., Nguyen, T. A., Tomasello, P., Lee, A., Elkahky, A., Hsu, W., Mohamed, A., Dupoux, E., & Adi, Y. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, pages 1–9, Hybrid: Seattle, Washington + Online, July, 2022. Association for Computational Linguistics.
textless-lib: a Library for Textless Spoken Language Processing [link]Paper  doi  abstract   bibtex   
Textless spoken language processing is an exciting area of research that promises to extend applicability of the standard NLP toolset onto spoken language and languages with few or no textual resources.Here, we introduce textless-lib, a PyTorch-based library aimed to facilitate research in the area. We describe the building blocks that the library provides and demonstrate its usability by discuss three different use-case examples: (i) speaker probing, (ii) speech resynthesis and compression, and (iii) speech continuation. We believe that textless-lib substantially simplifies research the textless setting and will be handful not only for speech researchers but also for the NLP community at large.
@inproceedings{kharitonov_textless-lib_2022,
	address = {Hybrid: Seattle, Washington + Online},
	title = {textless-lib: a {Library} for {Textless} {Spoken} {Language} {Processing}},
	shorttitle = {textless-lib},
	url = {https://aclanthology.org/2022.naacl-demo.1},
	doi = {10.18653/v1/2022.naacl-demo.1},
	abstract = {Textless spoken language processing is an exciting area of research that promises to extend applicability of the standard NLP toolset onto spoken language and languages with few or no textual resources.Here, we introduce textless-lib, a PyTorch-based library aimed to facilitate research in the area. We describe the building blocks that the library provides and demonstrate its usability by discuss three different use-case examples: (i) speaker probing, (ii) speech resynthesis and compression, and (iii) speech continuation. We believe that textless-lib substantially simplifies research the textless setting and will be handful not only for speech researchers but also for the NLP community at large.},
	urldate = {2023-02-06},
	booktitle = {Proceedings of the 2022 {Conference} of the {North} {American} {Chapter} of the {Association} for {Computational} {Linguistics}: {Human} {Language} {Technologies}: {System} {Demonstrations}},
	publisher = {Association for Computational Linguistics},
	author = {Kharitonov, Eugene and Copet, Jade and Lakhotia, Kushal and Nguyen, Tu Anh and Tomasello, Paden and Lee, Ann and Elkahky, Ali and Hsu, Wei-Ning and Mohamed, Abdelrahman and Dupoux, Emmanuel and Adi, Yossi},
	month = jul,
	year = {2022},
	pages = {1--9},
}

Downloads: 0