Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas. Sprugnoli, R., Moretti, G., & Passarotti, M. IJCoL. Italian Journal of Computational Linguistics, 6(1):29–45, June, 2020. Number: 1 Publisher: Accademia University Press
Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas [link]Paper  doi  abstract   bibtex   
This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. In addition, we release vectors pre-trained on the “Opera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective. The embeddings built upon the two training corpora are compared to each other to support diachronic lexical studies. The words showing the highest usage change between the two corpora are reported and a selection of them is discussed.
@article{sprugnoli_building_2020,
	title = {Building and {Comparing} {Lemma} {Embeddings} for {Latin}. {Classical} {Latin} versus {Thomas} {Aquinas}},
	volume = {6},
	copyright = {https://creativecommons.org/licenses/by-nc-nd/4.0/},
	issn = {2499-4553},
	url = {https://journals.openedition.org/ijcol/624},
	doi = {10.4000/ijcol.624},
	abstract = {This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. In addition, we release vectors pre-trained on the “Opera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective. The embeddings built upon the two training corpora are compared to each other to support diachronic lexical studies. The words showing the highest usage change between the two corpora are reported and a selection of them is discussed.},
	language = {en},
	number = {1},
	urldate = {2023-08-26},
	journal = {IJCoL. Italian Journal of Computational Linguistics},
	author = {Sprugnoli, Rachele and Moretti, Giovanni and Passarotti, Marco},
	month = jun,
	year = {2020},
	note = {Number: 1
Publisher: Accademia University Press},
	pages = {29--45},
}

Downloads: 0