NLP for Literary Latin and Ancient Greek Texts. Beyer, A. & Schulz, K. September, 2026.
Paper doi abstract bibtex This presentation introduces the Daidalos Project, an interdisciplinary initiative at Humboldt-Universität zu Berlin that bridges classical philology, corpus linguistics, and digital humanities through the application of natural language processing (NLP) to Latin and Ancient Greek literary texts. The project supports digital literary research by evaluating and adapting existing NLP models for ancient languages, with a focus on methodological triangulation, participatory research, and user-driven tool development. Key research questions addressed include the semantic and stylistic differences in epitaphs across Thucydides, Lysias, and Plato; the distinction between authentic invectives and declamationes in Ps.-Sallustius; and the identification of information gaps in ancient historiography. The project employs a range of NLP techniques—including word embeddings, named entity recognition (NER), morphological analysis, part-of-speech tagging, and sentiment analysis—using both rule-based and neural network approaches, including state-of-the-art models like XLM-Roberta and LatinCy. A central innovation lies in the use of research tandems, which bring together domain experts and non-specialists in a co-creation process to develop and refine NLP pipelines. The workflow emphasizes transparency, reproducibility, and accessibility, with results visualized through network graphs, tables, and annotated texts. Findings demonstrate that combining multiple methods enhances interpretive power, while LLM-based approaches enable rapid prototyping and exploration, particularly in low-resource settings. The project contributes to the development of digital literacies in classical scholarship and advances a new research paradigm - augmented reading - by integrating computational methods into traditional philological practice.
@misc{beyer_nlp_2026,
address = {Paris},
type = {Talk},
title = {{NLP} for {Literary} {Latin} and {Ancient} {Greek} {Texts}},
url = {https://zenodo.org/records/19422180},
doi = {https://doi.org/10.5281/zenodo.19422180},
abstract = {This presentation introduces the Daidalos Project, an interdisciplinary initiative at Humboldt-Universität zu Berlin that bridges classical philology, corpus linguistics, and digital humanities through the application of natural language processing (NLP) to Latin and Ancient Greek literary texts. The project supports digital literary research by evaluating and adapting existing NLP models for ancient languages, with a focus on methodological triangulation, participatory research, and user-driven tool development. Key research questions addressed include the semantic and stylistic differences in epitaphs across Thucydides, Lysias, and Plato; the distinction between authentic invectives and declamationes in Ps.-Sallustius; and the identification of information gaps in ancient historiography. The project employs a range of NLP techniques—including word embeddings, named entity recognition (NER), morphological analysis, part-of-speech tagging, and sentiment analysis—using both rule-based and neural network approaches, including state-of-the-art models like XLM-Roberta and LatinCy. A central innovation lies in the use of research tandems, which bring together domain experts and non-specialists in a co-creation process to develop and refine NLP pipelines. The workflow emphasizes transparency, reproducibility, and accessibility, with results visualized through network graphs, tables, and annotated texts. Findings demonstrate that combining multiple methods enhances interpretive power, while LLM-based approaches enable rapid prototyping and exploration, particularly in low-resource settings. The project contributes to the development of digital literacies in classical scholarship and advances a new research paradigm - augmented reading - by integrating computational methods into traditional philological practice.},
language = {en},
author = {Beyer, Andrea and Schulz, Konstantin},
month = sep,
year = {2026},
keywords = {Computational Literary Studies, Daidalos, NLP},
}
Downloads: 0
{"_id":"o8aJLubPDa76rHv9a","bibbaseid":"beyer-schulz-nlpforliterarylatinandancientgreektexts-2026","author_short":["Beyer, A.","Schulz, K."],"bibdata":{"bibtype":"misc","type":"Talk","address":"Paris","title":"NLP for Literary Latin and Ancient Greek Texts","url":"https://zenodo.org/records/19422180","doi":"https://doi.org/10.5281/zenodo.19422180","abstract":"This presentation introduces the Daidalos Project, an interdisciplinary initiative at Humboldt-Universität zu Berlin that bridges classical philology, corpus linguistics, and digital humanities through the application of natural language processing (NLP) to Latin and Ancient Greek literary texts. The project supports digital literary research by evaluating and adapting existing NLP models for ancient languages, with a focus on methodological triangulation, participatory research, and user-driven tool development. Key research questions addressed include the semantic and stylistic differences in epitaphs across Thucydides, Lysias, and Plato; the distinction between authentic invectives and declamationes in Ps.-Sallustius; and the identification of information gaps in ancient historiography. The project employs a range of NLP techniques—including word embeddings, named entity recognition (NER), morphological analysis, part-of-speech tagging, and sentiment analysis—using both rule-based and neural network approaches, including state-of-the-art models like XLM-Roberta and LatinCy. A central innovation lies in the use of research tandems, which bring together domain experts and non-specialists in a co-creation process to develop and refine NLP pipelines. The workflow emphasizes transparency, reproducibility, and accessibility, with results visualized through network graphs, tables, and annotated texts. Findings demonstrate that combining multiple methods enhances interpretive power, while LLM-based approaches enable rapid prototyping and exploration, particularly in low-resource settings. The project contributes to the development of digital literacies in classical scholarship and advances a new research paradigm - augmented reading - by integrating computational methods into traditional philological practice.","language":"en","author":[{"propositions":[],"lastnames":["Beyer"],"firstnames":["Andrea"],"suffixes":[]},{"propositions":[],"lastnames":["Schulz"],"firstnames":["Konstantin"],"suffixes":[]}],"month":"September","year":"2026","keywords":"Computational Literary Studies, Daidalos, NLP","bibtex":"@misc{beyer_nlp_2026,\n\taddress = {Paris},\n\ttype = {Talk},\n\ttitle = {{NLP} for {Literary} {Latin} and {Ancient} {Greek} {Texts}},\n\turl = {https://zenodo.org/records/19422180},\n\tdoi = {https://doi.org/10.5281/zenodo.19422180},\n\tabstract = {This presentation introduces the Daidalos Project, an interdisciplinary initiative at Humboldt-Universität zu Berlin that bridges classical philology, corpus linguistics, and digital humanities through the application of natural language processing (NLP) to Latin and Ancient Greek literary texts. The project supports digital literary research by evaluating and adapting existing NLP models for ancient languages, with a focus on methodological triangulation, participatory research, and user-driven tool development. Key research questions addressed include the semantic and stylistic differences in epitaphs across Thucydides, Lysias, and Plato; the distinction between authentic invectives and declamationes in Ps.-Sallustius; and the identification of information gaps in ancient historiography. The project employs a range of NLP techniques—including word embeddings, named entity recognition (NER), morphological analysis, part-of-speech tagging, and sentiment analysis—using both rule-based and neural network approaches, including state-of-the-art models like XLM-Roberta and LatinCy. A central innovation lies in the use of research tandems, which bring together domain experts and non-specialists in a co-creation process to develop and refine NLP pipelines. The workflow emphasizes transparency, reproducibility, and accessibility, with results visualized through network graphs, tables, and annotated texts. Findings demonstrate that combining multiple methods enhances interpretive power, while LLM-based approaches enable rapid prototyping and exploration, particularly in low-resource settings. The project contributes to the development of digital literacies in classical scholarship and advances a new research paradigm - augmented reading - by integrating computational methods into traditional philological practice.},\n\tlanguage = {en},\n\tauthor = {Beyer, Andrea and Schulz, Konstantin},\n\tmonth = sep,\n\tyear = {2026},\n\tkeywords = {Computational Literary Studies, Daidalos, NLP},\n}\n\n","author_short":["Beyer, A.","Schulz, K."],"key":"beyer_nlp_2026","id":"beyer_nlp_2026","bibbaseid":"beyer-schulz-nlpforliterarylatinandancientgreektexts-2026","role":"author","urls":{"Paper":"https://zenodo.org/records/19422180"},"keyword":["Computational Literary Studies","Daidalos","NLP"],"metadata":{"authorlinks":{}}},"bibtype":"misc","biburl":"https://api.zotero.org/groups/2611296/items?key=ICuNoiZzSvo1XKdM2ToTCKMK&format=bibtex&limit=100","dataSources":["YTuauZBRxw9mBtqs3","JFDnASMkoQCjjGL8E","nsAxma8Gv9c6fai5n"],"keywords":["computational literary studies","daidalos","nlp"],"search_terms":["nlp","literary","latin","ancient","greek","texts","beyer","schulz"],"title":"NLP for Literary Latin and Ancient Greek Texts","year":2026}