Bioinformatics and Classical Literary Study. Chaudhuri, P. & Dexter, J. P. Journal of Data Mining & Digital Humanities, August, 2017. Publisher: Episciences.org
Bioinformatics and Classical Literary Study [link]Paper  doi  abstract   bibtex   
This paper describes the Quantitative Criticism Lab, a collaborative initiative between classicists, quantitative biologists, and computer scientists to apply ideas and methods drawn from the sciences to the study of literature. A core goal of the project is the use of computational biology, natural language processing, and machine learning techniques to investigate authorial style, intertextuality, and related phenomena of literary significance. As a case study in our approach, here we review the use of sequence alignment, a common technique in genomics and computational linguistics, to detect intertextuality in Latin literature. Sequence alignment is distinguished by its ability to find inexact verbal similarities, which makes it ideal for identifying phonetic echoes in large corpora of Latin texts. Although especially suited to Latin, sequence alignment in principle can be extended to many other languages.
@article{chaudhuri_bioinformatics_2017,
	title = {Bioinformatics and {Classical} {Literary} {Study}},
	volume = {Numéro spécial sur le traitement assisté par ordinateur de l‘intertextualité dans les langues anciennes},
	issn = {2416-5999},
	url = {https://jdmdh.episciences.org/3807},
	doi = {10.46298/jdmdh.1386},
	abstract = {This paper describes the Quantitative Criticism Lab, a collaborative initiative between classicists, quantitative biologists, and computer scientists to apply ideas and methods drawn from the sciences to the study of literature. A core goal of the project is the use of computational biology, natural language processing, and machine learning techniques to investigate authorial style, intertextuality, and related phenomena of literary significance. As a case study in our approach, here we review the use of sequence alignment, a common technique in genomics and computational linguistics, to detect intertextuality in Latin literature. Sequence alignment is distinguished by its ability to find inexact verbal similarities, which makes it ideal for identifying phonetic echoes in large corpora of Latin texts. Although especially suited to Latin, sequence alignment in principle can be extended to many other languages.},
	number = {Project presentations},
	urldate = {2023-08-26},
	journal = {Journal of Data Mining \& Digital Humanities},
	author = {Chaudhuri, Pramit and Dexter, Joseph P.},
	month = aug,
	year = {2017},
	note = {Publisher: Episciences.org},
}

Downloads: 0