The Ancient Greek and Latin Dependency Treebanks. Bamman, D. & Crane, G. In Sporleder, C., van den Bosch, A., & Zervanou, K., editors, Language Technology for Cultural Heritage, of Theory and Applications of Natural Language Processing, pages 79–98, Berlin, Heidelberg, 2011. Springer.
doi  abstract   bibtex   
This paper describes the development, composition, and several uses of the Ancient Greek and Latin Dependency Treebanks, large collections of Classical texts in which the syntactic, morphological and lexical information for eachword is made explicit. To date, over 200 individuals from around the world have collaborated to annotate over 350,000 words, including the entirety of Homer’s Iliad and Odyssey, Sophocles’ Ajax, all of the extant works of Hesiod and Aeschylus, and selections from Caesar, Cicero, Jerome, Ovid, Petronius, Propertius, Sallust and Vergil. While perhaps the most straightforward value of such an annotated corpus for Classical philology is the morphosyntactic searching it makes possible, it also enables a large number of downstream tasks as well, such as inducing the syntactic behavior of lexemes and automatically identifying similar passages between texts.
@inproceedings{bamman_ancient_2011,
	address = {Berlin, Heidelberg},
	series = {Theory and {Applications} of {Natural} {Language} {Processing}},
	title = {The {Ancient} {Greek} and {Latin} {Dependency} {Treebanks}},
	isbn = {978-3-642-20227-8},
	doi = {10.1007/978-3-642-20227-8_5},
	abstract = {This paper describes the development, composition, and several uses of the Ancient Greek and Latin Dependency Treebanks, large collections of Classical texts in which the syntactic, morphological and lexical information for eachword is made explicit. To date, over 200 individuals from around the world have collaborated to annotate over 350,000 words, including the entirety of Homer’s Iliad and Odyssey, Sophocles’ Ajax, all of the extant works of Hesiod and Aeschylus, and selections from Caesar, Cicero, Jerome, Ovid, Petronius, Propertius, Sallust and Vergil. While perhaps the most straightforward value of such an annotated corpus for Classical philology is the morphosyntactic searching it makes possible, it also enables a large number of downstream tasks as well, such as inducing the syntactic behavior of lexemes and automatically identifying similar passages between texts.},
	language = {en},
	booktitle = {Language {Technology} for {Cultural} {Heritage}},
	publisher = {Springer},
	author = {Bamman, David and Crane, Gregory},
	editor = {Sporleder, Caroline and van den Bosch, Antal and Zervanou, Kalliopi},
	year = {2011},
	keywords = {Ancient Greek, Latin, dependency grammar, digital libraries, treebanks},
	pages = {79--98},
}

Downloads: 0