Standoff Annotation for the Ancient Greek and Latin Dependency Treebank. Celano, G. G. A. In Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage, of DATeCH2019, pages 149–153, New York, NY, USA, 2019. Association for Computing Machinery. Paper doi abstract bibtex This contribution presents the work in progress to convert the Ancient Greek and Latin Dependency Treebank (AGLDT) into standoff annotation using PAULA XML. With an increasing number of annotations of any kind, it becomes more and more urgent that annotations related to the same texts be added standoff. Standoff annotation consists in adding any kind of annotation in separate documents, which are ultimately linked to a main text, the so-called "base text," which is meant to be unchangeable. References occur via a graph-based system of IDs, which allows an annotation layer (contained in a separate file) to be linked to another annotation layer (contained in another separate file). All the annotations/files create a labeled directed acyclic graph, whose root is represented by the base text. Standoff annotation enables easy interoperability and extension, in that single annotation layers can reference other layers of annotation independently, thus overcoming the problem of conflicting hierarchies. Moreover, standoff annotation also allows addition of different annotations of the same kind to the same text (e.g., two different interpretations of the POS tag for a given token). In the present contribution, I show how the annotations of the AGLDT can become standoff using PAULA XML, which is an open access format following the LAF principles. More precisely, I show the case study of Caesar's De Bello Civili. I detail the PAULA XML files created for its tokenization and sentence split, which are preliminary required to add morphosyntactic annotation.
@inproceedings{celano_standoff_2019,
address = {New York, NY, USA},
series = {{DATeCH2019}},
title = {Standoff {Annotation} for the {Ancient} {Greek} and {Latin} {Dependency} {Treebank}},
isbn = {978-1-4503-7194-0},
url = {https://doi.org/10.1145/3322905.3322919},
doi = {10.1145/3322905.3322919},
abstract = {This contribution presents the work in progress to convert the Ancient Greek and Latin Dependency Treebank (AGLDT) into standoff annotation using PAULA XML. With an increasing number of annotations of any kind, it becomes more and more urgent that annotations related to the same texts be added standoff. Standoff annotation consists in adding any kind of annotation in separate documents, which are ultimately linked to a main text, the so-called "base text," which is meant to be unchangeable. References occur via a graph-based system of IDs, which allows an annotation layer (contained in a separate file) to be linked to another annotation layer (contained in another separate file). All the annotations/files create a labeled directed acyclic graph, whose root is represented by the base text. Standoff annotation enables easy interoperability and extension, in that single annotation layers can reference other layers of annotation independently, thus overcoming the problem of conflicting hierarchies. Moreover, standoff annotation also allows addition of different annotations of the same kind to the same text (e.g., two different interpretations of the POS tag for a given token). In the present contribution, I show how the annotations of the AGLDT can become standoff using PAULA XML, which is an open access format following the LAF principles. More precisely, I show the case study of Caesar's De Bello Civili. I detail the PAULA XML files created for its tokenization and sentence split, which are preliminary required to add morphosyntactic annotation.},
urldate = {2023-04-26},
booktitle = {Proceedings of the 3rd {International} {Conference} on {Digital} {Access} to {Textual} {Cultural} {Heritage}},
publisher = {Association for Computing Machinery},
author = {Celano, Giuseppe G. A.},
year = {2019},
keywords = {Ancient Greek, Latin, PAULA XML, dependency treebank, standoff annotation},
pages = {149--153},
}
Downloads: 0
{"_id":"6Jfdn9ZfrMsYB5Wa4","bibbaseid":"celano-standoffannotationfortheancientgreekandlatindependencytreebank-2019","author_short":["Celano, G. G. A."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","address":"New York, NY, USA","series":"DATeCH2019","title":"Standoff Annotation for the Ancient Greek and Latin Dependency Treebank","isbn":"978-1-4503-7194-0","url":"https://doi.org/10.1145/3322905.3322919","doi":"10.1145/3322905.3322919","abstract":"This contribution presents the work in progress to convert the Ancient Greek and Latin Dependency Treebank (AGLDT) into standoff annotation using PAULA XML. With an increasing number of annotations of any kind, it becomes more and more urgent that annotations related to the same texts be added standoff. Standoff annotation consists in adding any kind of annotation in separate documents, which are ultimately linked to a main text, the so-called \"base text,\" which is meant to be unchangeable. References occur via a graph-based system of IDs, which allows an annotation layer (contained in a separate file) to be linked to another annotation layer (contained in another separate file). All the annotations/files create a labeled directed acyclic graph, whose root is represented by the base text. Standoff annotation enables easy interoperability and extension, in that single annotation layers can reference other layers of annotation independently, thus overcoming the problem of conflicting hierarchies. Moreover, standoff annotation also allows addition of different annotations of the same kind to the same text (e.g., two different interpretations of the POS tag for a given token). In the present contribution, I show how the annotations of the AGLDT can become standoff using PAULA XML, which is an open access format following the LAF principles. More precisely, I show the case study of Caesar's De Bello Civili. I detail the PAULA XML files created for its tokenization and sentence split, which are preliminary required to add morphosyntactic annotation.","urldate":"2023-04-26","booktitle":"Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage","publisher":"Association for Computing Machinery","author":[{"propositions":[],"lastnames":["Celano"],"firstnames":["Giuseppe","G.","A."],"suffixes":[]}],"year":"2019","keywords":"Ancient Greek, Latin, PAULA XML, dependency treebank, standoff annotation","pages":"149–153","bibtex":"@inproceedings{celano_standoff_2019,\n\taddress = {New York, NY, USA},\n\tseries = {{DATeCH2019}},\n\ttitle = {Standoff {Annotation} for the {Ancient} {Greek} and {Latin} {Dependency} {Treebank}},\n\tisbn = {978-1-4503-7194-0},\n\turl = {https://doi.org/10.1145/3322905.3322919},\n\tdoi = {10.1145/3322905.3322919},\n\tabstract = {This contribution presents the work in progress to convert the Ancient Greek and Latin Dependency Treebank (AGLDT) into standoff annotation using PAULA XML. With an increasing number of annotations of any kind, it becomes more and more urgent that annotations related to the same texts be added standoff. Standoff annotation consists in adding any kind of annotation in separate documents, which are ultimately linked to a main text, the so-called \"base text,\" which is meant to be unchangeable. References occur via a graph-based system of IDs, which allows an annotation layer (contained in a separate file) to be linked to another annotation layer (contained in another separate file). All the annotations/files create a labeled directed acyclic graph, whose root is represented by the base text. Standoff annotation enables easy interoperability and extension, in that single annotation layers can reference other layers of annotation independently, thus overcoming the problem of conflicting hierarchies. Moreover, standoff annotation also allows addition of different annotations of the same kind to the same text (e.g., two different interpretations of the POS tag for a given token). In the present contribution, I show how the annotations of the AGLDT can become standoff using PAULA XML, which is an open access format following the LAF principles. More precisely, I show the case study of Caesar's De Bello Civili. I detail the PAULA XML files created for its tokenization and sentence split, which are preliminary required to add morphosyntactic annotation.},\n\turldate = {2023-04-26},\n\tbooktitle = {Proceedings of the 3rd {International} {Conference} on {Digital} {Access} to {Textual} {Cultural} {Heritage}},\n\tpublisher = {Association for Computing Machinery},\n\tauthor = {Celano, Giuseppe G. A.},\n\tyear = {2019},\n\tkeywords = {Ancient Greek, Latin, PAULA XML, dependency treebank, standoff annotation},\n\tpages = {149--153},\n}\n\n\n\n","author_short":["Celano, G. G. A."],"key":"celano_standoff_2019","id":"celano_standoff_2019","bibbaseid":"celano-standoffannotationfortheancientgreekandlatindependencytreebank-2019","role":"author","urls":{"Paper":"https://doi.org/10.1145/3322905.3322919"},"keyword":["Ancient Greek","Latin","PAULA XML","dependency treebank","standoff annotation"],"metadata":{"authorlinks":{}}},"bibtype":"inproceedings","biburl":"https://bibbase.org/zotero-group/schulzkx/5158478","dataSources":["JFDnASMkoQCjjGL8E"],"keywords":["ancient greek","latin","paula xml","dependency treebank","standoff annotation"],"search_terms":["standoff","annotation","ancient","greek","latin","dependency","treebank","celano"],"title":"Standoff Annotation for the Ancient Greek and Latin Dependency Treebank","year":2019}