Spanish Abstract Meaning Representation: Annotation of a General Corpus

Spanish Abstract Meaning Representation: Annotation of a General Corpus. Wein, S., Donatelli, L., Ricker, E., Engstrom, C., Nelson, A., Harter, L., & Schneider, N. In Northern European Journal of Language Technology, Volume 8, Copenhagen, Denmark, 2022. Northern European Association of Language Technology.

Paper doi abstract bibtex 7 downloads

Abstract Meaning Representation (AMR), originally designed for English, has been adapted to a number of languages to facilitate cross-lingual semantic representation and analysis. We build on previous work and present the first sizable, general annotation project for Spanish AMR. We release a detailed set of annotation guidelines and a corpus of 486 gold-annotated sentences spanning multiple genres from an existing, cross-lingual AMR corpus. Our work constitutes the second largest non-English gold AMR corpus to date. Fine-tuning an AMR to-Spanish generation model with our annotations results in a BERTScore improvement of 8.8%, demonstrating initial utility of our work.

@inproceedings{wein-2022-spanish,
    title = "{S}panish {A}bstract {M}eaning {R}epresentation: Annotation of a General Corpus",
    author = "Wein, Shira  and
      Donatelli, Lucia  and
      Ricker, Ethan  and
      Engstrom, Calvin  and
      Nelson, Alex  and
      Harter, Leonie  and
      Schneider, Nathan",
    booktitle = "Northern European Journal of Language Technology, Volume 8",
    year = "2022",
    address = "Copenhagen, Denmark",
    publisher = "Northern European Association of Language Technology",
    url = "https://aclanthology.org/2022.nejlt-1.6",
    doi = "https://doi.org/10.3384/nejlt.2000-1533.2022.4462",
    abstract = "Abstract Meaning Representation (AMR), originally designed for English, has been adapted to a number of languages to facilitate cross-lingual semantic representation and analysis. We build on previous work and present the first sizable, general annotation project for Spanish AMR. We release a detailed set of annotation guidelines and a corpus of 486 gold-annotated sentences spanning multiple genres from an existing, cross-lingual AMR corpus. Our work constitutes the second largest non-English gold AMR corpus to date. Fine-tuning an AMR to-Spanish generation model with our annotations results in a BERTScore improvement of 8.8{\%}, demonstrating initial utility of our work.",
}

Downloads: 7

{"_id":"aXaxtGWu7WEv9BYMS","bibbaseid":"wein-donatelli-ricker-engstrom-nelson-harter-schneider-spanishabstractmeaningrepresentationannotationofageneralcorpus-2022","author_short":["Wein, S.","Donatelli, L.","Ricker, E.","Engstrom, C.","Nelson, A.","Harter, L.","Schneider, N."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","title":"Spanish Abstract Meaning Representation: Annotation of a General Corpus","author":[{"propositions":[],"lastnames":["Wein"],"firstnames":["Shira"],"suffixes":[]},{"propositions":[],"lastnames":["Donatelli"],"firstnames":["Lucia"],"suffixes":[]},{"propositions":[],"lastnames":["Ricker"],"firstnames":["Ethan"],"suffixes":[]},{"propositions":[],"lastnames":["Engstrom"],"firstnames":["Calvin"],"suffixes":[]},{"propositions":[],"lastnames":["Nelson"],"firstnames":["Alex"],"suffixes":[]},{"propositions":[],"lastnames":["Harter"],"firstnames":["Leonie"],"suffixes":[]},{"propositions":[],"lastnames":["Schneider"],"firstnames":["Nathan"],"suffixes":[]}],"booktitle":"Northern European Journal of Language Technology, Volume 8","year":"2022","address":"Copenhagen, Denmark","publisher":"Northern European Association of Language Technology","url":"https://aclanthology.org/2022.nejlt-1.6","doi":"https://doi.org/10.3384/nejlt.2000-1533.2022.4462","abstract":"Abstract Meaning Representation (AMR), originally designed for English, has been adapted to a number of languages to facilitate cross-lingual semantic representation and analysis. We build on previous work and present the first sizable, general annotation project for Spanish AMR. We release a detailed set of annotation guidelines and a corpus of 486 gold-annotated sentences spanning multiple genres from an existing, cross-lingual AMR corpus. Our work constitutes the second largest non-English gold AMR corpus to date. Fine-tuning an AMR to-Spanish generation model with our annotations results in a BERTScore improvement of 8.8%, demonstrating initial utility of our work.","bibtex":"@inproceedings{wein-2022-spanish,\n title = \"{S}panish {A}bstract {M}eaning {R}epresentation: Annotation of a General Corpus\",\n author = \"Wein, Shira and\n Donatelli, Lucia and\n Ricker, Ethan and\n Engstrom, Calvin and\n Nelson, Alex and\n Harter, Leonie and\n Schneider, Nathan\",\n booktitle = \"Northern European Journal of Language Technology, Volume 8\",\n year = \"2022\",\n address = \"Copenhagen, Denmark\",\n publisher = \"Northern European Association of Language Technology\",\n url = \"https://aclanthology.org/2022.nejlt-1.6\",\n doi = \"https://doi.org/10.3384/nejlt.2000-1533.2022.4462\",\n abstract = \"Abstract Meaning Representation (AMR), originally designed for English, has been adapted to a number of languages to facilitate cross-lingual semantic representation and analysis. We build on previous work and present the first sizable, general annotation project for Spanish AMR. We release a detailed set of annotation guidelines and a corpus of 486 gold-annotated sentences spanning multiple genres from an existing, cross-lingual AMR corpus. Our work constitutes the second largest non-English gold AMR corpus to date. Fine-tuning an AMR to-Spanish generation model with our annotations results in a BERTScore improvement of 8.8{\\%}, demonstrating initial utility of our work.\",\n}\n\n","author_short":["Wein, S.","Donatelli, L.","Ricker, E.","Engstrom, C.","Nelson, A.","Harter, L.","Schneider, N."],"key":"wein-2022-spanish","id":"wein-2022-spanish","bibbaseid":"wein-donatelli-ricker-engstrom-nelson-harter-schneider-spanishabstractmeaningrepresentationannotationofageneralcorpus-2022","role":"author","urls":{"Paper":"https://aclanthology.org/2022.nejlt-1.6"},"metadata":{"authorlinks":{}},"downloads":7},"bibtype":"inproceedings","biburl":"https://shirawein.github.io/pubs.bib","dataSources":["wbK3th2Wubb6zoLMk"],"keywords":[],"search_terms":["spanish","abstract","meaning","representation","annotation","general","corpus","wein","donatelli","ricker","engstrom","nelson","harter","schneider"],"title":"Spanish Abstract Meaning Representation: Annotation of a General Corpus","year":2022,"downloads":7}