Effect of Source Language on AMR Structure

Effect of Source Language on AMR Structure. Wein, S., Leung, W. C., Mu, Y., & Schneider, N. In Proceedings of the 16th Linguistic Annotation Workshop (LAW-XVI) within LREC2022, pages 97–102, Marseille, France, June, 2022. European Language Resources Association.

Paper abstract bibtex 2 downloads

The Abstract Meaning Representation (AMR) annotation schema was originally designed for English. But the formalism has since been adapted for annotation in a variety of languages. Meanwhile, cross-lingual parsers have been developed to derive English AMR representations for sentences from other languages—implicitly assuming that English AMR can approximate an interlingua. In this work, we investigate the similarity of AMR annotations in parallel data and how much the language matters in terms of the graph structure. We set out to quantify the effect of sentence language on the structure of the parsed AMR. As a case study, we take parallel AMR annotations from Mandarin Chinese and English AMRs, and replace all Chinese concepts with equivalent English tokens. We then compare the two graphs via the Smatch metric as a measure of structural similarity. We find that source language has a dramatic impact on AMR structure, with Smatch scores below 50% between English and Chinese graphs in our sample—an important reference point for interpreting Smatch scores in cross-lingual AMR parsing.

@inproceedings{wein-etal-2022-effect,
    title = "Effect of Source Language on {AMR} Structure",
    author = "Wein, Shira  and
      Leung, Wai Ching  and
      Mu, Yifu  and
      Schneider, Nathan",
    booktitle = "Proceedings of the 16th Linguistic Annotation Workshop (LAW-XVI) within LREC2022",
    month = jun,
    year = "2022",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://aclanthology.org/2022.law-1.12",
    pages = "97--102",
    abstract = "The Abstract Meaning Representation (AMR) annotation schema was originally designed for English. But the formalism has since been adapted for annotation in a variety of languages. Meanwhile, cross-lingual parsers have been developed to derive English AMR representations for sentences from other languages{---}implicitly assuming that English AMR can approximate an interlingua. In this work, we investigate the similarity of AMR annotations in parallel data and how much the language matters in terms of the graph structure. We set out to quantify the effect of sentence language on the structure of the parsed AMR. As a case study, we take parallel AMR annotations from Mandarin Chinese and English AMRs, and replace all Chinese concepts with equivalent English tokens. We then compare the two graphs via the Smatch metric as a measure of structural similarity. We find that source language has a dramatic impact on AMR structure, with Smatch scores below 50{\%} between English and Chinese graphs in our sample{---}an important reference point for interpreting Smatch scores in cross-lingual AMR parsing.",
}

Downloads: 2

{"_id":"uya8bE8jd4dNXvXic","bibbaseid":"wein-leung-mu-schneider-effectofsourcelanguageonamrstructure-2022","author_short":["Wein, S.","Leung, W. C.","Mu, Y.","Schneider, N."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","title":"Effect of Source Language on AMR Structure","author":[{"propositions":[],"lastnames":["Wein"],"firstnames":["Shira"],"suffixes":[]},{"propositions":[],"lastnames":["Leung"],"firstnames":["Wai","Ching"],"suffixes":[]},{"propositions":[],"lastnames":["Mu"],"firstnames":["Yifu"],"suffixes":[]},{"propositions":[],"lastnames":["Schneider"],"firstnames":["Nathan"],"suffixes":[]}],"booktitle":"Proceedings of the 16th Linguistic Annotation Workshop (LAW-XVI) within LREC2022","month":"June","year":"2022","address":"Marseille, France","publisher":"European Language Resources Association","url":"https://aclanthology.org/2022.law-1.12","pages":"97–102","abstract":"The Abstract Meaning Representation (AMR) annotation schema was originally designed for English. But the formalism has since been adapted for annotation in a variety of languages. Meanwhile, cross-lingual parsers have been developed to derive English AMR representations for sentences from other languages—implicitly assuming that English AMR can approximate an interlingua. In this work, we investigate the similarity of AMR annotations in parallel data and how much the language matters in terms of the graph structure. We set out to quantify the effect of sentence language on the structure of the parsed AMR. As a case study, we take parallel AMR annotations from Mandarin Chinese and English AMRs, and replace all Chinese concepts with equivalent English tokens. We then compare the two graphs via the Smatch metric as a measure of structural similarity. We find that source language has a dramatic impact on AMR structure, with Smatch scores below 50% between English and Chinese graphs in our sample—an important reference point for interpreting Smatch scores in cross-lingual AMR parsing.","bibtex":"@inproceedings{wein-etal-2022-effect,\n title = \"Effect of Source Language on {AMR} Structure\",\n author = \"Wein, Shira and\n Leung, Wai Ching and\n Mu, Yifu and\n Schneider, Nathan\",\n booktitle = \"Proceedings of the 16th Linguistic Annotation Workshop (LAW-XVI) within LREC2022\",\n month = jun,\n year = \"2022\",\n address = \"Marseille, France\",\n publisher = \"European Language Resources Association\",\n url = \"https://aclanthology.org/2022.law-1.12\",\n pages = \"97--102\",\n abstract = \"The Abstract Meaning Representation (AMR) annotation schema was originally designed for English. But the formalism has since been adapted for annotation in a variety of languages. Meanwhile, cross-lingual parsers have been developed to derive English AMR representations for sentences from other languages{---}implicitly assuming that English AMR can approximate an interlingua. In this work, we investigate the similarity of AMR annotations in parallel data and how much the language matters in terms of the graph structure. We set out to quantify the effect of sentence language on the structure of the parsed AMR. As a case study, we take parallel AMR annotations from Mandarin Chinese and English AMRs, and replace all Chinese concepts with equivalent English tokens. We then compare the two graphs via the Smatch metric as a measure of structural similarity. We find that source language has a dramatic impact on AMR structure, with Smatch scores below 50{\\%} between English and Chinese graphs in our sample{---}an important reference point for interpreting Smatch scores in cross-lingual AMR parsing.\",\n}\n\n","author_short":["Wein, S.","Leung, W. C.","Mu, Y.","Schneider, N."],"key":"wein-etal-2022-effect","id":"wein-etal-2022-effect","bibbaseid":"wein-leung-mu-schneider-effectofsourcelanguageonamrstructure-2022","role":"author","urls":{"Paper":"https://aclanthology.org/2022.law-1.12"},"metadata":{"authorlinks":{}},"downloads":2},"bibtype":"inproceedings","biburl":"https://shirawein.github.io/pubs.bib","dataSources":["wbK3th2Wubb6zoLMk","h7kKWXpJh2iaX92T5"],"keywords":[],"search_terms":["effect","source","language","amr","structure","wein","leung","mu","schneider"],"title":"Effect of Source Language on AMR Structure","year":2022,"downloads":2}