Meta-Learning to Compositionally Generalize

Meta-Learning to Compositionally Generalize. Conklin, H., Wang, B., Smith, K., & Titov, I. In Zong, C., Xia, F., Li, W., & Navigli, R., editors, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 3322–3335, Online, August, 2021. Association for Computational Linguistics.

Paper doi abstract bibtex 1 download

Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural networks have been shown to struggle with this kind of generalization, in particular performing poorly on tasks designed to assess compositional generalization (i.e. where training and testing distributions differ in ways that would be trivial for a compositional strategy to resolve). Their poor performance on these tasks may in part be due to the nature of supervised learning which assumes training and testing data to be drawn from the same distribution. We implement a meta-learning augmented version of supervised learning whose objective directly optimizes for out-of-distribution generalization. We construct pairs of tasks for meta-learning by sub-sampling existing training data. Each pair of tasks is constructed to contain relevant examples, as determined by a similarity metric, in an effort to inhibit models from memorizing their input. Experimental results on the COGS and SCAN datasets show that our similarity-driven meta-learning can improve generalization performance.

@inproceedings{conklinMetaLearningCompositionallyGeneralize2021,
  title = {Meta-{{Learning}} to {{Compositionally Generalize}}},
  booktitle = {Proceedings of the 59th {{Annual Meeting}} of the {{Association}} for {{Computational Linguistics}} and the 11th {{International Joint Conference}} on {{Natural Language Processing}} ({{Volume}} 1: {{Long Papers}})},
  author = {Conklin, Henry and Wang, Bailin and Smith, Kenny and Titov, Ivan},
  editor = {Zong, Chengqing and Xia, Fei and Li, Wenjie and Navigli, Roberto},
  year = {2021},
  month = aug,
  pages = {3322--3335},
  publisher = {Association for Computational Linguistics},
  address = {Online},
  doi = {10.18653/v1/2021.acl-long.258},
  url = {https://aclanthology.org/2021.acl-long.258},
  urldate = {2024-03-19},
  abstract = {Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural networks have been shown to struggle with this kind of generalization, in particular performing poorly on tasks designed to assess compositional generalization (i.e. where training and testing distributions differ in ways that would be trivial for a compositional strategy to resolve). Their poor performance on these tasks may in part be due to the nature of supervised learning which assumes training and testing data to be drawn from the same distribution. We implement a meta-learning augmented version of supervised learning whose objective directly optimizes for out-of-distribution generalization. We construct pairs of tasks for meta-learning by sub-sampling existing training data. Each pair of tasks is constructed to contain relevant examples, as determined by a similarity metric, in an effort to inhibit models from memorizing their input. Experimental results on the COGS and SCAN datasets show that our similarity-driven meta-learning can improve generalization performance.},
  file = {/Users/shanest/sync/library/Conklin et al/2021/Conklin et al. - 2021 - Meta-Learning to Compositionally Generalize.pdf}
}

Downloads: 1

{"_id":"C2dAeGWoBkAKG5xPj","bibbaseid":"conklin-wang-smith-titov-metalearningtocompositionallygeneralize-2021","author_short":["Conklin, H.","Wang, B.","Smith, K.","Titov, I."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","title":"Meta-Learning to Compositionally Generalize","booktitle":"Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":[{"propositions":[],"lastnames":["Conklin"],"firstnames":["Henry"],"suffixes":[]},{"propositions":[],"lastnames":["Wang"],"firstnames":["Bailin"],"suffixes":[]},{"propositions":[],"lastnames":["Smith"],"firstnames":["Kenny"],"suffixes":[]},{"propositions":[],"lastnames":["Titov"],"firstnames":["Ivan"],"suffixes":[]}],"editor":[{"propositions":[],"lastnames":["Zong"],"firstnames":["Chengqing"],"suffixes":[]},{"propositions":[],"lastnames":["Xia"],"firstnames":["Fei"],"suffixes":[]},{"propositions":[],"lastnames":["Li"],"firstnames":["Wenjie"],"suffixes":[]},{"propositions":[],"lastnames":["Navigli"],"firstnames":["Roberto"],"suffixes":[]}],"year":"2021","month":"August","pages":"3322–3335","publisher":"Association for Computational Linguistics","address":"Online","doi":"10.18653/v1/2021.acl-long.258","url":"https://aclanthology.org/2021.acl-long.258","urldate":"2024-03-19","abstract":"Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural networks have been shown to struggle with this kind of generalization, in particular performing poorly on tasks designed to assess compositional generalization (i.e. where training and testing distributions differ in ways that would be trivial for a compositional strategy to resolve). Their poor performance on these tasks may in part be due to the nature of supervised learning which assumes training and testing data to be drawn from the same distribution. We implement a meta-learning augmented version of supervised learning whose objective directly optimizes for out-of-distribution generalization. We construct pairs of tasks for meta-learning by sub-sampling existing training data. Each pair of tasks is constructed to contain relevant examples, as determined by a similarity metric, in an effort to inhibit models from memorizing their input. Experimental results on the COGS and SCAN datasets show that our similarity-driven meta-learning can improve generalization performance.","file":"/Users/shanest/sync/library/Conklin et al/2021/Conklin et al. - 2021 - Meta-Learning to Compositionally Generalize.pdf","bibtex":"@inproceedings{conklinMetaLearningCompositionallyGeneralize2021,\n title = {Meta-{{Learning}} to {{Compositionally Generalize}}},\n booktitle = {Proceedings of the 59th {{Annual Meeting}} of the {{Association}} for {{Computational Linguistics}} and the 11th {{International Joint Conference}} on {{Natural Language Processing}} ({{Volume}} 1: {{Long Papers}})},\n author = {Conklin, Henry and Wang, Bailin and Smith, Kenny and Titov, Ivan},\n editor = {Zong, Chengqing and Xia, Fei and Li, Wenjie and Navigli, Roberto},\n year = {2021},\n month = aug,\n pages = {3322--3335},\n publisher = {Association for Computational Linguistics},\n address = {Online},\n doi = {10.18653/v1/2021.acl-long.258},\n url = {https://aclanthology.org/2021.acl-long.258},\n urldate = {2024-03-19},\n abstract = {Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural networks have been shown to struggle with this kind of generalization, in particular performing poorly on tasks designed to assess compositional generalization (i.e. where training and testing distributions differ in ways that would be trivial for a compositional strategy to resolve). Their poor performance on these tasks may in part be due to the nature of supervised learning which assumes training and testing data to be drawn from the same distribution. We implement a meta-learning augmented version of supervised learning whose objective directly optimizes for out-of-distribution generalization. We construct pairs of tasks for meta-learning by sub-sampling existing training data. Each pair of tasks is constructed to contain relevant examples, as determined by a similarity metric, in an effort to inhibit models from memorizing their input. Experimental results on the COGS and SCAN datasets show that our similarity-driven meta-learning can improve generalization performance.},\n file = {/Users/shanest/sync/library/Conklin et al/2021/Conklin et al. - 2021 - Meta-Learning to Compositionally Generalize.pdf}\n}\n\n","author_short":["Conklin, H.","Wang, B.","Smith, K.","Titov, I."],"editor_short":["Zong, C.","Xia, F.","Li, W.","Navigli, R."],"key":"conklinMetaLearningCompositionallyGeneralize2021","id":"conklinMetaLearningCompositionallyGeneralize2021","bibbaseid":"conklin-wang-smith-titov-metalearningtocompositionallygeneralize-2021","role":"author","urls":{"Paper":"https://aclanthology.org/2021.acl-long.258"},"metadata":{"authorlinks":{}},"downloads":1},"bibtype":"inproceedings","biburl":"https://www.shane.st/teaching/575/spr24/575_Compositionality.bib","dataSources":["TPZs4iPqAgE5a8mjq"],"keywords":[],"search_terms":["meta","learning","compositionally","generalize","conklin","wang","smith","titov"],"title":"Meta-Learning to Compositionally Generalize","year":2021,"downloads":2}