Linguistic generalization and compositionality in modern artificial neural networks. Baroni, M. Philosophical Transactions of the Royal Society B: Biological Sciences, 2020.
Linguistic generalization and compositionality in modern artificial neural networks [link]Paper  doi  abstract   bibtex   2 downloads  
In the last decade, deep artificial neural networks have achieved astounding performance in many natural language-processing tasks. Given the high productivity of language, these models must possess effective generalization abilities. It is widely assumed that humans handle linguistic productivity by means of algebraic compositional rules: are deep networks similarly compositional? After reviewing the main innovations characterizing current deep language-processing networks, I discuss a set of studies suggesting that deep networks are capable of subtle grammar-dependent generalizations, but also that they do not rely on systematic compositional rules. I argue that the intriguing behaviour of these devices (still awaiting a full understanding) should be of interest to linguists and cognitive scientists, as it offers a new perspective on possible computational strategies to deal with linguistic productivity beyond rule-based compositionality, and it might lead to new insights into the less systematic generalization patterns that also appear in natural language.
@article{Baroni2020,
abstract = {In the last decade, deep artificial neural networks have achieved astounding performance in many natural language-processing tasks. Given the high productivity of language, these models must possess effective generalization abilities. It is widely assumed that humans handle linguistic productivity by means of algebraic compositional rules: are deep networks similarly compositional? After reviewing the main innovations characterizing current deep language-processing networks, I discuss a set of studies suggesting that deep networks are capable of subtle grammar-dependent generalizations, but also that they do not rely on systematic compositional rules. I argue that the intriguing behaviour of these devices (still awaiting a full understanding) should be of interest to linguists and cognitive scientists, as it offers a new perspective on possible computational strategies to deal with linguistic productivity beyond rule-based compositionality, and it might lead to new insights into the less systematic generalization patterns that also appear in natural language.},
archivePrefix = {arXiv},
arxivId = {1904.00157},
author = {Baroni, Marco},
doi = {10.1098/rstb.2019.0307},
eprint = {1904.00157},
file = {:Users/shanest/Documents/Library/Baroni/Philosophical Transactions of the Royal Society B Biological Sciences/Baroni - 2020 - Linguistic generalization and compositionality in modern artificial neural networks.pdf:pdf},
issn = {0962-8436},
journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
keywords = {phenomenon: compositionality,survey},
number = {1791},
title = {{Linguistic generalization and compositionality in modern artificial neural networks}},
url = {https://royalsocietypublishing.org/doi/10.1098/rstb.2019.0307},
volume = {375},
year = {2020}
}

Downloads: 2