A Structural Probe for Finding Syntax in Word Representations. Hewitt, J. & Manning, C. D. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4129–4138, Stroudsburg, PA, USA, 2019. Association for Computational Linguistics.
A Structural Probe for Finding Syntax in Word Representations [link]Paper  doi  abstract   bibtex   
Recent work has improved our ability to detect linguistic knowledge in word representations. However, current methods for detecting syntactic knowledge do not test whether syntax trees are represented in their entirety. In this work, we propose a structural probe, which evaluates whether syntax trees are embedded in a linear transformation of a neural network's word representation space. The probe identifies a linear transformation under which squared L2 distance encodes the distance between words in the parse tree, and one in which squared L2 norm encodes depth in the parse tree. Using our probe, we show that such transformations exist for both ELMo and BERT but not in baselines, providing evidence that entire syntax trees are embedded implicitly in deep models' vector geometry.
@inproceedings{Hewitt2019,
abstract = {Recent work has improved our ability to detect linguistic knowledge in word representations. However, current methods for detecting syntactic knowledge do not test whether syntax trees are represented in their entirety. In this work, we propose a structural probe, which evaluates whether syntax trees are embedded in a linear transformation of a neural network's word representation space. The probe identifies a linear transformation under which squared L2 distance encodes the distance between words in the parse tree, and one in which squared L2 norm encodes depth in the parse tree. Using our probe, we show that such transformations exist for both ELMo and BERT but not in baselines, providing evidence that entire syntax trees are embedded implicitly in deep models' vector geometry.},
address = {Stroudsburg, PA, USA},
author = {Hewitt, John and Manning, Christopher D.},
booktitle = {Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)},
doi = {10.18653/v1/N19-1419},
file = {:Users/shanest/Documents/Library/Hewitt, Manning/Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologie./Hewitt, Manning - 2019 - A Structural Probe for Finding Syntax in Word Representations.pdf:pdf},
keywords = {method: geometry,phenomenon: dependency parsing},
pages = {4129--4138},
publisher = {Association for Computational Linguistics},
title = {{A Structural Probe for Finding Syntax in Word Representations}},
url = {http://aclweb.org/anthology/N19-1419},
year = {2019}
}

Downloads: 0