HamleDT: Harmonized multi-language dependency treebank. Zeman, D., Dušek, O., Mareček, D., Popel, M., Ramasamy, L., Štěpánek, J., Žabokrtský, Z., & Hajič, J. Language Resources and Evaluation, 48(4):601–637, December, 2014.
HamleDT: Harmonized multi-language dependency treebank [link]Paper  doi  abstract   bibtex   
We present HamleDT—a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. In the present article, we provide a thorough investigation and discussion of a number of phenomena that are comparable across languages, though their annotation in treebanks often differs. We claim that transformation procedures can be designed to automatically identify most such phenomena and convert them to a unified annotation style. This unification is beneficial both to comparative corpus linguistics and to machine learning of syntactic parsing.
@article{zeman_hamledt_2014,
	title = {{HamleDT}: {Harmonized} multi-language dependency treebank},
	volume = {48},
	issn = {1574-0218},
	url = {https://doi.org/10.1007/s10579-014-9275-2},
	doi = {10.1007/s10579-014-9275-2},
	abstract = {We present HamleDT—a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. In the present article, we provide a thorough investigation and discussion of a number of phenomena that are comparable across languages, though their annotation in treebanks often differs. We claim that transformation procedures can be designed to automatically identify most such phenomena and convert them to a unified annotation style. This unification is beneficial both to comparative corpus linguistics and to machine learning of syntactic parsing.},
	number = {4},
	journal = {Language Resources and Evaluation},
	author = {Zeman, Daniel and Dušek, Ondřej and Mareček, David and Popel, Martin and Ramasamy, Loganathan and Štěpánek, Jan and Žabokrtský, Zdeněk and Hajič, Jan},
	month = dec,
	year = {2014},
	keywords = {Natural language processing, POS, dependency treebank},
	pages = {601--637},
}

Downloads: 0