Analysis of Wikipedia-based Corpora for Question Answering. Jurczyk, T., Deshmane, A., & Choi, J. D. Technical Report 1801.02073, ArXiv, 2018.
Analysis of Wikipedia-based Corpora for Question Answering [link]Paper  abstract   bibtex   
This paper gives comprehensive analyses of corpora based on Wikipedia for several tasks in question answering. Four recent corpora are collected, WIKIQA, SELQA, SQUAD, and INFOBOXQA, and first analyzed intrinsically by contextual similarities, question types, and answer categories. These corpora are then analyzed extrinsically by three question answering tasks, answer retrieval, selection, and triggering. An indexing-based method for the creation of a silver-standard dataset for answer retrieval using the entire Wikipedia is also presented. Our analysis shows the uniqueness of these corpora and suggests a better use of them for statistical question answering learning.
@techreport{jurczyk:18a,
	abstract = {This paper gives comprehensive analyses of corpora based on Wikipedia for several tasks in question answering. Four recent corpora are collected, WIKIQA, SELQA, SQUAD, and INFOBOXQA, and first analyzed intrinsically by contextual similarities, question types, and answer categories. These corpora are then analyzed extrinsically by three question answering tasks, answer retrieval, selection, and triggering. An indexing-based method for the creation of a silver-standard dataset for answer retrieval using the entire Wikipedia is also presented. Our analysis shows the uniqueness of these corpora and suggests a better use of them for statistical question answering learning.},
	author = {Jurczyk, Tomasz and Deshmane, Amit and Choi, Jinho D.},
	date-added = {2018-02-23 18:11:43 +0000},
	date-modified = {2018-05-04 14:14:53 +0000},
	institution = {ArXiv},
	keywords = {emorynlp,selected},
	number = {1801.02073},
	title = {Analysis of Wikipedia-based Corpora for Question Answering},
	url = {https://www.researchgate.net/publication/324941689_Analysis_of_Wikipedia-based_Corpora_for_Question_Answering},
	year = {2018},
	Bdsk-Url-1 = {https://arxiv.org/abs/1801.02073}}

Downloads: 0