A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers. Miao, S.; Liang, C.; and Su, K. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 975–984, 2020. Association for Computational Linguistics.
abstract   bibtex   
We present ASDiv (Academia Sinica Di-verse MWP Dataset), a diverse (in terms of both language patterns and problem types) English math word problem (MWP) corpus for evaluating the capability of various MWP solvers. Existing MWP corpora for studying AI progress remain limited either in language usage patterns or in problem types. We thus present a new English MWP corpus with 2,305 MWPs that cover more text patterns and most problem types taught in elementary school. Each MWP is annotated with its problem type and grade level(for indicating the level of difficulty). Fur/-thermore, we propose a metric to measure the lexicon usage diversity of a given MWP corpus, and demonstrate that ASDiv is more diverse than existing corpora. Experiments show that our proposed corpus reflects the true capability of MWP solvers more faith-fully.
@inproceedings{miao_diverse_2020,
	title = {A {Diverse} {Corpus} for {Evaluating} and {Developing} {English} {Math} {Word} {Problem} {Solvers}},
	abstract = {We  present ASDiv (Academia Sinica Di-verse MWP Dataset), a diverse (in terms of both language patterns and problem types) English math word problem (MWP) corpus for  evaluating  the  capability  of  various MWP solvers. Existing MWP corpora for studying AI progress remain limited either in language usage patterns or in problem types. We thus present a new English MWP corpus with 2,305 MWPs that cover more text patterns and most problem types taught in elementary school. Each MWP is annotated with its problem type and grade level(for indicating the level of difficulty). Fur/-thermore, we propose a metric to measure the lexicon usage diversity of a given MWP corpus, and demonstrate that ASDiv is more diverse than existing corpora. Experiments show that our proposed corpus reflects the true capability of MWP solvers more faith-fully.},
	booktitle = {Proceedings of the 58th {Annual} {Meeting} of the {Association} for {Computational} {Linguistics}},
	publisher = {Association for Computational Linguistics},
	author = {Miao, Shen-Yun and Liang, Chao-Chun and Su, Keh-Yih},
	year = {2020},
	keywords = {computational linguistics, corpus, dataset, uses sympy},
	pages = {975--984},
}
Downloads: 0