Do the Math: Making Mathematics in Wikipedia Computable. Greiner-Petter, A., Schubotz, M., Breitinger, C., Scharpf, P., Aizawa, A., & Gipp, B. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(4):4384–4395, 2023. Journal Rank Q1; IF: 24.314
Do the Math: Making Mathematics in Wikipedia Computable [link]Paper  doi  abstract   bibtex   2 downloads  
Wikipedia combines the power of AI solutions and human reviewers to safeguard article quality. Quality control objectives include detecting malicious edits, fixing typos, and spotting inconsistent formatting. However, no automated quality control mechanisms currently exist for mathematical formulae. Spell checkers are widely used to highlight textual errors, yet no equivalent tool exists to detect algebraically incorrect formulae. Our paper addresses this shortcoming by making mathematical formulae computable. We present a method that (1) gathers the semantic information surrounding the context of each mathematical formulae, (2) provides access to the information in a graph-structured dependency hierarchy, and (3) performs automatic plausibility checks on equations. We evaluate the performance of our approach on 6,337 mathematical expressions contained in 104 Wikipedia articles on the topic of orthogonal polynomials and special functions. Our system, LaCASt, verified 358 out of 1,516 equations as error-free. LaCASt successfully translated 27% of the mathematical expressions and outperformed existing translation approaches by 16%. Additionally, LaCASt achieved an F1 score of .495 for annotating mathematical expressions with relevant textual descriptions, which is a significant step towards advancing searchability, readability, and accessibility of mathematical formulae in Wikipedia. A prototype of LaCASt and the semantically enhanced Wikipedia articles are available at: https://tpami.wmflabs.org.
@article{BibbaseGreinerPetter23b,
	title = {Do the {Math}: {Making} {Mathematics} in {Wikipedia} {Computable}},
	volume = {45},
	issn = {0162-8828, 1939-3539},
	shorttitle = {Do the {Math}},
	url = {https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9847017},
	doi = {10.1109/TPAMI.2022.3195261},
	abstract = {Wikipedia combines the power of AI solutions and human reviewers to safeguard article quality. Quality control objectives include detecting malicious edits, fixing typos, and spotting inconsistent formatting. However, no automated quality control mechanisms currently exist for mathematical formulae. Spell checkers are widely used to highlight textual errors, yet no equivalent tool exists to detect algebraically incorrect formulae. Our paper addresses this shortcoming by making mathematical formulae computable.

We present a method that (1) gathers the semantic information surrounding the context of each mathematical formulae, (2) provides access to the information in a graph-structured dependency hierarchy, and (3) performs automatic plausibility checks on equations. We evaluate the performance of our approach on 6,337 mathematical expressions contained in 104 Wikipedia articles on the topic of orthogonal polynomials and special functions. Our system, LaCASt, verified 358 out of 1,516 equations as error-free. LaCASt successfully translated 27\% of the mathematical expressions and outperformed existing translation approaches by 16\%. Additionally, LaCASt achieved an F1 score of .495 for annotating mathematical expressions with relevant textual descriptions, which is a significant step towards advancing searchability, readability, and accessibility of mathematical formulae in Wikipedia. 

A prototype of LaCASt and the semantically enhanced Wikipedia articles are available at: https://tpami.wmflabs.org.},
	number = {4},
	urldate = {2022-10-03},
	journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
	author = {Greiner-Petter, Andre and Schubotz, Moritz and Breitinger, Corinna and Scharpf, Philipp and Aizawa, Akiko and Gipp, Bela},
	year = {2023},
	note = {Journal Rank Q1; IF: 24.314},
	pages = {4384--4395},
}

Downloads: 2