Discovering Mathematical Objects of Interest — A Study of Mathematical Notations

Discovering Mathematical Objects of Interest — A Study of Mathematical Notations. Greiner-Petter, A., Schubotz, M., Müller, F., Breitinger, C., Cohl, H., Aizawa, A., & Gipp, B. In Proceedings of The Web Conference (WWW), pages 1445–1456, Taipei, Taiwan, April, 2020. ACM. Core Rank A*

Paper doi abstract bibtex

Mathematical notation, i.e., the writing system used to communicate concepts in mathematics, encodes valuable information for a variety of information search and retrieval systems. Yet, mathematical notations remain mostly unutilized by today's systems. In this paper, we present the first in-depth study on the distributions of mathematical notation in two large scientific corpora: the open access arXiv (2.5B mathematical objects) and the mathematical reviewing service for pure and applied mathematics zbMATH (61M mathematical objects). Our study lays a foundation for future research projects on mathematical information retrieval for large scientific corpora. Further, we demonstrate the relevance of our results to a variety of use-cases. For example, to assist semantic extraction systems, to improve scientific search engines, and to facilitate specialized math recommendation systems. The contributions of our presented research are as follows: (1) we present the first distributional analysis of mathematical formulae on arXiv and zbMATH; (2) we retrieve relevant mathematical objects for given textual search queries (e.g., linking $P_\{n\}{\textasciicircum}\{({\}alpha, {\}beta)\} {\}left(x{\}right)$ with `Jacobi polynomial'); (3) we extend zbMATH's search engine by providing relevant mathematical formulae; and (4) we exemplify the applicability of the results by presenting auto-completion for math inputs as the first contribution to math recommendation systems. To expedite future research projects, we have made available our source code and data.

@inproceedings{BibbaseGreinerPetterSMB20,
	address = {Taipei, Taiwan},
	title = {Discovering {Mathematical} {Objects} of {Interest} — {A} {Study} of {Mathematical} {Notations}},
	isbn = {978-1-4503-7023-3},
	url = {https://arxiv.org/abs/2002.02712},
	doi = {10.1145/3366423.3380218},
	abstract = {Mathematical notation, i.e., the writing system used to communicate concepts in mathematics, encodes valuable information for a variety of information search and retrieval systems. Yet, mathematical notations remain mostly unutilized by today's systems. In this paper, we present the first in-depth study on the distributions of mathematical notation in two large scientific corpora: the open access arXiv (2.5B mathematical objects) and the mathematical reviewing service for pure and applied mathematics zbMATH (61M mathematical objects). Our study lays a foundation for future research projects on mathematical information retrieval for large scientific corpora. Further, we demonstrate the relevance of our results to a variety of use-cases. For example, to assist semantic extraction systems, to improve scientific search engines, and to facilitate specialized math recommendation systems.

The contributions of our presented research are as follows: (1) we present the first distributional analysis of mathematical formulae on arXiv and zbMATH; (2) we retrieve relevant mathematical objects for given textual search queries (e.g., linking \$P\_\{n\}{\textasciicircum}\{({\textbackslash}alpha, {\textbackslash}beta)\} {\textbackslash}left(x{\textbackslash}right)\$ with `Jacobi polynomial'); (3) we extend zbMATH's search engine by providing relevant mathematical formulae; and (4) we exemplify the applicability of the results by presenting auto-completion for math inputs as the first contribution to math recommendation systems. To expedite future research projects, we have made available our source code and data.},
	language = {en},
	urldate = {2021-07-30},
	booktitle = {Proceedings of {The} {Web} {Conference} ({WWW})},
	publisher = {ACM},
	author = {Greiner-Petter, Andre and Schubotz, Moritz and Müller, Fabian and Breitinger, Corinna and Cohl, Howard and Aizawa, Akiko and Gipp, Bela},
	month = apr,
	year = {2020},
	note = {Core Rank A*},
	pages = {1445--1456},
}

Downloads: 0

{"_id":"jprRN9tL3eok34798","bibbaseid":"greinerpetter-schubotz-mller-breitinger-cohl-aizawa-gipp-discoveringmathematicalobjectsofinterestastudyofmathematicalnotations-2020","authorIDs":[],"author_short":["Greiner-Petter, A.","Schubotz, M.","Müller, F.","Breitinger, C.","Cohl, H.","Aizawa, A.","Gipp, B."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","address":"Taipei, Taiwan","title":"Discovering Mathematical Objects of Interest — A Study of Mathematical Notations","isbn":"978-1-4503-7023-3","url":"https://arxiv.org/abs/2002.02712","doi":"10.1145/3366423.3380218","abstract":"Mathematical notation, i.e., the writing system used to communicate concepts in mathematics, encodes valuable information for a variety of information search and retrieval systems. Yet, mathematical notations remain mostly unutilized by today's systems. In this paper, we present the first in-depth study on the distributions of mathematical notation in two large scientific corpora: the open access arXiv (2.5B mathematical objects) and the mathematical reviewing service for pure and applied mathematics zbMATH (61M mathematical objects). Our study lays a foundation for future research projects on mathematical information retrieval for large scientific corpora. Further, we demonstrate the relevance of our results to a variety of use-cases. For example, to assist semantic extraction systems, to improve scientific search engines, and to facilitate specialized math recommendation systems. The contributions of our presented research are as follows: (1) we present the first distributional analysis of mathematical formulae on arXiv and zbMATH; (2) we retrieve relevant mathematical objects for given textual search queries (e.g., linking $P_\\{n\\}{\\textasciicircum}\\{({\\}alpha, {\\}beta)\\} {\\}left(x{\\}right)$ with `Jacobi polynomial'); (3) we extend zbMATH's search engine by providing relevant mathematical formulae; and (4) we exemplify the applicability of the results by presenting auto-completion for math inputs as the first contribution to math recommendation systems. To expedite future research projects, we have made available our source code and data.","language":"en","urldate":"2021-07-30","booktitle":"Proceedings of The Web Conference (WWW)","publisher":"ACM","author":[{"propositions":[],"lastnames":["Greiner-Petter"],"firstnames":["Andre"],"suffixes":[]},{"propositions":[],"lastnames":["Schubotz"],"firstnames":["Moritz"],"suffixes":[]},{"propositions":[],"lastnames":["Müller"],"firstnames":["Fabian"],"suffixes":[]},{"propositions":[],"lastnames":["Breitinger"],"firstnames":["Corinna"],"suffixes":[]},{"propositions":[],"lastnames":["Cohl"],"firstnames":["Howard"],"suffixes":[]},{"propositions":[],"lastnames":["Aizawa"],"firstnames":["Akiko"],"suffixes":[]},{"propositions":[],"lastnames":["Gipp"],"firstnames":["Bela"],"suffixes":[]}],"month":"April","year":"2020","note":"Core Rank A*","pages":"1445–1456","bibtex":"@inproceedings{BibbaseGreinerPetterSMB20,\n\taddress = {Taipei, Taiwan},\n\ttitle = {Discovering {Mathematical} {Objects} of {Interest} — {A} {Study} of {Mathematical} {Notations}},\n\tisbn = {978-1-4503-7023-3},\n\turl = {https://arxiv.org/abs/2002.02712},\n\tdoi = {10.1145/3366423.3380218},\n\tabstract = {Mathematical notation, i.e., the writing system used to communicate concepts in mathematics, encodes valuable information for a variety of information search and retrieval systems. Yet, mathematical notations remain mostly unutilized by today's systems. In this paper, we present the first in-depth study on the distributions of mathematical notation in two large scientific corpora: the open access arXiv (2.5B mathematical objects) and the mathematical reviewing service for pure and applied mathematics zbMATH (61M mathematical objects). Our study lays a foundation for future research projects on mathematical information retrieval for large scientific corpora. Further, we demonstrate the relevance of our results to a variety of use-cases. For example, to assist semantic extraction systems, to improve scientific search engines, and to facilitate specialized math recommendation systems.\n\nThe contributions of our presented research are as follows: (1) we present the first distributional analysis of mathematical formulae on arXiv and zbMATH; (2) we retrieve relevant mathematical objects for given textual search queries (e.g., linking \\$P\\_\\{n\\}{\\textasciicircum}\\{({\\textbackslash}alpha, {\\textbackslash}beta)\\} {\\textbackslash}left(x{\\textbackslash}right)\\$ with `Jacobi polynomial'); (3) we extend zbMATH's search engine by providing relevant mathematical formulae; and (4) we exemplify the applicability of the results by presenting auto-completion for math inputs as the first contribution to math recommendation systems. To expedite future research projects, we have made available our source code and data.},\n\tlanguage = {en},\n\turldate = {2021-07-30},\n\tbooktitle = {Proceedings of {The} {Web} {Conference} ({WWW})},\n\tpublisher = {ACM},\n\tauthor = {Greiner-Petter, Andre and Schubotz, Moritz and Müller, Fabian and Breitinger, Corinna and Cohl, Howard and Aizawa, Akiko and Gipp, Bela},\n\tmonth = apr,\n\tyear = {2020},\n\tnote = {Core Rank A*},\n\tpages = {1445--1456},\n}\n\n","author_short":["Greiner-Petter, A.","Schubotz, M.","Müller, F.","Breitinger, C.","Cohl, H.","Aizawa, A.","Gipp, B."],"key":"BibbaseGreinerPetterSMB20","id":"BibbaseGreinerPetterSMB20","bibbaseid":"greinerpetter-schubotz-mller-breitinger-cohl-aizawa-gipp-discoveringmathematicalobjectsofinterestastudyofmathematicalnotations-2020","role":"author","urls":{"Paper":"https://arxiv.org/abs/2002.02712"},"metadata":{"authorlinks":{}},"downloads":0},"bibtype":"inproceedings","biburl":"https://api.zotero.org/users/7689706/collections/IBJGRWZX/items?key=R0b523dc3oYLxTGap1H4YXgd&format=bibtex&limit=100","creationDate":"2020-05-17T17:01:38.240Z","downloads":0,"keywords":[],"search_terms":["discovering","mathematical","objects","interest","study","mathematical","notations","greiner-petter","schubotz","müller","breitinger","cohl","aizawa","gipp"],"title":"Discovering Mathematical Objects of Interest — A Study of Mathematical Notations","year":2020,"dataSources":["TbmktWAwwYoDLi2hx","QGwcHf7xnb5mCCQi7","x2wNFgXC2PE23H45p","3wTLgXcXueP5mYbfu","cZ8X4Ke5so9b7csrB","wZtCXbB8M6GYSQHMx"]}