ARQMath Lab: An Incubator for Semantic Formula Search in zbMATH Open?. Scharpf, P., Schubotz, M., Greiner-Petter, A., Ostendorff, M., Teschke, O., & Gipp, B. In Working Notes of (CLEF) 2020 - Conference and Labs of the Evaluation Forum, volume 2696, Thessaloniki, Greece, 2020. CEUR-WS.org.
ARQMath Lab: An Incubator for Semantic Formula Search in zbMATH Open? [pdf]Paper  abstract   bibtex   
The zbMATH database contains more than 4 million bibliographic entries. We aim to provide easy access to these entries. Therefore, we maintain dif-ferent index structures including a formula index. To optimize the findability of the entries in our database, we constantly investigate new approaches to satisfy the information needs of our users. We believe that the findings from the ARQMath evaluation will generate new insights into which index struc-tures are most suitable to satisfy mathematical information needs. Search en-gines, recommender systems, plagiarism checking software, and many other added-value services acting on databases such as the arXiv and zbMATH need to combine natural and formula language. One initial approach to ad-dress this challenge is to enrich the mostly unstructured document data via Entity Linking. The ARQMath Task at CLEF 2020 aims to tackle the problem of linking newly posted questions from Math Stack Exchange (MSE) to exist-ing ones that were already answered by the community. To deeply under-stand MSE information needs, answer-, and formula types, we performed manual runs for tasks 1 and 2. Furthermore, we explored several formula re-trieval methods for task 2, such as fuzzy string search, k-nearest neighbors, and our recently introduced approach to retrieve Mathematical Objects of In-terest (MOI) with textual search queries. The task results show that neither our automated methods nor our manual runs archived good scores in the competition. However, the perceived quality of the hits returned by the MOI search particularly motivates us to conduct further research about MOI.

Downloads: 0