Final Report for the DFG-Project Methods and Tools to Advance the Retrieval of Mathematical Knowledge from Digital Libraries for Search-, Recommendation- and Assistance-Systems. Gipp, B., Greiner-Petter, A., Schubotz, M., & Meuschke, N. Technical Report University of Goettingen, March, 2023. Paper Demo 1 Demo 2 doi abstract bibtex 2 downloads This project investigated new approaches and technologies to enhance the accessibility of mathematical content and its semantic information for a broad range of information retrieval applications. To achieve this goal, the project addressed three main research challenges: (1) syntactic analysis of mathematical expressions, (2) semantic enrichment of mathematical expressions, and (3) evaluation using quality metrics and demonstrators. To make our research useful for the research community, we published tools that enable researchers to process mathematical expressions more effectively and efficiently. The project has made significant research contributions to various Mathematical Information Retrieval (MathIR) tasks and systems, including plagiarism detection and recommendation systems, search engines, the first mathematical type assistance system, math question answering and tutoring systems, automatic plausibility checks for mathematical expressions on Wikipedia, automatic computability of mathematical content via Computer Algebra Systems (CAS), and others. Although our project focused on MathIR tasks, its impact on other natural language research was significant, leading to a more extensive range of demonstrators than originally expected. Many of these demonstrators introduced novel applications, such as the tutoring system PhysWikiQuiz or LaCASt, which automatically verifies the correctness of math formulae on Wikipedia or the Digital Library of Mathematical Functions (DLMF) via commercial CAS. During the project, we published 29 peer-reviewed articles in international venues, including prestigious conferences like the Joint Conference on Digital Libraries (JCDL) and The Web Conference (WWW) (CORE rank A*), as well as journals such as IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (IF: 24.314) and Scientometrics (IF: 3.801). Our Wikipedia demonstrator was also featured in public media. Furthermore, we actively presented our contributions, especially demonstrators, to the research community in multiple workshops. This project has strengthened our international collaborations, particularly with colleagues at the National Institute of Standards and Technology (NIST) in the US and the National Institute of Informatics (NII) in Japan. Several subprojects were partially developed in course projects and theses at the Universities of Konstanz, Wuppertal, and Göttingen, exposing junior researchers to cutting-edge technologies and sensitizing students and researchers to the outstanding issues in MathIR technologies. We firmly believe that this project will have a lasting effect on following MathIR technologies. Several of the subprojects initiated as part of this grant are ongoing and motivating follow-up DFG projects, such as Analyzing Mathematics to Detect Disguised Academic Plagiarism (project no. 437179652).
@techreport{GippGSM23,
title = {Final {Report} for the {DFG}-{Project} {Methods} and {Tools} to {Advance} the {Retrieval} of {Mathematical} {Knowledge} from {Digital} {Libraries} for {Search}-, {Recommendation}- and {Assistance}-{Systems}},
copyright = {Creative Commons Attribution 4.0 International, Open Access},
url = {paper=https://zenodo.org/record/7924634/files/Gipp2023_DFG_Report_MathIR.pdf demo_1=https://lacast.wmflabs.org/ demo_2=https://physwikiquiz.wmflabs.org},
abstract = {This project investigated new approaches and technologies to enhance the accessibility of mathematical content and its semantic information for a broad range of information retrieval applications. To achieve this goal, the project addressed three main research challenges: (1) syntactic analysis of mathematical expressions, (2) semantic enrichment of mathematical expressions, and (3) evaluation using quality metrics and demonstrators. To make our research useful for the research community, we published tools that enable researchers to process mathematical expressions more effectively and efficiently. The project has made significant research contributions to various Mathematical Information Retrieval (MathIR) tasks and systems, including plagiarism detection and recommendation systems, search engines, the first mathematical type assistance system, math question answering and tutoring systems, automatic plausibility checks for mathematical expressions on Wikipedia, automatic computability of mathematical content via Computer Algebra Systems (CAS), and others. Although our project focused on MathIR tasks, its impact on other natural language research was significant, leading to a more extensive range of demonstrators than originally expected. Many of these demonstrators introduced novel applications, such as the tutoring system PhysWikiQuiz or LaCASt, which automatically verifies the correctness of math formulae on Wikipedia or the Digital Library of Mathematical Functions (DLMF) via commercial CAS. During the project, we published 29 peer-reviewed articles in international venues, including prestigious conferences like the Joint Conference on Digital Libraries (JCDL) and The Web Conference (WWW) (CORE rank A*), as well as journals such as IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (IF: 24.314) and Scientometrics (IF: 3.801). Our Wikipedia demonstrator was also featured in public media. Furthermore, we actively presented our contributions, especially demonstrators, to the research community in multiple workshops. This project has strengthened our international collaborations, particularly with colleagues at the National Institute of Standards and Technology (NIST) in the US and the National Institute of Informatics (NII) in Japan. Several subprojects were partially developed in course projects and theses at the Universities of Konstanz, Wuppertal, and Göttingen, exposing junior researchers to cutting-edge technologies and sensitizing students and researchers to the outstanding issues in MathIR technologies. We firmly believe that this project will have a lasting effect on following MathIR technologies. Several of the subprojects initiated as part of this grant are ongoing and motivating follow-up DFG projects, such as Analyzing Mathematics to Detect Disguised Academic Plagiarism (project no. 437179652).},
language = {en},
urldate = {2023-05-15},
institution = {University of Goettingen},
author = {Gipp, Bela and Greiner-Petter, André and Schubotz, Moritz and Meuschke, Norman},
month = mar,
year = {2023},
doi = {10.5281/ZENODO.7924634},
}
Downloads: 2
{"_id":"JZMq3mBNHL8aMWT6z","bibbaseid":"gipp-greinerpetter-schubotz-meuschke-finalreportforthedfgprojectmethodsandtoolstoadvancetheretrievalofmathematicalknowledgefromdigitallibrariesforsearchrecommendationandassistancesystems-2023","author_short":["Gipp, B.","Greiner-Petter, A.","Schubotz, M.","Meuschke, N."],"bibdata":{"bibtype":"techreport","type":"techreport","title":"Final Report for the DFG-Project Methods and Tools to Advance the Retrieval of Mathematical Knowledge from Digital Libraries for Search-, Recommendation- and Assistance-Systems","copyright":"Creative Commons Attribution 4.0 International, Open Access","abstract":"This project investigated new approaches and technologies to enhance the accessibility of mathematical content and its semantic information for a broad range of information retrieval applications. To achieve this goal, the project addressed three main research challenges: (1) syntactic analysis of mathematical expressions, (2) semantic enrichment of mathematical expressions, and (3) evaluation using quality metrics and demonstrators. To make our research useful for the research community, we published tools that enable researchers to process mathematical expressions more effectively and efficiently. The project has made significant research contributions to various Mathematical Information Retrieval (MathIR) tasks and systems, including plagiarism detection and recommendation systems, search engines, the first mathematical type assistance system, math question answering and tutoring systems, automatic plausibility checks for mathematical expressions on Wikipedia, automatic computability of mathematical content via Computer Algebra Systems (CAS), and others. Although our project focused on MathIR tasks, its impact on other natural language research was significant, leading to a more extensive range of demonstrators than originally expected. Many of these demonstrators introduced novel applications, such as the tutoring system PhysWikiQuiz or LaCASt, which automatically verifies the correctness of math formulae on Wikipedia or the Digital Library of Mathematical Functions (DLMF) via commercial CAS. During the project, we published 29 peer-reviewed articles in international venues, including prestigious conferences like the Joint Conference on Digital Libraries (JCDL) and The Web Conference (WWW) (CORE rank A*), as well as journals such as IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (IF: 24.314) and Scientometrics (IF: 3.801). Our Wikipedia demonstrator was also featured in public media. Furthermore, we actively presented our contributions, especially demonstrators, to the research community in multiple workshops. This project has strengthened our international collaborations, particularly with colleagues at the National Institute of Standards and Technology (NIST) in the US and the National Institute of Informatics (NII) in Japan. Several subprojects were partially developed in course projects and theses at the Universities of Konstanz, Wuppertal, and Göttingen, exposing junior researchers to cutting-edge technologies and sensitizing students and researchers to the outstanding issues in MathIR technologies. We firmly believe that this project will have a lasting effect on following MathIR technologies. Several of the subprojects initiated as part of this grant are ongoing and motivating follow-up DFG projects, such as Analyzing Mathematics to Detect Disguised Academic Plagiarism (project no. 437179652).","language":"en","urldate":"2023-05-15","institution":"University of Goettingen","author":[{"propositions":[],"lastnames":["Gipp"],"firstnames":["Bela"],"suffixes":[]},{"propositions":[],"lastnames":["Greiner-Petter"],"firstnames":["André"],"suffixes":[]},{"propositions":[],"lastnames":["Schubotz"],"firstnames":["Moritz"],"suffixes":[]},{"propositions":[],"lastnames":["Meuschke"],"firstnames":["Norman"],"suffixes":[]}],"month":"March","year":"2023","doi":"10.5281/ZENODO.7924634","bibtex":"@techreport{GippGSM23,\n\ttitle = {Final {Report} for the {DFG}-{Project} {Methods} and {Tools} to {Advance} the {Retrieval} of {Mathematical} {Knowledge} from {Digital} {Libraries} for {Search}-, {Recommendation}- and {Assistance}-{Systems}},\n\tcopyright = {Creative Commons Attribution 4.0 International, Open Access},\n\turl = {paper=https://zenodo.org/record/7924634/files/Gipp2023_DFG_Report_MathIR.pdf demo_1=https://lacast.wmflabs.org/ demo_2=https://physwikiquiz.wmflabs.org},\n\tabstract = {This project investigated new approaches and technologies to enhance the accessibility of mathematical content and its semantic information for a broad range of information retrieval applications. To achieve this goal, the project addressed three main research challenges: (1) syntactic analysis of mathematical expressions, (2) semantic enrichment of mathematical expressions, and (3) evaluation using quality metrics and demonstrators. To make our research useful for the research community, we published tools that enable researchers to process mathematical expressions more effectively and efficiently. The project has made significant research contributions to various Mathematical Information Retrieval (MathIR) tasks and systems, including plagiarism detection and recommendation systems, search engines, the first mathematical type assistance system, math question answering and tutoring systems, automatic plausibility checks for mathematical expressions on Wikipedia, automatic computability of mathematical content via Computer Algebra Systems (CAS), and others. Although our project focused on MathIR tasks, its impact on other natural language research was significant, leading to a more extensive range of demonstrators than originally expected. Many of these demonstrators introduced novel applications, such as the tutoring system PhysWikiQuiz or LaCASt, which automatically verifies the correctness of math formulae on Wikipedia or the Digital Library of Mathematical Functions (DLMF) via commercial CAS. During the project, we published 29 peer-reviewed articles in international venues, including prestigious conferences like the Joint Conference on Digital Libraries (JCDL) and The Web Conference (WWW) (CORE rank A*), as well as journals such as IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) (IF: 24.314) and Scientometrics (IF: 3.801). Our Wikipedia demonstrator was also featured in public media. Furthermore, we actively presented our contributions, especially demonstrators, to the research community in multiple workshops. This project has strengthened our international collaborations, particularly with colleagues at the National Institute of Standards and Technology (NIST) in the US and the National Institute of Informatics (NII) in Japan. Several subprojects were partially developed in course projects and theses at the Universities of Konstanz, Wuppertal, and Göttingen, exposing junior researchers to cutting-edge technologies and sensitizing students and researchers to the outstanding issues in MathIR technologies. We firmly believe that this project will have a lasting effect on following MathIR technologies. Several of the subprojects initiated as part of this grant are ongoing and motivating follow-up DFG projects, such as Analyzing Mathematics to Detect Disguised Academic Plagiarism (project no. 437179652).},\n\tlanguage = {en},\n\turldate = {2023-05-15},\n\tinstitution = {University of Goettingen},\n\tauthor = {Gipp, Bela and Greiner-Petter, André and Schubotz, Moritz and Meuschke, Norman},\n\tmonth = mar,\n\tyear = {2023},\n\tdoi = {10.5281/ZENODO.7924634},\n}\n\n","author_short":["Gipp, B.","Greiner-Petter, A.","Schubotz, M.","Meuschke, N."],"urlpaper":"https://zenodo.org/record/7924634/files/Gipp2023_DFG_Report_MathIR.pdf","urldemo_1":"https://lacast.wmflabs.org/","urldemo_2":"https://physwikiquiz.wmflabs.org","key":"GippGSM23","id":"GippGSM23","bibbaseid":"gipp-greinerpetter-schubotz-meuschke-finalreportforthedfgprojectmethodsandtoolstoadvancetheretrievalofmathematicalknowledgefromdigitallibrariesforsearchrecommendationandassistancesystems-2023","role":"author","urls":{"Paper":"https://zenodo.org/record/7924634/files/Gipp2023_DFG_Report_MathIR.pdf","Demo 1":"https://lacast.wmflabs.org/","Demo 2":"https://physwikiquiz.wmflabs.org"},"metadata":{"authorlinks":{}},"downloads":2},"bibtype":"techreport","biburl":"https://api.zotero.org/groups/2532143/items?key=DOjJ33bOgISaFjBIBr7jCV5S&format=bibtex&limit=100","dataSources":["6KJgnNtYZiwwFkcGq","dHLtmS5G7GmooD755","EvZZTzAZvA3EsuMjm"],"keywords":[],"search_terms":["final","report","dfg","project","methods","tools","advance","retrieval","mathematical","knowledge","digital","libraries","search","recommendation","assistance","systems","gipp","greiner-petter","schubotz","meuschke"],"title":"Final Report for the DFG-Project Methods and Tools to Advance the Retrieval of Mathematical Knowledge from Digital Libraries for Search-, Recommendation- and Assistance-Systems","year":2023,"downloads":2}