A Machine Learning Based Approach to Detect Machine Learning Design Patterns. Pan, W., Washizaki, H., Yoshioka, N., Fukazawa, Y., Khomh, F., & Gu�h�neuc, Y. In Yi, J. & Leavens, G. T., editors, Proceedings of the 30<sup>th</sup> Asia-Pacific Software Engineering Conference (APSEC), pages 574–578, December, 2023. IEEE CS Press. 5 pages. Early Research Achievements Track.
A Machine Learning Based Approach to Detect Machine Learning Design Patterns [pdf]Paper  abstract   bibtex   
As machine learning expands to various domains, the demand for reusable solutions to similar problems increases. Machine learning design patterns are reusable solutions to design problems of machine learning applications. They can significantly enhance programmers' productivity in programming that requires machine learning algorithms. Given the critical role of machine learning design patterns, the automated detection of them becomes equally vital. However, identifying design patterns can be time-consuming and error-prone. We propose an approach to detect their occurrences in Python files. Our approach uses an Abstract Syntax Tree (AST) of Python files to build a corpus of data and train a refined Text-CNN model to automatically identify machine learning design patterns. We empirically validate our approach by conducting an exploratory study to detect four common machine learning design patterns: Embedding, Multilabel, Feature Cross, and Hashed Feature. We manually label 450 Python code files containing these design patterns from repositories of projects in GitHub. Our approach achieves accuracy values ranging from 80% to 92% for each of the four patterns.

Downloads: 0