Automatic library categorization. Velazquez Rodriguez, C. & De Roover, C. 2020. 3rd International Workshop on Software Health ; Conference date: 05-10-2020 Through 11-10-2020
doi  abstract   bibtex   
Software ecosystems contain several types of artefacts such as libraries, documentation and source code files. Recent studies show that the Maven software ecosystem alone already contains over 2.8 million artefacts and over 70, 000 libraries. Given the size of the ecosystem, selecting a library represents a challenge to its users.The MVNRepository website offers a category-based search func- tionality as a solution. However, not all of the libraries have been categorised, which leads to incomplete search results. This work proposes an approach to the automatic categorisation of libraries through machine learning classifiers trained on class and method names. Our preliminary results show that the approach is accurate, suggesting that large-scale applications may be feasible.
@conference{28f171f75218419eb759b590aac0123c,
  title    = "Automatic library categorization",
  abstract = "Software ecosystems contain several types of artefacts such as libraries, documentation and source code files. Recent studies show that the Maven software ecosystem alone already contains over 2.8 million artefacts and over 70, 000 libraries. Given the size of the ecosystem, selecting a library represents a challenge to its users.The MVNRepository website offers a category-based search func- tionality as a solution. However, not all of the libraries have been categorised, which leads to incomplete search results. This work proposes an approach to the automatic categorisation of libraries through machine learning classifiers trained on class and method names. Our preliminary results show that the approach is accurate, suggesting that large-scale applications may be feasible.",
  keywords = "Software Ecosystems, API Category, Text Classification",
  author   = "{Velazquez Rodriguez}, {Camilo Ernesto} and {De Roover}, Coen",
  year     = "2020",
  doi      = "10.1145/3387940.3392186",
  language = "English",
  pages    = "733--734",
  note     = "3rd International Workshop on Software Health ; Conference date: 05-10-2020 Through 11-10-2020",
}

Downloads: 0