An Empirical Study on How the Developers Discussed About Pandas Topics. Joy, S. K. S., Ahmed, F., Mahamud, A. H., & Mandal, N. C. In Machine Intelligence and Emerging Technologies, pages 242–255, Cham, 2023. Springer Nature Switzerland.
Paper
Link
Code abstract bibtex 49 downloads Pandas is defined as a fast, easy open-source software library that is used for data analysis in Python programming language. It is rapidly used in different projects like software development, machine learning, computer vision, natural language processing, robotics, and others. Software developers show huge interest and discussions are becoming dominant in online developer forums, like Stack Overflow (SO) in pandas. Such discussions can help to understand the importance, prevalence, and difficulties of pandas topics. The aim of this work is to find the popularity and difficulty of pandas topics. In this regard, SO posts related to pandas are collected. Topic modeling is done on the textual contents of the posts. We found 26 topics which we further categorized into 5 board categories. We observed that developers discuss variety of pandas topics in SO related to error and excepting handling, visualization, External support, dataframe, and optimization. Also, a trend chart is generated according to the discussion of topics in a predefined time series. The finding of this paper provides a path to help developers, educators, and learners. For example, beginner developers can learn most important topics in pandas. Educators can understand the topics which seem hard to learners and make different tutorials that makes these topic understandable. From empirical study, it is possible to understand the preferences of developers in pandas topic by processing their SO posts.
@InProceedings{10.1007/978-3-031-34622-4_19,
author="Joy, Sajib Kumar Saha
and Ahmed, Farzad
and Mahamud, Al Hasib
and Mandal, Nibir Chandra",
title="An Empirical Study on How the Developers Discussed About Pandas Topics",
booktitle="Machine Intelligence and Emerging Technologies",
year="2023",
publisher="Springer Nature Switzerland",
address="Cham",
pages="242--255",
abstract="Pandas is defined as a fast, easy open-source software library that is used for data analysis in Python programming language. It is rapidly used in different projects like software development, machine learning, computer vision, natural language processing, robotics, and others. Software developers show huge interest and discussions are becoming dominant in online developer forums, like Stack Overflow (SO) in pandas. Such discussions can help to understand the importance, prevalence, and difficulties of pandas topics. The aim of this work is to find the popularity and difficulty of pandas topics. In this regard, SO posts related to pandas are collected. Topic modeling is done on the textual contents of the posts. We found 26 topics which we further categorized into 5 board categories. We observed that developers discuss variety of pandas topics in SO related to error and excepting handling, visualization, External support, dataframe, and optimization. Also, a trend chart is generated according to the discussion of topics in a predefined time series. The finding of this paper provides a path to help developers, educators, and learners. For example, beginner developers can learn most important topics in pandas. Educators can understand the topics which seem hard to learners and make different tutorials that makes these topic understandable. From empirical study, it is possible to understand the preferences of developers in pandas topic by processing their SO posts.",
isbn="978-3-031-34622-4",
url = {Paper=https://bibbase.org/network/files/RJq4MjqFf3qyfxvwT Link=https://link.springer.com/chapter/10.1007/978-3-031-34622-4_19 Code=https://github.com/Farzad-1996}
}
Downloads: 49
{"_id":"pB8c82hPw7YLPJxbH","bibbaseid":"joy-ahmed-mahamud-mandal-anempiricalstudyonhowthedevelopersdiscussedaboutpandastopics-2023","author_short":["Joy, S. K. S.","Ahmed, F.","Mahamud, A. H.","Mandal, N. C."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","author":[{"propositions":[],"lastnames":["Joy"],"firstnames":["Sajib","Kumar","Saha"],"suffixes":[]},{"propositions":[],"lastnames":["Ahmed"],"firstnames":["Farzad"],"suffixes":[]},{"propositions":[],"lastnames":["Mahamud"],"firstnames":["Al","Hasib"],"suffixes":[]},{"propositions":[],"lastnames":["Mandal"],"firstnames":["Nibir","Chandra"],"suffixes":[]}],"title":"An Empirical Study on How the Developers Discussed About Pandas Topics","booktitle":"Machine Intelligence and Emerging Technologies","year":"2023","publisher":"Springer Nature Switzerland","address":"Cham","pages":"242–255","abstract":"Pandas is defined as a fast, easy open-source software library that is used for data analysis in Python programming language. It is rapidly used in different projects like software development, machine learning, computer vision, natural language processing, robotics, and others. Software developers show huge interest and discussions are becoming dominant in online developer forums, like Stack Overflow (SO) in pandas. Such discussions can help to understand the importance, prevalence, and difficulties of pandas topics. The aim of this work is to find the popularity and difficulty of pandas topics. In this regard, SO posts related to pandas are collected. Topic modeling is done on the textual contents of the posts. We found 26 topics which we further categorized into 5 board categories. We observed that developers discuss variety of pandas topics in SO related to error and excepting handling, visualization, External support, dataframe, and optimization. Also, a trend chart is generated according to the discussion of topics in a predefined time series. The finding of this paper provides a path to help developers, educators, and learners. For example, beginner developers can learn most important topics in pandas. Educators can understand the topics which seem hard to learners and make different tutorials that makes these topic understandable. From empirical study, it is possible to understand the preferences of developers in pandas topic by processing their SO posts.","isbn":"978-3-031-34622-4","bibtex":"@InProceedings{10.1007/978-3-031-34622-4_19,\nauthor=\"Joy, Sajib Kumar Saha\nand Ahmed, Farzad\nand Mahamud, Al Hasib\nand Mandal, Nibir Chandra\",\ntitle=\"An Empirical Study on How the Developers Discussed About Pandas Topics\",\nbooktitle=\"Machine Intelligence and Emerging Technologies\",\nyear=\"2023\",\npublisher=\"Springer Nature Switzerland\",\naddress=\"Cham\",\npages=\"242--255\",\nabstract=\"Pandas is defined as a fast, easy open-source software library that is used for data analysis in Python programming language. It is rapidly used in different projects like software development, machine learning, computer vision, natural language processing, robotics, and others. Software developers show huge interest and discussions are becoming dominant in online developer forums, like Stack Overflow (SO) in pandas. Such discussions can help to understand the importance, prevalence, and difficulties of pandas topics. The aim of this work is to find the popularity and difficulty of pandas topics. In this regard, SO posts related to pandas are collected. Topic modeling is done on the textual contents of the posts. We found 26 topics which we further categorized into 5 board categories. We observed that developers discuss variety of pandas topics in SO related to error and excepting handling, visualization, External support, dataframe, and optimization. Also, a trend chart is generated according to the discussion of topics in a predefined time series. The finding of this paper provides a path to help developers, educators, and learners. For example, beginner developers can learn most important topics in pandas. Educators can understand the topics which seem hard to learners and make different tutorials that makes these topic understandable. From empirical study, it is possible to understand the preferences of developers in pandas topic by processing their SO posts.\",\nisbn=\"978-3-031-34622-4\",\nurl = {Paper=https://bibbase.org/network/files/RJq4MjqFf3qyfxvwT Link=https://link.springer.com/chapter/10.1007/978-3-031-34622-4_19 Code=https://github.com/Farzad-1996}\n}\n","author_short":["Joy, S. K. S.","Ahmed, F.","Mahamud, A. H.","Mandal, N. C."],"urlPaper":"https://bibbase.org/network/files/RJq4MjqFf3qyfxvwT","urlLink":"https://link.springer.com/chapter/10.1007/978-3-031-34622-4_19","urlCode":"https://github.com/Farzad-1996","key":"10.1007/978-3-031-34622-4_19","id":"10.1007/978-3-031-34622-4_19","bibbaseid":"joy-ahmed-mahamud-mandal-anempiricalstudyonhowthedevelopersdiscussedaboutpandastopics-2023","role":"author","urls":{"Paper":"https://bibbase.org/network/files/RJq4MjqFf3qyfxvwT","Link":"https://link.springer.com/chapter/10.1007/978-3-031-34622-4_19","Code":"https://github.com/Farzad-1996"},"metadata":{"authorlinks":{}},"downloads":49},"bibtype":"inproceedings","biburl":"https://bibbase.org/network/files/yxie9d6ErZbi23Awg","dataSources":["Cc99cqw5kr6mfh4eh","Fa7yJSwhhxjSZXd3w","z2RGvAAAwkvXKXJAY","bqc6jSavydXqcD4p5"],"keywords":[],"search_terms":["empirical","study","developers","discussed","pandas","topics","joy","ahmed","mahamud","mandal"],"title":"An Empirical Study on How the Developers Discussed About Pandas Topics","year":2023,"downloads":49}