Application of Nepali Large Language Models to Improve Sentiment Analysis

Application of Nepali Large Language Models to Improve Sentiment Analysis. Pudasaini, S., Ghimire, S., Ale, P., Shakya, A., Paudel, P., & Joshi, B. In Proceedings of the 2024 7th International Conference on Computers in Management and Business, of ACM International Conference Proceedings Series (ICPS), pages 144–150, New York, NY, USA, January, 2024. Association for Computing Machinery.

Paper doi abstract bibtex 4 downloads

With the rise in internet usage, Nepali individuals have left a flood of opinionated comments in their language on YouTube and other social media sites. Such remarks can be subjected to sentiment analysis, which can be useful for both research and business purposes. Such sentiment analysis models can be extremely useful in understanding the user's expectations towards the product which can uplift the business of any organization. Similarly, with the rise of Large Language models in the NLP space, there are several large language models pre-trained on the BERT architecture upon the Nepali text corpus. This research focuses on developing a benchmarking dataset for sentiment analysis in the Nepali language and demonstrating how large Nepali language models can be used to improve the results on downstream NLP tasks like sentiment analysis on such benchmark datasets. This paper describes an approach to how proper embeddings for a Nepali sentence can be extracted from the pre-trained Nepali language models. The comparison of transfer learning applied to the dataset on different machine learning and deep learning algorithms has been done in this study. From this experimentation, a state-of-the-art sentiment analysis model in the Nepali language with an F-score of 0.88 has been developed.

@inproceedings{10.1145/3647782.3647804,
  abstract = {With the rise in internet usage, Nepali individuals have left a flood of opinionated comments in their language on YouTube and other social media sites. Such remarks can be subjected to sentiment analysis, which can be useful for both research and business purposes. Such sentiment analysis models can be extremely useful in understanding the user's expectations towards the product which can uplift the business of any organization. Similarly, with the rise of Large Language models in the NLP space, there are several large language models pre-trained on the BERT architecture upon the Nepali text corpus. This research focuses on developing a benchmarking dataset for sentiment analysis in the Nepali language and demonstrating how large Nepali language models can be used to improve the results on downstream NLP tasks like sentiment analysis on such benchmark datasets. This paper describes an approach to how proper embeddings for a Nepali sentence can be extracted from the pre-trained Nepali language models. The comparison of transfer learning applied to the dataset on different machine learning and deep learning algorithms has been done in this study. From this experimentation, a state-of-the-art sentiment analysis model in the Nepali language with an F-score of 0.88 has been developed.},
  added-at = {2024-05-08T08:03:49.000+0200},
  address = {New York, NY, USA},
  author = {Pudasaini, Shushanta and Ghimire, Sunil and Ale, Prabhat and Shakya, Aman and Paudel, Prakriti and Joshi, Basanta},
  biburl = {https://www.bibsonomy.org/bibtex/261cfd56d265b78d3a417c984488d5f9c/amanshakya},
  booktitle = {Proceedings of the 2024 7th International Conference on Computers in Management and Business},
  doi = {10.1145/3647782.3647804},
  eventdate = {12-14 January, 2024},
  eventtitle = {7th International Conference on Computers in Management and Business (ICCMB 2024)},
  interhash = {5ba869b4c70f2027833ceb035ebf3785},
  intrahash = {61cfd56d265b78d3a417c984488d5f9c},
  isbn = {9798400716652},
  keywords = {Encoders LLM Nepali Sentiment myown},
  location = {<conf-loc>, <city>Singapore</city>, <country>Singapore</country>, </conf-loc>},
  month = {January},
  numpages = {7},
  pages = {144–150},
  publisher = {Association for Computing Machinery},
  series = {ACM International Conference Proceedings Series (ICPS)},
  timestamp = {2024-05-08T08:03:49.000+0200},
  title = {Application of Nepali Large Language Models to Improve Sentiment Analysis},
  url = {https://doi.org/10.1145/3647782.3647804},
  venue = {Singapore},
  year = 2024
}

Downloads: 4

{"_id":"Ah4nze6Fmq7Zpf4s5","bibbaseid":"pudasaini-ghimire-ale-shakya-paudel-joshi-applicationofnepalilargelanguagemodelstoimprovesentimentanalysis-2024","author_short":["Pudasaini, S.","Ghimire, S.","Ale, P.","Shakya, A.","Paudel, P.","Joshi, B."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","abstract":"With the rise in internet usage, Nepali individuals have left a flood of opinionated comments in their language on YouTube and other social media sites. Such remarks can be subjected to sentiment analysis, which can be useful for both research and business purposes. Such sentiment analysis models can be extremely useful in understanding the user's expectations towards the product which can uplift the business of any organization. Similarly, with the rise of Large Language models in the NLP space, there are several large language models pre-trained on the BERT architecture upon the Nepali text corpus. This research focuses on developing a benchmarking dataset for sentiment analysis in the Nepali language and demonstrating how large Nepali language models can be used to improve the results on downstream NLP tasks like sentiment analysis on such benchmark datasets. This paper describes an approach to how proper embeddings for a Nepali sentence can be extracted from the pre-trained Nepali language models. The comparison of transfer learning applied to the dataset on different machine learning and deep learning algorithms has been done in this study. From this experimentation, a state-of-the-art sentiment analysis model in the Nepali language with an F-score of 0.88 has been developed.","added-at":"2024-05-08T08:03:49.000+0200","address":"New York, NY, USA","author":[{"propositions":[],"lastnames":["Pudasaini"],"firstnames":["Shushanta"],"suffixes":[]},{"propositions":[],"lastnames":["Ghimire"],"firstnames":["Sunil"],"suffixes":[]},{"propositions":[],"lastnames":["Ale"],"firstnames":["Prabhat"],"suffixes":[]},{"propositions":[],"lastnames":["Shakya"],"firstnames":["Aman"],"suffixes":[]},{"propositions":[],"lastnames":["Paudel"],"firstnames":["Prakriti"],"suffixes":[]},{"propositions":[],"lastnames":["Joshi"],"firstnames":["Basanta"],"suffixes":[]}],"biburl":"https://www.bibsonomy.org/bibtex/261cfd56d265b78d3a417c984488d5f9c/amanshakya","booktitle":"Proceedings of the 2024 7th International Conference on Computers in Management and Business","doi":"10.1145/3647782.3647804","eventdate":"12-14 January, 2024","eventtitle":"7th International Conference on Computers in Management and Business (ICCMB 2024)","interhash":"5ba869b4c70f2027833ceb035ebf3785","intrahash":"61cfd56d265b78d3a417c984488d5f9c","isbn":"9798400716652","keywords":"Encoders LLM Nepali Sentiment myown","location":"<conf-loc>, <city>Singapore</city>, <country>Singapore</country>, </conf-loc>","month":"January","numpages":"7","pages":"144–150","publisher":"Association for Computing Machinery","series":"ACM International Conference Proceedings Series (ICPS)","timestamp":"2024-05-08T08:03:49.000+0200","title":"Application of Nepali Large Language Models to Improve Sentiment Analysis","url":"https://doi.org/10.1145/3647782.3647804","venue":"Singapore","year":"2024","bibtex":"@inproceedings{10.1145/3647782.3647804,\n abstract = {With the rise in internet usage, Nepali individuals have left a flood of opinionated comments in their language on YouTube and other social media sites. Such remarks can be subjected to sentiment analysis, which can be useful for both research and business purposes. Such sentiment analysis models can be extremely useful in understanding the user's expectations towards the product which can uplift the business of any organization. Similarly, with the rise of Large Language models in the NLP space, there are several large language models pre-trained on the BERT architecture upon the Nepali text corpus. This research focuses on developing a benchmarking dataset for sentiment analysis in the Nepali language and demonstrating how large Nepali language models can be used to improve the results on downstream NLP tasks like sentiment analysis on such benchmark datasets. This paper describes an approach to how proper embeddings for a Nepali sentence can be extracted from the pre-trained Nepali language models. The comparison of transfer learning applied to the dataset on different machine learning and deep learning algorithms has been done in this study. From this experimentation, a state-of-the-art sentiment analysis model in the Nepali language with an F-score of 0.88 has been developed.},\n added-at = {2024-05-08T08:03:49.000+0200},\n address = {New York, NY, USA},\n author = {Pudasaini, Shushanta and Ghimire, Sunil and Ale, Prabhat and Shakya, Aman and Paudel, Prakriti and Joshi, Basanta},\n biburl = {https://www.bibsonomy.org/bibtex/261cfd56d265b78d3a417c984488d5f9c/amanshakya},\n booktitle = {Proceedings of the 2024 7th International Conference on Computers in Management and Business},\n doi = {10.1145/3647782.3647804},\n eventdate = {12-14 January, 2024},\n eventtitle = {7th International Conference on Computers in Management and Business (ICCMB 2024)},\n interhash = {5ba869b4c70f2027833ceb035ebf3785},\n intrahash = {61cfd56d265b78d3a417c984488d5f9c},\n isbn = {9798400716652},\n keywords = {Encoders LLM Nepali Sentiment myown},\n location = {<conf-loc>, <city>Singapore</city>, <country>Singapore</country>, </conf-loc>},\n month = {January},\n numpages = {7},\n pages = {144–150},\n publisher = {Association for Computing Machinery},\n series = {ACM International Conference Proceedings Series (ICPS)},\n timestamp = {2024-05-08T08:03:49.000+0200},\n title = {Application of Nepali Large Language Models to Improve Sentiment Analysis},\n url = {https://doi.org/10.1145/3647782.3647804},\n venue = {Singapore},\n year = 2024\n}\n\n","author_short":["Pudasaini, S.","Ghimire, S.","Ale, P.","Shakya, A.","Paudel, P.","Joshi, B."],"key":"10.1145/3647782.3647804","id":"10.1145/3647782.3647804","bibbaseid":"pudasaini-ghimire-ale-shakya-paudel-joshi-applicationofnepalilargelanguagemodelstoimprovesentimentanalysis-2024","role":"author","urls":{"Paper":"https://doi.org/10.1145/3647782.3647804"},"keyword":["Encoders LLM Nepali Sentiment myown"],"metadata":{"authorlinks":{}},"downloads":4},"bibtype":"inproceedings","biburl":"https://bibbase.org/f/EjJTJxsvFZCsHSath/amanshakya.bib","dataSources":["Z3HkGSt6ayrcPfAbv","E9rme5R4RWByvYMbk","vyo52rPYZvkJkq6M7","6LDAAmS4Dr8bKCsQv","u6EhZEKhRK5yk6jEQ","bhqr7NYGzPtbC6XAj"],"keywords":["encoders llm nepali sentiment myown"],"search_terms":["application","nepali","large","language","models","improve","sentiment","analysis","pudasaini","ghimire","ale","shakya","paudel","joshi"],"title":"Application of Nepali Large Language Models to Improve Sentiment Analysis","year":2024,"downloads":4}