Distinguishing Human-Written and ChatGPT-Generated Text Using Machine Learning

Distinguishing Human-Written and ChatGPT-Generated Text Using Machine Learning. Alamleh, H., Alqahtani, A., & Elsaid, A. In pages 154–158, 2023.

Paper doi abstract bibtex

The use of sophisticated Artificial Intelligence (AI) language models, including ChatGPT, has led to growing concerns regarding the ability to distinguish between human-written and AI-generated text in academic and scholarly settings. This study seeks to evaluate the effectiveness of machine learning algorithms in differentiating between human-written and AI-generated text. To accomplish this, we collected responses from Computer Science students for both essay and programming assignments. We then trained and evaluated several machine learning models, including Logistic Regression (LR), Decision Trees (DT), Support Vector Machines (SVM), Neural Networks (NN), and Random Forests (RF), based on accuracy, computational efficiency, and confusion matrices. By comparing the performance of these models, we identified the most suitable one for the task at hand. The use of machine learning algorithms for detecting text generated by AI has significant potential for applications in content moderation, plagiarism detection, and quality control for text generation systems, thereby contributing to the preservation of academic integrity in the face of rapidly advancing AI-driven content generation. © 2023 IEEE.

@inproceedings{alamleh_distinguishing_2023,
	title = {Distinguishing {Human}-{Written} and {ChatGPT}-{Generated} {Text} {Using} {Machine} {Learning}},
	url = {https://www.scopus.com/inward/record.uri?eid=2-s2.0-85161905543&doi=10.1109%2fSIEDS58326.2023.10137767&partnerID=40&md5=cf2c79bb7679470ba3e43823f468e077},
	doi = {10.1109/SIEDS58326.2023.10137767},
	abstract = {The use of sophisticated Artificial Intelligence (AI) language models, including ChatGPT, has led to growing concerns regarding the ability to distinguish between human-written and AI-generated text in academic and scholarly settings. This study seeks to evaluate the effectiveness of machine learning algorithms in differentiating between human-written and AI-generated text. To accomplish this, we collected responses from Computer Science students for both essay and programming assignments. We then trained and evaluated several machine learning models, including Logistic Regression (LR), Decision Trees (DT), Support Vector Machines (SVM), Neural Networks (NN), and Random Forests (RF), based on accuracy, computational efficiency, and confusion matrices. By comparing the performance of these models, we identified the most suitable one for the task at hand. The use of machine learning algorithms for detecting text generated by AI has significant potential for applications in content moderation, plagiarism detection, and quality control for text generation systems, thereby contributing to the preservation of academic integrity in the face of rapidly advancing AI-driven content generation. © 2023 IEEE.},
	author = {Alamleh, H. and Alqahtani, A.A.S. and Elsaid, A.},
	year = {2023},
	keywords = {AI, AI-generated text, ChatGPT, NLP, TF-IDF, TextOriginClassifier, academic integrity, content detection, human-written text, machine learning},
	pages = {154--158},
}

Downloads: 0

{"_id":"4JTy6n6tXdMwsojKX","bibbaseid":"alamleh-alqahtani-elsaid-distinguishinghumanwrittenandchatgptgeneratedtextusingmachinelearning-2023","author_short":["Alamleh, H.","Alqahtani, A.","Elsaid, A."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","title":"Distinguishing Human-Written and ChatGPT-Generated Text Using Machine Learning","url":"https://www.scopus.com/inward/record.uri?eid=2-s2.0-85161905543&doi=10.1109%2fSIEDS58326.2023.10137767&partnerID=40&md5=cf2c79bb7679470ba3e43823f468e077","doi":"10.1109/SIEDS58326.2023.10137767","abstract":"The use of sophisticated Artificial Intelligence (AI) language models, including ChatGPT, has led to growing concerns regarding the ability to distinguish between human-written and AI-generated text in academic and scholarly settings. This study seeks to evaluate the effectiveness of machine learning algorithms in differentiating between human-written and AI-generated text. To accomplish this, we collected responses from Computer Science students for both essay and programming assignments. We then trained and evaluated several machine learning models, including Logistic Regression (LR), Decision Trees (DT), Support Vector Machines (SVM), Neural Networks (NN), and Random Forests (RF), based on accuracy, computational efficiency, and confusion matrices. By comparing the performance of these models, we identified the most suitable one for the task at hand. The use of machine learning algorithms for detecting text generated by AI has significant potential for applications in content moderation, plagiarism detection, and quality control for text generation systems, thereby contributing to the preservation of academic integrity in the face of rapidly advancing AI-driven content generation. © 2023 IEEE.","author":[{"propositions":[],"lastnames":["Alamleh"],"firstnames":["H."],"suffixes":[]},{"propositions":[],"lastnames":["Alqahtani"],"firstnames":["A.A.S."],"suffixes":[]},{"propositions":[],"lastnames":["Elsaid"],"firstnames":["A."],"suffixes":[]}],"year":"2023","keywords":"AI, AI-generated text, ChatGPT, NLP, TF-IDF, TextOriginClassifier, academic integrity, content detection, human-written text, machine learning","pages":"154–158","bibtex":"@inproceedings{alamleh_distinguishing_2023,\n\ttitle = {Distinguishing {Human}-{Written} and {ChatGPT}-{Generated} {Text} {Using} {Machine} {Learning}},\n\turl = {https://www.scopus.com/inward/record.uri?eid=2-s2.0-85161905543&doi=10.1109%2fSIEDS58326.2023.10137767&partnerID=40&md5=cf2c79bb7679470ba3e43823f468e077},\n\tdoi = {10.1109/SIEDS58326.2023.10137767},\n\tabstract = {The use of sophisticated Artificial Intelligence (AI) language models, including ChatGPT, has led to growing concerns regarding the ability to distinguish between human-written and AI-generated text in academic and scholarly settings. This study seeks to evaluate the effectiveness of machine learning algorithms in differentiating between human-written and AI-generated text. To accomplish this, we collected responses from Computer Science students for both essay and programming assignments. We then trained and evaluated several machine learning models, including Logistic Regression (LR), Decision Trees (DT), Support Vector Machines (SVM), Neural Networks (NN), and Random Forests (RF), based on accuracy, computational efficiency, and confusion matrices. By comparing the performance of these models, we identified the most suitable one for the task at hand. The use of machine learning algorithms for detecting text generated by AI has significant potential for applications in content moderation, plagiarism detection, and quality control for text generation systems, thereby contributing to the preservation of academic integrity in the face of rapidly advancing AI-driven content generation. © 2023 IEEE.},\n\tauthor = {Alamleh, H. and Alqahtani, A.A.S. and Elsaid, A.},\n\tyear = {2023},\n\tkeywords = {AI, AI-generated text, ChatGPT, NLP, TF-IDF, TextOriginClassifier, academic integrity, content detection, human-written text, machine learning},\n\tpages = {154--158},\n}\n\n","author_short":["Alamleh, H.","Alqahtani, A.","Elsaid, A."],"key":"alamleh_distinguishing_2023","id":"alamleh_distinguishing_2023","bibbaseid":"alamleh-alqahtani-elsaid-distinguishinghumanwrittenandchatgptgeneratedtextusingmachinelearning-2023","role":"author","urls":{"Paper":"https://www.scopus.com/inward/record.uri?eid=2-s2.0-85161905543&doi=10.1109%2fSIEDS58326.2023.10137767&partnerID=40&md5=cf2c79bb7679470ba3e43823f468e077"},"keyword":["AI","AI-generated text","ChatGPT","NLP","TF-IDF","TextOriginClassifier","academic integrity","content detection","human-written text","machine learning"],"metadata":{"authorlinks":{}},"html":""},"bibtype":"inproceedings","biburl":"https://bibbase.org/zotero/dgopinath","dataSources":["vRECxNMDMX9qKhX8Q"],"keywords":["ai","ai-generated text","chatgpt","nlp","tf-idf","textoriginclassifier","academic integrity","content detection","human-written text","machine learning"],"search_terms":["distinguishing","human","written","chatgpt","generated","text","using","machine","learning","alamleh","alqahtani","elsaid"],"title":"Distinguishing Human-Written and ChatGPT-Generated Text Using Machine Learning","year":2023}