Author detection by using different term weighting schemes. Tufekci, P. & Uzun, E. In 2013 21st Signal Processing and Communications Applications Conference (SIU), pages 1-4, 4, 2013. IEEE.
Author detection by using different term weighting schemes [link]Website  doi  abstract   bibtex   
In this study, the impact of term weighting on author detection as a type of text classification is investigated. The feature vector being used to represent texts, consists of stem words as features and their weight values, which are obtained by applying 14 different term weighting schemes. The performances of these feature vectors for 3 different datasets in the author detection are tested with some classification methods such as Naïve Bayes Multinominal (NBM), and Support Vector Machine (SVM), Decision Tree (C4.5), and Random Forrest (RF), and are compared with each other. As a result of that, the most successful classifier, which can predict the author of an article, is found as SVM classifier with 98.75% mean accuracy; the most successful term weighting scheme is found as ACTF.IDF.(ICF+1) with 91.54% general mean accuracy.
@inproceedings{
 title = {Author detection by using different term weighting schemes},
 type = {inproceedings},
 year = {2013},
 keywords = {Author detection,NLP,Term weighting schemes,Text classification},
 pages = {1-4},
 websites = {http://ieeexplore.ieee.org/document/6531190/},
 month = {4},
 publisher = {IEEE},
 id = {a7f56d5f-beae-387f-ac49-be768f2e410d},
 created = {2018-06-05T12:53:51.540Z},
 file_attached = {false},
 profile_id = {37fa15c3-e5d0-3212-8e18-e4c72814fd47},
 last_modified = {2018-07-04T12:59:46.632Z},
 read = {false},
 starred = {false},
 authored = {true},
 confirmed = {true},
 hidden = {false},
 citation_key = {Tufekci2013},
 private_publication = {false},
 abstract = {In this study, the impact of term weighting on author detection as a type of text classification is investigated. The feature vector being used to represent texts, consists of stem words as features and their weight values, which are obtained by applying 14 different term weighting schemes. The performances of these feature vectors for 3 different datasets in the author detection are tested with some classification methods such as Naïve Bayes Multinominal (NBM), and Support Vector Machine (SVM), Decision Tree (C4.5), and Random Forrest (RF), and are compared with each other. As a result of that, the most successful classifier, which can predict the author of an article, is found as SVM classifier with 98.75% mean accuracy; the most successful term weighting scheme is found as ACTF.IDF.(ICF+1) with 91.54% general mean accuracy.},
 bibtype = {inproceedings},
 author = {Tufekci, Pınar and Uzun, Erdinç},
 doi = {10.1109/SIU.2013.6531190},
 booktitle = {2013 21st Signal Processing and Communications Applications Conference (SIU)}
}

Downloads: 0