Clustering Twitter Feeds using Word Co-occurrence CS769 Project Report. Khot, T.
Clustering Twitter Feeds using Word Co-occurrence CS769 Project Report [pdf]Paper  Clustering Twitter Feeds using Word Co-occurrence CS769 Project Report [pdf]Website  abstract   bibtex   
For very large number of documents, normal clustering meth-ods would take O(document 2) time. When the number of documents are very large but short such as tweets, it may make sense to actually cluster the words. We present a method that clusters the words using the word co-occurrence as a similarity measure. We use spectral clustering for cre-ating word clusters and do a " search " to get the actual doc-uments. The resulting word clusters and tweets make sense most of the times.

Downloads: 0