Community Discovery in Twitter Based on User Interests

Community Discovery in Twitter Based on User Interests. Zhang, Y., Wu, Y., & Yang, Q. Journal of Computational Information Systems, 8(3):991–1000, 2012.

Paper abstract bibtex

Twitter has recently emerged as a popular social microblogging service. There are over 100 million users in Twitter nowadays, little is known yet about Twitter at user level. In this paper, we investigate the problem of identifying communities in Twitter based on users' interests. To address this problem, we first compute user similarity leveraging both textual contents and social structure, according with Twitter's role, not only a news media but also a social network. These features include tweet text, URLs, hashtags, following relationship and retweeting relationship, and all of them are closely correlated with users' interests. Then we use user similarity as well as classical clustering algorithms to discover communities. To assess effectiveness of our method, we propose the metrics in Twitter " average number of mutual following links per user in per community " . Experimental results show that our method can successfully discover communities in Twitter, and gives a much better performance than random selection. From a side view, our experiment also shows that users in our dataset of Twitter can be approximately categorized into 400 communities.

@article{zhang_community_2012,
	title = {Community {Discovery} in {Twitter} {Based} on {User} {Interests}},
	volume = {8},
	issn = {15539105},
	url = {http://www.jofcis.com},
	abstract = {Twitter has recently emerged as a popular social microblogging service. There are over 100 million users in Twitter nowadays, little is known yet about Twitter at user level. In this paper, we investigate the problem of identifying communities in Twitter based on users' interests. To address this problem, we first compute user similarity leveraging both textual contents and social structure, according with Twitter's role, not only a news media but also a social network. These features include tweet text, URLs, hashtags, following relationship and retweeting relationship, and all of them are closely correlated with users' interests. Then we use user similarity as well as classical clustering algorithms to discover communities. To assess effectiveness of our method, we propose the metrics in Twitter " average number of mutual following links per user in per community " . Experimental results show that our method can successfully discover communities in Twitter, and gives a much better performance than random selection. From a side view, our experiment also shows that users in our dataset of Twitter can be approximately categorized into 400 communities.},
	number = {3},
	journal = {Journal of Computational Information Systems},
	author = {Zhang, Yang and Wu, Yao and Yang, Qing},
	year = {2012},
	keywords = {Community Discovery, Social Structure, Textual Contents, Twitter, User Similarity, haiyanref, 如何发现用户传播什么内容；有idea的为文章, 社交媒体中的社区探测},
	pages = {991--1000},
}

Downloads: 0

{"_id":"sQF3Ydwa3cMmxa9fv","bibbaseid":"zhang-wu-yang-communitydiscoveryintwitterbasedonuserinterests-2012","author_short":["Zhang, Y.","Wu, Y.","Yang, Q."],"bibdata":{"bibtype":"article","type":"article","title":"Community Discovery in Twitter Based on User Interests","volume":"8","issn":"15539105","url":"http://www.jofcis.com","abstract":"Twitter has recently emerged as a popular social microblogging service. There are over 100 million users in Twitter nowadays, little is known yet about Twitter at user level. In this paper, we investigate the problem of identifying communities in Twitter based on users' interests. To address this problem, we first compute user similarity leveraging both textual contents and social structure, according with Twitter's role, not only a news media but also a social network. These features include tweet text, URLs, hashtags, following relationship and retweeting relationship, and all of them are closely correlated with users' interests. Then we use user similarity as well as classical clustering algorithms to discover communities. To assess effectiveness of our method, we propose the metrics in Twitter \" average number of mutual following links per user in per community \" . Experimental results show that our method can successfully discover communities in Twitter, and gives a much better performance than random selection. From a side view, our experiment also shows that users in our dataset of Twitter can be approximately categorized into 400 communities.","number":"3","journal":"Journal of Computational Information Systems","author":[{"propositions":[],"lastnames":["Zhang"],"firstnames":["Yang"],"suffixes":[]},{"propositions":[],"lastnames":["Wu"],"firstnames":["Yao"],"suffixes":[]},{"propositions":[],"lastnames":["Yang"],"firstnames":["Qing"],"suffixes":[]}],"year":"2012","keywords":"Community Discovery, Social Structure, Textual Contents, Twitter, User Similarity, haiyanref, 如何发现用户传播什么内容；有idea的为文章, 社交媒体中的社区探测","pages":"991–1000","bibtex":"@article{zhang_community_2012,\n\ttitle = {Community {Discovery} in {Twitter} {Based} on {User} {Interests}},\n\tvolume = {8},\n\tissn = {15539105},\n\turl = {http://www.jofcis.com},\n\tabstract = {Twitter has recently emerged as a popular social microblogging service. There are over 100 million users in Twitter nowadays, little is known yet about Twitter at user level. In this paper, we investigate the problem of identifying communities in Twitter based on users' interests. To address this problem, we first compute user similarity leveraging both textual contents and social structure, according with Twitter's role, not only a news media but also a social network. These features include tweet text, URLs, hashtags, following relationship and retweeting relationship, and all of them are closely correlated with users' interests. Then we use user similarity as well as classical clustering algorithms to discover communities. To assess effectiveness of our method, we propose the metrics in Twitter \" average number of mutual following links per user in per community \" . Experimental results show that our method can successfully discover communities in Twitter, and gives a much better performance than random selection. From a side view, our experiment also shows that users in our dataset of Twitter can be approximately categorized into 400 communities.},\n\tnumber = {3},\n\tjournal = {Journal of Computational Information Systems},\n\tauthor = {Zhang, Yang and Wu, Yao and Yang, Qing},\n\tyear = {2012},\n\tkeywords = {Community Discovery, Social Structure, Textual Contents, Twitter, User Similarity, haiyanref, 如何发现用户传播什么内容；有idea的为文章, 社交媒体中的社区探测},\n\tpages = {991--1000},\n}\n\n","author_short":["Zhang, Y.","Wu, Y.","Yang, Q."],"key":"zhang_community_2012-1","id":"zhang_community_2012-1","bibbaseid":"zhang-wu-yang-communitydiscoveryintwitterbasedonuserinterests-2012","role":"author","urls":{"Paper":"http://www.jofcis.com"},"keyword":["Community Discovery","Social Structure","Textual Contents","Twitter","User Similarity","haiyanref","如何发现用户传播什么内容；有idea的为文章","社交媒体中的社区探测"],"metadata":{"authorlinks":{}},"html":""},"bibtype":"article","biburl":"https://bibbase.org/zotero/wybert","dataSources":["TJkbwzD8s2wCxBy6Y"],"keywords":["community discovery","social structure","textual contents","twitter","user similarity","haiyanref","如何发现用户传播什么内容；有idea的为文章","社交媒体中的社区探测"],"search_terms":["community","discovery","twitter","based","user","interests","zhang","wu","yang"],"title":"Community Discovery in Twitter Based on User Interests","year":2012}