Short and Sparse Text Topic Modeling via Self-Aggregation. Quan, X., Kit, C., Ge, Y., & Pan, S., J.
Paper
Website abstract bibtex The overwhelming amount of short text data on social media and elsewhere has posed great chal-lenges to topic modeling due to the sparsity prob-lem. Most existing attempts to alleviate this prob-lem resort to heuristic strategies to aggregate short texts into pseudo-documents before the application of standard topic modeling. Although such strate-gies cannot be well generalized to more general genres of short texts, the success has shed light on how to develop a generalized solution. In this pa-per, we present a novel model towards this goal by integrating topic modeling with short text aggre-gation during topic inference. The aggregation is founded on general topical affinity of texts rather than particular heuristics, making the model read-ily applicable to various short texts. Experimental results on real-world datasets validate the effective-ness of this new model, suggesting that it can distill more meaningful topics from short texts.
@article{
title = {Short and Sparse Text Topic Modeling via Self-Aggregation},
type = {article},
keywords = {Technical Papers — Web and Knowledge-Based Information Systems},
websites = {http://www.ntu.edu.sg/home/sinnopan/publications/[IJCAI15]Short%20and%20Sparse%20Text%20Topic%20Modeling%20via%20Self-Aggregation.pdf},
id = {25622032-7e08-3452-8748-d62a736c6c34},
created = {2018-02-05T18:52:06.015Z},
accessed = {2018-02-05},
file_attached = {true},
profile_id = {371589bb-c770-37ff-8193-93c6f25ffeb1},
group_id = {f982cd63-7ceb-3aa2-ac7e-a953963d6716},
last_modified = {2018-02-05T18:52:09.145Z},
read = {false},
starred = {false},
authored = {false},
confirmed = {false},
hidden = {false},
private_publication = {false},
abstract = {The overwhelming amount of short text data on social media and elsewhere has posed great chal-lenges to topic modeling due to the sparsity prob-lem. Most existing attempts to alleviate this prob-lem resort to heuristic strategies to aggregate short texts into pseudo-documents before the application of standard topic modeling. Although such strate-gies cannot be well generalized to more general genres of short texts, the success has shed light on how to develop a generalized solution. In this pa-per, we present a novel model towards this goal by integrating topic modeling with short text aggre-gation during topic inference. The aggregation is founded on general topical affinity of texts rather than particular heuristics, making the model read-ily applicable to various short texts. Experimental results on real-world datasets validate the effective-ness of this new model, suggesting that it can distill more meaningful topics from short texts.},
bibtype = {article},
author = {Quan, Xiaojun and Kit, Chunyu and Ge, Yong and Pan, Sinno Jialin}
}
Downloads: 0
{"_id":"ErJd4LXr29LLtgkcX","bibbaseid":"quan-kit-ge-pan-shortandsparsetexttopicmodelingviaselfaggregation","downloads":0,"creationDate":"2018-02-07T16:22:57.323Z","title":"Short and Sparse Text Topic Modeling via Self-Aggregation","author_short":["Quan, X.","Kit, C.","Ge, Y.","Pan, S., J."],"year":null,"bibtype":"article","biburl":null,"bibdata":{"title":"Short and Sparse Text Topic Modeling via Self-Aggregation","type":"article","keywords":"Technical Papers — Web and Knowledge-Based Information Systems","websites":"http://www.ntu.edu.sg/home/sinnopan/publications/[IJCAI15]Short%20and%20Sparse%20Text%20Topic%20Modeling%20via%20Self-Aggregation.pdf","id":"25622032-7e08-3452-8748-d62a736c6c34","created":"2018-02-05T18:52:06.015Z","accessed":"2018-02-05","file_attached":"true","profile_id":"371589bb-c770-37ff-8193-93c6f25ffeb1","group_id":"f982cd63-7ceb-3aa2-ac7e-a953963d6716","last_modified":"2018-02-05T18:52:09.145Z","read":false,"starred":false,"authored":false,"confirmed":false,"hidden":false,"private_publication":false,"abstract":"The overwhelming amount of short text data on social media and elsewhere has posed great chal-lenges to topic modeling due to the sparsity prob-lem. Most existing attempts to alleviate this prob-lem resort to heuristic strategies to aggregate short texts into pseudo-documents before the application of standard topic modeling. Although such strate-gies cannot be well generalized to more general genres of short texts, the success has shed light on how to develop a generalized solution. In this pa-per, we present a novel model towards this goal by integrating topic modeling with short text aggre-gation during topic inference. The aggregation is founded on general topical affinity of texts rather than particular heuristics, making the model read-ily applicable to various short texts. Experimental results on real-world datasets validate the effective-ness of this new model, suggesting that it can distill more meaningful topics from short texts.","bibtype":"article","author":"Quan, Xiaojun and Kit, Chunyu and Ge, Yong and Pan, Sinno Jialin","bibtex":"@article{\n title = {Short and Sparse Text Topic Modeling via Self-Aggregation},\n type = {article},\n keywords = {Technical Papers — Web and Knowledge-Based Information Systems},\n websites = {http://www.ntu.edu.sg/home/sinnopan/publications/[IJCAI15]Short%20and%20Sparse%20Text%20Topic%20Modeling%20via%20Self-Aggregation.pdf},\n id = {25622032-7e08-3452-8748-d62a736c6c34},\n created = {2018-02-05T18:52:06.015Z},\n accessed = {2018-02-05},\n file_attached = {true},\n profile_id = {371589bb-c770-37ff-8193-93c6f25ffeb1},\n group_id = {f982cd63-7ceb-3aa2-ac7e-a953963d6716},\n last_modified = {2018-02-05T18:52:09.145Z},\n read = {false},\n starred = {false},\n authored = {false},\n confirmed = {false},\n hidden = {false},\n private_publication = {false},\n abstract = {The overwhelming amount of short text data on social media and elsewhere has posed great chal-lenges to topic modeling due to the sparsity prob-lem. Most existing attempts to alleviate this prob-lem resort to heuristic strategies to aggregate short texts into pseudo-documents before the application of standard topic modeling. Although such strate-gies cannot be well generalized to more general genres of short texts, the success has shed light on how to develop a generalized solution. In this pa-per, we present a novel model towards this goal by integrating topic modeling with short text aggre-gation during topic inference. The aggregation is founded on general topical affinity of texts rather than particular heuristics, making the model read-ily applicable to various short texts. Experimental results on real-world datasets validate the effective-ness of this new model, suggesting that it can distill more meaningful topics from short texts.},\n bibtype = {article},\n author = {Quan, Xiaojun and Kit, Chunyu and Ge, Yong and Pan, Sinno Jialin}\n}","author_short":["Quan, X.","Kit, C.","Ge, Y.","Pan, S., J."],"urls":{"Paper":"http://bibbase.org/service/mendeley/371589bb-c770-37ff-8193-93c6f25ffeb1/file/043a0160-5e36-4ca3-3997-64152e76b8bf/Short_and_Sparse_Text_Topic_Modeling_via_Self-Aggregation.pdf.pdf","Website":"http://www.ntu.edu.sg/home/sinnopan/publications/[IJCAI15]Short%20and%20Sparse%20Text%20Topic%20Modeling%20via%20Self-Aggregation.pdf"},"bibbaseid":"quan-kit-ge-pan-shortandsparsetexttopicmodelingviaselfaggregation","role":"author","keyword":["Technical Papers — Web and Knowledge-Based Information Systems"],"downloads":0},"search_terms":["short","sparse","text","topic","modeling","via","self","aggregation","quan","kit","ge","pan"],"keywords":["technical papers — web and knowledge-based information systems"],"authorIDs":[]}