Text classification and named entities for new event detection. Kumaran, G. & Allan, J. Proceedings of the 27th annual international conference on Research and development in information retrieval SIGIR 04, ACM Press, 2004.
Text classification and named entities for new event detection [pdf]Paper  Text classification and named entities for new event detection [link]Website  abstract   bibtex   
New Event Detection is a challenging task that still offers scope for great improvement after years of effort. In this paper we show how performance on New Event Detection (NED) can be improved by the use of text classification techniques as well as by using named entities in a new way. We explore modifications to the document representation in a vector space-based NED system. We also show that addressing named entities preferentially is useful only in certain situations. A combination of all the above results in a multi-stage NED system that performs much better than baseline single-stage NED systems.

Downloads: 0