Cascaded Data Mining Methods for Text Understanding, with medical case study. Romano, R., Rokach, L., & Maimon, O. In Workshops Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 18-22 December 2006, Hong Kong, China, pages 458-462, 2006.
Cascaded Data Mining Methods for Text Understanding, with medical case study [link]Link  doi  abstract   bibtex   
Substantial electronically stored textual data such as clinical narratives reports often need to be retrieved to find relevant information for clinical and research purposes. The context of negation, a negative finding, is of special importance, since many of the most frequently described findings are such. Hence, when searching free-text narratives for patients with a certain medical condition, if negation is not taken into account, many of the documents retrieved were irrelevant. We present a new cascaded pattern learning method for automatic identification of negative context in clinical narratives reports. Studying the training corpuses, the classification errors and patterns selected by the classifier, we noticed that it is possible to create a more powerful ensemble structure than the structure obtained from general-purpose ensemble method (such as Adaboost). We compare the new algorithm to previous methods proposed for the same task of similar medical narratives, and show its advantages: accuracy improvement compared to other machine learning methods, and much faster than manual knowledge engineering techniques with matching accuracy
@InProceedings{DBLP:conf/icdm/RomanoRM06,
  author        = {Roni Romano and
               Lior Rokach and
               Oded Maimon},
  title         = {Cascaded Data Mining Methods for Text Understanding, with
               medical case study},
  ee            = {http://doi.ieeecomputersociety.org/10.1109/ICDMW.2006.38},
  bibsource     = {DBLP, http://dblp.uni-trier.de},
  pages         = {458-462},
  booktitle     = {Workshops Proceedings of the 6th IEEE International Conference
               on Data Mining (ICDM 2006), 18-22 December 2006, Hong Kong,
               China},
  year          = {2006},
  abstract={Substantial electronically stored textual data such as clinical narratives reports often need to be retrieved to find relevant information for clinical and research purposes. The context of negation, a negative finding, is of special importance, since many of the most frequently described findings are such. Hence, when searching free-text narratives for patients with a certain medical condition, if negation is not taken into account, many of the documents retrieved were irrelevant. We present a new cascaded pattern learning method for automatic identification of negative context in clinical narratives reports. Studying the training corpuses, the classification errors and patterns selected by the classifier, we noticed that it is possible to create a more powerful ensemble structure than the structure obtained from general-purpose ensemble method (such as Adaboost). We compare the new algorithm to previous methods proposed for the same task of similar medical narratives, and show its advantages: accuracy improvement compared to other machine learning methods, and much faster than manual knowledge engineering techniques with matching accuracy},
  doi={10.1109/ICDMW.2006.38}, 
  keywords	= {Information retrieval, Medical informatics, Ensemble learning, Text mining, Sequence mining}
}

Downloads: 0