Information extraction and text summarization using linguistic knowledge acquisition. Rau, L., F., Jacobs, P., S., & Zernik, U. Information Processing & Management, 25(4):419-428, 1989.
Information extraction and text summarization using linguistic knowledge acquisition [link]Website  abstract   bibtex   
Storing and accessing texts in a conceptual format has a number of advantages over traditional document retrieval methods. A conceptual format facilitates natural language access to text information. It can support imprecise and inexact queries, conceptual information summarization, and, ultimately, document translation. The lack of extensive linguistic coverage is the major barrier to extracting useful information from large bodies of text. Current natural language processing (NLP) systems do not have rich enough lexicons to cover all the important words and phrases in extended texts. Two methods of overcoming this limitation are (1) to apply a text processing strategy that is tolerant of unknown words and gaps in linguistics knowledge, and (2) to acquire lexical information automatically from the texts. These two methods have been implemented in a prototype intelligent information retrieval system called SCISOR (System for Conceptual Information Summarization, Organization and Retrieval). This article describes the text processing, language acquisition, and summarization components of SCISOR.
@article{
 title = {Information extraction and text summarization using linguistic knowledge acquisition},
 type = {article},
 year = {1989},
 identifiers = {[object Object]},
 pages = {419-428},
 volume = {25},
 websites = {http://dx.doi.org/10.1016/0306-4573(89)90069-1},
 id = {56c50683-1b20-3444-978c-aa72f66fd8c0},
 created = {2012-04-01T16:32:49.000Z},
 file_attached = {false},
 profile_id = {5284e6aa-156c-3ce5-bc0e-b80cf09f3ef6},
 group_id = {066b42c8-f712-3fc3-abb2-225c158d2704},
 last_modified = {2017-03-14T14:36:19.698Z},
 read = {false},
 starred = {false},
 authored = {false},
 confirmed = {true},
 hidden = {false},
 citation_key = {Rau1989},
 private_publication = {false},
 abstract = {Storing and accessing texts in a conceptual format has a number of advantages over traditional document retrieval methods. A conceptual format facilitates natural language access to text information. It can support imprecise and inexact queries, conceptual information summarization, and, ultimately, document translation. The lack of extensive linguistic coverage is the major barrier to extracting useful information from large bodies of text. Current natural language processing (NLP) systems do not have rich enough lexicons to cover all the important words and phrases in extended texts. Two methods of overcoming this limitation are (1) to apply a text processing strategy that is tolerant of unknown words and gaps in linguistics knowledge, and (2) to acquire lexical information automatically from the texts. These two methods have been implemented in a prototype intelligent information retrieval system called SCISOR (System for Conceptual Information Summarization, Organization and Retrieval). This article describes the text processing, language acquisition, and summarization components of SCISOR.},
 bibtype = {article},
 author = {Rau, L F and Jacobs, P S and Zernik, U},
 journal = {Information Processing & Management},
 number = {4}
}

Downloads: 0