{"_id":"DPAMuBMdLmDsWkrjT","bibbaseid":"kushmerick-johnston-mcguinness-informationextractionbytextclassification-2004","authorIDs":[],"author_short":["Kushmerick, N.","Johnston, E.","Mcguinness, S."],"bibdata":{"title":"Information extraction by text classification","type":"article","year":"2004","pages":"1-7","websites":"http://citeseer.ist.psu.edu/652372.html;","id":"bce0a6f6-42b5-3adf-bb00-d57398ca6db0","created":"2011-12-28T07:04:55.000Z","file_attached":"true","profile_id":"5284e6aa-156c-3ce5-bc0e-b80cf09f3ef6","group_id":"066b42c8-f712-3fc3-abb2-225c158d2704","last_modified":"2017-03-14T14:36:19.698Z","tags":"named entities","read":false,"starred":false,"authored":false,"confirmed":"true","hidden":false,"citation_key":"Kushmerick2004","private_publication":false,"abstract":"We investigate the application of classification techniques to the problem of information extraction (IE). In particular we use support vector machines and several different feature-sets to build a set of classifiers for information extraction. We show that this approach is competitive with current state-ofthe -art information extraction algorithms based on specialized learning algorithms. We also introduce a new technique for improving the recall of IE systems called convergent boundary classification. We show that this can give significant improvement in the performance of our IE system and gives a system with both high precision and high recall.","bibtype":"article","author":"Kushmerick, Nicholas and Johnston, Edward and Mcguinness, Stephen","journal":"Science","bibtex":"@article{\n title = {Information extraction by text classification},\n type = {article},\n year = {2004},\n pages = {1-7},\n websites = {http://citeseer.ist.psu.edu/652372.html;},\n id = {bce0a6f6-42b5-3adf-bb00-d57398ca6db0},\n created = {2011-12-28T07:04:55.000Z},\n file_attached = {true},\n profile_id = {5284e6aa-156c-3ce5-bc0e-b80cf09f3ef6},\n group_id = {066b42c8-f712-3fc3-abb2-225c158d2704},\n last_modified = {2017-03-14T14:36:19.698Z},\n tags = {named entities},\n read = {false},\n starred = {false},\n authored = {false},\n confirmed = {true},\n hidden = {false},\n citation_key = {Kushmerick2004},\n private_publication = {false},\n abstract = {We investigate the application of classification techniques to the problem of information extraction (IE). In particular we use support vector machines and several different feature-sets to build a set of classifiers for information extraction. We show that this approach is competitive with current state-ofthe -art information extraction algorithms based on specialized learning algorithms. We also introduce a new technique for improving the recall of IE systems called convergent boundary classification. We show that this can give significant improvement in the performance of our IE system and gives a system with both high precision and high recall.},\n bibtype = {article},\n author = {Kushmerick, Nicholas and Johnston, Edward and Mcguinness, Stephen},\n journal = {Science}\n}","author_short":["Kushmerick, N.","Johnston, E.","Mcguinness, S."],"urls":{"Paper":"https://bibbase.org/service/mendeley/bfdabac2-d7f2-3c5b-aa7a-06431c0ae35e/file/1827eb56-0690-3435-4c19-f34e8b5fd523/2004-Information_extraction_by_text_classification.pdf.pdf","Website":"http://citeseer.ist.psu.edu/652372.html;"},"bibbaseid":"kushmerick-johnston-mcguinness-informationextractionbytextclassification-2004","role":"author","downloads":0,"html":""},"bibtype":"article","creationDate":"2020-02-06T23:48:11.834Z","downloads":0,"keywords":[],"search_terms":["information","extraction","text","classification","kushmerick","johnston","mcguinness"],"title":"Information extraction by text classification","year":2004}