{"_id":"vJeyMmxLL6jC9HAm5","bibbaseid":"fan-jernite-perez-grangier-weston-auli-eli5longformquestionanswering-2019","authorIDs":[],"author_short":["Fan, A.","Jernite, Y.","Perez, E.","Grangier, D.","Weston, J.","Auli, M."],"bibdata":{"bibtype":"article","type":"article","author":[{"propositions":[],"lastnames":["Fan"],"firstnames":["Angela"],"suffixes":[]},{"propositions":[],"lastnames":["Jernite"],"firstnames":["Yacine"],"suffixes":[]},{"propositions":[],"lastnames":["Perez"],"firstnames":["Ethan"],"suffixes":[]},{"propositions":[],"lastnames":["Grangier"],"firstnames":["David"],"suffixes":[]},{"propositions":[],"lastnames":["Weston"],"firstnames":["Jason"],"suffixes":[]},{"propositions":[],"lastnames":["Auli"],"firstnames":["Michael"],"suffixes":[]}],"title":"ELI5: Long Form Question Answering","journal":"arXiv","volume":"","number":"","pages":"1907.09190v1","year":"2019","abstract":"We introduce the first large-scale corpus for long-form question answering, a task requiring elaborate and in-depth answers to open-ended questions. The dataset comprises 270K threads from the Reddit forum ``Explain Like I’m Five’’ (ELI5) where an online community provides answers to questions which are comprehensible by five year olds. Compared to existing datasets, ELI5 comprises diverse questions requiring multi-sentence answers. We provide a large set of web documents to help answer the question. Automatic and human evaluations show that an abstractive model trained with a multi-task objective outperforms conventional Seq2Seq, language modeling, as well as a strong extractive baseline. However, our best model is still far from human performance since raters prefer gold responses in over 86% of cases, leaving ample opportunity for future improvement.","location":"","keywords":"","bibtex":"@Article{Fan2019,\nauthor = {Fan, Angela and Jernite, Yacine and Perez, Ethan and Grangier, David and Weston, Jason and Auli, Michael}, \ntitle = {ELI5: Long Form Question Answering}, \njournal = {arXiv}, \nvolume = {}, \nnumber = {}, \npages = {1907.09190v1}, \nyear = {2019}, \nabstract = {We introduce the first large-scale corpus for long-form question answering, a task requiring elaborate and in-depth answers to open-ended questions. The dataset comprises 270K threads from the Reddit forum ``Explain Like I’m Five’’ (ELI5) where an online community provides answers to questions which are comprehensible by five year olds. Compared to existing datasets, ELI5 comprises diverse questions requiring multi-sentence answers. We provide a large set of web documents to help answer the question. Automatic and human evaluations show that an abstractive model trained with a multi-task objective outperforms conventional Seq2Seq, language modeling, as well as a strong extractive baseline. However, our best model is still far from human performance since raters prefer gold responses in over 86\\% of cases, leaving ample opportunity for future improvement.}, \nlocation = {}, \nkeywords = {}}\n\n\n","author_short":["Fan, A.","Jernite, Y.","Perez, E.","Grangier, D.","Weston, J.","Auli, M."],"key":"Fan2019","id":"Fan2019","bibbaseid":"fan-jernite-perez-grangier-weston-auli-eli5longformquestionanswering-2019","role":"author","urls":{},"downloads":0},"bibtype":"article","biburl":"https://gist.githubusercontent.com/stuhlmueller/a37ef2ef4f378ebcb73d249fe0f8377a/raw/6f96f6f779501bd9482896af3e4db4de88c35079/references.bib","creationDate":"2020-01-27T02:13:33.743Z","downloads":0,"keywords":[],"search_terms":["eli5","long","form","question","answering","fan","jernite","perez","grangier","weston","auli"],"title":"ELI5: Long Form Question Answering","year":2019,"dataSources":["hEoKh4ygEAWbAZ5iy"]}