Universal Sentence Encoder

Universal Sentence Encoder. Cer, D., Yang, Y., Kong, S., Hua, N., Limtiaco, N., John, R. S., Constant, N., Guajardo-Cespedes, M., Yuan, S., Tar, C., Sung, Y., Strope, B., & Kurzweil, R.

Paper abstract bibtex

We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance. Comparisons are made with baselines that use word level transfer learning via pretrained word embeddings as well as baselines do not use any transfer learning. We find that transfer learning using sentence embeddings tends to outperform word level transfer. With transfer learning via sentence embeddings, we observe surprisingly good performance with minimal amounts of supervised training data for a transfer task. We obtain encouraging results on Word Embedding Association Tests (WEAT) targeted at detecting model bias. Our pre-trained sentence encoding models are made freely available for download and on TF Hub.

@article{cerUniversalSentenceEncoder2018,
  archivePrefix = {arXiv},
  eprinttype = {arxiv},
  eprint = {1803.11175},
  primaryClass = {cs},
  title = {Universal {{Sentence Encoder}}},
  url = {http://arxiv.org/abs/1803.11175},
  abstract = {We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance. Comparisons are made with baselines that use word level transfer learning via pretrained word embeddings as well as baselines do not use any transfer learning. We find that transfer learning using sentence embeddings tends to outperform word level transfer. With transfer learning via sentence embeddings, we observe surprisingly good performance with minimal amounts of supervised training data for a transfer task. We obtain encouraging results on Word Embedding Association Tests (WEAT) targeted at detecting model bias. Our pre-trained sentence encoding models are made freely available for download and on TF Hub.},
  urldate = {2019-02-24},
  date = {2018-03-29},
  keywords = {Computer Science - Computation and Language},
  author = {Cer, Daniel and Yang, Yinfei and Kong, Sheng-yi and Hua, Nan and Limtiaco, Nicole and John, Rhomni St and Constant, Noah and Guajardo-Cespedes, Mario and Yuan, Steve and Tar, Chris and Sung, Yun-Hsuan and Strope, Brian and Kurzweil, Ray},
  file = {/home/dimitri/Nextcloud/Zotero/storage/HQI5EHPN/Cer et al. - 2018 - Universal Sentence Encoder.pdf;/home/dimitri/Nextcloud/Zotero/storage/AM7EA25D/1803.html}
}

Downloads: 0

{"_id":"ksdLJyWqCYbyvZ7Kr","bibbaseid":"cer-yang-kong-hua-limtiaco-john-constant-guajardocespedes-etal-universalsentenceencoder","authorIDs":[],"author_short":["Cer, D.","Yang, Y.","Kong, S.","Hua, N.","Limtiaco, N.","John, R. S.","Constant, N.","Guajardo-Cespedes, M.","Yuan, S.","Tar, C.","Sung, Y.","Strope, B.","Kurzweil, R."],"bibdata":{"bibtype":"article","type":"article","archiveprefix":"arXiv","eprinttype":"arxiv","eprint":"1803.11175","primaryclass":"cs","title":"Universal Sentence Encoder","url":"http://arxiv.org/abs/1803.11175","abstract":"We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance. Comparisons are made with baselines that use word level transfer learning via pretrained word embeddings as well as baselines do not use any transfer learning. We find that transfer learning using sentence embeddings tends to outperform word level transfer. With transfer learning via sentence embeddings, we observe surprisingly good performance with minimal amounts of supervised training data for a transfer task. We obtain encouraging results on Word Embedding Association Tests (WEAT) targeted at detecting model bias. Our pre-trained sentence encoding models are made freely available for download and on TF Hub.","urldate":"2019-02-24","date":"2018-03-29","keywords":"Computer Science - Computation and Language","author":[{"propositions":[],"lastnames":["Cer"],"firstnames":["Daniel"],"suffixes":[]},{"propositions":[],"lastnames":["Yang"],"firstnames":["Yinfei"],"suffixes":[]},{"propositions":[],"lastnames":["Kong"],"firstnames":["Sheng-yi"],"suffixes":[]},{"propositions":[],"lastnames":["Hua"],"firstnames":["Nan"],"suffixes":[]},{"propositions":[],"lastnames":["Limtiaco"],"firstnames":["Nicole"],"suffixes":[]},{"propositions":[],"lastnames":["John"],"firstnames":["Rhomni","St"],"suffixes":[]},{"propositions":[],"lastnames":["Constant"],"firstnames":["Noah"],"suffixes":[]},{"propositions":[],"lastnames":["Guajardo-Cespedes"],"firstnames":["Mario"],"suffixes":[]},{"propositions":[],"lastnames":["Yuan"],"firstnames":["Steve"],"suffixes":[]},{"propositions":[],"lastnames":["Tar"],"firstnames":["Chris"],"suffixes":[]},{"propositions":[],"lastnames":["Sung"],"firstnames":["Yun-Hsuan"],"suffixes":[]},{"propositions":[],"lastnames":["Strope"],"firstnames":["Brian"],"suffixes":[]},{"propositions":[],"lastnames":["Kurzweil"],"firstnames":["Ray"],"suffixes":[]}],"file":"/home/dimitri/Nextcloud/Zotero/storage/HQI5EHPN/Cer et al. - 2018 - Universal Sentence Encoder.pdf;/home/dimitri/Nextcloud/Zotero/storage/AM7EA25D/1803.html","bibtex":"@article{cerUniversalSentenceEncoder2018,\n archivePrefix = {arXiv},\n eprinttype = {arxiv},\n eprint = {1803.11175},\n primaryClass = {cs},\n title = {Universal {{Sentence Encoder}}},\n url = {http://arxiv.org/abs/1803.11175},\n abstract = {We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance. Comparisons are made with baselines that use word level transfer learning via pretrained word embeddings as well as baselines do not use any transfer learning. We find that transfer learning using sentence embeddings tends to outperform word level transfer. With transfer learning via sentence embeddings, we observe surprisingly good performance with minimal amounts of supervised training data for a transfer task. We obtain encouraging results on Word Embedding Association Tests (WEAT) targeted at detecting model bias. Our pre-trained sentence encoding models are made freely available for download and on TF Hub.},\n urldate = {2019-02-24},\n date = {2018-03-29},\n keywords = {Computer Science - Computation and Language},\n author = {Cer, Daniel and Yang, Yinfei and Kong, Sheng-yi and Hua, Nan and Limtiaco, Nicole and John, Rhomni St and Constant, Noah and Guajardo-Cespedes, Mario and Yuan, Steve and Tar, Chris and Sung, Yun-Hsuan and Strope, Brian and Kurzweil, Ray},\n file = {/home/dimitri/Nextcloud/Zotero/storage/HQI5EHPN/Cer et al. - 2018 - Universal Sentence Encoder.pdf;/home/dimitri/Nextcloud/Zotero/storage/AM7EA25D/1803.html}\n}\n\n","author_short":["Cer, D.","Yang, Y.","Kong, S.","Hua, N.","Limtiaco, N.","John, R. S.","Constant, N.","Guajardo-Cespedes, M.","Yuan, S.","Tar, C.","Sung, Y.","Strope, B.","Kurzweil, R."],"key":"cerUniversalSentenceEncoder2018","id":"cerUniversalSentenceEncoder2018","bibbaseid":"cer-yang-kong-hua-limtiaco-john-constant-guajardocespedes-etal-universalsentenceencoder","role":"author","urls":{"Paper":"http://arxiv.org/abs/1803.11175"},"keyword":["Computer Science - Computation and Language"],"downloads":0},"bibtype":"article","biburl":"https://raw.githubusercontent.com/dlozeve/newblog/master/bib/all.bib","creationDate":"2020-01-08T20:39:39.265Z","downloads":0,"keywords":["computer science - computation and language"],"search_terms":["universal","sentence","encoder","cer","yang","kong","hua","limtiaco","john","constant","guajardo-cespedes","yuan","tar","sung","strope","kurzweil"],"title":"Universal Sentence Encoder","year":null,"dataSources":["3XqdvqRE7zuX4cm8m"]}