SUBTLEX ESP: Spanish word frequencies based on film subtitles

SUBTLEX ESP: Spanish word frequencies based on film subtitles. Cuetos, F., González-Nosti, M., Barbón, A., & Brysbaert, M. Psicológica, 211(32):133–143.

Paper abstract bibtex

Recent studies have shown that word frequency estimates obtained from films and television subtitles are better to predict performance in word recognition experiments than the traditional word frequency estimates based on books and newspapers. In this study, we present a subtitle-based word frequency list for Spanish, one of the most widely spoken languages. The subtitle frequencies are based on a corpus of 41M words taken from contemporary movies and TV series (screened between 1990 and 2009). In addition, the frequencies have been validated by correlating them with the RTs from two megastudies involving 2,764 words each (lexical decision and word naming tasks). The subtitle frequencies explained 6% more of the variance than the existing written frequencies in lexical decision, and 2% extra in word naming.

@article{cuetos_subtlex_nodate,
	title = {{SUBTLEX}	{ESP}: {Spanish} word frequencies based on film subtitles},
	volume = {211},
	url = {http://www.uv.es/psicologica/articulos2.11/1CUETOS.pdf},
	abstract = {Recent studies have shown that word frequency estimates obtained from films and television subtitles are better to predict performance in word recognition experiments than the traditional word frequency estimates based on books and newspapers. In this study, we present a subtitle-based word frequency list for Spanish, one of the most widely spoken languages. The subtitle frequencies are based on a corpus of 41M words taken from contemporary movies and TV series (screened between 1990 and 2009). In addition, the frequencies have been validated by correlating them with the RTs from two megastudies involving 2,764 words each (lexical decision and word naming tasks). The subtitle frequencies explained 6\% more of the variance than the existing written frequencies in lexical decision, and 2\% extra in word naming.},
	number = {32},
	urldate = {2015-07-20},
	journal = {Psicológica},
	author = {Cuetos, Fernando and González-Nosti, María and Barbón, Analia and Brysbaert, Marc},
	pages = {133--143},
}

Downloads: 0

{"_id":"v9oRDBWNN84e8dBEG","bibbaseid":"cuetos-gonzleznosti-barbn-brysbaert-subtlexespspanishwordfrequenciesbasedonfilmsubtitles","author_short":["Cuetos, F.","González-Nosti, M.","Barbón, A.","Brysbaert, M."],"bibdata":{"bibtype":"article","type":"article","title":"SUBTLEX ESP: Spanish word frequencies based on film subtitles","volume":"211","url":"http://www.uv.es/psicologica/articulos2.11/1CUETOS.pdf","abstract":"Recent studies have shown that word frequency estimates obtained from films and television subtitles are better to predict performance in word recognition experiments than the traditional word frequency estimates based on books and newspapers. In this study, we present a subtitle-based word frequency list for Spanish, one of the most widely spoken languages. The subtitle frequencies are based on a corpus of 41M words taken from contemporary movies and TV series (screened between 1990 and 2009). In addition, the frequencies have been validated by correlating them with the RTs from two megastudies involving 2,764 words each (lexical decision and word naming tasks). The subtitle frequencies explained 6% more of the variance than the existing written frequencies in lexical decision, and 2% extra in word naming.","number":"32","urldate":"2015-07-20","journal":"Psicológica","author":[{"propositions":[],"lastnames":["Cuetos"],"firstnames":["Fernando"],"suffixes":[]},{"propositions":[],"lastnames":["González-Nosti"],"firstnames":["María"],"suffixes":[]},{"propositions":[],"lastnames":["Barbón"],"firstnames":["Analia"],"suffixes":[]},{"propositions":[],"lastnames":["Brysbaert"],"firstnames":["Marc"],"suffixes":[]}],"pages":"133–143","bibtex":"@article{cuetos_subtlex_nodate,\n\ttitle = {{SUBTLEX}\t{ESP}: {Spanish} word frequencies based on film subtitles},\n\tvolume = {211},\n\turl = {http://www.uv.es/psicologica/articulos2.11/1CUETOS.pdf},\n\tabstract = {Recent studies have shown that word frequency estimates obtained from films and television subtitles are better to predict performance in word recognition experiments than the traditional word frequency estimates based on books and newspapers. In this study, we present a subtitle-based word frequency list for Spanish, one of the most widely spoken languages. The subtitle frequencies are based on a corpus of 41M words taken from contemporary movies and TV series (screened between 1990 and 2009). In addition, the frequencies have been validated by correlating them with the RTs from two megastudies involving 2,764 words each (lexical decision and word naming tasks). The subtitle frequencies explained 6\\% more of the variance than the existing written frequencies in lexical decision, and 2\\% extra in word naming.},\n\tnumber = {32},\n\turldate = {2015-07-20},\n\tjournal = {Psicológica},\n\tauthor = {Cuetos, Fernando and González-Nosti, María and Barbón, Analia and Brysbaert, Marc},\n\tpages = {133--143},\n}\n\n","author_short":["Cuetos, F.","González-Nosti, M.","Barbón, A.","Brysbaert, M."],"key":"cuetos_subtlex_nodate","id":"cuetos_subtlex_nodate","bibbaseid":"cuetos-gonzleznosti-barbn-brysbaert-subtlexespspanishwordfrequenciesbasedonfilmsubtitles","role":"author","urls":{"Paper":"http://www.uv.es/psicologica/articulos2.11/1CUETOS.pdf"},"metadata":{"authorlinks":{}}},"bibtype":"article","biburl":"https://bibbase.org/zotero/juliob","dataSources":["P49CRQ3roC5tkkHG3"],"keywords":[],"search_terms":["subtlex","esp","spanish","word","frequencies","based","film","subtitles","cuetos","gonzález-nosti","barbón","brysbaert"],"title":"SUBTLEX ESP: Spanish word frequencies based on film subtitles","year":null}