SUBLEX-us ========= SubtlexUS is database containing word frequencies based on English-US movies and TV series subtitles (51 million words in total). Two measures are provided 1. The frequency per million words, called SUBTLEXWF (Subtitle frequency: word form frequency) 2. The percentage of films in which a word occurs, called SUBTLEXCD (Subtitle frequency: contextual diversity; see Adelman, Brown, & Quesada (2006) for the qualities of this measure). **Table**: [SUBTLEXus74286wordstextversion.tsv](http://www.lexique.org/databases/SUBTLEX-US/SUBTLEXus74286wordstextversion.tsv) **Website:** **Publication:** Brysbaert, Marc, and Boris New. 2009. "Moving beyond Kučera and Francis: A Critical Evaluation of Current Word Frequency Norms and the Introduction of a New and Improved Word Frequency Measure for American English." Behavior Research Methods 41 (4): 977--990. ([pdf](Brysbaert_New_2009_Moving%20beyond%20Kučera%20and%20Francis.pdf)) Brysbaert, Marc, Boris New, and Emmanuel Keuleers (2012). "Adding Part-of-Speech Information to the SUBTLEX-US Word Frequencies." *Behavior Research Methods* 44 (4): 991--997. ([pdf](Brysbaert2012_Article_AddingPart-of-speechInformatio.pdf))