Here's a comprehensive list of word frequency lists that are commonly used in various contexts. Whether you are a linguist, a data scientist, or just someone curious about language, these lists can be incredibly useful.
Common Word Frequency Lists
The Brown Corpus
- This is one of the most widely used English corpora for word frequency studies.
- It contains 1.5 million words and is split into 15 genres.
- More about The Brown Corpus
The COCA Corpus
- The Corpus of Contemporary American English is a large, balanced corpus of American English.
- It contains 450 million words and is organized into 15 genres, including spoken, fiction, and newspaper texts.
- More about COCA Corpus
The BNC Corpus
- The British National Corpus is a collection of samples of written and spoken English from a wide range of sources.
- It contains 100 million words and is divided into 20 genres.
- More about BNC Corpus