Across
- 5. Contains spoken and written American English from TV shows, blogs, and magazines, providing a modern perspective on word usage and trends.
- 6. Sequences of n words that occur together in a text.
- 10. a corpus is designed for a particular domain, field, or variety of language,
- 11. Assigning grammatical tags to each word
- 12. A collection of authentic written texts from various genres, like fiction, news, and academic prose, used to study English language patterns.
- 13. adding metadata to a text corpus to enable deeper analysis
- 15. Offers detailed annotations on voice quality, intonation, and pronunciation aspects.
- 16. finding words in context and studying their usage
Down
- 1. focuses on theoretical ideas like Universal Grammar to explain the deeper, universal structures of language.
- 2. Measuring the variety of words in a corpus
- 3. words that frequently appear together
- 4. a leader in corpus linguistics who developed innovative tools for analyzing real-world language data, such as concordance programs.
- 7. Removing suffixes to obtain the root form of a word.
- 8. Calculating the number of times a word appears in a corpus
- 9. Recurring sequences of words that function as urits of meaning
- 14. A corpora is that contains aligned translations of the same text in multiple languages
