Crossword
Across
- 5. Set of all tokens
- 6. Start, end, etc.
- 7. Splits text into tokens
- 10. Commas, periods, etc.
- 11. Numerical representations of text
- 13. Spaces and tabs
Down
- 1. Tunable parameter
- 2. Compressing long prompts
- 3. Reducing sequence length
- 4. How often pairs appear
- 8. Byte Pair Encoding
- 9. Handling various data types
- 12. Large Language Models