Across
- 1. Popular optimizer with weight decay
- 4. Collection of training examples
- 6. Method that updates model weights
- 8. Raw scores before softmax
- 9. Adapt a pretrained model to a task
Down
- 2. Mechanism that weighs input relevance
- 3. Algorithm for training neural networks
- 4. Regularization by randomly disabling neurons
- 5. Vector representation of words or items
- 7. Basic text units used by language models
