ChatGPT generated puzzle

1234567891011121314151617181920
Across
  1. 4. A neural network architecture that relies on self-attention mechanisms.
  2. 6. Part of a transformer architecture that processes the input data.
  3. 7. Algorithm used to adjust weights in neural networks.
  4. 9. A function applied to the output of a neural network node to introduce non-linearity.
  5. 11. A process where a function calls itself, often used in algorithmic problem-solving.
  6. 13. The process of converting text into smaller units such as words or subwords.
  7. 14. A type of RNN designed to better handle long-term dependencies in sequential data.
  8. 17. Part of a transformer that generates the output from encoded data.
  9. 18. Transformer-based model that excels at natural language understanding.
  10. 19. A type of machine learning model, often used in image recognition.
  11. 20. A multi-dimensional array of data, fundamental to deep learning computations.
Down
  1. 1. When a model learns to perform too well on training data, but fails on new data.
  2. 2. A learning paradigm where the model is trained on labeled data.
  3. 3. A regularization technique where random units are ignored during training to prevent overfitting.
  4. 5. A step-by-step procedure used for solving problems or performing computations.
  5. 8. A basic unit in a neural network, introduced in the 1950s.
  6. 10. A framework involving a generator and a discriminator, often used in image generation.
  7. 12. A supervised learning algorithm used for classification tasks.
  8. 15. An individual measurable property or characteristic of a phenomenon being observed.
  9. 16. Type of machine learning where agents learn by interacting with an environment.