NNDL

123456789101112131415161718192021
Across
  1. 2. What type of neural network is used for unsupervised learning and aims to encode input data into a lower-dimensional representation?
  2. 6. What optimization algorithm combines the benefits of both momentum and RMSprop?
  3. 11. What approach involves using a pre-trained model as the starting point for a new model?
  4. 13. What is the technique for representing categorical variables as binary vectors?
  5. 15. What technique is used to assess a model's performance by splitting the dataset into multiple subsets?
  6. 17. What is the representation of categorical variables as continuous vectors in neural networks?
  7. 19. What is the technique for initializing neural network weights to prevent vanishing or exploding gradients?
  8. 20. What type of neural network is designed for processing structured grid data, such as images?
  9. 21. What metric combines precision and recall for evaluating the performance of a classification model?
Down
  1. 1. The activation function commonly used in the output layer for multiclass classification.
  2. 3. What downsampling technique in convolutional neural networks selects the maximum value from a set of values?
  3. 4. What is the phenomenon where the gradient becomes extremely small during training?
  4. 5. What is the process of artificially increasing the size of a dataset by applying transformations to the existing data?
  5. 7. What technique is used to prevent overfitting by adding a penalty term to the loss function based on the magnitude of weights?
  6. 8. What is the iterative optimization algorithm used to minimize the loss function?
  7. 9. What is the activation function that replaces all negative values in the input with zero?
  8. 10. What technique involves normalizing inputs in a neural network to improve training stability?
  9. 12. What is the process of finding the best hyperparameter values for a machine learning model?
  10. 14. What is the technique of gradually reducing the learning rate during training to converge more effectively?
  11. 16. What activation function is commonly used in the output layer for multiclass classification?
  12. 18. What technique involves randomly dropping out some neurons during training to prevent overfitting?