Across
- 4. A simple model or method used as a reference point for comparing the performance of more complex models
- 7. A non-parametric measure of statistical dependence between two variables, calculated over the ranked values
- 9. A measure of how frequently the itemset appears in the dataset
- 11. A kind of correlation which quantifies the degree to which two variables are linearly related or associated
- 14. An open-source software library developed by Yandex
- 15. A phenomenon occurs when a model has not learned the patterns in the training data well and is unable to generalize well on the new data
- 16. Converting categorical data (text or labels) into a numerical format that can be used as input for machine learning algorithms
- 19. A numerical value that an agent receives as feedback from the environment after taking an action
- 20. A concept that measures the amount of uncertainty or disorder in a set of data
- 21. An ensemble machine learning algorithm that can be used in a wide variety of classification and regression tasks
Down
- 1. A kind of analysis which is also known as the 80/20 rule or the principle of factor sparsity
- 2. A technique in machine learning and data analysis that involves grouping similar data points together based on certain criteria
- 3. A kind of function which is an S-shaped curve that maps any real-valued number to a value between 0 and 1
- 5. A phenomenon in probability and statistics in which a trend appears in several groups of data but disappears or reverses when the groups are combined
- 6. A kind of matrix also known as cost matrix or misclassification cost matrix
- 8. A metric used to evaluate the performance of a classification model
- 9. A kind of diagram showing the flow of resources or information from one set of entities to another
- 10. A one-dimensional form of data
- 12. A kind of regression modeling technique that assumes a linear relationship between the independent variables and the dependent variable
- 13. An ensemble learning technique in machine learning where multiple weak learners (models that perform slightly better than random chance) are combined to create a strong learner
- 17. A phenomenon in machine learning where a model learns the training data too well, capturing noise and random fluctuations rather than the underlying patterns
- 18. A kind of sampling where every member of the population has an equal chance of being included in the sample
