ML3 - Crossword Labs

4. A simple model or method used as a reference point for comparing the performance of more complex models
7. A non-parametric measure of statistical dependence between two variables, calculated over the ranked values
9. A measure of how frequently the itemset appears in the dataset
11. A kind of correlation which quantifies the degree to which two variables are linearly related or associated
14. An open-source software library developed by Yandex
15. A phenomenon occurs when a model has not learned the patterns in the training data well and is unable to generalize well on the new data
16. Converting categorical data (text or labels) into a numerical format that can be used as input for machine learning algorithms
19. A numerical value that an agent receives as feedback from the environment after taking an action
20. A concept that measures the amount of uncertainty or disorder in a set of data
21. An ensemble machine learning algorithm that can be used in a wide variety of classification and regression tasks

1. A kind of analysis which is also known as the 80/20 rule or the principle of factor sparsity
2. A technique in machine learning and data analysis that involves grouping similar data points together based on certain criteria
3. A kind of function which is an S-shaped curve that maps any real-valued number to a value between 0 and 1
5. A phenomenon in probability and statistics in which a trend appears in several groups of data but disappears or reverses when the groups are combined
6. A kind of matrix also known as cost matrix or misclassification cost matrix
8. A metric used to evaluate the performance of a classification model
9. A kind of diagram showing the flow of resources or information from one set of entities to another
10. A one-dimensional form of data
12. A kind of regression modeling technique that assumes a linear relationship between the independent variables and the dependent variable
13. An ensemble learning technique in machine learning where multiple weak learners (models that perform slightly better than random chance) are combined to create a strong learner
17. A phenomenon in machine learning where a model learns the training data too well, capturing noise and random fluctuations rather than the underlying patterns
18. A kind of sampling where every member of the population has an equal chance of being included in the sample