Across
- 2. The process of interpreting and examining data to extract meaningful insights
- 5. Detection Identifying unusual patterns or outliers in data
- 7. data variables that represent categories or groups and can take on a limited, fixed number of distinct values
- 8. Categorizing data points into predefined classes or groups
- 10. An ensemble learning algorithm that combines weak classifiers to create a strong classifier
- 12. An error in a model that causes it to consistently predict values away from the true values
- 14. Technologies, processes, and tools that help organizations make informed business decisions
- 15. Sampling A resampling technique where random samples are drawn with replacement from a dataset
- 16. Test A statistical test used to determine if there is a significant association between two categorical variables
- 17. Grouping similar data points together based on certain criteria
Down
- 1. Data Large and complex datasets that cannot be easily processed using traditional data processing methods
- 2. A set of rules that allows one software application to interact with another
- 3. A metric that tells us how well a classification model is doing overall, considering different ways of deciding what counts as a positive or negative prediction
- 4. Gradient Descent An optimization algorithm that updates model parameters using the entire training dataset (different from mini-batch gradient descent)
- 5. A step-by-step set of instructions or rules followed by a computer to solve a problem or perform a task
- 6. A statistical method used to analyze the differences among group means in a sample
- 9. Testing A statistical method used to compare two versions of a product, webpage, or model to determine which performs better
- 11. The measure of how often a classification model correctly predicts outcomes among all instances it evaluates
- 12. Classification Categorizing data into two groups, such as spam or not spam
- 13. Tradeoff The balance between the error introduced by bias and variance in a model
- 15. Statistics A statistical approach that combines prior knowledge with observed data
