Across
- 2. A measure of spread derived from the squared difference between each value in a set and the mean and the number of observations in the set. [8]
- 7. A type of machine learning model that groups data observations together without specifying a response variable. [12]
- 9. The Exponential is a member of this two parameter continuous probability distribution family. [5]
- 10. [blank]Forest. [6]
- 11. If Input corresponds to “Feature” then Output corresponds to [blank]. [5]
- 12. Developing a model that accurately predicts on a training set, but fails when trying to predict on a new set of data. [11]
- 13. Repeatable sets of instructions which people or machines can use to process data. [9]
Down
- 1. Hierarchical, Density-based, or Distribution-based [blank]. [10]
- 3. A [blank] “Matrix” that data science engineers love, not sure about Neo 😊. [9]
- 4. When the expected value of a statistic equals the parameter, it is considered [blank]. [8]
- 5. Amazon’s 'People who bought this also bought…' is a [blank] algorithm. [14]
- 6. The information that describes the properties of a data object. [8]
- 8. The selection of a subset of observations from within a statistical population. [8]
