Across
- 2. A _______ describes the contents, structure, and layout of a dataset collection.
- 6. Data _______ is the idea that digital information should be accessible and understandable to the average end user as a basis for decision-making.
- 7. Twitter data is a great resource for performing the tasks like opinion mining, sentiment analysis, and can be accessed using the Twitter ____.
- 8. Solution to replace Unicorn data scientist is have a data science _______.
- 9. ______ is the idea that the output of an algorithm, or any computer function for that matter, is only as good as the quality of the input that it receives.
- 12. Web _______ is the process of pulling data from a website’s source code.
- 13. You may need to make ____ requests when you want to get more detail on the projects that government money is funding.
Down
- 1. ________ is a tool that regulate the storage of massive datasets.
- 2. Data ______ is closely related to data quality.
- 3. An ______ is a data point that is considered extremely far from other points.
- 4. Missing data can add _____ to a model which result in overestimate or underestimate value.
- 5. Data scientists spend 80% of their time cleaning and manipulating data and only 20% of their time actually _______ it.
- 10. ______ data is data which is acquired through various computer network operations but not used in any manner to derive insights or for decision making.
- 11. Data ______ is the process of structuring your data in a way that makes it easy to analyze and use.
