Across
- 2. Delivery method that is ideal for subscribers needing close to real time performance
- 4. Pub/sub connects applications and services through a ______ infrastructure
- 7. Dataflow models find out when in processing time results are materialized via ____, triggers, and allowed lateness
- 12. In a streaming system on GCP this allows you to query data as it arrives from the streaming pipelines
- 13. In Dataproc, they cost less but may not always be available
- 14. A machine learning problem where the outcome to be predicted is a continuous number
- 17. Is a dataflow set of data in your pipeline
- 19. Is a class that will do logistic regression
- 23. BigQuery supports nested and ______ fields
- 24. Dataflow models find out how refinements of results relate via _______ modes
- 26. ML API can ___ scanned receipts
- 29. Determines the Google data centre where compute nodes will be
- 30. A _____ combines its inputs to map part of a decision surface in a neural network
- 31. Bigtable learns access patterns and attempts to distribute reads and storage across _______ evenly
- 32. Is a dataflow processing operation or step in your pipeline
- 33. Provides the ability for spark programs to seperate compute and storage
Down
- 1. Performance of this can be improved by changing schemas to minimize data skew
- 3. Is a dataflow endpoint for your pipeline
- 5. Installing software libraries on the master in dataproc typically uses an ______ script
- 6. Dataflow models find what results are calculated via ________
- 8. Enhancing ML data by extracting features from raw data is feature______
- 9. In a neural network the number of layers is a _______
- 10. One complete pass through the training dataset in Machine Learning
- 11. An efficient way to read data into TensorFlow
- 15. Dataflow models find where results are calculated via event-time _______
- 16. A small set of examples on which gradient is computed in Machine Learning
- 18. Bigtable is great for __________ data
- 20. Lets you train TensorFlow machine learning models at scale
- 21. This cluster mode in Dataproc provides 1 master and N workers
- 22. The best practice for querying a table, then querying the results of that query is to use a _____
- 25. A software framework for writing portable ML code
- 27. Makes it easy to create resilient streaming pipelines
- 28. Dataproc helps you create job-specific clusters without ______
