Across
- 3. Which of the following data structures are Spark DataFrames built on top of?
- 4. Which is a group of tasks that can be executed in parallel to compute the same set of operations on potentially multiple machines
- 7. A common format for storing semi-structured data in columnar format
- 8. How many level of Permissions for a compute in databricks?
- 10. What is the distributed stream-processing engine developed by Apache for high-throughput, low-latency processing?
Down
- 1. Which SQL keyword can be used to append new rows to an existing Delta table?
- 2. A name of the query engine designed for interactive queries over large datasets, commonly used with cloud data warehouses?
- 5. A system is used to manage and orchestrate data workflows.
- 6. Which is a combination of a block of data and a set of transformers that will run on a single executor
- 9. What system allows users to interact with big data using SQL-like queries and is often integrated with Hadoop?
