dec_mixed

12345678910
Across
  1. 3. Which of the following data structures are Spark DataFrames built on top of?
  2. 4. Which is a group of tasks that can be executed in parallel to compute the same set of operations on potentially multiple machines
  3. 7. A common format for storing semi-structured data in columnar format
  4. 8. How many level of Permissions for a compute in databricks?
  5. 10. What is the distributed stream-processing engine developed by Apache for high-throughput, low-latency processing?
Down
  1. 1. Which SQL keyword can be used to append new rows to an existing Delta table?
  2. 2. A name of the query engine designed for interactive queries over large datasets, commonly used with cloud data warehouses?
  3. 5. A system is used to manage and orchestrate data workflows.
  4. 6. Which is a combination of a block of data and a set of transformers that will run on a single executor
  5. 9. What system allows users to interact with big data using SQL-like queries and is often integrated with Hadoop?