mixed

123456789
Across
  1. 1. : In Spark, what is the distributed collection of data called that can be processed in parallel?
  2. 3. In data lake, what format is often preferred for its flexibility and schema-on-read capability.
  3. 5. which mechanism in Kafka is used to maintain the order of messages within a partition.
  4. 7. It processes large datasets by breaking them into smaller tasks, improving efficiency.
  5. 8. Which is the of summarizing data at different levels of granularity in OLAP.
  6. 9. which is been used to helps you to schedule and manage tasks, keeping your workflows in line.
Down
  1. 1. what type of integrity constraint in OLTP systems, ensures that data remains accurate and reliable
  2. 2. What is the basic storage unit of Kafka.
  3. 4. What schema type is commonly used in OLAP systems to enhance query performance through pre-aggregated data.
  4. 6. Which type of database is generally the most scalable and high performing.