mixed
Across
- 1. : In Spark, what is the distributed collection of data called that can be processed in parallel?
- 3. In data lake, what format is often preferred for its flexibility and schema-on-read capability.
- 5. which mechanism in Kafka is used to maintain the order of messages within a partition.
- 7. It processes large datasets by breaking them into smaller tasks, improving efficiency.
- 8. Which is the of summarizing data at different levels of granularity in OLAP.
- 9. which is been used to helps you to schedule and manage tasks, keeping your workflows in line.
Down
- 1. what type of integrity constraint in OLTP systems, ensures that data remains accurate and reliable
- 2. What is the basic storage unit of Kafka.
- 4. What schema type is commonly used in OLAP systems to enhance query performance through pre-aggregated data.
- 6. Which type of database is generally the most scalable and high performing.