Intelligence studio Fun

1234567891011121314151617
Across
  1. 2. Python-based framework for orchestrating workflows
  2. 4. Programming language used in PySpark
  3. 5. Component in Spark that optimizes query execution plans
  4. 8. Default cluster manager for Spark
  5. 10. Newer Airflow feature that allows event-driven DAGs
  6. 12. Spark engine that processes structured data
  7. 13. Process of executing a task in Airflow
  8. 14. Directed structure that defines tasks and dependencies in Airflow
  9. 15. File format optimized for big data processing in Spark
  10. 17. Component responsible for scheduling tasks in Airflow
Down
  1. 1. Distributed computing framework that Spark is often compared to
  2. 3. Process of breaking jobs into smaller execution tasks in Spark
  3. 5. Fast and general-purpose computation engine in Spark
  4. 6. Default executor in Airflow for running tasks sequentially
  5. 7. User interface used to monitor DAGs in Airflow
  6. 9. Database backend that stores Airflow metadata
  7. 11. Spark API for running SQL queries
  8. 16. Fundamental data structure in Spark