Crossword Hadoop Ecosystem

12345678910
Across
  1. 3. A scripting language designed for processing and analyzing large datasets
  2. 4. A NoSQL database built on top of HDFS, used for real-time read/write access
  3. 5. Namenode A backup system that periodically copies metadata from NameNode to prevent data loss.
  4. 7. The intermediate process that sorts and groups key-value pairs before reducing.
  5. 10. A component that stores actual data blocks in HDFS.
Down
  1. 1. A tool used for querying large datasets using SQL-like commands
  2. 2. A framework for processing large datasets using the Map and Reduce programming model
  3. 6. A distributed file system that stores large amounts of data across multiple nodes
  4. 7. A tool used for moving data between Hadoop and relational databases
  5. 8. A resource management layer that schedules and allocates cluster resources
  6. 9. A machine learning library built on top of Hadoop for scalable data analysis