Crossword Hadoop Ecosystem
Across
- 3. A scripting language designed for processing and analyzing large datasets
- 4. A NoSQL database built on top of HDFS, used for real-time read/write access
- 5. Namenode A backup system that periodically copies metadata from NameNode to prevent data loss.
- 7. The intermediate process that sorts and groups key-value pairs before reducing.
- 10. A component that stores actual data blocks in HDFS.
Down
- 1. A tool used for querying large datasets using SQL-like commands
- 2. A framework for processing large datasets using the Map and Reduce programming model
- 6. A distributed file system that stores large amounts of data across multiple nodes
- 7. A tool used for moving data between Hadoop and relational databases
- 8. A resource management layer that schedules and allocates cluster resources
- 9. A machine learning library built on top of Hadoop for scalable data analysis