Hadoop Ecosystem

1234567891011
Across
  1. 2. Read RDBMS data into Hadoop SQ(L to Had)oop
  2. 6. Read log files into Hadoop ("Apache" Log Ride)
  3. 7. Buzzword for Download, Modify, and Save or optionally Import, Clean, and Export
  4. 9. Strategy for distributing parallel jobs by grouping data into sets for aggregation
  5. 10. Big Data infrastructure for distributing parallel processing jobs and managing job completion
  6. 11. Stores schema with data to pass to various programming languages
Down
  1. 1. manages jobs across large clusters and HBase configurations
  2. 3. Popular Hadoop query tool focused on parallel processing of large data sets
  3. 4. Stores redundant copies of data across clusters of commodity servers
  4. 5. Schedules and prioritizes Hadoop Batch Jobs
  5. 8. Machine Learning to classify data
  6. 10. Query tool for accessing HDFS data using SQL like language