Across
- 2. Read RDBMS data into Hadoop SQ(L to Had)oop
- 6. Read log files into Hadoop ("Apache" Log Ride)
- 7. Buzzword for Download, Modify, and Save or optionally Import, Clean, and Export
- 9. Strategy for distributing parallel jobs by grouping data into sets for aggregation
- 10. Big Data infrastructure for distributing parallel processing jobs and managing job completion
- 11. Stores schema with data to pass to various programming languages
Down
- 1. manages jobs across large clusters and HBase configurations
- 3. Popular Hadoop query tool focused on parallel processing of large data sets
- 4. Stores redundant copies of data across clusters of commodity servers
- 5. Schedules and prioritizes Hadoop Batch Jobs
- 8. Machine Learning to classify data
- 10. Query tool for accessing HDFS data using SQL like language
