Bigdata Compendium

1234567891011121314
Across
  1. 1. is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis
  2. 4. command-line interface application for transferring data between relational databases and Hadoop
  3. 7. Distribute database modeledbsed on google bigtable
  4. 9. is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data
  5. 10. software for searching, monitoring, and analyzing machine-generated big data, via a web-style interface
  6. 11. NO SQL database which store data in BSON format
  7. 13. is a serialization system that bundles the data together with a schema
  8. 14. workflow scheduler system to manage Apache Hadoop jobs
Down
  1. 1. open-source software was developed from Google’s MapReduce concept
  2. 2. Naming registry for large distributed system
  3. 3. distributed realtime computation system
  4. 5. library of machine learning algorithms
  5. 6. database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failur
  6. 8. tool for indexing large blocks of unstructured text, and it's a natural partner for Hadoop
  7. 12. computer system monitoring, network monitoring and infrastructure monitoring software application