Bigdata compendium

1234567891011121314
Across
  1. 3. is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data
  2. 5. computer system monitoring, network monitoring and infrastructure monitoring software application
  3. 7. workflow scheduler system to manage Apache Hadoop jobs
  4. 8. Naming registry for large distributed system
  5. 10. database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure
  6. 12. software for searching, monitoring, and analyzing machine-generated big data, via a web-style interface
  7. 13. tool for indexing large blocks of unstructured text, and it's a natural partner for Hadoop
  8. 14. distributed realtime computation system
Down
  1. 1. is a serialization system that bundles the data together with a schema
  2. 2. open-source software was developed from Google’s MapReduce concept
  3. 4. NO SQL database which store data in BSON format
  4. 6. Distribute database modeledbsed on google bigtable
  5. 9. library of machine learning algorithms
  6. 11. is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis
  7. 12. command-line interface application for transferring data between relational databases and Hadoop