BigData Family

1234567891011121314
Across
  1. 4. is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis
  2. 5. workflow scheduler system to manage Apache Hadoop jobs
  3. 7. software for searching, monitoring, and analyzing machine-generated big data, via a web-style interface
  4. 8. NO SQL database which store data in BSON format
  5. 9. computer system monitoring, network monitoring and infrastructure monitoring software application
  6. 11. tool for indexing large blocks of unstructured text, and it's a natural partner for Hadoop
  7. 13. Distribute database modeledbsed on google bigtable
  8. 14. command-line interface application for transferring data between relational databases and Hadoop
Down
  1. 1. library of machine learning algorithms
  2. 2. is a serialization system that bundles the data together with a schema
  3. 3. database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure
  4. 6. Naming registry for large distributed system
  5. 10. distributed realtime computation system
  6. 12. is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data
  7. 13. open-source software was developed from Google’s MapReduce concept