bigdata-compendium

1234567891011121314
Across
  1. 2. is a serialization system that bundles the data together with a schema
  2. 4. open-source software was developed from Google’s MapReduce concept
  3. 6. is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data
  4. 7. software for searching, monitoring, and analyzing machine-generated big data, via a web-style interface
  5. 10. computer system monitoring, network monitoring and infrastructure monitoring software application
  6. 11. Distribute database modeledbsed on google bigtable
  7. 12. distributed realtime computation system
  8. 14. NO SQL database which store data in BSON format
Down
  1. 1. workflow scheduler system to manage Apache Hadoop jobs
  2. 3. Naming registry for large distributed system
  3. 5. tool for indexing large blocks of unstructured text, and it's a natural partner for Hadoop
  4. 8. is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis
  5. 9. database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure
  6. 13. command-line interface application for transferring data between relational databases and Hadoop
  7. 14. library of machine learning algorithms