Across
- 3. is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data
- 5. computer system monitoring, network monitoring and infrastructure monitoring software application
- 7. workflow scheduler system to manage Apache Hadoop jobs
- 8. Naming registry for large distributed system
- 10. database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure
- 12. software for searching, monitoring, and analyzing machine-generated big data, via a web-style interface
- 13. tool for indexing large blocks of unstructured text, and it's a natural partner for Hadoop
- 14. distributed realtime computation system
Down
- 1. is a serialization system that bundles the data together with a schema
- 2. open-source software was developed from Google’s MapReduce concept
- 4. NO SQL database which store data in BSON format
- 6. Distribute database modeledbsed on google bigtable
- 9. library of machine learning algorithms
- 11. is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis
- 12. command-line interface application for transferring data between relational databases and Hadoop
