Across
- 4. is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis
- 5. workflow scheduler system to manage Apache Hadoop jobs
- 7. software for searching, monitoring, and analyzing machine-generated big data, via a web-style interface
- 8. NO SQL database which store data in BSON format
- 9. computer system monitoring, network monitoring and infrastructure monitoring software application
- 11. tool for indexing large blocks of unstructured text, and it's a natural partner for Hadoop
- 13. Distribute database modeledbsed on google bigtable
- 14. command-line interface application for transferring data between relational databases and Hadoop
Down
- 1. library of machine learning algorithms
- 2. is a serialization system that bundles the data together with a schema
- 3. database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure
- 6. Naming registry for large distributed system
- 10. distributed realtime computation system
- 12. is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data
- 13. open-source software was developed from Google’s MapReduce concept
