Bigdata Compendium
Across
- 1. is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis
- 4. command-line interface application for transferring data between relational databases and Hadoop
- 7. Distribute database modeledbsed on google bigtable
- 9. is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data
- 10. software for searching, monitoring, and analyzing machine-generated big data, via a web-style interface
- 11. NO SQL database which store data in BSON format
- 13. is a serialization system that bundles the data together with a schema
- 14. workflow scheduler system to manage Apache Hadoop jobs
Down
- 1. open-source software was developed from Google’s MapReduce concept
- 2. Naming registry for large distributed system
- 3. distributed realtime computation system
- 5. library of machine learning algorithms
- 6. database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failur
- 8. tool for indexing large blocks of unstructured text, and it's a natural partner for Hadoop
- 12. computer system monitoring, network monitoring and infrastructure monitoring software application