Across
- 2. _________________is a coordination and synchronization service that a distributed set of computer make decisions by consensus, handles failure, etc.
- 5. _____________________tactics include applications that can display real-time changes and more illustrative graphics, thus going beyond pie, bar and other charts.
- 8. MongoDB, Cassandra, HBase, Neo4 are _______ DBs
- 9. _______ is a programming model and an associated implementation for processing and generating large data sets with a parallel, distributed algorithm on a cluster.
- 11. ______________ is an ETL tool
- 12. A subset of big data,_______ includes information that companies collect, process and store but don't properly utilize, analyze or take advantage of to monetize.
- 15. _____________________ states that every two years, the number of transistors we’re able to fit on a given silicon chip doubles.
- 16. VOLUME,VELOCITY,VARIETY & ________
Down
- 1. It is impossible for a distributed computer system to simultaneously provide Consistency, Availibility & Partition tolerance. What is this theorem called?
- 3. _____________is a high-throughput, distributed messaging system originally developed at LinkedIn to manage the service's activity stream (data about a Website's usage) and operational data processing pipeline (about the performance of server components).
- 4. The Apache _______ sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system.
- 6. Information that either does not have a pre-defined data model or is not organized in a pre-defined manner
- 7. Implementation of the associative memory paradigm for parallel/distributed computing
- 10. ________________ is a library of scalable machine-learning algorithms, implemented on top of Apache Hadoop® and using the MapReduce paradigm.
- 13. 1000^8(Septillions)is how many bytes?
- 14. _______________ is an open source platform for consolidating, combining and understanding large-scale data in order to make better business decisions.
