BIG DATA CROSSWORD

12345678910111213141516
Across
  1. 2. _________________is a coordination and synchronization service that a distributed set of computer make decisions by consensus, handles failure, etc.
  2. 5. _____________________tactics include applications that can display real-time changes and more illustrative graphics, thus going beyond pie, bar and other charts.
  3. 8. MongoDB, Cassandra, HBase, Neo4 are _______ DBs
  4. 9. _______ is a programming model and an associated implementation for processing and generating large data sets with a parallel, distributed algorithm on a cluster.
  5. 11. ______________ is an ETL tool
  6. 12. A subset of big data,_______ includes information that companies collect, process and store but don't properly utilize, analyze or take advantage of to monetize.
  7. 15. _____________________ states that every two years, the number of transistors we’re able to fit on a given silicon chip doubles.
  8. 16. VOLUME,VELOCITY,VARIETY & ________
Down
  1. 1. It is impossible for a distributed computer system to simultaneously provide Consistency, Availibility & Partition tolerance. What is this theorem called?
  2. 3. _____________is a high-throughput, distributed messaging system originally developed at LinkedIn to manage the service's activity stream (data about a Website's usage) and operational data processing pipeline (about the performance of server components).
  3. 4. The Apache _______ sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system.
  4. 6. Information that either does not have a pre-defined data model or is not organized in a pre-defined manner
  5. 7. Implementation of the associative memory paradigm for parallel/distributed computing
  6. 10. ________________ is a library of scalable machine-learning algorithms, implemented on top of Apache Hadoop® and using the MapReduce paradigm.
  7. 13. 1000^8(Septillions)is how many bytes?
  8. 14. _______________ is an open source platform for consolidating, combining and understanding large-scale data in order to make better business decisions.