Across
- 2. pattern that segregates the operations that read data (queries) from the operations and update data (commands) by using separate interfaces
- 4. data structure in SparkSQL which is strongly typed and is a map to a relational schema
- 6. Amazon's platform which gives hive and spark as a service deployment
- 8. Type of streaming which is based on Dataframes as logical foundation unit
- 11. Cross platform BSD licensed distributed monitoring tool for Big data clusters
- 12. AWS managed message queuing service
- 13. A mobile and web application development platform which also acts as a realtime database and backend as a service. Owned by google cloud from 2014
- 14. A row major file format whose primary design was schema evolution
Down
- 1. Type of Map side join when huge dataset is joined with a tiny dataset
- 3. Which V's in Big Data deals with "Uncertainty of Data"
- 5. Interactive browser-based notebook which brings data ingestion, data exploration, visualization, sharing and collaboration features to Hadoop and Spark
- 7. Project/Component for Spark SQL that provides more efficient Spark operations by working directly at the byte level. Defaulted from Spark 1.5
- 9. Type of spot instance request which spans across multiple AZs to increase the likelihood of getting your instance requirement fulfilled
- 10. Google Cloud __________ is a cloud-based data processing service for both batch and real-time data streaming applications
