Across
- 2. Centralized repository designed to store, process, and secure large amounts of structured, semistructured, and unstructured data
- 4. An open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.
- 6. Data visualization and business intelligence tool used for reporting and analyzing vast volumes of data
- 7. Big data processing framework comprising of MapReduce, YARN and HDFS
- 9. Distributed NoSQL database popular for columnar storage capabilities
- 10. Software engineering process that simplifies the diagram of a software system by applying certain formal techniques and provides the blueprint for building a new database or reengineering legacy applications
Down
- 1. The process of moving data from a data warehouse into third party systems to make data operational
- 3. An unified programming model that can implement both batch and streaming data processing jobs that run on any execution engine
- 5. Relational database table like construct in programming languages
- 8. Process of uncovering patterns and other valuable information from large data sets
