Data Engineering Glossary

12345678910
Across
  1. 2. Centralized repository designed to store, process, and secure large amounts of structured, semistructured, and unstructured data
  2. 4. An open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.
  3. 6. Data visualization and business intelligence tool used for reporting and analyzing vast volumes of data
  4. 7. Big data processing framework comprising of MapReduce, YARN and HDFS
  5. 9. Distributed NoSQL database popular for columnar storage capabilities
  6. 10. Software engineering process that simplifies the diagram of a software system by applying certain formal techniques and provides the blueprint for building a new database or reengineering legacy applications
Down
  1. 1. The process of moving data from a data warehouse into third party systems to make data operational
  2. 3. An unified programming model that can implement both batch and streaming data processing jobs that run on any execution engine
  3. 5. Relational database table like construct in programming languages
  4. 8. Process of uncovering patterns and other valuable information from large data sets