Across
- 3. is the process of integrating multiple data sources to produce more consistent, accurate, and useful information
- 5. is an open-source framework that allows to store and process big data in a distributed environment
- 8. is a processing technique and a program model for distributed computing based on java.
- 10. framework operates on <key,value> pairs
- 11. Applications implement the Map and the Reduce functions, and form the core of the job.
- 12. programming is a style of computer programming in which algorithms are written in terms of types to-be-specified-later
Down
- 1. job is to process the input data.
- 2. A program is an execution of a Mapper and Reducer across a dataset.
- 4. Big Data refers to structured, unstructured, and------------- data
- 6. has been used in the industry to provide customer insights for transparent and simpler products
- 7. learning is a method of data analysis that automates analytical model building.
- 9. job is to process the data that comes from the mapper.
- 13. is a fully managed, cloud-native, enterprise data integration service
