Bigdata

1234567891011
Across
  1. 2. open source software framework used to develop data processing applications which are executed in a distributed computing environment.
  2. 4. collection of data that is huge in volume, yet growing exponentially with time
  3. 7. helps you to manage the state of an HDFS node and allows you to interacts with the blocks
  4. 8. world’s largest Hadoop cluster
  5. 9. data represented in an XML file
  6. 11. The data is increasing at a very fast rate. It is estimated that the volume of data will double in every 2 years.
Down
  1. 1. The amount of data which we deal with is of very large size of Peta bytes.
  2. 3. represented every files and directory which is used in the namespace
  3. 5. refers to heterogeneous sources and the nature of data
  4. 6. data is distributed over several machines and replicated to ensure their durability to failure and high availability to parallel application.
  5. 10. an example of unstructured data