Big Data Concepts and Tools - Big Data Technologies
5 important questions on Big Data Concepts and Tools - Big Data Technologies
What are the common characteristics of emerging Big Data technologies?
- enable scale-out and parallel-processing techniques
- employ non-relational data storage capabilities to process unstructured and semistructured data
- apply advanced analytics and data visualization technology to Big Data o convey insights to end users
What is MapReduce? What does it do? How does it do it?
Is a technique to distribute the processing of very large multistructured data files across a large cluster of machines.
high performance is achieved by breaking the processing into small units of work that can be run in parallel across the hundreds, potentially thousands of nodes in the cluster.
What is Hadoop? How does it work?
It breaks the data up in parts, which are then loaded into a file system made up of multiple nodes running on commodity hardware.
Hadoop Distributed File System(HDFS)
- Higher grades + faster learning
- Never study anything twice
- 100% sure, 100% understanding
What are the main Hadoop components? What functions do they perform?
Young technology, still immature
What is NoSQL? How does it fit into Big Data analytics picture?
Still immature
The question on the page originate from the summary of the following study material:
- A unique study and practice tool
- Never study anything twice again
- Get the grades you hope for
- 100% sure, 100% understanding