hadoop
Hadoop is a framework of distributed storage & computing.
* distributed storage: hadoop use HDFS to save large amount of data in cluster.
* distributed computing: hadoop use map-reduce framework to conduct fast data analysis (query & writing) over data in HDFS.
* resource manager & job schedular: hadoop use yarn to manage/allocate cluster resources (memory, cpu, etc.) and to schedule and moniter job executing.
Architecture
cluster architecture
request processing
Fault Tolerance
Use rack aware so that your replicas will be saved into different racks, which can solve the rack failure