Lec1-Introduction

1. Distributed System

Infrastracture of application
1. storage:focus on, useeful adstraction
2. communication: tool need to be used to create distributed system
3. computation: mapreduce
for storage and computation, out goal is to discover abstraction where use of simplyfing the interface of system(hide the distributed system, make syste look like non-distributed system)
topic
1. Implemenatation, Tool: RPC, Threads, Concurrency Control
2. Performance, Scalability: increasing number of people, increasing number of computer
3. Fault Tolerance
  1. Availability：huge numbers of computer makes failure constant thing.
  2. Recoverability：for example, save disk date for recovery.
  3. Tool: non-volatile storage, replication
4. Consistency：for key-value, there are two operation: put(k, v), get(k). but in distributed system, the replica makes many version.
  1. level of consistency
    1. Strong consistency: make gurantee
    2. Weak consistency: do not make gurantee
  2. Don’t put replica(copy) in the same place

Purpose: Looking for a framework that would make it easy for non specialists to be able to write and run giant distributed computations.
Map function
1. run on each of input file(or chunk)
2. output a list of key value pair
3. for example, input: a b, output: <a,1> <b,1>
Reduce function
1. collect all output of maps
2. for example: each reduce deal with a word, result in<a, 2>, <b, 2>.
Output of MapReduce can be used as input of MapReduce, like PageRank
More in paper.

MIT-6.824

#MIT-6.824 #分布式 #Map Reduce

MIT-6.824-Lecture1-Introduction

https://spricoder.github.io/2022/02/18/MIT-6.824/MIT-6-824-Lecture1-Introduction/

作者

SpriCoder

发布于

2022年2月18日

许可协议