The Google File System is a scalable distributed file system for large distributed data-intensive applications. It provides fault tolerance while running on inexpensive commodity hardware, and it delivers high aggregate performance to a large number of clients.

It likes other distributed file system, it has master nodes to store meta data, and chunk server to store data.

It's interesting to read their design choices for google data-intensive applications, for example, the size of each chunk is 64M bytes, compared to usual 4K block size in unix file system.

Comments


Log in to comment or register here.