Jags_FL
August 4th, 2009, 02:51 PM
Tahoe is a secure cloud filesystem that is licensed under the GPL. Its distributed storage model, which resembles peer-to-peer networking, makes it possible to build a shared storage pool using excess drive capacity from multiple computers across the Internet.
When a file is deployed to Tahoe, it is encrypted and split into pieces that are spread out across ten separate nodes. Using a variation of Reed-Solomon error correction, it can reconstruct a file using only three of the original ten nodes. This helps to ensure data integrity when some nodes are unavailable. This is a bit similar to how RAID storage works. Tahoe uses a library called zfec that provides an efficient implementation of the error correction code and exposes it through a Python API.
There is a simple interactive mockup (http://bigasterisk.com/tahoe-playground/) that illustrates visually how Tahoe's distributed storage works.
More at ARS Technica (http://arstechnica.com/open-source/news/2009/08/p2p-like-tahoe-filesystem-offers-secure-storage-in-the-cloud.ars)
When a file is deployed to Tahoe, it is encrypted and split into pieces that are spread out across ten separate nodes. Using a variation of Reed-Solomon error correction, it can reconstruct a file using only three of the original ten nodes. This helps to ensure data integrity when some nodes are unavailable. This is a bit similar to how RAID storage works. Tahoe uses a library called zfec that provides an efficient implementation of the error correction code and exposes it through a Python API.
There is a simple interactive mockup (http://bigasterisk.com/tahoe-playground/) that illustrates visually how Tahoe's distributed storage works.
More at ARS Technica (http://arstechnica.com/open-source/news/2009/08/p2p-like-tahoe-filesystem-offers-secure-storage-in-the-cloud.ars)