Samiksha Jaiswal (Editor)

Cooperative storage cloud

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

A cooperative storage cloud is a decentralized model of networked online storage where data is stored on multiple computers (nodes), hosted by the participants cooperating in the cloud. For the cooperative scheme to be viable, the total storage contributed in aggregate must be at least equal to the amount of storage needed by end users. However, some nodes may contribute less storage and some may contribute more. There may be reward models to compensate the nodes contributing more.

Contents

Unlike a traditional storage cloud, a cooperative does not directly employ dedicated servers for the actual storage of the data, thereby eliminating the need for a significant dedicated hardware investment. Each node in the cooperative runs specialized software which communicates with a centralized control and orchestration server, thereby allowing the node to both consume and contribute storage space to the cloud. The centralized control and orchestration server requires several orders of magnitude less resources (storage, computing power, and bandwidth) to operate, relative to the overall capacity of the cooperative.

Data security

Files hosted in the cloud are fragmented and encrypted before leaving the local machine. They are then distributed randomly using a load balancing and geo-distribution algorithm to other nodes in the cooperative. Users can add an additional layer of security and reduce storage space by compressing and encrypting files before they are copied to the cloud.

Data redundancy

In order to maintain data integrity and high availability across a relatively unreliable set of computers over a wide area network like the Internet, the source node will add some level of redundancy to each data block. This allows the system to recreate the entire block even if some nodes are temporarily unavailable (due to loss of network connectivity, the machine being powered off or a hardware failure). The most storage and bandwidth efficient forms of redundancy use erasure coding techniques like Reed-Solomon. A simple, less CPU intensive but more expensive form of redundancy is duplicate copies.

Flexible contribution

Due to bandwidth or hardware constraints some nodes may not be able to contribute as much space as they consume in the cloud. On the other hand, nodes with large storage space and limited or no bandwidth constraints may contribute more than they consume, thereby the cooperative can always stay in balance.

Examples

The University of California's OceanStore project, MIT's Chord, is a non-commercial example.

Sia is a fully distributed system using a blockchain where a cryptocurrency is exchanged for storage, and a daemon that can act both as a file host and as a "renter" of storage, using Merkle trees to provide strong guarantees that hosts will only get paid if they successfully hold data for the duration of a "storage contract". Data is encrypted and spread among hosts in a redundant manner, using erasure coding, to maximize availability. According to one of its creators, Sia was the first platform of its kind to launch an end-user product.

On June 1, 2016 Minebox GMBH announced that their forthcoming Minebox Networked Attached Storage device (NAS) will utilize the Sia network to persist its backups. Users of the Minebox will also be able to rent their free disk space via Sia.

Storj is another example: based on the Bitcoin blockchain technology and a peer-to-peer architecture, it intends to provide cloud storage to people. It is currently developing two applications to achieve this goal: MetaDisk, which lets the user upload files to the network, and DriveShare, allowing users to rent out their storage space to MetaDisk users.

A partly centralized system was operated by Symform, Inc., a startup company based in Seattle. Symform generated and kept the keys used to encrypt and decrypt, and since it also decided which server will host which parts of a file, users have to trust Symform not to share those with any other party or misuse the information. Symform discontinued its service on Jul 31, 2016.

References

Cooperative storage cloud Wikipedia