We currently have a single node GlusterFS solution, which has network bandwidth cap of 1Gbps and cannot scale up unless we add a few more nodes or find a better a better storage solution.
A solution which can be considered for future storage needs is Amazon EFS. Amazon EFS is still in preview state when this wiki was created.
GlusterFS seems to be a good overall solution, because of the following reasons;
What GlusterFS lacks?
For More FAQ's please visit: http://www.gluster.org/community/documentation/index.php/GlusterFS_Technical_FAQ
BeeGFS/FhGFS seems to be a good solution for performance, because of the following reasons;
What BeeGFS lacks?
MAPR-FS seems to be a good overall for performance and storage, because of the following reasons;
What MapR-FS lacks?
We have tested the S3 performance on AWS and we found it to be efficient and highly available.
Here are some statistics;
1073741824 bytes (1.1 GB) copied, 9.97808 s, 108 MB/s 4294967296 bytes (4.3 GB) copied, 37.887 s, 113 MB/s
We have reached the maximum of Gigabit speed on S3 similar to HDFS (though not limited to Bandwidth of our Storage Server)
Note : The performance is dependent on the type of Instance used, the above test was done on c1.xlarge machine.