I'm building an index of data, which will entail storing lots of triplets in the form (document, term, weight). I will be storing up to a few million such rows. Currently ...
I want to develop one multimedia system, the system need to save millions videos and images, so I want to select a distributed storage subsystem. who can give me some suggestion ...
Cassandra & HBase both do not efficiently support storage of blobs like images. Storing directly on HDFS stresses the Namenode. Facebook's Haystack is not open source. So is Lustre ...