I m planning a Hadoop cluster, and I expect to be receiving just under 1TB of data per week which will be stored on the cluster, using Hadoop's default replication. So, what I decide is that all the slave nodes will be configured with 4 x 1TB disks.
But is there any way to calculate how many slave nodes I need to deploy at a minimum to store one year's worth of data.
Thanks in advance!