Are there recommended hardware configurations or guidelines for Hadoop data nodes for Hail 0.2?
- How many drives per data node
- Drive type (SATA
- Number of cores per data node
- Memory per core or memory per node
Given a budget, are we better off with the fastest CPUs or more data nodes with slower CPUs?
This Hadoop Cluster will be used to analyze VCFs ranging from 5k to 500K.