Skip to main content


Fig. 1 | GigaScience

Fig. 1

From: Erratum to: A quantitative assessment of the Hadoop framework for analyzing massively parallel DNA sequencing data

Fig. 1

The ratio of the F Hadoop /F HPC as a function of the reciprocal dataset size in Gb. The pipelines were run on the Hadoop I and II clusters, as well as a 16 core HPC node. The analytical curve f(x) = (a1x + b1)/(a2x + b2) was used to fit the data for the stretches of linear scaling of calculation time on the HPC platform. The outliers are marked with crossed symbols

Back to article page