From: Handling the data management needs of high-throughput sequencing data: SpeedGene, a compression algorithm for the efficient storage of genetic data
Dataset
Size
PLINK
Gzip
SpeedGene
Avg MAF
FHS
8.822 GB
564.6 MB
1.400 GB
460 MB
0.238637
COPDgene
161 MB
10.1 MB
20.5 MB
3.6 MB
0.057327