27
1 answer ✓
2 views
Fastest way to parse a large VCF file in Python for GWAS analysis?
I have a VCF file with ~15 million SNPs and 5000 samples (~40 GB). I need to extract allele frequencies and filter by MAF >…