Hello everyone
Currently i face one problem. I am truing to Import VCF with standard FORMAT set like call_fields=[‘GT’,‘AD’,‘DP’,‘GQ’,‘PL’]
I am doing in in Jupiter notebook, on my local machine. I am trying download 1K genomes… For this i am using following command
vcfs = [f"data/1kg/ALL.chr{contig}.phase3_shapeit2_mvncall_integrated_v5b.20130502.genotypes.vcf.gz" for contig in range(1, 23)]
hl.import_vcf(vcfs,force_bgz=True,call_fields=[‘DP’]).write(‘Haill_mt/1kg_Full.mt’, overwrite=True)
Then it gave me an error:
Hail version: 0.2.97-937922d7f46c
Error summary: HailException: Can only convert a header line with type ‘String’ to a call type. Found ‘Integer’.
Obviously problem occur only with ‘DP’ (coverage ). When i truing other FORMAT settings like this call_fields=[‘AO’,‘RO’,‘GT’,‘AD’,‘GQ’,‘PL’,‘PGT’], the script is running.
vcfs = [f"data/1kg/ALL.chr{contig}.phase3_shapeit2_mvncall_integrated_v5b.20130502.genotypes.vcf.gz" for contig in range(1, 23)]
hl.import_vcf(vcfs,force_bgz=True,call_fields=[‘AO’,‘RO’,‘GT’,‘AD’,‘GQ’,‘PL’,‘PGT’]).write(‘Haill_mt/1kg_Full.mt’, overwrite=True)
Of course, i can calculate DP with equitation DP=AO+RO. Then i can add DP to MatrixTable afterwards
But i wounder, is it possible to add DP to Matrix Table at import_vcf step?