Are there additional resources to better understand how the sample_qc()
function computes the resulting statistics? I have looked at https://hail.is/docs/stable/hail.VariantDataset.html#hail.VariantDataset.sample_qc but there is not a lot there.
I imported a single chromosomes vcf from the 1000 genomes project as a vds and called sample_qc. The function worked and I received some statistics, but the results for dpMean, dpStDev, gqMean, and gqStDev were None for all samples.
vds = (hc.import_vcf('gs://genomics-public-data/1000-genomes/vcf/ALL.chr16.integrated_phase1_v3.20101123.snps_indels_svs.genotypes.vcf')
.split_multi()
.sample_qc()
.variant_qc())
df = vds.samples_table().to_pandas()
df.head()
Thanks for your help!