Hi,
I am laoding a multisaple VCF as:
dta = hl.import_vcf(filename, force_bgz = True, array_elements_required = True, min_partitions = 2)
mt = dta.key_rows_by('locus') \
.distinct_by_row() \
.key_rows_by('locus', 'alleles')
I have multiple variants annotated at the same loci, so I am spliting the file with:
x1 = hl.split_multi_hts(mt)
Then I am creating some annotations with transmute_entries and annotate_rows and I want to save the file.
But:
x1.write('~/annotated', overwrite = True)
[...]
FatalError: VCFParseError: missing value in FORMAT array. Import with argument 'array_elements_required=False'
Java stack trace:
org.apache.spark.SparkException: Job aborted due to stage failure: Task 3 in stage 53.0 failed 1 times, most recent failure: Lost task 3.0 in stage 53.0 (TID 559, localhost, executor driver): is.hail.utils.HailException: medpercan_20191011.1.red.vcf:column 12174: missing value in FORMAT array. Import with argument 'array_elements_required=False'
... 3:.:.:0,73,892,73,892,892 0/1:12,14,.:26:99:0|1:3789194_G_GCGGCCC:638,0, ...
^
offending line: 1 3789195 . T TGTCCCTGCCGC,C 289664 . BaseQRankSum=0.374;Cli...
[...]
Hail version: 0.2.22-597b3bd86135
Error summary: VCFParseError: missing value in FORMAT array. Import with argument 'array_elements_required=False'
Any help understanding what is happening or solving the issue is welcome.