Hi @danking Thanks again for the help. It looks like export_vcf
is having a hard time writing this information to a VCF file. I tried
vcf = vcf.annotate_rows(gencode=gtf_file[vcf.locus])
which gives the warning
Hail: WARN: export_vcf: ignored the following fields: 'gencode' (row)
and
vcf = vcf.annotate_rows(info = vcf.info.annotate(gencode = gtf_file[vcf.locus]))
,
which shows
hail.utils.java.FatalError: HailException: VCF does not support SSubsetStruct(SSubsetStruct(SBaseStructPointer(PCStruct{source:PCString,feature:PCString,score:PFloat64,strand:PCString,frame:PInt32,tag:PCString,level:PCString,gene_id:PCString,gene_type:PCString,ccdsid:PCString,exon_id:PCString,exon_number:PCString,havana_gene:PCString,transcript_type:PCString,protein_id:PCString,gene_name:PCString,transcript_name:PCString,transcript_id:PCString,transcript_support_level:PCString,hgnc_id:PCString,ont:PCString,havana_transcript:PCString,interval:PCInterval[PCLocus(GRCh38)],__partNum:+PInt32}),Vector(source, feature, score, strand, frame, tag, level, gene_id, gene_type, ccdsid, exon_id, exon_number, havana_gene, transcript_type, protein_id, gene_name, transcript_name, transcript_id, transcript_support_level, hgnc_id, ont, havana_transcript, interval)),Vector(source, feature, score, strand, frame, tag, level, gene_id, gene_type, ccdsid, exon_id, exon_number, havana_gene, transcript_type, protein_id, gene_name, transcript_name, transcript_id, transcript_support_level, hgnc_id, ont, havana_transcript))
.
Do you have any suggestions for how to circumvent this issue? Thanks again for all of your help.