Hi,
I have an ANN entry field within info, that I want to display a count of the specific gene symbol or name, using the aggregator.
my code is as follows where anngen is the MT:
anngen.aggregate_entries(hl.agg.explode(lambda element: hl.agg.counter(element), anngen.info.ANN))
this returns:
2021-11-12 12:53:32 Hail: INFO: Coerced prefix-sorted dataset
frozendict({’|3_prime_UTR_variant|MODIFIER|LACE1|ENSG00000135537|transcript|ENST00000368977|protein_coding|13/13|c.|||||1556|INFO_REALIGN_3_PRIME’: 2504, ‘|3_prime_UTR_variant|MODIFIER|MIER1|ENSG00000198160|transcript|ENST00000355356|protein_coding|13/13|c.|||||904|INFO_REALIGN_3_PRIME’: 2504,
I believe the gene names are in the 4th position, array element 3. The above was great for exploding and aggregating counts of the entire ANN , but I am specifically wanting to enumerate the gene name list.