Dear Hail team,
Due to the GCP log4j vulnerability (CVE-2021-44228) issue, some Dataproc Image versions are deprecated. I use hail 0.1 with image 1.1-debian9 (w/ spark2.0.2), it works well. however, now 1.1-debian9 is deprecated, and I cannot find a new suitable dataproc image version for my hail 0.1 jar. Does any one know how to solve this? which image could be used now? All helps are welcome and I would really appreciate!
I also considered move to Hail 0.2, but my input is not a VCF but a merged VDS containing all autosomal chrs which already went through all QC steps, all I need now is split it into each chrs and export to VCF. I cannot find a suitable function in Hail0.2 to do this task. (I find: hail.vds.to_dense_mt
, but according this description: “Hail 0.1 also had a Variant Dataset class. Although pieces of the interfaces are similar, they should not be considered interchangeable and do not represent the same data.” I am not sure I can use it or not.
Thanks a lot, Shuang