Hi,
We noticed a “strange behavior” when we try to load a VDS (~7400 1GB partitions) to Hail when Yarn always allocate many executors to one particular worker node before allocating 1 or no executors to the remaining node. I’m not sure if this is data locality problem or not. Is there a remedy you can suggest like reshuffling partitions, changing parquet block size?
Thanks!