It would be beneficial for our use case to support hashlib functionality in hail. Currently the only way to hash a column of strings is to export to pandas, then do the hashing and bring it back into hail after which isn’t totally ideal given the amount of data it needs to be done for.
This is what I am trying to do in hail with no luck:
hashed = hta2.annotate(ALT_HASH = hashlib.md5(str.encode(str(hta2.ALT))).hexdigest())
Tim mentioned it would fit nicely into the expr module.