Hail implementation of RUTH

jjfarrell · October 8, 2019, 2:27pm

There is a robust implementation of HWE Test that takes into account population substructure:
RUTH - Robust Unified Hardy-Weinberg Equilibrium Test. It is used for the TopMed pipeline.

The GitHub repository is here:

It would be great to have an implementation in Hail to run on matrix tables containing a large number of samples with mixed ethnicity…

tpoterba · October 8, 2019, 6:12pm

it looks like all we need is a way to do logistic regression on an arbitrary array/ndarray. Thanks for the submission, this should be feasible to implement in not too long, I think.

jjfarrell · July 1, 2020, 4:08pm

Has there been any progress on implementing a version of RUTH in hail?

tpoterba · July 2, 2020, 6:59pm

We’re still building out the ndarray infrastructure that will let us write hl.logistic_regression_rows in Python. I’m not sure exactly the timeline, but I’d hope we have the set of features necessary to implement RUTH in Hail in Python done by the end of 2020.

Topic		Replies	Views
Logistic regression on entries Hail Query & hailctl	10	1286	December 6, 2021
Normalized Likelihood w/ Respect to Percentile Grid Hail Query & hailctl	3	371	June 16, 2020
Announcing Hail 0.2! Updates	2	4890	October 22, 2018
Hail stuck when running array data Hail Query & hailctl	4	305	October 9, 2023
Added ability to compute HWE on subsets of samples Updates	0	986	November 7, 2016

Hail implementation of RUTH

Related topics