Hail 0.2 export gwas results to tsv

hhx037 · October 16, 2018, 4:06pm

I’m trying to export the gwas results to a tsv, via a table, but I’m not able to shake the structure format. This is how I do it:

results = gwas.rows()
results.export('file:////root/logreg_wald.tsv.bgz')

This works, but the resulting file requires a lot of parsing:

locus alleles rsid cm_position logreg
1:768448 [“G”,“A”] rs12562034 0.0000e+00 {“beta”:0.24564585404573575,“standard_error”:0.09766742410866683,“z_stat”:2.515125757513836,“p_value”:0.011898993159370147,“fit”:{“n_iterations”:4,“converged”:true,“exploded”:false}}

The explode() method does not apply here (and does not apply to structures). Is there a quick way to save the results as different columns? (logreg.beta etc would be just fine)

tpoterba · October 16, 2018, 5:15pm

Table.flatten is what you’re looking for here!

tpoterba · October 16, 2018, 5:15pm

you can also do something like

results = results.select(**results.logreg)

in between the two lines you have there

hhx037 · October 16, 2018, 7:28pm

That makes perfect sense, thank you. I was looking for a flatten option in rows and export, but didn’t think to look if a table.flatten existed.

hhx037 · October 16, 2018, 7:30pm

One more thing though, I don’t understand the two stars in results.select(**results.logreg), what do they stand for? I saw this for ds.annotate() also.

jbloom · October 16, 2018, 7:34pm

It’s unpacking the struct results.logreg into a top-level list of keyword pairs (field name and expression):
http://treyhunner.com/2018/10/asterisks-in-python-what-they-are-and-how-to-use-them/

tpoterba · October 16, 2018, 7:59pm

it’s usually a good idea to avoid the ** in annotate unless you’re sure you want to be doing that, as it can do things like overwrite existing fields. Generally annotate_cols(phenos = pheno_table[mt.s]) is safer than annotate_cols(**pheno_table[mt.s]), but will require you to access fields with mt.phenos.pheno1 rather than me.pheno1

hhx037 · October 18, 2018, 9:43am

Great, thank you both, very helpful

Topic		Replies	Views
Export GWAS summary statistics to a .txt file Hail Query & hailctl	8	1185	February 22, 2022
Long time to export UK Biobank GWAS result to tsv file Hail Query & hailctl	2	925	April 27, 2020
Export the results of agg.linreg Hail Query & hailctl	6	547	February 27, 2019
Unable to flatten sample/gene counts for table export Hail Query & hailctl	4	509	March 21, 2024
Export variants to a tsv file Hail Query & hailctl	6	410	June 18, 2022

Hail 0.2 export gwas results to tsv

Related topics