Clumping variants to calculate polygenic risk scores

armartin · January 9, 2018, 9:54pm

It would be helpful to calculate polygenic risk scores if a clump variants function existed. This is a greedy algorithm described in plink here: https://www.cog-genomics.org/plink/1.9/postproc#clump

Basically: 1) compute r^2 (LD) between pairs of variants, 2) rank order summary stats by p-values from most to least significant, 3) descend down variant list storing variants along the way: if no stored variant is in LD with current variant, store it. Otherwise chuck out any variants in LD with a more significant stored variant. This function comes in handy for some other tasks as well. Alternatively (even better but probably much harder): LDPred

kkarbasi · January 20, 2021, 9:58pm

I second this!

Topic		Replies	Views
Applying externally generated polygenic risk scores to a VDS Hail Query & hailctl	1	686	November 24, 2018
Running polygenic risk scoring with VariantDataset Science	3	600	May 15, 2023
Polygenic risk score calculation Hail Query & hailctl	2	537	November 2, 2020
Calculate PGS score from UKB data using PGS Catalog Science	2	464	November 13, 2023
Linear regression burden tests, collapsing genotypes by variant key Updates	1	1656	October 4, 2018

Clumping variants to calculate polygenic risk scores

Related topics