Hello, I am wondering if you have any suggestions for how to proceed with using hail to sampleqc roughly 500k WES samples on the UK biobank database. The database contains WES data in PLINK format (fractioned by chr), BGEN format (fractioned by chr), and VCFs (fractioned both by individual partici…

OK, I’ve been meaning to write most of this out anyway, so this is a good chance to do that. I start with some background and definitions before giving you practical advice. We view sequencing as a series of transformations of data: DNA ==sequencer=> FASTQ ==alignment=> BAM/CRAM ==variant caller=> …

Performing SampleQC using Hail on ~500k WES samples

Hail Query & hailctl

danking Split this topic July 12, 2023, 12:00pm 5

A post was split to a new topic: How do I use hl.import_vcf to import a VCF that has been partitioned into multiple files?

Topic		Replies	Views
SampleQC Script Stalling on UKB RAP Hail Query & hailctl	3	446	July 27, 2023
Performing sample missingness filtering on multiple pVCF files Hail Query & hailctl	2	240	November 9, 2023
Loading a large and growing cohort into Hail Hail Query & hailctl	1	928	May 16, 2019
Counting Rows More Quickly in VDS Hail Query & hailctl	12	529	July 17, 2023
Using hail features at scale in the cloud Hail Query & hailctl	2	341	August 30, 2022

Performing SampleQC using Hail on ~500k WES samples

Related topics