Has your colleague found any faster solutions to loading the genotype data? Currently it appears we cannot use the BGEN files with hail because of the compression type (discussion here) and for pVCFs compression format also is an issue (only works with forse=true, which is slow and the docs say is highly discouraged). The only other format is PLINK which I’ll try.
RossDeVito
17
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Issues writing matrix table from filtered pVCF (UK Biobank data) | 1 | 495 | September 22, 2022 | |
| Small MatrixTable hangs on write into Google bucket | 13 | 980 | September 5, 2019 | |
| Can't export to plink/bgen/vcf on DNAnexus | 5 | 687 | September 28, 2022 | |
| How should I use Hail on the DNANexus RAP? | 10 | 2461 | March 5, 2025 | |
| Fail to retrieve row information of Hail matrix.table | 5 | 550 | July 22, 2022 |