Apologies in advance for misusing terminology, I am rather new to genetics work. My ultimate goal is to generate intermediate data so that I can perform my own GWAS with a prototype niche system I am developing that uses Hail for filtering and preprocessing. Previously I used the 1kg sample data from the GWAS tutorial and the annotations (isFemale and PurpleHair) for logistic regression with one covariant. This worked well for development purposes, but now I wish to use a lot more data to test my GWAS system. In the schema for your datasets it looks like only the allele information is available, and I could not find where the matching annotations/variant data might be.
My question is: where can I find the annotation/variant data and how do I load this data and merge it with the allele data?
Hopefully this makes sense, please let me know if I am misunderstanding something and the answer is right under my nose!