Hello Everyone,
I am new to hail and bioinformatics. Trying to learn the basics of VCF file analysis. So, am following the tutorial provided by hail here
Though all this works, I don’t understand the inner workings… For ex, when we write a group by statement in SQL, we can try to see how the logic works based on tutorials that explain the concept with a small sample tutorial (which we can also validate manually).
Similarly, does Hail have tutorial which can help us understand how does Aggregate functions
actually work with the help of small sample data. For ex: For someone like me who is new to this domain, functions like n_alt_alleles()
, is_defined(mt.GT))
are kind of black box.
Is it possible to use only the first 10 records from a huge VCF file in a matrix table? May I know how can we get only the first 10 records from large VCF file… I tried doing it but getting an error…