Performance of writing matrixtable on 0.2

tpoterba · September 29, 2018, 12:24am

for now you can also get down to O(N * log2 N) doing a tree union:

mts_ = mts[:]

iteration = 0
while (len(mts_) > 1):
    iteration += 1
    print(f'iteration {iteration}')
    tmp = []
    for i in range(0, len(mts_), 2):
        tmp.append(hl.MatrixTable.union_cols(mts_[i:i+2]))
    mts_ = tmp[:]
[final_mt] = mts_

Topic		Replies	Views
Appending to an existing Matrixtable Hail Query & hailctl	8	497	March 16, 2021
Merging single sample MatrixTables into one big MatrixTable Hail Query & hailctl	3	789	November 18, 2022
Store multiple vcfs into single MatrixTable Hail Query & hailctl	10	818	September 9, 2020
[Hail 0.2] Merge two MatrixTable Help [0.1]	11	3060	November 19, 2019
Importing many sample-specific VCFs Hail Query & hailctl	12	1283	December 12, 2022

Performance of writing matrixtable on 0.2

Related topics