Slurm Jobs and Hail

natestockham · February 22, 2017, 2:33pm

Has anyone used Slurm as a submission system for hail to a local cluster? I’m in the middle of setting it up and wanted to make sure I understood how to let hail know it has X many cpus available with Y amount of memory. Thanks!

tpoterba · February 22, 2017, 4:10pm

Hi Nate,
The real question is setting up Spark – running Hail is pretty easy when you’ve got a Spark cluster running.

I found a page here: https://www.princeton.edu/researchcomputing/faq/spark-via-slurm/

Unfortunately, setting up Spark on top of HPC submission engines isn’t super simple, and I think this is going to be a problem for a lot of people.

natestockham · February 22, 2017, 4:23pm

It seems like the approach would be to write a script in python importing hail, and submit that as described in the link. Or, use a jupyter notebook attached to a running spark instance. I don’t see any particular roadblock here, though I have yet to try it. If the cluster can run spark, what other roadblocks are you anticipating?

tpoterba · February 22, 2017, 4:26pm

Nope! Hail is really just a java jar and a python library. If you submit a Hail python script with spark-submit and supply the jar, everything should work. Let us know if things do go well! You’re not the first to have trouble with HPC systems.

natestockham · February 22, 2017, 4:31pm

Great, thank you! I’ll try it out today and report back with what I find.

Topic		Replies	Views
Running hail on slurm Hail Batch & General Cloud	2	43	May 21, 2025
Hail on Spark cluster! Help [0.1]	2	840	March 20, 2018
Running Hail with a remote Spark Help [0.1]	10	2770	June 2, 2017
Running HAIL on more than one compute node Hail Query & hailctl	0	161	February 15, 2024
Initialise Hail with existing Spark Hail Query & hailctl	3	522	May 9, 2023

Slurm Jobs and Hail

Related topics