Hi,
I am working on ‘RHEL Server release 7.7’. I installed the recommended ‘miniconda3’, then created python 3.7 virtual environment:
conda create -n py37 python=3.7
Then I installed hail using pip the way it is recommended:
pip install hail
py-spark and ipython:
conda install -c conda-forge pyspark
conda install -c anaconda ipython
I have also downloaded ‘Spark2.4.4’, started one master and one slave. Then, I tried to run the basic script in various ways:
import hail as hl
mt = hl.balding_nichols_model(n_populations=3, n_samples=50, n_variants=100)
mt.count()
But it just gets stuck on the second line in Hail and Spark init and does not go anywhere further. No log output, no error.
Whether I set $SPARK_HOME or not, it does not fix it. I also set the path to HAIL jar directly when running it with spark-submit but the result is the same:
spark-submit --master spark://ai-grisnodedev1:7077 --verbose --conf spark.driver.port=40065 --driver-memory 4g --conf spark.driver.extraClassPath=/opt/seqr/.conda/envs/py37/lib/python3.7/site-packages/hail/hail-all-spark.jar --conf spark.executor.extraClassPath=./hail-all-spark.jar test_hail.py
Or
spark-submit --master spark://ai-grisnodedev1:7077 --verbose --conf spark.driver.port=40065 --driver-memory 4g --conf spark.driver.extraClassPath=/opt/seqr/.conda/envs/py37/lib/python3.7/site-packages/hail/hail-all-spark.jar --conf spark.executor.extraClassPath=/opt/seqr/.conda/envs/py37/lib/python3.7/site-packages/hail/hail-all-spark.jar test_hail.py
test_hail.py just contains the 3 lines of the sample code.