is.hail.kryo.HailKryoRegistrator ClassNotFoundException

danking · May 4, 2018, 2:41pm

Sorry for the long delay on my reply, @atebbe

Let’s recall your spark class path settings:

spark.driver.extraClassPath …:./hail-all-spark.jar
spark.executor.extraClassPath …:./hail-all-spark.jar

These assert that, on both the driver and the executors, the jar is located in the working directory of the driver process. If you ssh to one of your executors and find the spark job working directory (try looking in /var/run/spark/work), I suspect you will not find hail-all-spark.jar in that directory. While you’re at it, can you open a terminal in your Jupyter notebook and verify that the hail-all-spark.jar is indeed in the working directory of your executor?

This StackOverflow post suggests that addFile is inappropriate for “runtime dependencies”.

So. Assuming the jar is indeed missing from the working directory of your executors, we need to figure out how to get it there.

First, try sc._jsc.addJar instead of sc.addFile.

If that fails, Apache Toree suggests using the %AddJar magics invocation to add a jar.

Topic		Replies	Views
Hail 0.2 class not found exception on EMR Hail Query & hailctl	29	2773	August 20, 2018
Incompatibility between Hail and Spark 3.3.2 Hail Batch & General Cloud	2	333	October 18, 2023
Initialise Hail with existing Spark Hail Query & hailctl	3	520	May 9, 2023
Import_vcf() on databricks results in NoClassDefFoundError Help [0.1]	2	813	May 8, 2017
TypeError: 'JavaPackage' object is not callable on AWS EMR when adding jars Hail Query & hailctl	1	582	March 30, 2021

is.hail.kryo.HailKryoRegistrator ClassNotFoundException

Related topics