Ah, alright – same set up as Nate had when he got the same error. It looks like we never actually found the problem in #1419, it just started working on its own by recompiling. Could you try that and report back? Sorry it’s broken!
Gradle suite > Gradle test > is.hail.io.LoadBgenSuite.testBgenImportRandom FAILED
java.io.IOException at LoadBgenSuite.scala:137
Caused by: java.io.IOException at LoadBgenSuite.scala:137
Gradle suite > Gradle test > is.hail.stats.LogisticRegressionModelSuite.covariatesVsInterceptOnlyR FAILED
java.io.FileNotFoundException at LogisticRegressionModelSuite.scala:155
Running test: Test method covariatesVsInterceptOnlyTest(is.hail.stats.LogisticRegressionModelSuite)
Those two tests are failing because you don’t have the R and Plink test dependencies installed and on your path. See Running the Tests at bottom of Getting Started.
Do you get your original error if you increase the Java stack size as described here?
Following on with Jon’s commentary, we can more effectively debug if we have specific information about your system:
What operating system do the nodes of the HPC cluster run? If you can get on a node this command will give us the info we need:
uname -a
From the machine where you submit or invoke hail, can you post the output of:
echo $SPARK_HOME
echo $HAIL_HOME
echo $PYTHONPATH
Can you post the exact command you used to compile Hail? The minimal command for Spark 2.0.2 is listed below. I’m particularly interested in whether you specified -Dspark.version and to what it was set.
./gradlew shadowJar
Sorry again that you’re having so much trouble. Hopefully we can pin down exactly what Hail is tripping on and clean that up for you and for others in the future!
Alright, I think I know the source of this error. A Hail JAR is peculiar to a version of Spark. To work properly, you need to build with this invocation: