Hi, I am trying to run PC-Relate followed by hl.maximal_independent_set on a curated set of SNPs from UK biobank dataset, this resulted in the error message below, which I think means the task ran out of memory. There is 480GB already assigned to the job on the cluster so I am unsure how to increas…

I would agree they are a little difficult to debug! I suspect it is running out of memory perhaps, from reading online. I have uploaded the log file here. [image] log.log Google Drive file.

Hi, I was wondering if you had any updates regarding this. The same problem is ongoing. Kind regards, Sam

Hi, just following up again. I would really like to be able to use your PC-Relate implementation on our local cluster. This is proving tricky due to a variety of ‘heartbeat’-type errors, I think it is running out of memory (there is ~3TB on this node). From what I can see the command seems to put a …

Hey @samkleeman1 , sorry for the delay. The relevant exception is this one: java.lang.OutOfMemoryError: Java heap space at scala.reflect.ManifestFactory$$anon$12.newArray(Manifest.scala:141) at scala.reflect.ManifestFactory$$anon$12.newArray(Manifest.scala:139) at breeze.linalg.DenseMatrix$.zer…

Okay many thanks for the update. I am trying to run the script below. I have pre-computed the PCA vectors using PLINK2 approx function to save computational time. I am still unclear whether I need to set executor memory on a local instance? import hail as hl import os import pandas as pd import sub…

Yes the command is listed above, copied below os.environ["PYSPARK_SUBMIT_ARGS"] ="--conf spark.network.timeout=5m --conf spark.executor.heartbeatInterval=1m --conf spark.memory.fraction=1.0 --driver-memory 2880g --executor-memory 5g pyspark-shell"

Hmm, I haven’t tried setting it in the python script, but if it seems to work for you then let’s stick with it. How many executors do you have? Setting that to 5g seems really low. How many cores does each executor have? I’d reduce the driver memory dramatically. There’s not a lot of work that happ…

This is being run on a node with 96 cores and 3TB of RAM. So I guess this means I have 96 executors each with one core or 1 executor with 96 cores, I am unsure to be honest? I have found that altering driver memory seems to determine the amount of memory available in the Spark executors page, see a…

Nobody is ever sure with spark :wink: . Hmm. It looks like you just have a driver and no executors? In that case, it would seem that you’ve already supplied all available memory to the job. Can you share the hail log file? Nothing seems unusual about this to me. You should have lots of excess memor…

Hl.maximal_independent_set - job 'cancelled because SparkContext was shut down'

Hail Query & hailctl

danking January 22, 2021, 4:52pm 8

Ah, sorry, I didn’t realize you were running on a single machine. Are you already setting PYSPARK_SUBMIT_ARGS?

Topic		Replies	Views
Pc_rel memory issue: ConnectionRefusedError: [Errno 111] Connection refused Hail Query & hailctl	10	875	June 11, 2024
Ld_prune OutOfMemoryError: Java heap space Hail Query & hailctl	5	730	January 21, 2020
PCA job aborted from SparkException Hail Query & hailctl	46	2791	July 28, 2020
Getting java heap error tried a bunch of things with the executor and memory settings Hail Batch & General Cloud	2	3542	October 5, 2022
Java Heap Space out of memory Hail Query & hailctl	5	3778	August 10, 2020

Hl.maximal_independent_set - job 'cancelled because SparkContext was shut down'

Related topics