Set up spark.driver.memory in hail0.2 via hailctl

shuang · July 21, 2020, 11:28am

Hi,

I tried:

hailctl dataproc start hail-test
–master-machine-type n1-highmem-8
–master-boot-disk-size 500
–num-workers 2
–worker-machine-type n1-highmem-16
–worker-boot-disk-size 500
–region europe-west1
–zone europe-west1-b
–max-idle 60m
–scopes cloud-platform
–properties “spark:spark.driver.extraJavaOptions=-Xss4M,spark:spark.executor.extraJavaOptions=-Xss4M,spark:spark.driver.memory=100g,spark:spark.driver.maxResultSize=100g,spark:spark.task.maxFailures=20,spark:spark.kryoserializer.buffer.max=2g”

But I noticed, when command run, it actually generate spark.driver.memory=41g

–properties=^|||^spark:spark.task.maxFailures=20|||spark:spark.driver.extraJavaOptions=-Xss4M|||spark:spark.ex
ecutor.extraJavaOptions=-Xss4M|||spark:spark.speculation=true|||hdfs:dfs.replication=1|||dataproc:dataproc.logging.
stackdriver.enable=false|||dataproc:dataproc.monitoring.stackdriver.enable=false|||spark:spark.driver.memory=41g|||
spark:spark.driver.maxResultSize=100g|||spark:spark.kryoserializer.buffer.max=2g
–initialization-actions=gs://hail-common/hailctl/dataproc/0.2.49/init_notebook.py \

However, my task aborted with error code 134, out-of-memory issue. (with 41g)

I noticed spark doc said I need to set tag --driver-memory, however this didn’t work via hailctl.

How could I specific spark.driver.memory to 100g and what is the limitation for it?

Thanks for your time and any help are welcome.
Best.

kumarveerapen · July 30, 2020, 3:24pm

Hello, @shuang

For some reason, I thought this was answered in a separate thread. Was it?

Otherwise, if I recall the --driver-memory specifications can go up to 96G. Unless if you have tried it and it still errored out, I can further look into this for you.

shuang · August 4, 2020, 1:26pm

Hi @kumarveerapen Sorry for my slow response and thx for you answer.

I noticed the reason is my master-machine-type is too small which limited my available spark.driver.memory.

kumarveerapen · August 7, 2020, 2:04am

Great!

Topic		Replies	Views
Java Heap Space out of memory Hail Query & hailctl	5	3688	August 10, 2020
Hail configuration Hail Query & hailctl	2	400	September 22, 2020
How do I increase the memory or RAM available to the JVM when I start Hail through Python? Hail Query & hailctl	2	5497	March 4, 2021
Ld_prune OutOfMemoryError: Java heap space Hail Query & hailctl	5	697	January 21, 2020
"Hail off-heap memory exceeded maximum threshold" error on large analysis job Hail Query & hailctl	1	310	April 18, 2023

Set up spark.driver.memory in hail0.2 via hailctl

Related topics