Spark tuning for Elasticsearch export?

nicklecompteBCH · February 14, 2020, 12:55am

Pleased to report that setting spark.dynamicAllocation.enabled = false fixed the Elasticsearch slowness/crashing problem without me having to fiddle with other memory settings. It seems that by default AWS EMR will turn this on, and (along with who knows what other AWS oddness) it caused problems in the export. Thanks to @pavlos for the tip (Small MatrixTable hangs on write into Google bucket).

Topic		Replies	Views
Could not able to export the data to ElasticSearch Hail Query & hailctl	25	4991	March 14, 2019
Memory issue in Hail Help [0.1]	12	1499	September 20, 2017
Export ElasticSearch error Hail Query & hailctl	4	567	November 4, 2020
PCA job aborted from SparkException Hail Query & hailctl	46	2660	July 28, 2020
Executor Lost Failure when writing out a MT for WGS pvcf Hail Query & hailctl	3	392	February 6, 2023

Spark tuning for Elasticsearch export?

Related topics