I hope we can use this thread to collect autopsies of dead workers. Specifically, why the worker died? Shall I increase memory or disk?
I want to point out that an useful diagnostic tool is the sparkUI, which is often available at localhost:4040 (http://spark.apache.org/docs/latest/monitoring.html). So we can start the conversation by having a look at this.