I started using the cloud storage connector after Tim’s ATGU presentation. However, my computer got wiped, and I now need to install it again. I’m getting this issue trying to install it:
curl -sSL https://broad.io/install-gcs-connector | python3
Traceback (most recent call last):
File "<stdin>", line 7, in <module>
ModuleNotFoundError: No module named 'pyspark'
pyspark is a python library that hail depends on. If you install hail, you should also end up with pyspark installed as well. This error suggests to me that you don’t yet have hail installed, or at least don’t have it installed in whatever python environment gets picked up when you run python3.