Hi,
I’m trying to access s3 data from Hail on the Dataproc cluster, but facing this issue.
Logs
Py4JJavaError: An error occurred while calling o63.csv.
: org.apache.hadoop.fs.s3a.AWSClientIOException: doesBucketExist on bt-transient-bucket-rbnc: com.amazonaws.AmazonClientException: No AWS Credentials provided by BasicAWSCredentialsProvider EnvironmentVariableCredentialsProvider InstanceProfileCredentialsProvider : com.amazonaws.SdkClientException: The requested metadata is not found at http://ip/latest/meta-data/iam/security-credentials/: No AWS Credentials provided by BasicAWSCredentialsProvider EnvironmentVariableCredentialsProvider InstanceProfileCredentialsProvider : com.amazonaws.SdkClientException: The requested metadata is not found at http://ip/4/latest/meta-data/iam/security-credentials/
can you please provied documentation or any details on how to access s3 data from Hail dataproc cluster?
Thanks.