Open arturzangiev opened 1 year ago
I managed to figure it out. It is clearly networking issue to do with AWS Elasticache. As I deployed to EMR the job successfully get executed. The thing I can't figure out now is why I can't execute it locally as I am on VPN and if I just use redis-cli I can access Elasticache fine. It looks like spark locally can't assign IP correctly.
Trying to read a dataframe from redis instance of AWS, but get stuck at stage 0.
[Stage 0:> (0 + 1) / 1]
Spark 3.3.1 Scala 2.12.15 Java 17.0.1 Python 3.8.14 pyspark 3.3.1 Macbook M1