Closed aaqibjavith closed 6 years ago
I could solve this issue by adding region and subnet.
This is my mrjob.conf
file
runners:
emr:
aws_access_key_id:
aws_secret_access_key:
region: us-east-2
subnet: subnet-6019a21c
ec2_key_pair: emr
ec2_key_pair_file: /home/ec2-user/.ssh/emr.pem
ssh_tunnel: true
instance_type: m4.large
master_instance_type: m4.large
num_core_instances: 2
# There's a newer AMI version but it has issues with the released stable mrjob
#ami_version: 3.0.4
interpreter: python2.7
bootstrap:
- sudo yum install -y python27 python27-devel gcc-c++
- sudo wget -S -T 10 -t 5 https://bootstrap.pypa.io/get-pip.py
- sudo python2.7 get-pip.py
- sudo /usr/local/bin/pip2.7 install mrjob simplejson warc phonenumbers boto --ignore-installed chardet
- sudo /usr/local/bin/pip2.7 install https://github.com/commoncrawl/gzipstream/archive/master.zip
Hope it helps
I am running sample wordcount script in emr with cluster id
j-XXXXXX
as followsIt gets failed. Error message is as follows
I cross checked the cluster id in EMR dashboard. It matches. Why I am getting this error? How do I fix this issue?