cloudera / cdh-twitter-example

Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
288 stars 342 forks source link

Oozie Job execution "Intrernal Server Error" #14

Open NaveenSrikanth opened 10 years ago

NaveenSrikanth commented 10 years ago

Hi ,

I have executed the same procedure as given by you only difference is I am not using Cloudera Manager . I am using CDH4.7 . when I execute as : oozie job -oozie http://localhost:11000/oozie -config job.properties -run. I am getting the following error as given below.

Error: HTTP error code: 500 : Internal Server Error.

Please help me in crossing this hurdle

joey commented 10 years ago

Can you send me any logs from the Oozie server?

NaveenSrikanth commented 10 years ago

HI Joey it is resolved there is an small flaw in your job.properties code nameNode=hdfs://localhost:8020 jobTracker=localhost:8021 workflowRoot=${nameNode}/user/${user.name}/oozie-workflows

jobStart and jobEnd must be in UTC, because Oozie does not yet support

custom timezones

jobStart=2012-09-10T22:00Z jobEnd=2014-01-01T23:00Z

This should be set to an hour boundary. In this case, it is set to 8 hours

before the jobStart, since PST is GMT-8

initialDataset=2012-08-23T09:00Z

Timezone offset between UTC and the server timezone

tzOffset=-8

oozie.use.system.libpath=true oozie.coord.application.path=${nameNode}/user/${user.name}/oozie-workflows/coord-app.xml

In your job.properties code there is nameNode=hdfs://hadoop1:8020 jobTracker=hadoop1:8021

I got hadoop1 unknown host exception . After I changed it worked. the status is still runn Joey . Thanks alot for the code.

I want to know more about this how are the tweets being retrieved ? What tweets are loaded into hdfs ?. It would be very very helpful f this information is known