kite-sdk / kite

Kite SDK
http://kitesdk.org/docs/current/
Apache License 2.0
394 stars 263 forks source link

CDK-1014: Fix support for Hive datasets on Kerberos enabled clusters. #378

Closed joey closed 9 years ago

rdblue commented 9 years ago

+1! (when tests pass)

Is there a way to test this?

joey commented 9 years ago

Ugh, this doesn't work with CDH4 due to API changes in Hive. Let me see what I can do.

joey commented 9 years ago

I still need to fix the CDH4 build. The latest patch add's support for Kerberos in the CopyCommand for Hive datasets. The fixes to the MR Input/OutputFormats isn't enough as you need to distribute a delegation token to the remote job.

I'm not sure we want that fix in this commit or not, but I figured I'd push it to show you what I had to do to test it.

rdblue commented 9 years ago

+1

Thanks, Joey!

rdblue commented 9 years ago

That was fast, I was just going to merge it. Thanks!

joey commented 9 years ago

Ha ha. I hope that was ok. I know historically we've merged our own branches, but I'm happy to have that move to the reviewer (I think that's what Hadoop does) if that makes sense.

rdblue commented 9 years ago

Merging our own branches is great. I just wanted to get this in as soon as possible. I'm going to need to backport it.