ondra-m / ruby-spark

Ruby wrapper for Apache Spark
MIT License
227 stars 29 forks source link

Does this project have a future? #29

Open gnilrets opened 8 years ago

gnilrets commented 8 years ago

Hi, I'm really excited by Spark, but not so excited about transitioning my work to Python. I'd love to see this project succeed, but it looks like it might be stalled. Is that true?

I see there's a pull request for DataFrame support that hasn't been merged. Is there anything I could do to help move that along?

ondra-m commented 8 years ago

It is true (a little).

Sure, you can add:

ryan-boder-zz commented 8 years ago

Just wanted to say thanks. I'm using your work for a Spark presentation at a Ruby meetup. It looks like you've done a ton of great work here. I hope it grows into an active project.

ondra-m commented 8 years ago

Thanks but I have problem with your example on ruby 2.2.3

n = 10_000_000
hits = $sc.parallelize(1..n).map(lambda do |_|
  x = rand * 2 - 1
  y = rand * 2 - 1
  x**2 + y**2 < 1 ? 1 : 0
end).sum

# => Sourcify::NoMatchingProcError: Sourcify::NoMatchingProcError

Does it work to you? I have to save Proc/lambda on variable first.

ryan-boder-zz commented 8 years ago

It works for me on ruby 2.0.0p481 on OSX Mavericks. I'm not a ruby expert by any means, just learning it. I did notice in your Pi example you used a named function instead of a lambda, just wasn't sure why. Isn't this code correct with the lambda?

ondra-m commented 8 years ago

If it works for you, its ok :)