johnmyleswhite / ML_for_Hackers

Code accompanying the book "Machine Learning for Hackers"
http://shop.oreilly.com/product/0636920018483.do
3.67k stars 2.22k forks source link

Converting R -> Python #30

Open josephmisiti opened 9 years ago

josephmisiti commented 9 years ago

Hello,

Now that ggplot for Python has been around for a while (a few months anyways) - I am personally, for fun, converting the R examples into Python (via IPython Notebooks) using the expected libs: numpy, scipy, pandas, ggplot, statsmodels, etc. (maybe a few others).

My questions are the following:

  1. Is anyone is interested in helping?
  2. Are John/Drew interested in merging this into the master branch?

Depending on the answer to (2), I will try to document my code accordingly. I'm currently done with Chapters 1+2 and 1/2 of 3. I suspect the rest of the code might take me another two weeks if I am doing it by myself.

Thanks,

Joe Misiti @josephmisiti

upjohnc commented 9 years ago

I can pitch in some time. What are the thoughts on divvying up the work?

Chad Upjohn On Tue, Dec 30, 2014 at 10:28 joseph misiti notifications@github.com wrote:

Hello,

Now that ggplot for Python has been around for a while (a few months anyways) - I am personally, for fun, converting the R examples into Python (via IPython Notebooks) using the expected libs: numpy, scipy, pandas, ggplot, statsmodels, etc. (maybe a few others).

My questions are the following:

  1. Is anyone is interested in helping?
  2. Are John/Drew interested in merging this into the master branch?

Depending on the answer to (2), I will try to document my code accordingly. I'm currently done with Chapters 1+2 and 1/2 of 3. I suspect the rest of the code might take me another two weeks if I am doing it by myself.

Thanks,

Joe Misiti @josephmisiti https://github.com/josephmisiti

— Reply to this email directly or view it on GitHub https://github.com/johnmyleswhite/ML_for_Hackers/issues/30.

josephmisiti commented 9 years ago

Ideally -

I would like to get each chapter started, and then go from there - what I am running into right now is some of the charts they are using in the book are not yet supported with yhat's gglot: for example geom_jitter isnt fully functional yet and some of the histograms with time on the x-axis are not working - so if we really wanted to get ambitious we could submit pull requests there also. I plan on doing this but if you are not interested that is cool also. If you dont want to deal with ggplot we could also create the plots in matplotlib also.

upjohnc commented 9 years ago

I'd definitely want to use ggplot. The R version is great.

I'd say that you should submit a request for the geom_jitter. That's needed.

I assume that you have started on Chapter 1. If so, I can start on Chapter 2.

kaiwang0112006 commented 8 years ago

why not use matplotlib?