allr / purdue-fastr

Other
268 stars 40 forks source link

could not find function 'read.table' #6

Closed slihn closed 10 years ago

slihn commented 10 years ago

I am trying to use fastR to perform factor analysis (using lm). But I can't get pass read a csv file. Is this not supported yet? Or what am I missing?

Thanks, Steve

chrismattmann commented 10 years ago

Hi @slihn can you be more specific about the specific CSV file you are loading? How are you loading it? Do you have an example script?

mhaupt commented 10 years ago

Hello,

the FastR hosted on GitHub is not the most recent one; development continues on BitBucket (https://bitbucket.org/allr/fastr). Note that it is a work in progress and loading CSV is not supported at this time.

Regards,

Michael

chrismattmann commented 10 years ago

Hi @mhaupt any particular reason that the code moved to BitBucket?

janvitek commented 10 years ago

The github reflects the first version of the system developped at Purdue.

When Oracle took over the code base we moved to bitbucket as it was more familiar to the new developers.

On Aug 7, 2014, at 11:54 AM, Chris Mattmann notifications@github.com wrote:

Hi @mhaupt any particular reason that the code moved to BitBucket?

— Reply to this email directly or view it on GitHub.

mhaupt commented 10 years ago

Nope, this was not about familiarity. It was about technical compatibility. :-)

chrismattmann commented 10 years ago

Thanks for the explanation guys. I think your project is awesome and we are trying to figure out how to use it on our Regional Climate Model Evaluation System (RCMES) project at JPL http://rcmes.jpl.nasa.gov/ for model evaluation metrics.

janvitek commented 10 years ago

Hi Chris,

Can I ask you what draws you to a JVM based implementation of R? (I am always curious)

We may also be able to implement missing pieces if there are only a few of them.

-jan

On Aug 7, 2014, at 4:17 PM, Chris Mattmann notifications@github.com wrote:

Thanks for the explanation guys. I think your project is awesome and we are trying to figure out how to use it on our Regional Climate Model Evaluation System (RCMES) project at JPL http://rcmes.jpl.nasa.gov/ for model evaluation metrics.

— Reply to this email directly or view it on GitHub.

chrismattmann commented 10 years ago

Hi @janvitek thanks for the reply. I think the thing that drew me is that I a Java wonk (was one of the people who helped created Apache Hadoop and I co-created the Apache Tika project, http://tika.apache.org/). My team at JPL uses Java a lot (amongst other things like Python, etc.) and we work with scientists who use IDL, R, Matlab, etc. (Fortran, even) for science codes. One of the projects we are working on now (RCMES) has a relationship between statistical metrics for climate model outputs and remote sensing data, and using those metrics to compare them (bias, RMSE, etc.) and we were thinking of trying out FastR (which I found a presentation like 6 months ago on) for this since we thought maybe it would help bridge our data system needs with our science users, some of whom know R. Thanks!

janvitek commented 10 years ago

Chris,

If you are interested to discuss more, please contact me directly. We could try to see if you application could be a demonstrator for our work. In which case perhaps we could collaborate.

-jan

On Aug 7, 2014, at 4:45 PM, Chris Mattmann notifications@github.com wrote:

Hi @janvitek thanks for the reply. I think the thing that drew me is that I a Java wonk (was one of the people who helped created Apache Hadoop and I co-created the Apache Tika project, http://tika.apache.org/). My team at JPL uses Java a lot (amongst other things like Python, etc.) and we work with scientists who use IDL, R, Matlab, etc. (Fortran, even) for science codes. One of the projects we are working on now (RCMES) has a relationship between statistical metrics for climate model outputs and remote sensing data, and using those metrics to compare them (bias, RMSE, etc.) and we were thinking of trying out FastR (which I found a presentation like 6 months ago on) for this since we thought maybe it would help bridge our data system needs with our science users, some of whom know R. Thanks!

— Reply to this email directly or view it on GitHub.

slihn commented 10 years ago

Hi, @chrismattmann, my company uses scala to build a financial analytics platform. In the past few years, we had to port a lot of quant library into scala that have long existed elsewhere (most noticeably in R). In my opinion, that is such a waste of time. It would've been much easier if scala can pass data into R, call a few packages, and get the data back. But Java developers and R developers don't live together well. I think FastR is an attractive option to get buy-in from the java developers.

My particular experiment is to reproduce what's in http://www.calculatinginvestor.com/2011/04/19/fama-french-tutorial/ And extend it to capture the computations mentioned in the recent "Buffett's Alpha" paper. http://www.econ.yale.edu/~af227/pdf/Buffett%27s%20Alpha%20-%20Frazzini,%20Kabiller%20and%20Pedersen.pdf

Obviously I can't even read the data from a CSV file. If FastR doesn't have comprehensive I/O to the outside world, it will remain a toy in the lab.

janvitek commented 10 years ago

Hi slihn,

You have to start somewhere. We would be happy to help out with some of the work needed to evaluate the technology.

This said, you can also check out Renjin. It claims a much higher level of compatibility with full R.

-jan

Hi, @chrismattmann, my company uses scala to build a financial analytics platform. In the past few years, we had to port a lot of quant library into scala that have long existed elsewhere (most noticeably in R). In my opinion, that is such a waste of time. It would've been much easier if scala can pass data into R, call a few packages, and get the data back. But Java developers and R developers don't live together well. I think FastR is an attractive option to get buy-in from the java developers.

My particular experiment is to reproduce what's in http://www.calculatinginvestor.com/2011/04/19/fama-french-tutorial/ And extend it to capture the computations mentioned in the recent "Buffett's Alpha" paper. http://www.econ.yale.edu/~af227/pdf/Buffett%27s%20Alpha%20-%20Frazzini,%20Kabiller%20and%20Pedersen.pdf

Obviously I can't even read the data from a CSV file. If FastR doesn't have comprehensive I/O to the outside world, it will remain a toy in the lab.

— Reply to this email directly or view it on GitHub.

kalibera commented 10 years ago

Well if you are ok running native code, you can also use rJava, which gives you access to the official implementation of R from Java.

https://github.com/s-u/rJava

Note that you cannot get rid of native code completely anyway. Any R implementation will have to use (native) numerical libraries. Also many R packages include native code.

Tomas

On 08/08/2014 04:39 PM, Jan Vitek wrote:

Hi slihn,

You have to start somewhere. We would be happy to help out with some of the work needed to evaluate the technology.

This said, you can also check out Renjin. It claims a much higher level of compatibility with full R.

-jan

Hi, @chrismattmann, my company uses scala to build a financial analytics platform. In the past few years, we had to port a lot of quant library into scala that have long existed elsewhere (most noticeably in R). In my opinion, that is such a waste of time. It would've been much easier if scala can pass data into R, call a few packages, and get the data back. But Java developers and R developers don't live together well. I think FastR is an attractive option to get buy-in from the java developers.

My particular experiment is to reproduce what's in http://www.calculatinginvestor.com/2011/04/19/fama-french-tutorial/ And extend it to capture the computations mentioned in the recent "Buffett's Alpha" paper.

http://www.econ.yale.edu/~af227/pdf/Buffett%27s%20Alpha%20-%20Frazzini,%20Kabiller%20and%20Pedersen.pdf

Obviously I can't even read the data from a CSV file. If FastR doesn't have comprehensive I/O to the outside world, it will remain a toy in the lab.

— Reply to this email directly or view it on GitHub.

— Reply to this email directly or view it on GitHub https://github.com/allr/fastr/issues/6#issuecomment-51610617.

slihn commented 10 years ago

@kalibera, thanks for the suggestions. I still like fastR since it is supposed to be tightly integrated with java and is fast! That said, I may look into Renjin as well.

chrismattmann commented 10 years ago

Thanks for all the info guys. @janvitek would be happy to chat directly. feel free to reach out over email or Twitter, etc.