GlobalNamesArchitecture / gnparser

Split scientific names to meaningful elements with meta information
https://parser.globalnames.org/
MIT License
20 stars 2 forks source link

how do you differentiate author strings? #479

Closed sckott closed 5 years ago

sckott commented 5 years ago

hi @dimus - hope you don't mind a question about how you made this so I can help my R port of this?

Curious how you differentiate author strings from other parts of the name? I can see how it's easier when there's a year given, but how is it done when there's no year present?

dimus commented 5 years ago

@sckott probably easier to make a google hangout or skype call about it?

dimus commented 5 years ago

I am also wondering if we can do something with my new project: https://gitlab.com/gogna/gnparser

sckott commented 5 years ago

a call would be good. Either of those is good for me. What day and time? What time zone are you in?

sckott commented 5 years ago

Possibly with the go version. though I think we still don't have a nice Go/R integration that works across platforms, etc. R has great integrations for C and C++, and Rust actually works very well too. There is some experimentation with including Go in R https://github.com/rstats-go/gotest that I could try

dimus commented 5 years ago

I am connecting other languages via gRPC (https://rubygems.org/gems/gnparser) but for R there is no direct route to gRPC according to https://grpc.io/docs/. So connector to R would have to be in C++ I guess, and then binding to R? And it is RPC, so a bit less convenient and a little bit slower. I am considering to send Ruby and Scala parsers to limbo, and concentrate on Go version because of the speed and cleaner parsing rules https://gitlab.com/gogna/gnparser/blob/master/grammar/grammar.peg._

dimus commented 5 years ago

My timezone is CST. I am free today for the rest of the day, tomorrow I am free after 10CST, and on Friday I am free after 11CST. Google hangout is a bit easier, as it is better supported on LInux than Skype

sckott commented 5 years ago

Of those options C++ would be best for R.

Go does seem like a good single best implementation.

How about 130 pm PST today? 330 CST Hangouts works for me. I'm at myrmecocystus@gmail.com

dimus commented 5 years ago

Sounds good @sckott, talk to you soon

dimus commented 5 years ago

calling in a few as dmozzherin@gmail.com

On Wed, Jan 16, 2019 at 1:47 PM Scott Chamberlain notifications@github.com wrote:

Of those options C++ would be best for R.

Go does seem like a good single best implementation.

How about 130 pm PST today? 330 CST Hangouts works for me. I'm at myrmecocystus@gmail.com

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/GlobalNamesArchitecture/gnparser/issues/479#issuecomment-454914636, or mute the thread https://github.com/notifications/unsubscribe-auth/AAAvXILPYuj_UpCxSbd7eEBfG86ZNpYRks5vD4HPgaJpZM4aBpjh .

dimus commented 5 years ago

Closing this, now it relates to Go gnparser

dimus commented 4 years ago

@sckott my Ruby parser now uses Go gnparser compiled to c library: https://github.com/GlobalNamesArchitecture/biodiversity

sckott commented 4 years ago

Thanks - has the C library that's created from the Go version changed much or at all since early 2019 when I last tried it? Tried again briefly and I still can't get it to work

dimus commented 4 years ago

hm, the C-library appeared first time 6 months ago, so it is not the same thing we tried in early 2019 https://gitlab.com/gogna/gnparser/-/issues/69

To use it in Ruby I used FFI https://github.com/GlobalNamesArchitecture/biodiversity/blob/master/lib/biodiversity/parser.rb

sckott commented 4 years ago

Okay, right. I was looking at that file. In R, we have to do what FFI does in C, which I do not know :) I'll try it again at some point

dimus commented 4 years ago

You can also look at the Go to C functions here:

https://gitlab.com/gogna/gnparser/-/blob/master/binding/main.go

sckott commented 4 years ago

thanks