ResearchSoftwareInstitute / greendatatranslator

Green Team Data Translator Software Engineering and Development
BSD 3-Clause "New" or "Revised" License
2 stars 1 forks source link

September 2018 Hackathon Goals #135

Closed stevencox closed 5 years ago

stevencox commented 6 years ago

Green Team : September

ICEES: Scientific Use Cases

X-CTSA Translator Clinical Data Pipeline

(Socio)Environmental Exposures Knowledge Sources

Translator Platform Services

karafecho commented 6 years ago

@stevencox: Perhaps add OmniCorp to Translator Platform Services?

karafecho commented 5 years ago

Update from Emily on plan for ICEES integrated feature tables and X-CTSA clinical data pipeline, minor corrections provided by Kara:

stevencox commented 5 years ago

@karafecho @empfff @xu-hao @jameschump

empfff commented 5 years ago

@stevencox @karafecho @xu-hao @jameschump yep, so the non-open-access lines are internal milestones--but some of those (like the identified ICEES tables) are dependencies for the open-access stuff (the final ICEES tables), so I thought they belonged on this list.

We can share the CAMP FHIR code--I just didn't want it going too far before we got a green team publication out of it. I defer to the group.

stevencox commented 5 years ago

Ten four to all of that.

karafecho commented 5 years ago

@stevencox @empfff : For what it's worth, I agree with Emily that we should not share the CAMP FHIR code until we have a publication (or at least a submitted manuscript).

stevencox commented 5 years ago

I'm fine with doing that.

Also, at some point, I'd like to have an over-nachos discussion about this for my edification. Is hiding code really the secret to publication? I expect we'll want to publish something about Robokop (@cbizon) - that's all way public. I'm pretty sure @balhoff's stuff is all entirely open. He likes to publsh things. That lot doesn't seem to fear being scooped.

balhoff commented 5 years ago

I wasn't going to stick my nose in... however it's my preference to develop in the open from the first commit. If someone else has time to look at the code you are probably lucky, and maybe they will even end up collaborating.

karafecho commented 5 years ago

I don't think anything has to be published in a peer-reviewed journal article, but I do think one needs to demonstrate ownership of a product and restrict broad dissemination (meaning large-scale socialization or advertising) if one wishes to publish a peer-reviewed journal article. WRT the latter point, most (regarded) journals simply will not publish 'old news', especially these days. So, yeah, there's a balance between being completely open and being a tad bit guarded. The balance depends on your personal goals. For instance, academics remain 'old school'. Chris and Jim are not traditional academics (i.e., tenure-track faculty).

Furthermore, open-source anything opens the door to academic and industry 'theft' of products plus IP. Sorry, but my cynical view of the world is rooted in experience.

karafecho commented 5 years ago

And I'm not the only one who shares these beliefs.

Here's Charles' perspective on a potential meeting with Gamma and BD (excerpt from an email): BD should be interesting (too bad Perry is retired, at least that's what his linkedin says). I think BD could be interested in Reasoner (probably not Translator) - but, putting my cynical hat on it could come across as yet another biomed knowledge system (e.g., Pathway Studio, Watson) without a clear differentiator. Being even more cynical, if they like the idea a lot it looks like there are zero barriers to them stealing it (I suspect the code license allows them to freely sell Reasoner/Translator).

balhoff commented 5 years ago

Open source is absolutely no barrier to peer-reviewed publication. Traditional journals usually want some sort of scientific application to carry a software article, and that will extend well beyond the source code itself. For direct publishing of the software as a product itself, JOSS is an interesting new choice. Since we typically publish software with a license like MIT or BSD, no one can steal it—they are freely given a license to use it.

The free and open nature of Translator is to me the primary and very important differentiator from those other systems.

cbizon commented 5 years ago

?I agree with Jim 100%.

Chris


From: Jim Balhoff notifications@github.com Sent: Wednesday, September 5, 2018 6:33 PM To: ResearchSoftwareInstitute/greendatatranslator Cc: Chris Bizon; Mention Subject: Re: [ResearchSoftwareInstitute/greendatatranslator] September 2018 Hackathon Goals (#135)

Open source is absolutely no barrier to peer-reviewed publication. Traditional journals usually want some sort of scientific application to carry a software article, and that will extend well beyond the source code itself. For direct publishing of the software as a product itself, JOSShttp://joss.theoj.org is an interesting new choice. Since we typically publish software with a license like MIT or BSD, no one can steal it-they are freely given a license to use it.

The free and open nature of Translator is to me the primary and very important differentiator from those other systems.

- You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://github.com/ResearchSoftwareInstitute/greendatatranslator/issues/135#issuecomment-418902973, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AKPyHiDBGYlXfWcHYvPzCHZxoRvSN5viks5uYFEjgaJpZM4UjDkB.

karafecho commented 5 years ago

I guess my point relates more to the application space for a particular piece of software code. My argument is that we keep the initial application of CAMP FHIR somewhat restricted until some sort of publication is released (JOSS?), one that all team members agree upon and provides proper attribution to team members. I agree that subsequent applications of the same (or slightly adapted) software code also are publishable in peer-reviewed journal articles, but Green Team loses provenance over the original code the more things get recycled. In fact, many academic groups make their livelihood off of this sort of thing.

I'd also like to remind you that the Translator CTS commentaries are being considered 'confidential' until they are published.

I fully support the Translator program (and I REALLY support the program, in probably more ways than you are aware of). I also completely agree with you that the open nature of the program is the main differentiator between Translator and other academic or commercial programs/products.

However, I recognize that the Translator program relies on people, not just software, and therein lies the challenge. The Translator program is by no means free of questionable behavior by team members. Trust me.

I guess my concerns are that the nuances of peer review, publication, IP, and licensing are tricky and in flux. All I'm suggesting is to be a bit guarded. Open, yet guarded, if that makes sense.

Apologies for the cynicism, but my experience warrants it.

Kara

karafecho commented 5 years ago

Especially when it comes to open access to clinical data.

We're breaking new ground here, so let's be a bit guarded.

stevencox commented 5 years ago

@karafecho - I get the concern about bad actors and poaching and all of that. What's our timeline for submitting an abstract?

Here's Harold Solbrig's proposed clinical profile specification scheme published - on GitHub. It overlaps substantially with our work (FHIR). And uses crepe which seems to do OMOP.

As @balhoff said, for the kind of code we're building here, there are no secrets algorithms. We'll be better off and stay closer to the cutting edge the more open we're able to be.

With Orange willing to share fairly innovative work like Harold's, we stand to look awkward showing up with secret code to a Hackathon whose purpose is sharing and collaboration.

karafecho commented 5 years ago

@stevencox : Your thoughts are well taken.

However, you all need to consider sustainability of the Translator program. NCATS has provided a one-year extension in funding during the 'feasibility' phase of the program and no guarantee of future funding thereafter. (For what it's worth, the OTA award mechanism CANNOT by definition extend beyond the feasibility phase of the program.)

'Open-everything' is fine, but you'd be foolish to NOT consider additional options.

In terms of sustainability, there's room to accommodate 'open', 'semi-open', AND 'closed'. We just need to be careful and define the 'space', 'time-frame', and 'scope' of 'openness'.

Just my two cents...

karafecho commented 5 years ago

One other thing: In terms of attribution, I relinquished first-authorship on the Translator CTS commentaries in exchange for attributing authorship to 'The Biomedical Data Translator Consortium'. And convincing the journal that all contributors deserved recognition took a bit of work.

I point this out only to demonstrate that I truly am a 'team player', just a pragmatic one. ;-)

karafecho commented 5 years ago

One other-other thing: sharing 'stuff' broadly among Translator team members, as @stevencox suggests, is perfectly fine and in the spirit of the program, but the terms and conditions need to be clearly defined. In fact, this very issue has been the topic of discussion among several Green Team members in response to recent events.

empfff commented 5 years ago

Interesting thread. I'll just add that we do have an abstract on CAMP FHIR already in to AMIA, waiting for a decision--we should hear back by Halloween. If we are accepted, at that point I feel entirely comfortable sharing the code. For me, the code isn't what needs protecting prior to publication--it's the idea/concept behind it. (Converting relational data to JSON, which is all CAMP FHIR really does, isn't the novel part--but WHY we're doing it and HOW we're applying it is.)

The angel on my right shoulder tends to side with the "everything should be open from day 1" crowd. The devil on my left has seen some pretty gauche behavior from certain academics, and is much more of a cynic. I usually split the difference by saying I'll happily share everything and more once at least one abstract or manuscript is in the works. I don't think anyone loses out that way, but maybe that's just me.

stevencox commented 5 years ago

Thank you all for an excellent Hackathon, y'all!

@karafecho @empfff @jameschump @balhoff @xu-hao @lstillwe @ColinKCurtis @StanAhalt @arunacs @karlgustafson