Closed mattodd closed 10 years ago
I haven't had time to write it up yet, but please check out this page, which visualizes the SD file of Oct 10:
http://apps.ideaconsult.net:8080/ambit2/dataset/1898501?page=0&pagesize=100
You can select the "Properties" checkbox to visualize all the SD properties, as shown in the second screenshot:
I have spoken with @vedina about options to automate updating the data set daily...
Another possible option: JSDraw by Scilligence has a java script solution to sdf display. It can be embedded in html and might look like this: http://www.elncloud.com/jsdrawapp/jsdraw/SDFList.htm
This is all on good lines but I can add 1) make sure the InChIKeys are open to the Googlebot 2) I would have thought monthly ChEMBL uploads were fine or even just before a new release so they flow to PubChem a couple of weeks later. 3) At least for the SD masterfiles keep the result-linked and designed virtuals plus made-but-not-yet-tested cleanly seperate (but can merge as well of course for clustering display ect). Since I have an academic ChemAxon licence I'd probably try JChem for Excel or the new MSWord integration but I'm generally agnostic about communal web-centric choices on the display side. As Egon can give us expert support directly with Ambit thats a bonus for a start.
http://ideaconsult.github.io/examples-ambit/molbrowser/
There is a command line application, which can convert any SDF file into JSON and a folder with images. The page is a static one with some jQuery scripts and a bit of styling. The entire directory structure can be hosted at any HTTP server.
Here's a screenshot of that page:
As some of the posts above point out: it's pretty easy to create and use cheminformatics tools to pull the data from github, add whatever info is required, and build the kind of web page you want. I'm sure a lot of us cheminfogeeks would be happy to help out with that.
It's probably also no problem to provide a relatively easy to use desktop tool for grabbing, manipulating, and displaying the data. If I were doing this for a chemist, I'd probably use Knime. It's not that tough to use and has a great selection of tools for munging and manipulating data. For a more expert user (or someone willing to invest some more time in learning), I'd use an IPython notebook.
The trickier part is having the tool hosted on a reliable server somewhere that the community can access and keeping the contents up-to-date. Here, unfortunately, I don't have much in the way of ideas to offer.
Coming back to Chris's question above: how often does the web page need to be updated?
"reliable server and keeping the contents up-to-date" - with only a static page with scripts to host, the choice of HTTP servers is very broad (no application server needed nor database backend). JSON and images the page in my post relies on can be updated with a simple cron script any time the source SDF file is changed.
And for a very small dataset like this one, I would say github is the perfect place. Version control as an added value to hosting ;)
Oh yeah, that's definitely true for static files, but someone either has to run the scripts to generate those files and upload them to the server or you need another server somewhere that's running the cron job.
@greglandrum Not necessarily a server - run the scripts anywhere (laptop, desktop) and check the files into github (or server of choice). More or less similar to compiling source files.
That tool looks pretty cool. If the files are purely static and we don't need anything too special we could simply host the site from an S3 bucket (however the cron job would still, as mentioned above have to run on something).
Another option is to host the web server, and the cron job on the same server, this could either be as an AWS instance (a t1.micro would be sufficient), or alternately we have free access to use Nectar ( http://nectar.org.au/ ) an Australian cloud for researchers that would provide a box with better specs than a low cost AWS instance.
@mike - yes - all the files behind the molbrowser page are static and currently on github gh-pages. Great idea to host on a cloud service!
Ping @vedina could you please add a LICENSE to the repo folder that outlines usage/license? Cheers.
Thanks, much appreciated.
Nice examples everyone, thank you. Just to clarify, is someone actively constructing something, or is the ball back in my court? @greglandrum and @cdsouthan I'm not sure how often we'd update the sd file. We add only those compounds that have been biologically evaluated, so it's periodic. At a best guess, weekly? And yes, Knime is a very good advanced tool for playing with data, but we just need something for the interested med chemist. Structures, some info like an ID, potencies. I think it's important people can follow the trail from the data to more info about the compounds, but if that's not already in the sd file we can handle it separately later.
I think we have enough to work with at the moment. At this stage the app by @vedina looks neat, but I'm concerned that Google won't be able to index it properly (due to the nature of how the content is served through javascript). The priority of having the data indexable by Googlebot should exceed any concern about the webpages being static. I'm in favour of a slightly more built out webapp that makes indexing for Google easier, in addition to making the user experience better for chemists browsing the site (good visualisations, search etc). We have the resources to host something like this so I think we should take advantage of that.
@mike - true, the JSON data as it is now will not be indexed properly by Google. One option is to modify the application to directly generate HTML tables instead of JSON, this is quite straightforward. The same jQuery datatable can be used for rendering, but the HTML will contain all the values.
In fact Google might index the JSON pages with a bit of more efforts. Google seem to have a patent for indexing and searching JSON objects ...
@mattodd I can't help on the IT construction side but it looks like you have very able folk engaging. Re updating, I know OSM keeps up a cracking pace but longer result consolidation intervals have advantages. Besides, if Google gets to the ELNs and folk add the InChIKeys for what they are designing, and the more novel reagents you have de facto real-time surfacing anway. My suggestion would be that @georgeisyourman tips you off for the last two week window before ChEMBLMalaria > ChEMBL18 loads. You then put the call out for any new collaborator results, shut the fume cuboards, order the pizzas, crack a few tubes, and have a data catch-up session, filling in the tables the way GP needs. As mentioned before, you should then not only hit the ChEMBL release but also flow into PubChemBioassay a couple of weeks later. Its important that the MMV and OSM codes come through (via the SIDs) as searchable synonyms in the CID records. Note also you should also update any data and code name additions in the pre-existing ChEMBL records e.g. newer assay results. These will automatically refresh in the SIDs (there may be issues with multiple results needing new assay rows but maybe G&I can take a look)
@vedina I wouldn't be too surprised if they can index JSON, I'm more concerned about them indexing the entry page that generates the data dynamically based on loading the contents of the JSON file.
Sure, Chris, but we <3 automation. If there were a model here for deposition of data in the simplest fashion possible, followed by an auto-upload to something as powerful as Chembl then we'd all win. Absolutely take your point about ELN needing the identifiers - we're trying to get better at that.
On 28 October 2013 00:33, cdsouthan notifications@github.com wrote:
@mattodd https://github.com/mattodd I can't help on the IT construction side but it looks like you have very able folk engaging. Re updating, I know OSM keeps up a cracking pace but longer result consolidation intervals have advantages. Besides, if Google gets to the ELNs and folk add the InChIKeys for what they are designing, and the more novel reagents you have de facto real-time surfacing anway. My suggestion would be that @georgeisyourman tips you off for the last two week window beore ChEMBLMalaria > ChEMBL18 loads. You then put the call out for any new collaborator results, shut the fume cuboards, order the pizzas, crack a few tubes, and have a data catch-up session ,filling in the tables the way GP wants, As mentioned before, you should then not only hit the ChEMBL release but flow into PubChemBioassay a couple of weeks later. Its important that the MMV and OSM codes come through (via the SIDs) as searchable synonyms in th e CID re cords. Note also you should also update any data and code changes in the pre-existing ChEMBL records e.g. newer assay results (there may be issues with multiple results needing new assay rows but maybe G&I can take a look)
— Reply to this email directly or view it on GitHubhttps://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-27169150 .
MATTHEW TODD | Senior Lecturer and Honours Coordinator School of Chemistry | Faculty of Science
THE UNIVERSITY OF SYDNEY Rm 519, F11 | The University of Sydney | NSW | 2006 T +61 2 9351 2180 | F +61 2 9351 3329 | M +61 415 274104 E matthew.todd@sydney.edu.au | W http://sydney.edu.au/science/chemistry/research/todd.html
CRICOS 00026A This email plus any attachments to it are confidential. Any unauthorised use is strictly prohibited. If you receive this email in error, please delete it and any attachments.
@mattodd - well, try uploading to an Ambit instance - nothing more than SDF upload through a web form or a REST API and all substructure and similarity searches are enabled once the file is uploaded.
@ngn thank you - I wish I understood that you just wrote, but sadly don't. It's like if I asked you to kugelrohr out the desired compound before doing a CHN to confirm purity then proceed with the transfer hydrog in the next step. Any chance you could let me know what you mean by doing..?
On 28 October 2013 22:19, ngn notifications@github.com wrote:
@mattodd https://github.com/mattodd - well, try uploading to an Ambit instance - nothing more than SDF upload through a web form or a REST API and all substructure and similarity searches are enabled once the file is uploaded.
— Reply to this email directly or view it on GitHubhttps://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-27203722 .
MATTHEW TODD | Senior Lecturer and Honours Coordinator School of Chemistry | Faculty of Science
THE UNIVERSITY OF SYDNEY Rm 519, F11 | The University of Sydney | NSW | 2006 T +61 2 9351 2180 | F +61 2 9351 3329 | M +61 415 274104 E matthew.todd@sydney.edu.au | W http://sydney.edu.au/science/chemistry/research/todd.html
CRICOS 00026A This email plus any attachments to it are confidential. Any unauthorised use is strictly prohibited. If you receive this email in error, please delete it and any attachments.
@mattodd I refer exactly to the same solution as suggested by @egonw at the second comment.
Sorry for the self advertising.
Ambit is a chemical database (MySQL) with REST web service API doi:10.1186/1758-2946-3-18 . LGPL2 licence. Substructure and similarity search
It runs under Apache Tomcat container and is distributed as a web application archive, which means anyone can host it. Unlike ChEMBL the database is distributed without default content - that's it you could upload any chemical file through a web form or a HTTP POST call. Bioclipse can do this (among other things as running calculations and variety of predictions through the same API).
What I had in mind is 1) installing a Tomcat server 2) deploy ambit.war 3) upload SDF files (or CML, SMILES, InChI, etc). This could be an existing installation or one dedicated to OSM, at a location of choice.
Guys - without wishing to hassle anyone, and acknowledging that @miike just started a new repo on this, can I ask if anyone is actively pursuing a solution, such as a demo visualisation of an sd file that is suitable for the uninitiated? If a med chemist comes to the main project hub and we direct them to a commonly-held sd file of all compounds, is there a link we can share with them that allows browsing of the structures and associated data. People here have suggested excellent possibilities, and I wondered if a leader had emerged about which we could start becoming more specific? i.e. that we could start trialling on non-IT-expert medicinal chemists as a user group.
On 28 October 2013 22:48, ngn notifications@github.com wrote:
@mattodd https://github.com/mattodd I refer exactly to the same solution as suggested by @egonw https://github.com/egonw at the second commenthttps://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-26993796.
Sorry for the self advertising.
Ambit http://ambit.sourceforge.net/download_ambitrest.html is a chemical database (MySQL) with REST web servicehttps://github.com/ideaconsult/examples-ambit/tree/master/ambit-json-docsAPI doi:10.1186/1758-2946-3-18 http://www.jcheminf.com/content/3/1/18 . LGPL2 licence. Substructurehttp://apps.ideaconsult.net:8080/ambit2/ui/query?option=smarts&type=smiles&threshold=0.8&funcgroups=%5BC%5D%5BNX3H2%2CNX4H3%3B%21%24%28NC%3D%5B%21%236%5D%29%5D&search=%5BC%5D%5BNX3H2%2CNX4H3%3B%21%24%28NC%3D%5B%21%236%5D%29%5D&pagesize=10and similarity searchhttp://apps.ideaconsult.net:8080/ambit2/ui/query?option=similarity&type=smiles&threshold=0.8&search=OC%5BC%40H%5D1O%5BC%40H%5D%28C%5BC%40%40H%5D1O%29N2C%3DC%28F%29C%28%3DO%29NC2%3DO&pagesize=10
It runs under Apache Tomcat http://tomcat.apache.org/ container and is distributed as a web application archivehttp://en.wikipedia.org/wiki/WAR_fileformat(Sun), which means anyone can host it. Unlike ChEMBL the database is distributed without default content - that's it you could upload any chemical file through a web formhttp://apps.ideaconsult.net:8080/ambit2/ui/uploadstrucor a HTTP POST call. Bioclipse http://www.bioclipse.net/ can do this (among other things as running calculations and variety of predictions through the same API).
What I had in mind is 1) installing a Tomcat server 2) deploy ambit.war 3) upload SDF files (or CML, SMILES, InChI, etc). This could be an existing installationhttp://apps.ideaconsult.net:8080/ambit2/dataset?page=0&pagesize=100or one dedicated to OSM, on a location of choice.
— Reply to this email directly or view it on GitHubhttps://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-27205154 .
MATTHEW TODD | Senior Lecturer and Honours Coordinator School of Chemistry | Faculty of Science
THE UNIVERSITY OF SYDNEY Rm 519, F11 | The University of Sydney | NSW | 2006 T +61 2 9351 2180 | F +61 2 9351 3329 | M +61 415 274104 E matthew.todd@sydney.edu.au | W http://sydney.edu.au/science/chemistry/research/todd.html
CRICOS 00026A This email plus any attachments to it are confidential. Any unauthorised use is strictly prohibited. If you receive this email in error, please delete it and any attachments.
I've started work on this, I just haven't made enough progress to warrant pushing anything to that repo yet. For those interested it will be a mongodb backed node.js app. Creating a quick and dirty visualisation doesn't take too long (and can actually be done in basically one line using openbabel) but I'd like to take the time to develop an awesome, user friendly system that has some cool features (like pulling in external data from ChEMBL).
just to tell @mike there is external data pulling in ambit with a REST API ...
I'm working with the guys at knime.com and we will have something coming "soon". Hopefully that means in the next week or so.
On Thu, Nov 21, 2013 at 12:39 PM, miike notifications@github.com wrote:
I've started work on this, I just haven't made enough progress to warrant pushing anything to that repo yet. For those interested it will be a mongodb backed node.js app. Creating a quick and dirty visualisation doesn't take too long (and can actually be done in basically one line using openbabel) but I'd like to take the time to develop an awesome, user friendly system that has some cool features (like pulling in external data from ChEMBL).
— Reply to this email directly or view it on GitHubhttps://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-28976997 .
@greglandrum Have you guys got a repo for that? Just want to make sure we aren't doubling up on work.
That was my thought there too.
On 22 November 2013 09:02, miike notifications@github.com wrote:
@greglandrum https://github.com/greglandrum Have you guys got a repo for that? Just want to make sure we aren't doubling up on work.
— Reply to this email directly or view it on GitHubhttps://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-29029073 .
MATTHEW TODD | Senior Lecturer and Honours Coordinator School of Chemistry | Faculty of Science
THE UNIVERSITY OF SYDNEY Rm 519, F11 | The University of Sydney | NSW | 2006 T +61 2 9351 2180 | F +61 2 9351 3329 | M +61 415 274104 E matthew.todd@sydney.edu.au | W http://sydney.edu.au/science/chemistry/research/todd.html
CRICOS 00026A This email plus any attachments to it are confidential. Any unauthorised use is strictly prohibited. If you receive this email in error, please delete it and any attachments.
Hi All,
We are in the process of deploying a KNIME Server for use on projects such as this. We should be able to write out to any backend, and can do cheminfo. reports both on scheduled tasks and as part of ad hoc requests if this is useful. This is deployed to the EC2 but we will point some dns at it once we get to that stage (hopefully soon).
For now, you can run workflows from the link below, and make requests for reports/visualizations directly to me if you don't want to do them yourselves in KNIME.
I'm not sure how this fits into your ongoing efforts, but we should be able to stitch things together.
http://54.216.45.3:8080/webportal/#/OpenSourceMalaria/ScheduledJobs/OpenMalariaStructures (u&p): OpenMalariaGuest
I would upload an example report, but don't see a way to attach a PDF. Is there somewhere decent to upload it (sorry, I'm new here)?
Best Regards,
Aaron Hart, KNIME.com ps. Nice to meet you all!
Hi @Aaron-KNIME, looks cool. Is there a repo somewhere with the source?
Hi Aaron,
This is a really great development that will make it much easier for more to contribute substantially to this project and share the results of their analyses under the same platform. I've got already lots of ideas for useful workflows leveraging public domain data, for example.
I tried to login using OpenMalaria and Guest as username and password respectively (also lowercase) but I got a 'login failed' error.
Am I missing something?
Best regards,
George
On 26 November 2013 15:59, Aaron-KNIME notifications@github.com wrote:
Hi All,
We are in the process of deploying a KNIME Server for use on projects such as this. We should be able to write out to any backend, and can do cheminfo. reports both on scheduled tasks and as part of ad hoc requests if this is useful. This is deployed to the EC2 but we will point some dns at it once we get to that stage (hopefully soon).
For now, you can run workflows from the link below, and make requests for reports/visualizations directly to me if you don't want to do them yourselves in KNIME.
I'm not sure how this fits into your ongoing efforts, but we should be able to stitch things together.
http://54.216.45.3:8080/webportal/#/OpenSourceMalaria/ScheduledJobs/OpenMalariaStructures (u&p): OpenMalaria
Guest
I would upload an example report, but don't see a way to attach a PDF. Is there somewhere decent to upload it (sorry, I'm new here)?
Best Regards,
Aaron Hart, KNIME.com ps. Nice to meet you all!
[image: report page]https://f.cloud.github.com/assets/5878985/1623421/a4af98ba-56b3-11e3-977c-c8b92c0abf9f.png
— Reply to this email directly or view it on GitHubhttps://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-29304261 .
Hi George,
I had accidentally embedded the report preview in the middle of the login credentials. I fixed it immediately in the post, but it looks like the broken version got sent to everyone by email. The correct u&p are OpenMalariaGuest. If you want actually use KNIME, I can create you a real account on the server.
Regards,
Aaron
On 27 Nov 2013, at 13:29, George Papadatos notifications@github.com wrote:
Hi Aaron,
This is a really great development that will make it much easier for more to contribute substantially to this project and share the results of their analyses under the same platform. I've got already lots of ideas for useful workflows leveraging public domain data, for example.
I tried to login using OpenMalaria and Guest as username and password respectively (also lowercase) but I got a 'login failed' error.
Am I missing something?
Best regards,
George
On 26 November 2013 15:59, Aaron-KNIME notifications@github.com wrote:
Hi All,
We are in the process of deploying a KNIME Server for use on projects such as this. We should be able to write out to any backend, and can do cheminfo. reports both on scheduled tasks and as part of ad hoc requests if this is useful. This is deployed to the EC2 but we will point some dns at it once we get to that stage (hopefully soon).
For now, you can run workflows from the link below, and make requests for reports/visualizations directly to me if you don't want to do them yourselves in KNIME.
I'm not sure how this fits into your ongoing efforts, but we should be able to stitch things together.
http://54.216.45.3:8080/webportal/#/OpenSourceMalaria/ScheduledJobs/OpenMalariaStructures (u&p): OpenMalaria
Guest
I would upload an example report, but don't see a way to attach a PDF. Is there somewhere decent to upload it (sorry, I'm new here)?
Best Regards,
Aaron Hart, KNIME.com ps. Nice to meet you all!
[image: report page]https://f.cloud.github.com/assets/5878985/1623421/a4af98ba-56b3-11e3-977c-c8b92c0abf9f.png
— Reply to this email directly or view it on GitHubhttps://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-29304261 .
— Reply to this email directly or view it on GitHub.
Hi Mike,
KNIME is built on top of eclipse and we host most everything ourselves at knime.org.
For workflow development, I would just grab the desktop application from http://www.knime.org/node/81
There are various extensions that can be used along with KNIME. The ones that we develop and support can be found (along with sources) as part of an eclipse update site from here: http://www.knime.org/update/2.8/
Finally, we use a lot of community contributed code as well, which can be found (also with sources) here: http://tech.knime.org/community
For getting started with cheminformatics work, I would recommend the RDKit nodes for KNIME. There is a lot of functionality there, and the nodes are quite easy to use.
Regards,
Aaron
On 26 Nov 2013, at 21:43, miike notifications@github.com wrote:
Hi @Aaron-KNIME, looks cool. Is there a repo somewhere with the source?
— Reply to this email directly or view it on GitHub.
Just a heads up to people following this thread that we'll likely have an open online meeting of the project in the next few days to which everyone is invited. @miike has a nice demo of a visualisation tool we can look at as part of that meeting. Agenda growing here: http://malaria.ourexperiment.org/uri/48f Meeting will be one hour, probably early morning EU time. Exact day/time coming.
On 28 November 2013 01:00, George Papadatos notifications@github.comwrote:
OK thanks - I'm in now.
Yes please, I would really like an account on the server so I can deploy cool workflows.
George EMBL-EBI
On 27 November 2013 12:51, Aaron-KNIME notifications@github.com wrote:
Hi George,
I had accidentally embedded the report preview in the middle of the login credentials. I fixed it immediately in the post, but it looks like the broken version got sent to everyone by email. The correct u&p are OpenMalariaGuest. If you want actually use KNIME, I can create you a real account on the server.
Regards,
Aaron
On 27 Nov 2013, at 13:29, George Papadatos notifications@github.com wrote:
Hi Aaron,
This is a really great development that will make it much easier for more to contribute substantially to this project and share the results of their analyses under the same platform. I've got already lots of ideas for useful workflows leveraging public domain data, for example.
I tried to login using OpenMalaria and Guest as username and password respectively (also lowercase) but I got a 'login failed' error.
Am I missing something?
Best regards,
George
On 26 November 2013 15:59, Aaron-KNIME notifications@github.com wrote:
Hi All,
We are in the process of deploying a KNIME Server for use on projects such as this. We should be able to write out to any backend, and can do cheminfo. reports both on scheduled tasks and as part of ad hoc requests if this is useful. This is deployed to the EC2 but we will point some dns at it once we get to that stage (hopefully soon).
For now, you can run workflows from the link below, and make requests for reports/visualizations directly to me if you don't want to do them yourselves in KNIME.
I'm not sure how this fits into your ongoing efforts, but we should be able to stitch things together.
http://54.216.45.3:8080/webportal/#/OpenSourceMalaria/ScheduledJobs/OpenMalariaStructures
(u&p): OpenMalaria
Guest
I would upload an example report, but don't see a way to attach a PDF. Is there somewhere decent to upload it (sorry, I'm new here)?
Best Regards,
Aaron Hart, KNIME.com ps. Nice to meet you all!
[image: report page]<
https://f.cloud.github.com/assets/5878985/1623421/a4af98ba-56b3-11e3-977c-c8b92c0abf9f.png>
— Reply to this email directly or view it on GitHub<
https://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-29304261>
.
— Reply to this email directly or view it on GitHub.
— Reply to this email directly or view it on GitHub< https://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-29381925>
.
— Reply to this email directly or view it on GitHubhttps://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99#issuecomment-29386034 .
MATTHEW TODD | Senior Lecturer and Honours Coordinator School of Chemistry | Faculty of Science
THE UNIVERSITY OF SYDNEY Rm 519, F11 | The University of Sydney | NSW | 2006 T +61 2 9351 2180 | F +61 2 9351 3329 | M +61 415 274104 E matthew.todd@sydney.edu.au | W http://sydney.edu.au/science/chemistry/research/todd.html
CRICOS 00026A This email plus any attachments to it are confidential. Any unauthorised use is strictly prohibited. If you receive this email in error, please delete it and any attachments.
Closing this for now due to more recent Issues linked above. Meeting held before Christmas. Minutes and actions here: http://malaria.ourexperiment.org/osddmalaria_meeting_/8407
How do we best view a collection of molecular structures and their biological activities?
This is a central challenge we have in OSM. It would be good if the casual medicinal chemist could simply browse the project's structures.
Contributors are making molecules every day and regularly but less frequently receiving potencies or other bio/chemdata. We need to be able to share the structures and the activities most effectively. We want the data to be easily shared but also easily browsed. So we need a sheet/something with:
1) Structures, i.e. 2D pictures of the molecules that are human-friendly 2) Associated informatics data (e.g. InChI) that are machine-friendly 3) Potencies or other data 4) Any associated ID numbers 5) A weblink or two to where the molecule is featured/made
It's useful I think for the project to have a discrete place where the data are kept, just to maintain identity - i.e. not just to be subsumed by a larger database. Or at least for the project's structures to be group-able if they are part of a larger database. But really this is a problem about human visualization.
The initial solution was a Google sheet, but we found that beyond about 50 structures the sheet didn't handle the images well.
An alternative is a shared Excel sheet, but as I understand it we would need a plugin to handle the chemical structures. That's do-able, but not if we expect all the readers to have the same plugin.
The current solution is an sd file - a succinct and easy-to-update text file that contains all the information:
https://github.com/OpenSourceMalaria/OSM_compounds
HOWEVER, reading the data (i.e. browsing the structures) is not easy to do for the casual observer.
So what is needed? Well, we're batch-uploading data to Chembl (https://www.ebi.ac.uk/chembl/malaria/doc/inspect/CHEMBL2113921). If we could do an auto-upload to Chembl (daily) then this would be problem solved, since Chembl is very cool and are doing cool things with visualization:
http://chembl.blogspot.com.au/2013/10/some-gamification-of-chembl.html http://chembl.blogspot.ch/2013/10/chembl-web-service-update-3-image.html
But another possible solution is for us to be able to set up a system where: the sd file is displayed on a webpage with a static address (can be bookmarked). When the sd file is updated, that would lead to a new rendering of the webpage when it is loaded. The page would need to have the structures, and be displayed in an active way such that the data can be re-ordered on demand, like in a spreadsheet.
The main sd file has now been joined by an Excel sheet and sd file of lots of exciting new molecules for the latest series the project's looking at:
http://malaria.ourexperiment.org/uri/438
We coincidentally need to combine these two sd files, and we need to browse the new structures because we need to think about which molecules to make next in that new series.
I know that there are solutions that are appropriate for cheminformaticians. We need solutions for people who are happy with email and web browsers only.
Any ideas, anyone?
Egon Willighagen spoke about possible solutions using the sd file during the previous OSM project meeting (http://youtu.be/GxJnWg_eR2I)
This post at:
ELN: http://malaria.ourexperiment.org/uri/43e
G+: https://plus.google.com/u/0/b/114702323662314783325/114702323662314783325/posts/4EDdrZqzuuj
Github: https://github.com/OpenSourceMalaria/OSM_To_Do_List/issues/99