samplchallenges / SAMPL8

Challenge details, inputs, and (eventually) results for the SAMPL8 series of challenges
https://samplchallenges.github.io
MIT License
22 stars 11 forks source link

The SAMPL8 Blind Prediction Challenges for Computational Chemistry

DOI

Challenge details, inputs, and results for the SAMPL8 series (phase) of challenges. Each individual SAMPL8 challenge may be broken up into multiple stages.

See the SAMPL website for information on the Statistical Assessment of the Modeling of Proteins and Ligands (SAMPL) series of challenges as a whole. This repository focuses specifically on the SAMPL8 series of challenges. Additionally, see the SAMPL community on Zenodo for content related to the SAMPL series of challenges. If you wish to use Zenodo to post your presentation slides, datasets, and/or other SAMPL related material, and get a DOI for them, please upload to the community here so that your content will be listed.

Because these files are available publicly, we have no record of who downloads them. Therefore, you should sign up for notifications. Specifically, if you want to receive updates if we uncover any problems, it is imperative that you either (a) sign up for the SAMPL e-mail list, or (b) sign up for notifications of changes to this GitHub repository (the Watch button, above); ideally you would do both.

Please note that some aspects of the SAMPL7 series of challenges are still ongoing, but as we are launching a new host-guest challenge that marks the beginning of the SAMPL8 series of challenges, so we have opened up this repository.

Acknowledging and citing SAMPL

If you've benefitted from our work on the SAMPL series of challenges, please be sure to acknowledge our SAMPL NIH grant in any publications/presentations. This funded host-guest experiments, as well as our work organizing and administrating these challenges. You may acknowledge SAMPL by saying something like, "We appreciate the National Institutes of Health for its support of the SAMPL project via R01GM124270 to David L. Mobley (UC Irvine)."

We also ask you to cite the SAMPL dataset(s) you used. These are versioned on Zenodo, and the latest DOI is here: DOI . Click through for access to all data releases. You may cite these sets by their DOI.

Of course, we also appreciate it if you cite any overview/experimental papers relevant to the particular SAMPL challenge you participated in.

What's here

What's coming

This challenge is concluded; analysis results are still forthcoming for the physical properties challenge.

Changes and Data Set Versions

Releases

Changes not in a release

Challenge construction

Overview

The SAMPL8 phase of challenges included two new host-guest challenges (CB8 and Gibb's deep cavity cavitands). We are currently running our physical properties challenge with GSK (details below) including pKa and logD prediction.

The CB8 challenge

The CB8 "drugs of abuse" challenge focused on binding of CB8 to seven guests which are drugs of abuse, including morphine, hydromorphine, methamphetamine, cocaine and others. Binding has been experimentally characterized, a provisional patent filed, and the Isaacs group has prepared a paper for publication available here. Experimental results/data is available in this repository.

Deadline: The deadline for CB8 submissions was September 15, 2020. The submission format is available.

The GDCC challenge

The GDCC challenge focused on binding of two Gibb Deep Cavity Cavitand (GDCC) hosts (related to the familiar OctaAcid) to five guests. Binding has been experimentally characterized and these compounds form the basis of this challenge, as detailed in this repository.

Deadline: The deadline for GDCC submissions was Feb. 21, 2021 (updated from Feb. 4, 2021). The submission format is available. Additionally, SAMPL8 GDCC predictions may be submitted here. Challenge is closed and experimental results/data is available here

GDCC experimental publication is available. doi:10.1021/acs.jpcb.2c00628

The GSK physical properties challenges

We recently concluded work with GSK for a physical properties challenge. The challenge involved:

We first ran a pKa challenge on the 24 compounds, and then a logD challenge.

Details on dataset collection are available in this talk from the GCC/EuroSAMPL workshop and then further described in this preprint at chemRxiv (DOI 10.33774/chemrxiv-2021-8gd90).

GSK physical properties challenge molecules in Tripos MOL2, SDF, and PDB file format are now available (3/10/21). Enumerated microstates of each molecule are available (as of 5/4/21).

Generally, the challenge structure resembles that of the SAMPL7 physical properties challenge.

Submission deadlines were in August, 2021, as discussed in the challenge details. Submission links were available from the files giving instructions for each challenge component.

Following the challenge, the pKa and logD for several controversial compounds were re-measured in Paul Czodrowski's laboratory with a Sirius T3. The updated pKa values (which have not been used in analysis) are available in the physical_properties/experimental_data directory. It is likely that these new values are superior to those originally provided, but at this time we do not plan to update the analysis here to reflect the new values since these were provided long after the challenge closed.

MANIFEST

SAMPL-related

If you give a SAMPL-related talk or presentation or an analysis of its data, and are willing to share publicly, please consider posting on Zenodo and linking it to the SAMPL Zenodo community.

LICENSE

This material here is made available under CC-BY and MIT licenses, as appropriate:

In other words, we are happy to have you reuse any of the materials here for any purpose as long as proper credit/citation is given.