NMRLipids / Databank

NMRlipids databank
GNU General Public License v3.0
4 stars 31 forks source link

We cannot match experiments on "total lipid concentration" because this number is useful only for a narrow group of experiments #164

Closed comcon1 closed 8 months ago

comcon1 commented 1 year ago

The total concentration of lipid components ([lipid]= Nlipid*[water]/Nwater, where [water] = 55.5 M) doesn't make sense as well as the number of water molecules per lipid for some kind of simulations. I suppose one should not match experiments based on comparing this number.

Number of water mols per lipid makes sense only if all lipids in your mixture have an equal amount of tails. Once you have POPC + TOCL mixture - this number immediately becomes absolutely unusable. However, there is a large amount of people who work only with two-fatty-acid lipids and that number makes sense for them. But we already have CL:PC mixtures merged into the databank (from me and from Milka Doktorova), and I'm going to upload more CL data and also some LipidA data, which has even 6 fatty acids.

There are two possibilities of how it can be fixed:

"total concentration" has an additional disadvantage: it is not a concentration. This equation gives you a concentration only when Nlipid/Nwater is a very-very-very small number. It's OK for liposome experiments. But in the case of solid-state NMR experiments, it is NEVER that. I consider this as some water-per-lipid derivative but becoming much less intuitive than water-per-lipid itself.

I decided that it would be good to initiate that discussion before we have a lot of data with non-two-fatty-acid lipids...

ohsOllila commented 1 year ago

I do not see the problem here. In experiments you should know weights of water and lipids that are put in the system, and based on molecular weights you should be able to calculate the ratio in term of molecules. On the other hand, from simulations it is easy to calculate molecule numbers. If we go for mass percentage here, we need to include molecular weights in the databank. If we go for number of water molecules per one fatty acid chain, we need to define how many chains there are per molecule.

However, if this causes confusion, we should maybe clarify this in the documentation then?

comcon1 commented 1 year ago

The problem is better seen if you have two experiments with, e.g., TOCL:DOPC. Both have 30 water per lipid. One run is for the mixture 90% CL. Another run is for 10% CL. For the second, the hydration will be close to "full". The first one is dehydrated.

Also, similar question is, how to measure % of lipid. In CL:PC mixture, 30 mass % of TOCL means that 30% of surface approximately is covered with CL. You can principally compare that bilayer to PG:PC membrane with 30 mass % of PG. If you provide molar % - it is unclear, how the CL:PC membrane looks like, one should recalculate in head to find it out.

ohsOllila commented 1 year ago

I still do not see the problem. This value does not say whether it is fully hydrated or not. It is just used to check that hydration level is the same in experiments and simulations. Actual value matters only for low hydration levels. If systems are "fully hydrated", then the value does not have to be exactly the same. There is no area involved in this number.

Or do you mean that we should re-evaluate the value for "fully hydrated" system? Currently it is like this: "For molar concentrations of water, the exact hydration level is considered only for systems with molar water-to-lipid ratio below 25, otherwise the systems are considered as fully hydrated.".

pbuslaev commented 1 year ago

Number of waters per lipid is a typical "hydration" measure in MD simulations that is used routinely across different software. For example, CHARMM-GUI membrane builder uses this measure and it is reported in many simulation papers. I think, that it is better to stick to current definition.

comcon1 commented 1 year ago

OK. But then indeed, as Samuli mentioned, the definition of "fully hydrated" should be reconsidered. Because 25 water-per-lipid for hexaacyl LipidA and 25 for DMPE are numbers from different universes.

ohsOllila commented 1 year ago

Ok, I think that we can solve this when have the first experimental data where this problem occurs. Or do you already have it? This does not affect any additions of simulations.

comcon1 commented 8 months ago

Ok, I think that we can solve this when have the first experimental data where this problem occurs. Or do you already have it? This does not affect any additions of simulations.

Yes, as I see there are a lot of issues. So let's close this one until it comes up with a real problem. I am mining that kind of data (with very different lipids) right now. If I succeed with this, I will anyway reopen this question having actual data to refer.