Open gelzinyte opened 4 years ago
On Mar 31, 2020, at 8:31 AM, gelzinyte notifications@github.com wrote:
For example, I was trying to fit 2-body distance descriptor to methane dataset:
gap_fit atoms_filename=ch4.xyz sparse_separate_file=F gap={distance_2b cutoff=3.0 covariance_type=ard_se delta=2 n_sparse=10 theta_uniform=0.8 sparse_method=uniform add_species} default_sigma={0.01 0.1 0.0 0.0} sparse_jitter=1e-10
.n_sparse=10 but there aren't 10 different environments for each species in the fitting database, so it's perhaps unsurprising that it fails in a weird way. Have you tried with a more realistic database, say a few tens of perturbed configurations (with correct energies/forces etc)?
Noam
|| |U.S. NAVAL| |RESEARCH| LABORATORY Noam Bernstein, Ph.D. Center for Materials Physics and Technology U.S. Naval Research Laboratory T +1 202 404 8628 F +1 202 404 7546 https://www.nrl.navy.mil https://www.nrl.navy.mil/
n_sparse automatically gets reduced the to maximum available number. the problem occurs when that is zero. it fails with a lapack error
-- Gábor
Gábor Csányi Professor of Molecular Modelling Engineering Laboratory, University of Cambridge Pembroke College Cambridge
Pembroke College supports CARA. A Lifeline to Academics at Risk. http://www.cara.ngo/
On 31 Mar 2020, at 13:56, bernstei notifications@github.com wrote:
On Mar 31, 2020, at 8:31 AM, gelzinyte notifications@github.com wrote:
For example, I was trying to fit 2-body distance descriptor to methane dataset:
gap_fit atoms_filename=ch4.xyz sparse_separate_file=F gap={distance_2b cutoff=3.0 covariance_type=ard_se delta=2 n_sparse=10 theta_uniform=0.8 sparse_method=uniform add_species} default_sigma={0.01 0.1 0.0 0.0} sparse_jitter=1e-10
.n_sparse=10 but there aren't 10 different environments for each species in the fitting database, so it's perhaps unsurprising that it fails in a weird way. Have you tried with a more realistic database, say a few tens of perturbed configurations (with correct energies/forces etc)?
Noam
|| |U.S. NAVAL| |RESEARCH| LABORATORY Noam Bernstein, Ph.D. Center for Materials Physics and Technology U.S. Naval Research Laboratory T +1 202 404 8628 F +1 202 404 7546 https://www.nrl.navy.mil https://www.nrl.navy.mil/ — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or unsubscribe.
On Mar 31, 2020, at 9:09 AM, gabor1 notifications@github.com wrote:
n_sparse automatically gets reduced the to maximum available number. the problem occurs when that is zero. it fails with a lapack error
So is it the lack of a C-C distance? Might be nice to test for that and either skip it or give a more meaningful error message.
Noam
|| |U.S. NAVAL| |RESEARCH| LABORATORY Noam Bernstein, Ph.D. Center for Materials Physics and Technology U.S. Naval Research Laboratory T +1 202 404 8628 F +1 202 404 7546 https://www.nrl.navy.mil https://www.nrl.navy.mil/
yes. if we manually only include C-H and H-H only, it runs fine. the silly thing is that it only fails on evaluation. the training code is quite happy to create an xml file with no sparse points and an empty sparseX file
-- Gábor
Gábor Csányi Professor of Molecular Modelling Engineering Laboratory, University of Cambridge Pembroke College Cambridge
Pembroke College supports CARA. A Lifeline to Academics at Risk. http://www.cara.ngo/
On 31 Mar 2020, at 14:11, bernstei notifications@github.com wrote:
On Mar 31, 2020, at 9:09 AM, gabor1 notifications@github.com wrote:
n_sparse automatically gets reduced the to maximum available number. the problem occurs when that is zero. it fails with a lapack error
So is it the lack of a C-C distance? Might be nice to test for that and either skip it or give a more meaningful error message.
Noam
|| |U.S. NAVAL| |RESEARCH| LABORATORY Noam Bernstein, Ph.D. Center for Materials Physics and Technology U.S. Naval Research Laboratory T +1 202 404 8628 F +1 202 404 7546 https://www.nrl.navy.mil https://www.nrl.navy.mil/ — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.
Is it possible to use distance_2b
with different parameters for different bonds, namely 3.0 AA cutoff for C-C and 2.0 AA for H-H? And how to manually only include few bond types in the system, such as only C-H and H-H in CH4, by using gap_fit
?
yes of course. just stop using "add_species=T", and give each pair descriptor explicitly. You can start by copying the descriptor strings that are reported in the output file when you did use the add_species=T command, and then you can modify them one by one, or omit the ones you don't need.
yes of course. to do this most easily, create a run with "add_species=T", which uses the same settings for all species. Then take a look at the output file of that training, which will list the descriptors created automatically as a result of the add_species, you will find all species pairs that are in your dataset, now separately specified. Copy those specifications into your gap string (instead of the pervious generic one), not how add_species is NOT part of these. Adjust the settings for each descriptor (one for each species pair) as desired.
-- Gábor
Gábor Csányi Professor of Molecular Modelling Engineering Laboratory, University of Cambridge Pembroke College Cambridge
Pembroke College supports CARA. A Lifeline to Academics at Risk. http://www.cara.ngo/
On 10 Jul 2020, at 23:32, Jiayan Xu notifications@github.com wrote:
Is it possible to use distance_2b with different parameters for different bonds, namely 3.0 AA cutoff for C-C and 2.0 AA for H-H? And how to manually only include few bond types in the system, such as only C-H and H-H in CH4, by using gap_fit?
— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.
For example, I was trying to fit 2-body distance descriptor to methane dataset:
and calling quip with
gave an error.
Explicitly calling C-H and H-H two-body distance descriptors only works, though, with command:
This
ch4.xyz
file only has only one CH4 configuration:-Elena