How to train small energy difference accurately in MLFF
Posted: Fri Jun 14, 2024 5:35 am
Dear VASP Developers,
I am currently working on training a machine learning force field (MLFF) for the charge density wave material, NbSe2, where the low-symmetry CDW structure is energetically more favorable than the high-symmetry pristine structure. [Please see the attached figure for reference]. However, I've encountered significant challenges during the training process.
Initially, I attempted on-the-fly training, but this approach resulted in poor predictions for the blue DFT energy landscape curve, producing a positive parabola instead of the expected negative one for the high-symmetry pristine structure. I then manually selected around 1000 structures near the potential well as training data and employed the select mode (ML_LMLFF=.TRUE., ML_MODE=select). Unfortunately, as illustrated by the red curve in the attachment, the depth of the potential well is still greatly underestimated.
Although I've consulted the best practices on your website wiki/index.php/Best_practices_for_machi ... rce_fields and even increased the radius cutoffs (ML_RCUT1 = 9, ML_RCUT2 = 7), there has been little improvement in the MLFF results.
Could you please advise on the most effective strategies to enhance my MLFF for NbSe2? Any guidance or suggestions would be immensely appreciated.
Thank you very much for your assistance!
Best regards,
Yubi
I am currently working on training a machine learning force field (MLFF) for the charge density wave material, NbSe2, where the low-symmetry CDW structure is energetically more favorable than the high-symmetry pristine structure. [Please see the attached figure for reference]. However, I've encountered significant challenges during the training process.
Initially, I attempted on-the-fly training, but this approach resulted in poor predictions for the blue DFT energy landscape curve, producing a positive parabola instead of the expected negative one for the high-symmetry pristine structure. I then manually selected around 1000 structures near the potential well as training data and employed the select mode (ML_LMLFF=.TRUE., ML_MODE=select). Unfortunately, as illustrated by the red curve in the attachment, the depth of the potential well is still greatly underestimated.
Although I've consulted the best practices on your website wiki/index.php/Best_practices_for_machi ... rce_fields and even increased the radius cutoffs (ML_RCUT1 = 9, ML_RCUT2 = 7), there has been little improvement in the MLFF results.
Could you please advise on the most effective strategies to enhance my MLFF for NbSe2? Any guidance or suggestions would be immensely appreciated.
Thank you very much for your assistance!
Best regards,
Yubi