ML EPS REG: Difference between revisions

From VASP Wiki
No edit summary
No edit summary
 
Line 19: Line 19:


All eigenvalues satisfying <math>\lambda_{i} / \lambda_{\mathrm{max}} </math> > {{TAG|ML_EPS_REG}} are included in the above equations, whereas smaller eigenvalues are disregarded (they are anyway potentially inaccurate because of loss of significance).
All eigenvalues satisfying <math>\lambda_{i} / \lambda_{\mathrm{max}} </math> > {{TAG|ML_EPS_REG}} are included in the above equations, whereas smaller eigenvalues are disregarded (they are anyway potentially inaccurate because of loss of significance).
The smaller the value of {{TAG|ML_EPS_REG}}, the smaller the error of the fit can potentially become. But at the same time, the danger of over fitting increases. We determined empirically that the default of {{TAG|ML_EPS_REG}}=1E-14 is fairly safe. If a user desires higher accuracy, the user can try to set {{TAG|ML_EPS_REG}}=1E-15 (lower values are not recommended), but the user should very carefully monitor the convergence of the calculations as described below.


If at any point during iterating the above equations, the quadratic norm of errors (eight column of <code>REGR/REGRF</code> in [[ML_LOGFILE]]) becomes too large (more than 1.2 times larger than in previous iterations), the code assumes that numerical issues
If at any point during iterating the above equations, the quadratic norm of errors (eight column of <code>REGR/REGRF</code> in [[ML_LOGFILE]]) becomes too large (more than 1.2 times larger than in previous iterations), the code assumes that numerical issues

Latest revision as of 08:48, 16 May 2022

ML_EPS_REG = [real]
Default: ML_EPS_REG = 1E-15 

Description: Initial value for the threshold of the eigenvalues of the covariance matrix in the evidence approximation.


This threshold is used to determine which eigenvalues of the covariance matrix are used in the optimization of the regularization parameters and determined by the following equations

.

All eigenvalues satisfying > ML_EPS_REG are included in the above equations, whereas smaller eigenvalues are disregarded (they are anyway potentially inaccurate because of loss of significance).

If at any point during iterating the above equations, the quadratic norm of errors (eight column of REGR/REGRF in ML_LOGFILE) becomes too large (more than 1.2 times larger than in previous iterations), the code assumes that numerical issues (loss of significance) have occurred, and then ML_EPS_REG is automatically doubled. Furthermore, if the regression does not converge within 10 steps, ML_EPS_REG is also increased by a factor of 4. The maximum allowed iteration depths is 50 (the iteration number is the second entry of REGR/REGRF in ML_LOGFILE). When 50 iterations are reached, no force field is created and there is most likely something seriously wrong in the calculation.

The seventh entry of REGR/REGRF in the ML_LOGFILE shows the ratio of the regularization () and the largest eigenvalue. Usually this number is a number with many varying digits. If this number becomes a "well rounded" number (e.g. 1.00000000E-14), this is an indication that the cap for the current ML_EPS_REG is reached. That means that regularization becomes crucial.

Related tags and articles

ML_LMLFF, ML_IALGO_LINREG, ML_IREG, ML_SIGV0, ML_SIGW0

Examples that use this tag