K-point integration: Difference between revisions

From VASP Wiki
No edit summary
No edit summary
 
(29 intermediate revisions by 5 users not shown)
Line 11: Line 11:
<math>
<math>
  \sum_n \frac{1}{\Omega_{\mathrm{BZ}}} \int_{\Omega_{\mathrm{BZ}}}
  \sum_n \frac{1}{\Omega_{\mathrm{BZ}}} \int_{\Omega_{\mathrm{BZ}}}
   \epsilon_{n\bold{k}} \, \Theta(\epsilon_{n\bold{k}}-\mu) \, d \bold{k},
   \epsilon_{n\bold{k}} \, \Theta(\epsilon_{n\bold{k}}-\mu) \, d \bold{k},  
</math>
</math>


where <math>\Theta(x)</math>  is the Dirac step function. Due to our finite
where <math>\Theta(x)</math>  is the Dirac step function. Due to our finite
computer resources this integral has to be evaluated using
computer resources this integral has to be evaluated using
a discrete set of k-points\cite{bal73}:
a discrete set of k-points{{cite|baldereschi:prb:1973}}{{cite|chadi:prb:1973}}{{cite|monkhorst:prb:1976}}:


<math>
<math>
Line 34: Line 34:
accurately using a low number of k-points (this is the
accurately using a low number of k-points (this is the
case for semiconductors and insulators).
case for semiconductors and insulators).
For metals the trick is now to replace the step function <math>\Theta(\epsilon_{n\bold{k}}-\mu)</math>
by a (smooth) function <math>f(\{\epsilon_{n\bold{k}}\})</math>
resulting in a much faster convergence speed
without destroying the accuracy of the sum.
Several methods have been proposed to solve this dazzling problem.
== Linear tetrahedron methods ==
Within the linear tetrahedron method, the term <math>\epsilon_{n\bold{k}}</math> is interpolated
linearly between two k-points. Bloechl{{cite|bloechl:prb:1994}} has recently revised
the tetrahedron method to give effective weights <math>f(\{\epsilon_{n\bold{k}}\})</math>
for each band and k-point. In addition Bloechel was
able to derive a correction formula which removes the
quadratic error inherent in the linear tetrahedron method
(linear  tetrahedron method with Bloechel corrections).
The linear tetrahedron is more or less fool proof and requires
a minimal interference by the user.
The main drawback is that the Bloechels method is not variational with respect to
the partial occupancies if the correction terms are included,
therefore the calculated forces might be wrong
by a few percent. If accurate forces are required
we recommend a finite temperature method.
== Finite temperature approaches - smearing methods ==
In this case the step function is simply replaced by a smooth
function, for example the Fermi-Dirac function{{cite|mermin:pra:1965}}
<math>
f(\frac{\epsilon-\mu}{\sigma}) = \frac{1}{\exp(\frac{\epsilon-\mu}{\sigma})+1}.
</math>
or a Gauss like function{{cite|devita:phd:1992}}
<math>
f(\frac{\epsilon-\mu}{\sigma}) = \frac{1}{2} \left ( 1- {\mathrm{erf}}
      \left [ \frac{\epsilon - \mu}{\sigma} \right ] \right ).
</math>
is one used quite frequently in the context of solid state calculations.
Nevertheless, it turns out that the total energy is no longer
variational (or minimal) in this case. It is necessary to replace the
total energy by some generalized free energy
<math>
F =E - \sum_{n\bold{k}} w_{\bold{k}} \sigma S(f_{n\bold{k}}).
</math>
The calculated forces are now the derivatives of this free energy <math>F</math>
(see {{TAG|Forces}}).
In conjunction with  Fermi-Dirac statistics
the free energy might be interpreted as the free energy
of the electrons at some finite temperature  <math>\sigma=k_{\mathrm{B}} T</math>,
but the physical significance remains unclear in the case
of Gaussian smearing. Despite this problem, it  is
possible to obtain an accurate
extrapolation for <math>\sigma \to 0</math>
from results at finite <math>\sigma</math>  using the formula
<math>
E(\sigma \to 0) =E_0=  \frac{1}{2} ( F +E) .
</math>
In this way we get a "physical" quantity from a
finite temperature calculation, and the  Gaussian smearing method
serves as an mathematical tool to obtain faster convergence with respect
to the number of k-points.
For Al this method  converges even faster than the
linear  tetrahedron method with Bloechel corrections.
== Improved functional form by Methfessel and Paxton ==
The method described previously has two shortcomings:
*The forces calculated by VASP are a derivative of the free electronic energy F (see {{TAG|Forces}}). Therefore the forces can not be used to obtain the equilibrium groundstate, which corresponds to an energy-minimum of <math>E(\sigma \to 0)</math>. Nonetheless the error in the forces is generally small and acceptable.
*The parameter <math>\sigma</math> must be chosen with great care. If <math>\sigma</math> is too large the energy  <math>E(\sigma \to 0)</math> will converge  to the wrong value even for an infinite k-point mesh. If <math>\sigma</math>  is too small the convergence speed with the number of k-points will deteriorate. An optimal choice for <math>\sigma</math> for several cases is given in table 1. Aluminium possesses an extremely simple DOS, Lithium and Tellurium are also simple nearly free electron metals, therefore <math>\sigma</math> might be large. For Copper <math>\sigma</math> is restricted by the fact that the d-band lies approximately 0.5 eV beneath the Fermi-level. Rhodium and Vanadium posses a fairly complex structure in the DOS at the Fermi-level, <math>\sigma</math> must be small. The only way to get a good  math>\sigma</math> is by performing several calculations with different k-point meshes and different parameters for <math>\sigma</math>.
{| class="wikitable"
|+ Table 1: Typical convenient settings  for <math>\sigma</math> for different metals.
|-
! Sigma(eV)
|-
| Aluminium
| 1.0
|-
| Lithium
| 0.4
|-
| Tellurium
| 0.8
|-
| Copper, Palladium
| 0.4
|-
| Vanadium
| 0.2
|-
| Rhodium
| 0.2
|-
| Potassium
| 0.3
|}
These problems can be solved by adopting a slightly different
functional form for <math>f(\{\epsilon_{n\bold{k}}\})</math>. This is possible
by expanding the step function in a complete orthonormal set of functions
(method of Methfessel and Paxton {{cite|methfessel:prb:1989}}).
The Gaussian function is only the first approximation (<math>N</math>=0)
to the  step function, further successive approximations (<math>N</math>=1,2,...) are easily obtained.
In similarity to the Gaussian method, the energy  has to be replaced
by a generalized free energy functional
<math>
F =E - \sum_{n\bold{k}} w_{\bold{k}} \sigma S(f_{n\bold{k}}).
</math>
In contrast to the Gaussian method
the entropy term  <math>\sum_{n\bold{k}} w_{\bold{k}} \sigma S(f_{n\bold{k}})</math> will be very small
for reasonable values of <math>\sigma</math>
(for instance for the values given in table 1).
The  <math>\sum_{n\bold{k}} w_{\bold{k}} \sigma S(f_{n\bold{k}})</math>
is a simple error estimation for the difference between the
free energy <math>F</math> and the "physical" energy <math>E(\sigma \to 0)</math>.
<math>\sigma</math> can be increased till this error estimation gets too large.
== References ==
<references/>
----
[[Category:Electronic minimization]][[Category:Crystal momentum]][[Category:Theory]]

Latest revision as of 11:33, 21 October 2024

In this section we discuss partial occupancies. A must for all readers.

First there is the question why to use partial occupancies at all. The answer is: partial occupancies help to decrease the number of k-points necessary to calculate an accurate band-structure energy. This answer might be strange at first sight. What we want to calculate is, the integral over the filled parts of the bands

where is the Dirac step function. Due to our finite computer resources this integral has to be evaluated using a discrete set of k-points[1][2][3]:

Keeping the step function we get a sum

which converges exceedingly slow with the number of k-points included. This slow convergence speed arises only from the fact that the occupancies jump form 1 to 0 at the Fermi-level. If a band is completely filled the integral can be calculated accurately using a low number of k-points (this is the case for semiconductors and insulators).

For metals the trick is now to replace the step function by a (smooth) function resulting in a much faster convergence speed without destroying the accuracy of the sum. Several methods have been proposed to solve this dazzling problem.

Linear tetrahedron methods

Within the linear tetrahedron method, the term is interpolated linearly between two k-points. Bloechl[4] has recently revised the tetrahedron method to give effective weights for each band and k-point. In addition Bloechel was able to derive a correction formula which removes the quadratic error inherent in the linear tetrahedron method (linear tetrahedron method with Bloechel corrections). The linear tetrahedron is more or less fool proof and requires a minimal interference by the user.

The main drawback is that the Bloechels method is not variational with respect to the partial occupancies if the correction terms are included, therefore the calculated forces might be wrong by a few percent. If accurate forces are required we recommend a finite temperature method.

Finite temperature approaches - smearing methods

In this case the step function is simply replaced by a smooth function, for example the Fermi-Dirac function[5]

or a Gauss like function[6]

is one used quite frequently in the context of solid state calculations. Nevertheless, it turns out that the total energy is no longer variational (or minimal) in this case. It is necessary to replace the total energy by some generalized free energy

The calculated forces are now the derivatives of this free energy (see Forces). In conjunction with Fermi-Dirac statistics the free energy might be interpreted as the free energy of the electrons at some finite temperature , but the physical significance remains unclear in the case of Gaussian smearing. Despite this problem, it is possible to obtain an accurate extrapolation for from results at finite using the formula

In this way we get a "physical" quantity from a finite temperature calculation, and the Gaussian smearing method serves as an mathematical tool to obtain faster convergence with respect to the number of k-points. For Al this method converges even faster than the linear tetrahedron method with Bloechel corrections.

Improved functional form by Methfessel and Paxton

The method described previously has two shortcomings:

  • The forces calculated by VASP are a derivative of the free electronic energy F (see Forces). Therefore the forces can not be used to obtain the equilibrium groundstate, which corresponds to an energy-minimum of . Nonetheless the error in the forces is generally small and acceptable.
  • The parameter must be chosen with great care. If is too large the energy will converge to the wrong value even for an infinite k-point mesh. If is too small the convergence speed with the number of k-points will deteriorate. An optimal choice for for several cases is given in table 1. Aluminium possesses an extremely simple DOS, Lithium and Tellurium are also simple nearly free electron metals, therefore might be large. For Copper is restricted by the fact that the d-band lies approximately 0.5 eV beneath the Fermi-level. Rhodium and Vanadium posses a fairly complex structure in the DOS at the Fermi-level, must be small. The only way to get a good math>\sigma</math> is by performing several calculations with different k-point meshes and different parameters for .
Table 1: Typical convenient settings for for different metals.
Sigma(eV)
Aluminium 1.0
Lithium 0.4
Tellurium 0.8
Copper, Palladium 0.4
Vanadium 0.2
Rhodium 0.2
Potassium 0.3


These problems can be solved by adopting a slightly different functional form for . This is possible by expanding the step function in a complete orthonormal set of functions (method of Methfessel and Paxton [7]). The Gaussian function is only the first approximation (=0) to the step function, further successive approximations (=1,2,...) are easily obtained. In similarity to the Gaussian method, the energy has to be replaced by a generalized free energy functional

In contrast to the Gaussian method the entropy term will be very small for reasonable values of (for instance for the values given in table 1). The is a simple error estimation for the difference between the free energy and the "physical" energy . can be increased till this error estimation gets too large.

References