A hybrid CPU/GPU method for Hartree-Fock self-consistent-field calculation.
J Chem Phys
; 159(10)2023 Sep 14.
Article
em En
| MEDLINE
| ID: mdl-37681693
The calculation of two-electron repulsion integrals (ERIs) is a crucial aspect of Hartree-Fock calculations. In computing the ERIs of varying angular momentum, both the central processing unit (CPU) and the graphics processing unit (GPU) have their respective advantages. To accelerate the ERI evaluation and Fock matrix generation, a hybrid CPU/GPU method has been proposed to maximize the computational power of both CPU and GPU while overlapping the CPU and GPU computations. This method employs a task queue where each task corresponds to ERIs with the same angular momentum. The queue begins with ERIs of low angular momentum, which are computationally efficient on GPUs, and ends with ERIs of high angular momentum, which are better suited for CPU computation. CPUs and GPUs dynamically grab and complete tasks from the start and end of the queue using OpenMP dynamic scheduling until all tasks are finished. The hybrid CPU/GPU computation offers the advantage of enabling calculations with arbitrary angular momentum. Test calculations showed that the hybrid CPU/GPU algorithm is more efficient than "GPU-only" when using a single GPU. However, as more GPUs are involved, the advantage diminishes or disappears. The scaling exponents of the hybrid method were slightly higher than "GPU-only," but the pre-exponent factor was significantly lower, making the hybrid method more effective overall.
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Idioma:
En
Ano de publicação:
2023
Tipo de documento:
Article