Journal of Chemical Physics, Vol.121, No.14, 6608-6614, 2004
Linear scaling computation of the Fock matrix. VII. Parallel computation of the Coulomb matrix
We present parallelization of a quantum-chemical tree-code [J. Chem. Phys. 106, 5526 (1997)] for linear scaling computation of the Coulomb matrix. Equal time partition [J. Chem. Phys. 118, 9128 (2003)] is used to load balance computation of the Coulomb matrix. Equal time partition is a measurement based algorithm for domain decomposition that exploits small variation of the density between self-consistent-field cycles to achieve load balance. Efficiency of the equal time partition is illustrated by several tests involving both finite and periodic systems. It is found that equal time partition is able to deliver 91%-98% efficiency with 128 processors in the most time consuming part of the Coulomb matrix calculation. The current parallel quantum chemical tree code is able to deliver 63%-81% overall efficiency on 128 processors with fine grained parallelism (less than two heavy atoms per processor). (C) 2004 American Institute of Physics.