Next Article in Journal
Certain Matrix Riemann–Liouville Fractional Integrals Associated with Functions Involving Generalized Bessel Matrix Polynomials
Next Article in Special Issue
Many-Electron QED with Redefined Vacuum Approach
Previous Article in Journal
Properties of an Alternative Off-Shell Formulation of 4D Supergravity
Previous Article in Special Issue
Atomic Cascade Computations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Scalable Codes for Precision Calculations of Properties of Complex Atomic Systems

1
Department of Physics and Astronomy, University of Delaware, Newark, DE 19716, USA
2
Petersburg Nuclear Physics Institute of NRC “Kurchatov Institute”, Leningrad District, 188300 Gatchina, Russia
*
Author to whom correspondence should be addressed.
Symmetry 2021, 13(4), 621; https://doi.org/10.3390/sym13040621
Submission received: 7 March 2021 / Revised: 29 March 2021 / Accepted: 6 April 2021 / Published: 8 April 2021
(This article belongs to the Special Issue Development of New Methods in Atomic and Molecular Theory)

Abstract

:
High precision atomic data are indispensable for studies of fundamental symmetries, tests of fundamental physics postulates, developments of atomic clocks, ultracold atom experiments, astrophysics, plasma science, and many other fields of research. We have developed a new parallel atomic structure code package that enables computations that were not previously possible due to system complexity. This code package also allows much quicker computations to be run with higher accuracy for simple systems. We explored different methods of load-balancing matrix element calculations for many-electron systems, which are very difficult due to the intrinsic nature of the computational methods used to calculate them. Furthermore, dynamic memory allocation and MPI parallelization have been implemented to optimize and accelerate the computations. We have achieved near-perfect linear scalability and efficiency with the number of processors used for calculation, paving the way towards the future where most open-shell systems will finally be able to be treated with good accuracy. We present several examples illustrating new capabilities of the newly developed codes, specifically correlating up to all 60 electrons in the highly charged Ir 17 + ion and predicting certain properties of Fe 16 + .

1. Introduction

Studies of the fundamental symmetries with atoms and ions require knowledge of atomic properties as well as calculations to extract potential new physics from the experiments [1]. For example, the most accurate atomic calculation for a heavy atom was carried out to study Cs parity violation [2,3]. A precision (0.35% accuracy) experiment [4] had to be supplemented by theory computation of a comparable accuracy in order to extract the value of the weak charge and test the standard model of elementary particles. The theory accuracy is more than an order of magnitude worse in Yb, not sufficient for a similar analysis of the Yb experiments [5,6], motivating studies with multiple isotopes [6]. While Yb has two valence electrons, low-lying excitations from the closed shells cannot so far be accurately treated by any high-precision methods with significantly lower attainable precision. Theory accuracy is even worse for more compacted systems such as Dy [7] or Sm of interest to parity violation and other studies. Studies of CP-violation with atoms also require computations of the enhancement factors [8]. Studies of Lorentz symmetry violations need computations of the Lorentz invariance violating matrix elements [9]. Moreover, the development of experiments with new systems requires accurate knowledge of many atomic properties. For example, development of even more precise atomic clocks [10] that are used to test the Einstein equivalence principle, search for the variation of the fundamental constants and dark matter [1], is accelerated by strong theory-experimental collaborations.
In the present work, we improve on the capabilities of an ab initio atomic structure code package for calculating atomic properties of complex many-electron systems. The methods used here, including configuration interaction (CI) and the combination of CI with either many body perturbation theory (CI+MBPT) [11,12] or the linearized coupled cluster method (CI+all-order) [13], are very broadly applicable to any atom in the periodic table.
A common problem of the CI method is that the number of configurations for a many-electron atom grows exponentially on the number of correlated electrons, and for systems with more than 4–5 electrons, it is possible to include only a small fraction of them. A standard approach is to divide all electrons into core and valence electrons. In the framework of the CI method, only valence–valence interactions are treated explicitly, while core excitations are neglected. This essentially limits the accuracy of the method when it is applied to a heavy many-electron atom or ion. A way to circumvent this problem was suggested by Dzuba, Flambaum, and Kozlov in Ref. [11]. They used an idea that a CI method treats the interactions between valence electrons, while MBPT can be effectively used to account for core–core and core–valence correlations. Combining these two methods, it is possible to acquire benefits from both approaches and attain higher computational accuracy. A similar approach, combining CI and MBPT, was suggested by Savukov and Johnson [14]. Another program package based on the CI+MBPT method discussed in Ref. [11] was developed by Dzuba and co-authors (see Refs. [15,16] and the references therein). The version of the CI+MBPT code package applied by us was modified for public use and published in Computer Physics Communications in 2015 by Kozlov et al. [12]. Although not published yet, the inclusion of the all-order method which provides accurate solutions for a large number of properties of atoms and ions with up to 5–6 valence electrons has been completed and made fully compatible with this code package.
Numerous problems that were not tractable before have been solved with our newly developed parallel codes, including problems in astrophysics, metrology of highly charged ions, neutral atoms, and negative ions. Our focus will be on the computational developments that enable these new large-scale calculations. In particular, we implemented a message passing interface (MPI) to parallelize computationally expensive portions of the programs. This allows us to fully take advantage of modern high performance computing facilities, such as the University of Delaware high performance Caviness community cluster, where we developed and tested our programs. We use the new package of parallel programs to consider two cases of particular experimental significance: Ir 17 + , which was proposed for the development of novel atomic clocks, and Fe 16 + , which has lines essential for plasma diagnostics tools for astrophysics. We found great success in the parallelization efforts, as the new programs enabled precision calculations of these systems beyond what was desired.
The organization of the rest of this paper is as follows: in Section 2, a summary of atomic structure theory and the methods used in this work is presented. In Section 3, we describe main programs of the CI/CI+MBPT/CI+all-order code package and the computational developments done in this work. In Section 4, the application of the newly developed code package to highly charged Ir 17 + and Fe 16 + ions is discussed. Finally, we conclude and describe future prospects of this work in Section 5.

2. Theory and Methods

In this section, we describe the methods used in our programs and applications. For any many-electron system, we can divide all electrons into core and valence electrons. In this way, we can separate the electron–electron correlation problem into one describing the valence–valence correlations under the frozen-core approximation, and another describing the core–core and core–valence correlations. In the initial approximation, we start from the solution of the restricted Dirac–Hartree–Fock (HFD) equations in the central field approximation to construct one-electron orbitals for the core and valence electrons. Virtual orbitals can be constructed from B-splines or by other means to account for correlations. The valence–valence correlation problem is solved using the CI method, while core–core and core–valence correlations are included using either MBPT or the all-order method. In either case (CI+MBPT/CI+all-order), we form an effective Hamiltonian in the valence CI space, then diagonalize the effective Hamiltonian using the CI method to find energies and wave functions for the low-lying states.

2.1. The CI Method

The CI method is a standard ab initio method for calculating atomic properties of many-electron systems. In the valence space, the CI wave function is constructed as a linear combination of all distinct states of a specified angular momentum J and parity
ψ = i c i Φ i ,
where the set Φ i are Slater determinants generated by exciting electrons from some reference configurations to higher orbitals.
Varying the coefficients c i results in a generalized eigenvalue problem
j Φ i | H | Φ j c j = E c i ,
which can be written in matrix form and diagonalized to find the energies and wave functions of the low-lying states. The energy matrix of the CI method can be obtained as a projection of the exact Hamiltonian H onto the CI subspace H CI  [11]
H CI = E core + i > N core h i CI + j > i > N core V i j ,
where E core is the energy of the frozen core, N core is the number of core electrons, h i CI accounts for the kinetic energy of the valence electrons and their interaction with the central field, and  V i j accounts for the valence–valence correlations.
Having obtained from CI the many-electron states | J M and | J M with the total angular momenta J , J and their projections M , M , one can form density transition matrix in terms of the one-electron states | n l j m [12]
ρ ^ = ρ n l j m , n l j m | n l j m n l j m | ,
where
ρ n l j m , n l j m = J M | a n l j m a n l j m | J M .
Here, un-primed indices refer to the initial state and primed indices refer to the final state. The many-electron matrix element can then be written as
J M | T q L | J M = Tr ρ n l j m , n l j m n l j m | T q L | n l j m ,
where the trace sums over all quantum numbers ( n l j m ) and ( n l j m ) , and  T q L is the spherical component of the tensor operator of rank L. Using the Wigner–Eckart theorem, one can reduce the many-electron matrix element to
J T L J = Tr ρ n l j , n l j L n l j T L n l j ,
where  
ρ n l j , n l j L = ( 1 ) J M J L J M q M 1 m m ( 1 ) j m j L j m q m ρ n l j m , n l j m .
We have developed new parallel programs based on these methods: conf realizes the CI method, which forms the CI Hamiltonian and uses Davidson’s algorithm of diagonalization [17] to find low-lying energies and wave functions; dtm calculates reduced matrix elements (7) of one-electron operators by forming the reduced density transition matrices (8). We discuss the challenges of developing the parallel programs in Section 3.

2.2. Selecting Important Configurations with Valence Perturbation Theory (CI+PT)

As the number of configurations contributing to the CI wave function grows exponentially with the number of valence electrons, efficient selection of the most important configurations from a set of configurations becomes the main challenge of accurate computations. To significantly reduce the number of configurations, we further developed a method suggested in Ref. [18] to predict important configurations based on a set of configurations with known weights. This method can be used to optimize the CI space by identifying the most important configurations from a list of CI configurations using perturbation theory (PT) [18].
All second-order corrections are taken into account and added to the energy calculated from CI to obtain the total energy, E CI = E 0 + E 1 , while first-order corrections to the wave functions are stored for use in subsequent CI calculations. This process of using CI on a small subspace, calculating corrections via PT, and reordering the list of configurations in descending weights, can be repeated several times to form the most optimal CI subspace. Once the energy differences between subsequent CI calculations are relatively small, it can be assumed that convergence has been met.
We have developed a new parallel program conf_pt that realizes the CI+PT method. The parallel version enables computations of extremely large problems, with tests running up to 400 million determinants.

2.3. Including Core Correlations with Other Methods (CI+MBPT/CI+All-Order Method)

The CI+MBPT [11,13,19] and CI+all-order [13] methods include core–core and core–valence correlations using the combination of CI with second-order MBPT and the linearized coupled cluster method, respectively. The many-electron Schrödinger equation can be written as
H eff Ψ = E Ψ ,
where the effective Hamiltonian has the form
H eff = H CI + Σ .
Here, H CI is the CI Hamiltonian described by Equation (3), and  Σ = Σ 1 + Σ 2 is the core–valence correlation potential obtained from either MBPT or the all-order method, where Σ 1 and Σ 2 are the one- and two-electron parts of the core–valence correlation potential, respectively. Following the methods described in Section 2.1, Equation (9) is solved by diagonalizing the effective Hamiltonian using the CI method to obtain energies and wave functions of the low-lying states, then the wave functions can be used to calculate matrix elements of desired one-electron operators.

3. Computational Developments

A schematic of the main programs in the CI/CI+MBPT/CI+all-order code package is illustrated in Figure 1. First, the program hfd solves the restricted HFD equations to construct one-electron orbitals, and the program bass constructs the basis set required by all subsequent codes. If including core correlations, either the all-order block of codes (allcore-ci, valsd-ci, sdvw-ci and second-ci) or the second-order MBPT code (second-ci) are used to calculate the corrections to the Hamiltonian. The set of configurations are generated using the program add and then radial integrals are calculated with the program basc. The main program conf, which realizes the CI method as discussed in Section 2.1, is then ran to calculate wave functions and energies. The CI+PT code conf_pt can optionally be utilized here to optimize the set of configurations or improve the results of CI. Matrix elements of desired observables can then be calculated using the program dtm and polarizabilities using the program pol-ci.
Of the programs listed in the scheme in Figure 1, the CI program (conf) and the matrix elements program (dtm) have been parallelized with the message passing interface (MPI) [20,21]. The CI+PT program (conf_pt) is an auxiliary program that has also been parallelized with MPI. There are several key features of our recently developed parallel codes over previous serial (non-parallel) programs published in Ref. [12]. For all parallel programs, we focused on improving readability and usability, reducing the total memory footprint, implementing dynamic memory allocation where possible, and parallelizing the computationally expensive portions of code using MPI.
The newly developed parallel programs enable computations that were not possible before due to lack of memory or prohibitive computational times. Calculations for smaller and simpler systems can now be done with much more accuracy in a shorter amount of time, allowing a broader range of effects to be investigated in a short period of time. With the original serial version of the programs, calculations of properties of very small systems of 1–2 valence electrons could last up to a few days, while calculations for more complex systems with 3–6 valence electrons could last up to weeks. Some problems involving systems with more than six valence electrons were simply intractable with this code due to the large amount of memory required or extremely long computation times. The main objectives of the new developments were to optimize memory usage in the programs while speeding up computational times by implementing MPI.

3.1. Parallelization of Codes

Our main focus will be on the developments of the parallel CI, CI+PT, and matrix element programs. Each of these methods requires calculating matrix elements between a pair of determinants, which is very difficult to parallelize due to the nature of the problem. In the CI program, the CI Hamiltonian matrix, as well as the matrix of the operator J 2 , are constructed by calculating these matrix elements. Similarly, the CI and PT blocks of the Hamiltonian matrix in the CI+PT program, and the density or transition matrix in the matrix element program, are constructed in the same manner. Construction of each of these matrices is essentially a different variant of the same problem.
We will describe only the parallel implementation to the CI program, but the methods discussed here are applicable in general to the other parallel programs. The main difficulty here is the intrinsic unpredictability of the computational method, i.e., one cannot guess how many non-zero matrix elements there are without explicitly comparing all pairs of determinants. When forming the matrices and calculating the matrix elements, one has to compare electron occupancies between determinants for each configuration. The Slater–Condon rules [22,23] describe the number of operations that are done for each matrix element, which depends on the number of differences between the pair of determinants forming the matrix element.
The construction of the matrix follows a two-loop structure: an outer loop iterates over the total number of determinants and an inner loop iterates over the total number of configurations. Determinants belonging to each configuration are paired and the value of the matrix element is calculated if there are less than 3 differences between the pair of determinants. We separate the computation into two stages: (i) a comparison stage to find pairs of determinants with less than three differences between them and (ii) a calculation stage to calculate the value of the matrix element. After all determinants have been compared, one final loop is done through the total number of non-zero matrix elements to calculate their values. With this implementation, we have achieved near-perfect linear scalability and efficiency in our tests with up to 500 computing cores. It is also possible, and much simpler, to distribute the workload by dividing the outer loop by the number of cores. However, this results in very uneven load-balancing which is exacerbated for larger computations. This is not the case when dealing with smaller matrices such as in the case of the density and transition matrices.
In the serial implementation, non-zero matrix elements were written to disk due to lack of memory of the computers used when the codes were first developed. However, with modern computers, we are able to store the non-zero matrix elements in memory, which also has the advantage of reducing the computation time due to the removal of file input and output (I/O) to disk. The serial program wrote each matrix element to disk with 24 bytes: eight bytes for the counter, four bytes for each index, and eight bytes for the value of the non-zero matrix element. The parallel program removes the redundant eight bytes for the id, reducing the memory requirement of storing the Hamiltonian matrix by 33%.
Since the matrix elements of the CI Hamiltonian are distributed across cores, the Davidson procedure had to be modified to take advantage of the parallelization. The Davidson procedure diagonalizes the CI Hamiltonian matrix to find energies and wave functions of the low-lying states. In the present CI code, only the calculation of matrix-vector products has been parallelized due to it being the only computationally expensive portion of the algorithm. Since each core holds a portion of the Hamiltonian matrix in memory, the Davidson algorithm had to be modified so each core would only be responsible for computing products of their stored matrix elements with the wave function.

3.2. Speedup of Parallel Programs

The performance of the parallel conf program was tested using the Ir 17 + ion. We describe Ir 17 + as the ion with 30 valence electrons in the open 4 f , 4 d , and  4 p shells with an [ 8 s p d f g ] basis set. The basis set is designated by the highest principal quantum number for each partial wave included. For example, the [ 8 s p d f g ] basis set includes 1 8 s , 2 8 p , 3 8 d , 4 8 f , and  5 8 g orbitals. These test calculations included 24,895 relativistic even-parity configurations and 17,431,323 determinants. As reference, the total computation time of the original serial calculation was just short of two weeks.
In Table 1, we compare runtimes of the formation of the CI Hamiltonian matrix (FormH) and the Davidson iterative procedure (Dvdsn), as well as the total computational time, for increasing number of computing cores. Comparing a large number of cores to the base N = 50 case, we see that there is a near-perfect linear scalability up to 500 cores for the FormH subroutine, but very minimal performance gain for Dvdsn from 200 to 500 cores. In Figure 2, we plot the speedup of the main subroutines as a function of the total number of cores, with respect to the ideal speedup of 100% efficiency.
The implementation of the Davidson procedure in the parallel CI program does not scale well since only the calculation of matrix–vector products was parallelized, leaving a large majority of the algorithm serial. Since the Davidson procedure typically does not run as long as the formation of the Hamiltonian, the performance of the Davidson procedure was deemed sufficient for our problems, leaving remaining optimizations to a future project. Despite the inefficiency of Dvdsn, we can see its relatively small impact on the total speedup of the program in Figure 2.
The efficiency of the total speedup attained with the parallel CI program is about 70–80% with the number of cores. The speedup achieved with the parallel CI+PT program and the parallel matrix element program is very similar, with the parallel CI+PT program averaging around 75–85% efficiency and the parallel matrix element program averaging around 80–90% efficiency with the number of cores. These parallel programs have been routinely used in our group to calculate various atomic properties of many-electron systems.

3.3. Selection of Important Configurations

In addition to the parallel codes, we have developed algorithms and auxiliary codes for significantly reducing the number of configurations by efficiently identifying dominant configurations from a list of configurations. It is necessary to be able to select the most important configurations for the valence CI space, especially when the dimensionality of the CI problem becomes intractably large.
The first algorithm involves using the CI+PT method to predict important configurations based on a set of already selected configurations with known weights, as discussed in Section 2.2. Initially, a configuration list with no weights is constructed by allowing excitations from a few basic configurations. A small subspace is chosen from the top of the list to generate initial weights for each of the chosen configurations in the list using CI, then weights of all other configurations in the list are calculated using PT. We then reorder the initial configuration list by descending weights, then repeat this process until convergence is met, i.e., the CI space has been saturated as it has taken into account the most important configurations. The convergence is checked by comparing energies computed from consecutive CI and PT procedures.
The second algorithm explores the importance of the configurations based on the correlations between electrons that are present in the configurations. We find that contributions of electrons in orbitals of the same partial wave are very regular. For example, if our calculations of Fe 16 + show that the contribution of the 1 s 2 2 s 2 p 5 6 s 2 configuration is negligible, then all similar configurations where the two last electrons have higher principal quantum number, e.g., 1 s 2 2 s 2 p 5 6 s 7 s , 1 s 2 2 s 2 p 5 6 s 8 s , 1 s 2 2 s 2 p 5 7 s 2 , can be omitted. However, if the 1 s 2 2 s 2 p 6 6 f configuration has a large weight, then other configurations with 7 f and 8 f electrons should be included.

4. Applications

4.1. Optical Clocks Based on Highly Charged Ions

Highly charged ions (HCI) such as Ir 17 + are of particular interest for the development of novel atomic clocks due to its very high sensitivity to the variation of the fine structure constant α and related dark matter searches [24,25,26,27]. There are many advantages to creating optical clocks with HCI, such as enhanced sensitivity to the variation of α , heavily suppressed systematic effects, and estimated potential clock uncertainty beyond the 10 18 limit [28,29,30]. Recent developments in quantum logic techniques for HCI spectroscopy have made rapid progress in the development of HCI possible [31].
In the case of Ir 17 + , theoretical calculations are particularly difficult due to atomic configurations with holes in the 4 f shell leading to a very large uncertainty in prior theoretical results, with over 50% for the predicted clock transition energy [32,33]. The clock transitions were expected to be observed in recent experiments done in the last six years. However, no clock transitions or any E 1 transitions have been found, even though the predicted transition rates were well within experimental capabilities. Our new results explain the lack of observations of the E 1 transitions and provide a pathway towards detection of clock transitions based on highly charged ions. Here, we will only summarize the results of our application of the new parallel code to Ir 17 + . A detailed discussion of the computation can be found in Ref. [34].
We find that the best initial approximation is achieved by solving restricted DHF equations with partially filled shells, namely [ 1 s 2 4 d 10 ] 4 f 13 5 s . The hybrid CI+MBPT/CI+all-order approaches described in Section 2.3 can not be used with such an initial approximation. Therefore, we treat the inner shells with the CI method, leading to an exponential increase in the number of configurations. We found that, while the weights of most configurations are small, the number of important configurations are still very large.
We start with the most straightforward CI computation that includes single and double excitations from the 4 f and 5 s valence shells, similar to Ref [26]. The excitations are allowed to each of the basis set orbitals up to [ 7 s p d f g ] . Next, we allow excitations of any of the 24 electrons from the 4 d 10 4 f 13 5 s shells to the same basis set orbitals. We find drastic changes in the energies of the E 1 transitions when excitations are allowed from the 4 d shell. Due to such large contributions, we continued to include more and more electrons of the inner shells into the CI valence space, until all 60 electrons have been included. Both single and double excitations were allowed from the 4 f , 4 d , 4 p , 4 s and 3 d shells, while only single excitations are included for all other shells. We tested that the double excitation contribution is small for these inner shells, allowing us to omit them at the present level of accuracy. The results, obtained with different numbers of shells included in the CI valence space, are given in Table 2. Contributions from increased size of the basis set, triple excitations, full Breit, and QED, were found to be small at the present level of accuracy. We found that by far the largest effect comes from the inclusion of the inner electron shells into the CI. Three basis sets of increasing sizes [ 7 s p d f g ] , [ 8 s p d f g ] , and [ 10 s p d f g ] were used to test basis set convergence.
While allowing excitations from the 4 d shell drastically reduced the energies of the E 1 transitions, allowing excitations from the 4 p shell drastically reduced several E 1 matrix elements and rates to well below the detector ability. We have identified several other transitions at different wavelengths for the future E 1 transition searches. As soon as any E 1 transition is detected, we will be able to obtain a better prediction for the proposed clock transition wavelength.
The largest Ir 17 + run with the latest version of the parallel CI program included 96,622 configurations and 58,224,918 determinants, while the largest number of determinants included in the old serial CI program was about 5,000,000. The largest run calculated and stored over 100 billion Hamiltonian matrix elements, with the whole run requiring a total of 2880 GB of memory, which was the maximum amount available to us at the time. Distributed across 80 cores, this run took two days and 18 h to complete. A larger number of cores were not used due to the large amount of memory required to store the basis set, the Hamiltonian matrix, and subsequent arrays for the Davidson procedure.

4.2. Calculation of the 3C/3D Line Intensity Ratio in Fe XVII

Some of the brightest lines of the spectra of many hot astrophysical objects arise from Fe 16 + around 15 Å, namely the resonance line 3 C ( [ ( 2 p 5 ) 1 / 2 3 d 3 / 2 ] J = 1 [ 2 p 6 ] J = 0 ) and the intercombination line 3 D ( [ ( 2 p 5 ) 3 / 2 3 d 5 / 2 ] J = 1 [ 2 p 6 ] J = 0 ) . They are crucial for plasma diagnostics of electron temperatures, elemental abundances, ionization conditions, velocity turbulences, and opacities. For the last four decades, there has been a persistent discrepancy between the observed intensity ratios and advanced plasma models, diminishing the utility of high-resolution X-ray observations. A recent experiment measured the most accurate 3 C / 3 D oscillator strength to date, in an attempt to explain this puzzle [35].
We carried out a precision calculation with our newly developed MPI version of the CI program, allowing us to drastically increase the number of included configurations to over 230,000. This calculation correlated all 10 electrons, included full Breit and QED corrections, and predicted the transition rates with 1–2% accuracy. Our calculations ruled out incomplete inclusion of the electronic correlations in theoretical calculations as the potential explanation of the puzzle. A detailed study of the latest experiment and theoretical work done can be found in Ref. [35].
We started with all possible single and double excitations from the 2 s 2 2 p 6 , 2 s 2 2 p 5 3 p even and 2 s 2 2 p 5 3 s , 2 s 2 2 p 5 3 d , 2 s 2 p 6 3 p , 2 s 2 2 p 5 4 d , 2 s 2 2 p 5 5 d odd configurations, correlating eight electrons with a short [ 5 s p d f 6 g ] basis set. Separate calculations were done to establish the effects of triple and quadruple excitations, as well as full correlation with the 1 s 2 shell. We found negligible corrections to both energies and matrix elements as illustrated by Table 3. The differences between our theoretical energies and experimental values were found to be less than those of the NIST database by 3000 cm 1 . The energies from the revised analysis of the Fe 16 + spectra given in Table 3 are estimated to be accurate to about 90 cm 1 . The last line of Table 3 shows the difference of the 3 C and 3 D energies in eV, with the final value 13.44(5) eV.

4.3. Other Applications

The methods and codes developed in this work have also been utilized in various other calculations involving HCI, neutral atoms and negative ions. Recent works include exploring the development of optical clocks using highly charged Cf 15 + and Cf 17 + ions [37], predicting the lowest-lying odd-parity energy levels in neutral actinium [38], predicting quasibound states of negatively charged La [39], and calculating E 2 transition properties of Bi [40].

5. Conclusions and Further Developments

We have developed a new parallel atomic structure code package that has opened a lot of new possibilities for high-precision calculations of atomic properties of various complex many-electron systems. The efficiency and accuracy of our programs have been validated by solving many problems, involving astrophysics, in metrology of highly charged ions, and negative ions.
Although our parallel programs have displayed great success in efficiency and accuracy, there is still plenty of developmental work to be done. This includes optimization of current parallel algorithms, parallelization of remaining problematic serial codes (e.g., the Davidson procedure in the CI program), optimization of memory usage, completion of documentation, including check-pointing, making the codes user-friendly, and more.
The programs developed here are part of a larger project developing an online portal for high-precision atomic data and computation. The portal will feature a database of all high-precision data calculated in the last couple of decades, as well as wave functions of a large number of atoms and ions stored on a back-end server. Users will be able to request transition properties and polarizabilities for systems not in the database without having to download any codes or learn any inputs. Requested data will be generated automatically through code executions on a back-end server and then updated on the database. All codes will also be released to the public, optimized and user-friendly.

Author Contributions

Conceptualization, M.S. and C.C.; methodology, M.S., S.P. and C.C.; software, C.C.; validation, C.C.; formal analysis, C.C.; investigation, C.C.; resources, M.S.; data curation, C.C.; writing—original draft preparation, C.C.; writing—review and editing, S.P. and M.S.; visualization, C.C.; supervision, M.S.; project administration, M.S.; funding acquisition, M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by U.S. NSF Grant No. PHY-1620687 and Office of Naval Research Grant No. N00014-17-1-2252. S.G.P. acknowledges support by the Russian Science Foundation under Grant No. 19-12-00157.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to the unavailability of the parallel programs at the present time.

Acknowledgments

This research was supported in part through the use of the Caviness and DARWIN HPC systems at the University of Delaware. The authors wish to thank Jeffrey Frey for helpful discussions and contributions on optimizing parts of the CI program.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Safronova, M.S.; Budker, D.; DeMille, D.; Kimball, D.F.J.; Derevianko, A.; Clark, C.W. Search for new physics with atoms and molecules. Rev. Mod. Phys. 2018, 90, 025008. [Google Scholar] [CrossRef] [Green Version]
  2. Porsev, S.G.; Beloy, K.; Derevianko, A. Precision determination of electroweak coupling from atomic parity violation and implications for particle physics. Phys. Rev. Lett. 2009, 102, 181601. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Porsev, S.G.; Beloy, K.; Derevianko, A. Precision determination of weak charge of 133Cs from atomic parity violation. Phys. Rev. D 2010, 82, 036008. [Google Scholar] [CrossRef] [Green Version]
  4. Wood, C.S.; Bennett, S.C.; Cho, D.; Masterson, B.P.; Roberts, J.L.; Tanner, C.E.; Wieman, C.E. Measurement of Parity Nonconservation and an Anapole Moment in Cesium. Science 1997, 275, 1759. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Tsigutkin, K.; Dounas-Frazer, D.; Family, A.; Stalnaker, J.E.; Yashchuk, V.V.; Budker, D. Observation of a Large Atomic Parity Violation Effect in Ytterbium. Phys. Rev. Lett. 2009, 103, 071601. [Google Scholar] [CrossRef]
  6. Antypas, D.; Fabricant, A.; Stalnaker, J.E.; Tsigutkin, K.; Flambaum, V.V.; Budker, D. Isotopic variation of parity violation in atomic ytterbium. Nat. Phys. 2018, 15, 120–123. [Google Scholar] [CrossRef] [Green Version]
  7. Leefer, N.; Bougas, L.; Antypas, D.; Budker, D. Towards a new measurement of parity violation in dysprosium. arXiv 2014, arXiv:1412.1245. [Google Scholar]
  8. Porsev, S.G.; Safronova, M.S.; Kozlov, M.G. Electric Dipole Moment Enhancement Factor of Thallium. Phys. Rev. Lett. 2012, 108, 173001. [Google Scholar] [CrossRef] [Green Version]
  9. Sanner, C.; Huntemann, N.; Lange, R.; Tamm, C.; Peik, E.; Safronova, M.S.; Porsev, S.G. Optical clock comparison for Lorentz symmetry testing. Nature 2019, 567, 204–208. [Google Scholar] [CrossRef] [Green Version]
  10. Ludlow, A.D.; Boyd, M.M.; Ye, J.; Peik, E.; Schmidt, P.O. Optical atomic clocks. Rev. Mod. Phys. 2015, 87, 637–701. [Google Scholar] [CrossRef]
  11. Dzuba, V.A.; Flambaum, V.V.; Kozlov, M.G. Combination of the many-body perturbation theory with the configuration-interaction method. Phys. Rev. A 1996, 54, 3948–3959. [Google Scholar] [CrossRef] [Green Version]
  12. Kozlov, M.G.; Porsev, S.G.; Safronova, M.S.; Tupitsyn, I.I. CI-MBPT: A package of programs for relativistic atomic calculations based on a method combining configuration interaction and many-body perturbation theory. Comput. Phys. Commun. 2015, 195, 199–213. [Google Scholar] [CrossRef] [Green Version]
  13. Safronova, M.S.; Kozlov, M.G.; Johnson, W.R.; Jiang, D. Development of a configuration-interaction plus all-order method for atomic calculations. Phys. Rev. A 2009, 80, 012516. [Google Scholar] [CrossRef] [Green Version]
  14. Savukov, I.M.; Johnson, W.R. Combined CI+MBPT calculations of energy levels and transition amplitudes in Be, Mg, Ca, and Sr. Phys. Rev. A 2002, 65, 042503. [Google Scholar] [CrossRef] [Green Version]
  15. Dzuba, V.A. VN-M approximation for atomic calculations. Phys. Rev. A 2005, 71, 032512. [Google Scholar] [CrossRef] [Green Version]
  16. Dzuba, V.A.; Flambaum, V.V. Core-valence correlations for atoms with open shells. Phys. Rev. A 2007, 75, 052504. [Google Scholar] [CrossRef] [Green Version]
  17. Davidson, E.R. The iterative calculation of a few of the lowest eigenvalues and corresponding eigenvectors of large real-symmetric matrices. J. Comput. Phys. 1975, 17, 87–94. [Google Scholar] [CrossRef]
  18. Rakhlina, Y.G.; Kozlov, M.G.; Porsev, S.G. The energy of electron affinity to a zirconium atom. Opt. Spectrosc. 2001, 90, 817–821. [Google Scholar] [CrossRef]
  19. Dzuba, V.A.; Kozlov, M.G.; Porsev, S.G.; Flambaum, V.V. Using effective operators in calculating the hyperfine structure of atoms. Zh. Eksp. Teor. Fiz. 1998, 114, 1636–1645, [Sov. Phys. JETP 1998, 87, 885]. [Google Scholar] [CrossRef]
  20. Gropp, W.; Lusk, E.; Skjellum, A. Using MPI; MIT Press Ltd.: Cambridge, MA, USA, 2014. [Google Scholar]
  21. Gropp, W.; Hoefler, T.; Thakur, R.; Lusk, E. Using Advanced MPI: Modern Features of the Message-Passing-Interface; The MIT Press: Cambridge, MA, USA, 2015. [Google Scholar]
  22. Slater, J.C. The Theory of Complex Spectra. Phys. Rev. 1929, 34, 1293–1322. [Google Scholar] [CrossRef]
  23. Condon, E.U. The Theory of Complex Spectra. Phys. Rev. 1930, 36, 1121–1133. [Google Scholar] [CrossRef]
  24. López-Urrutia, J.C. The visible spectrum of highly charged ions: A window to fundamental physics. Can. J. Phys. 2008, 86, 111–123. [Google Scholar] [CrossRef]
  25. Berengut, J.C.; Dzuba, V.A.; Flambaum, V.V. Enhanced laboratory sensitivity to variation of the fine-structure constant using highly-charged ions. Phys. Rev. Lett. 2010, 105, 120801. [Google Scholar] [CrossRef]
  26. Berengut, J.C.; Dzuba, V.A.; Flambaum, V.V.; Ong, A. Electron-Hole Transitions in Multiply Charged Ions for Precision Laser Spectroscopy and Searching for Variations in α. Phys. Rev. Lett. 2011, 106, 210802. [Google Scholar] [CrossRef] [Green Version]
  27. Kozlov, M.G.; Safronova, M.S.; Crespo López-Urrutia, J.R.; Schmidt, P.O. Highly charged ions: Optical clocks and applications in fundamental physics. Rev. Mod. Phys. 2018, 90, 045005. [Google Scholar] [CrossRef] [Green Version]
  28. Berengut, J.C.; Dzuba, V.A.; Flambaum, V.V.; Ong, A. Optical transitions in highly charged californium ions with high sensitivity to variation of the fine-structure constant. Phys. Rev. Lett. 2012, 109, 070802. [Google Scholar] [CrossRef] [PubMed]
  29. Derevianko, A.; Dzuba, V.A.; Flambaum, V.V. Highly Charged Ions as a Basis of Optical Atomic Clockwork of Exceptional Accuracy. Phys. Rev. Lett. 2012, 109, 180801. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Dzuba, V.A.; Derevianko, A.; Flambaum, V.V. High-precision atomic clocks with highly charged ions: Nuclear-spin-zero f12-shell ions. Phys. Rev. A 2012, 86, 054501, Erratum in 2013, 87, 029906. [Google Scholar] [CrossRef] [Green Version]
  31. Schmoger, L.; Versolato, O.O.; Schwarz, M.; Kohnen, M.; Windberger, A.; Piest, B.; Feuchtenbeiner, S.; Pedregosa-Gutierrez, J.; Leopold, T.; Micke, P.; et al. Coulomb crystallization of highly charged ions. Science 2015, 347, 1233–1236. [Google Scholar] [CrossRef] [PubMed]
  32. Windberger, A.; Crespo López-Urrutia, J.R.; Bekker, H.; Oreshkina, N.S.; Berengut, J.C.; Bock, V.; Borschevsky, A.; Dzuba, V.A.; Eliav, E.; Harman, Z.; et al. Identification of the Predicted 5s-4f Level Crossing Optical Lines with Applications to Metrology and Searches for the Variation of Fundamental Constants. Phys. Rev. Lett. 2015, 114, 150801. [Google Scholar] [CrossRef]
  33. Safronova, U.I.; Flambaum, V.V.; Safronova, M.S. Transitions between the 4f-core-excited states in Ir16+, Ir17+, and Ir18+ ions for clock applications. Phys. Rev. A 2015, 92, 022501. [Google Scholar] [CrossRef] [Green Version]
  34. Cheung, C.; Safronova, M.; Porsev, S.; Kozlov, M.; Tupitsyn, I.; Bondarev, A. Accurate Prediction of Clock Transitions in a Highly Charged Ion with Complex Electronic Structure. Phys. Rev. Lett. 2020, 124, 163001. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Kühn, S.; Shah, C.; López-Urrutia, J.R.C.; Fujii, K.; Steinbrügge, R.; Stierhof, J.; Togawa, M.; Harman, Z.; Oreshkina, N.S.; Cheung, C.; et al. High Resolution Photoexcitation Measurements Exacerbate the Long-Standing Fe XVII Oscillator Strength Problem. Phys. Rev. Lett. 2020, 124, 225001. [Google Scholar] [CrossRef] [PubMed]
  36. Kramida, A.; (National Institute of Standards and Technology, Gaithersburg, MD 20899, USA). Preliminary Critical Analysis of Fe XVII Spectral Data. Private communication, 2019. [Google Scholar]
  37. Porsev, S.G.; Safronova, U.I.; Safronova, M.S.; Schmidt, P.O.; Bondarev, A.I.; Kozlov, M.G.; Tupitsyn, I.I.; Cheung, C. Optical clocks based on the Cf15+ and Cf17+ ions. Phys. Rev. A 2020, 102, 012802. [Google Scholar] [CrossRef]
  38. Zhang, K.; Studer, D.; Weber, F.; Gadelshin, V.M.; Kneip, N.; Raeder, S.; Budker, D.; Wendt, K.; Kieck, T.; Porsev, S.G.; et al. Detection of the Lowest-Lying Odd-Parity Atomic Levels in Actinium. Phys. Rev. Lett. 2020, 125, 073001. [Google Scholar] [CrossRef]
  39. Safronova, M.S.; Cheung, C.; Kozlov, M.G.; Spielman, S.E.; Gibson, N.D.; Walter, C.W. Predicting quasibound states of negative ions: La- as a test case. Phys. Rev. A 2021, 103, 022819. [Google Scholar] [CrossRef]
  40. Walter, C.; Spielman, S.; Ponce, R.; Gibson, N.; Yukich, J.; Cheung, C.; Safronova, M. Observation of an Electric Quadrupole Transition in a Negative Ion: Experiment and Theory. Phys. Rev. Lett. 2021, 126, 083001. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The scheme of the CI/CI+MBPT/CI+all-order code package.
Figure 1. The scheme of the CI/CI+MBPT/CI+all-order code package.
Symmetry 13 00621 g001
Figure 2. Speedup of the main subroutines in the parallel CI program.
Figure 2. Speedup of the main subroutines in the parallel CI program.
Symmetry 13 00621 g002
Table 1. The runtime and speedup of the subroutines FormH and Dvdsn of the parallel CI program (conf) are presented for increasing number of cores N, relative to 50 cores (n = 50). The runtimes presented are given in seconds (s). These large test runs were done with Ir 17 + with 24,895 relativistic configurations and 17.4 × 10 6 determinants. Note that the total times include all serial subroutines outside of FormH and Dvdsn.
Table 1. The runtime and speedup of the subroutines FormH and Dvdsn of the parallel CI program (conf) are presented for increasing number of cores N, relative to 50 cores (n = 50). The runtimes presented are given in seconds (s). These large test runs were done with Ir 17 + with 24,895 relativistic configurations and 17.4 × 10 6 determinants. Note that the total times include all serial subroutines outside of FormH and Dvdsn.
Runtime (s)Speedup (from N = 50)
NFormHDvdsnTotalFormHDvdsnTotal
5022,571134324,218111
10012,843151414,5931.80.91.7
200580095769273.91.43.5
300381067846105.92.05.3
400291359636467.82.36.6
500229253529589.92.58.2
Table 2. Energies of Ir 17 + (in cm 1 ) obtained using CI with increasing number of open shells. Only configurations obtained by exciting 4 f and 5 s electrons are included in the “ 5 s 4 f only” column. Contributions from exciting electrons from the 4 d shell are given in the column labeled “ 4 d contr.”, and contributions of all other shells are given separately in the next columns. The results with all 60 electrons correlated by the CI are listed in the column “All shells open”. Sum of all other corrections is given in the column labeled “Other”—see Ref. [34] for detailed explanation.
Table 2. Energies of Ir 17 + (in cm 1 ) obtained using CI with increasing number of open shells. Only configurations obtained by exciting 4 f and 5 s electrons are included in the “ 5 s 4 f only” column. Contributions from exciting electrons from the 4 d shell are given in the column labeled “ 4 d contr.”, and contributions of all other shells are given separately in the next columns. The results with all 60 electrons correlated by the CI are listed in the column “All shells open”. Sum of all other corrections is given in the column labeled “Other”—see Ref. [34] for detailed explanation.
Configuration 5 s 4 f Only 4 d 4 p 4 s 3 d 1 s 2 s 3 s 3 p 2 p All ShellsOtherFinal
Contr.Contr.Contr.Contr.Contr.Contr.Contr.Open
4 f 13 5 s 3 F 4 o 00000000000
3 F 3 o 47143115148−3204781−44777
3 F 2 o 25,170−75141375−225−425,220−3425,186
4 f 14 1 S 0 90735797−931−19941097−240−54912,757−37512,382
4 f 12 5 s 2 3 H 6 36,362−85494601848−40318329414430,339−5630,283
3 F 4 46,303−8680−51858−41018425114439,645−8139,564
3 H 5 59,883−86384541858−32618332414353,882−8453,798
3 F 2 68,786−8751−1881690−3842531916461,662−23361,429
Table 3. Contributions to the energies (in cm 1 ) of Fe 16 + calculated with increased basis set sizes and number of configurations. Contributions from triple excitations, excitations from the 1 s 2 shell, and QED contributions are given in their respective columns. The last line gives the 3 C 3 D energy difference in eV.
Table 3. Contributions to the energies (in cm 1 ) of Fe 16 + calculated with increased basis set sizes and number of configurations. Contributions from triple excitations, excitations from the 1 s 2 shell, and QED contributions are given in their respective columns. The last line gives the 3 C 3 D energy difference in eV.
Configuration [ 5 spdf 6 g ] Triples 1 s 2 + [ 12 spdfg ] + [ 17 dfg ] QEDFinalDiff. [36]
2 p 6 1 S 0 0000000
2 p 5 3 p 3 S 1 6,087,18562543876772676,092,1590.02%
2 p 5 3 p 3 D 2 6,116,210−21242886701436,119,8420.03%
2 p 5 3 p 3 D 3 6,129,041−23253015711946,132,8640.03%
2 p 5 3 p 1 P 1 6,138,383−11412825704826,142,0250.03%
2 p 5 3 s 25,842,248−1010834087357875,847,2760.03%
2 p 5 3 s 15,857,770−107033037087845,862,6260.03%
2 p 5 3 s 15,953,697−1074336471710425,958,8830.03%
2 p 5 3 d 3 P 1 6,466,575−11162384665876,469,7170.03%
2 p 5 3 d 3 P 2 6,481,385−13162250658866,484,3830.03%
2 p 5 3 d 3 F 4 6,482,549−12271745622976,485,0280.03%
2 p 5 3 d 3 F 3 6,488,573−14261740607846,491,0160.03%
2 p 5 3 d 1 D 2 6,502,481−17211696627886,504,8950.03%
2 p 5 3 d 3 D 3 6,511,163−18181762604876,513,6170.02%
2 p 5 3 d 3 D 1 6,548,550−16−3174761613465510290.02%
2 p 5 3 d 3 F 2 6,589,977−162217296293356,592,6760.02%
2 p 5 3 d 3 D 2 6,596,316−171419476413346,599,2350.03%
2 p 5 3 d 1 F 3 6,600,744−171918036103436,603,5010.03%
2 p 5 3 d 1 P 1 6,656,872−8−5217436192886,659,4620.02%
3 C 3 D (eV)13.43020.0009−0.0061−0.00050.00040.019113.44400.15%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Cheung, C.; Safronova, M.; Porsev, S. Scalable Codes for Precision Calculations of Properties of Complex Atomic Systems. Symmetry 2021, 13, 621. https://doi.org/10.3390/sym13040621

AMA Style

Cheung C, Safronova M, Porsev S. Scalable Codes for Precision Calculations of Properties of Complex Atomic Systems. Symmetry. 2021; 13(4):621. https://doi.org/10.3390/sym13040621

Chicago/Turabian Style

Cheung, Charles, Marianna Safronova, and Sergey Porsev. 2021. "Scalable Codes for Precision Calculations of Properties of Complex Atomic Systems" Symmetry 13, no. 4: 621. https://doi.org/10.3390/sym13040621

APA Style

Cheung, C., Safronova, M., & Porsev, S. (2021). Scalable Codes for Precision Calculations of Properties of Complex Atomic Systems. Symmetry, 13(4), 621. https://doi.org/10.3390/sym13040621

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop