Complexity of Molecular Nets: Topological Approach and Descriptive Statistics

Banaru, Alexander M.; Aksenov, Sergey M.

doi:10.3390/sym14020220

Open AccessArticle

Complexity of Molecular Nets: Topological Approach and Descriptive Statistics

by

Alexander M. Banaru

^1,2,* and

Sergey M. Aksenov

^2,3,*

¹

Faculty of Chemistry, Moscow State University, Vorobievy Hills, 119991 Moscow, Russia

²

Laboratory of Arctic Mineralogy and Materials Sciences, Kola Science Centre, Russian Academy of Sciences, 14 Fersman Str., 184209 Apatity, Russia

³

Geological Institute, Kola Science Centre, Russian Academy of Sciences, 14 Fersman Str., 184209 Apatity, Russia

^*

Authors to whom correspondence should be addressed.

Symmetry 2022, 14(2), 220; https://doi.org/10.3390/sym14020220

Submission received: 5 November 2021 / Revised: 13 December 2021 / Accepted: 12 January 2022 / Published: 24 January 2022

(This article belongs to the Special Issue Mathematical Crystallography 2021)

Download

Browse Figures

Versions Notes

Abstract

:

The molecular net complexity (H_molNet) is an extension of the combinatorial complexity (H_mol) of a crystal structure introduced by Krivovichev. It was calculated for a set of 4152 molecular crystal structures with the composition of C_xH_yO_z characterized by the structural class P2₁/c, Z = 4 (1). The molecular nets were derived from the molecular Voronoi–Dirichlet Polyhedra (VDP_mol). The values of the molecular coordination number (CN_mol) and critical coordination number (CN_crit) are discussed in relation with the complexity of the crystal structures. A statistical distribution of the set of molecular crystals based on the values of CN_mol, CN_crit, and the complexity parameters is obtained. More than a half of the considered structures has CN_mol = 14 and CN_mol′ = 9 with the Wyckoff set of edges e⁵dcba. The average multiplicity of intermolecular contacts statistically significantly decreases from 1.58 to 1.51 upon excluding all contacts except those bearing the molecular net. The normalized value of H_molNet is of the logistic distribution type and is distributed near 0.85H_molNet with a small standard deviation. The contribution of H_mol into H_molNet ranges from 35 to 95% (mean 79%, SD 6%), and the subset of bearing intermolecular contacts accounts for 41 to 100% (mean 62%, SD 11%) of the complexity of the full set of intermolecular contacts.

Keywords:

information measure; complexity; crystal structure; crystallographic net; coordination number

1. Introduction

According to the approach initially developed by Shannon in his theory of communication [1], the complexity of a message consisting of symbols depends on the probability of occurrence of each symbol in the message. In particular, quantifying information content of a message in bits corresponds as the function:

H = \sum_{i = 1}^{k} L (p_{i})

(1)

L (p_{i}) = \{\begin{matrix} 0 (p_{i} = 0), \\ - p_{i} \log_{2} p_{i} (p_{i} > 0) \end{matrix}

(2)

where p_i is the probability of i-th symbol to appear in the given message. Any graph with certain types of vertices may be considered as a message, as well. Finite graphs corresponding to the molecules belong to a wide class of so-called chemical graphs, and the approaches of measuring information content for them were introduced in the 1950s by Trucco [2] and in the early 2000s reviewed by Bonchev [3]. Commonly, the vertices of a chemical graph G are referred to as equivalent if they belong to the same orbit of the automorphism group of the graph Aut(G), which is isomorphic to the maximal symmetry group of the graph. The information measures of molecules and their ensembles was recently reviewed by Sabirov and Shepelevich [4]. Information content of molecules is of a specific interest due to the studying of chemical reactions [5], molecular aggregation processes [6], searching the reason for the first bioorganic molecules to appear [7], etc.

A crystal structure can also be represented by a finite graph called quotient graph introduced by Chung et al. in the 1980s [8]. In fact, the quotient graph is a “molecule” of a non-molecular crystal. A quotient graph of the crystal structure maps atoms onto the vertices and chemical bonds onto the edges or loops and reflects the connectivity of the reduced unit cell of the structure. The quotient graph is a useful tool to enumerate nets occurring in crystal structures [9] and perform a topological analysis of underlying nets [10,11]. The cyclomatic number of the quotient graph equals the dimensionality n of the Euclidean space Eⁿ in which the net derived from the quotient graph may be embedded being periodic in the same number of linearly independent directions. In such a space, the deletion of any edge lattice of the net leads to a disconnected net, and the net is referred to as minimal [12,13,14]. For instance, the diamondoid net is minimal in E³, while the quartz net is minimal in E⁴. Embeddings of some typical nets in E³ were enumerated in Reticular Chemistry Structure Resource (RCSR) [15], where each net is characterized by the maximal possible symmetry achieved by a barycentric placement of the vertices [16,17].

The amount of information stored by the quotient graph of the crystal structure was introduced by Krivovichev [18,19] to quantify the information content of the crystal. In this case, the probability p_i from (2) is calculated as p_i = m_i/v, where m_i is the multiplicity of the i-th crystallographic orbit occupied by atoms; v—the number of atoms in the reduced unit cell. Later, Hornfeck [20] complemented this measure by terms considering the degrees of freedom associated with a translational motion of an atom along the Wyckoff position and Kaußler and Kieslich [21] adapted this measure to positionally disordered crystals. However, for molecular crystals the information content calculated using this approach indicates the complexity of the molecule itself instead of the crystal structure. The possible scheme of avoiding this problem was proposed in [22]:

H_{molNet} = H (2 N, {CN}_{mol}) + \frac{2 N}{2 N + {CN}_{mol}} H_{mol} + \frac{{CN}_{mol}}{2 N + {CN}_{mol}} H_{edge};

(3)

H (2 N, {CN}_{mol}) = - \frac{2 N}{2 N + {CN}_{mol}} \log_{2} \frac{2 N}{2 N + {CN}_{mol}} - \frac{{CN}_{mol}}{2 N + {CN}_{mol}} \log_{2} \frac{{CN}_{mol}}{2 N + {CN}_{mol}};

(4)

H_{molNet, tot} = (N + {CN}_{mol} / 2) H_{molNet}

(5)

where N is the number of atoms in the molecule, CN_mol—the molecular coordination number, H_edge—the information content of the molecule, H_edge—the information content of the edge net of the molecular net, H_molNet—the combination of H_mol and H_edge with the property of strong additivity [20,23]. The value of H_molNet is meaningful even for high symmetric molecular structures with the only orbit occupied by the atoms (i.e., I₂, S₆, and α-N₂) [22]. It should be noted that the molecule in the crystal structure is commonly distorted, and the only symmetry operation retained in a molecule (in more than 90% cases) is the inversion center [24], which requires for the preserving of dense packing according to Kitaigorodskii [25]. Generally, a more conformational lability of the molecule promotes a more diverse set of contacts in the coordination shell and should result in the increasing of the molecular net complexity. On the other hand, certain molecular fragments have the opportunity to form a specific intermolecular interaction, such as H-bonds, π … π interactions, Hal…Hal, etc. In such a case the small subset of interactions often predominates in the crystal structure and, in fact, is bearing the entire net of the intermolecular contacts. However, the subset of bearing contacts may include excessive interactions and thus be redundant. The portion of the bearing subnet complexity attributable to the target engineered interactions may serve an indicator of effectiveness of a crystal engineering technique, as the latter aims to reproduce targeted bearing contacts.

In this work the formalism (3)–(5) previously discussed in [26] is tested for the set of more than 4000 homomolecular crystals with the general formula C_xH_yO_z of a structural class P2₁/c, Z = 4(1) (such notation indicates that there is exactly one symmetrically unique molecule occupying a general orbit in the space group P2₁/c). This structural class is of the special interest as the most widespread among organic crystals and corresponding to ~1/3 of all homomolecular structures and more than 1/2 of homomolecular racemates [27]. The aim of this work is to investigate the partitioning of intermolecular contacts from the coordination shell of the molecule into equivalence classes and to obtain the distribution type and the descriptive statistics of H_molNet.

2. Methods

The initial set of the crystal structures was extracted from Cambridge Structural Database (CSD) [28] using the following restrains: the presence of atomic coordinates, the absence of errors and/or disorder, and R-factor < 5%. Out of 4249 high-quality molecular crystal structures [26] selected from CSD ver. 5.41 (with updates), the set of 4152 structures without duplicates was retained for further investigation. The criteria of considering a structure as a duplicate were the same cell dimensions (with the tolerance of 2σ), the same chemical composition, space group and Wyckoff sequence.

The construction of molecular nets was carried out using the ToposPro program [29] by calculating the solid angles of the molecular Voronoi-Dirichlet polyhedron (VDP_mol). According to Blatov [30], VDP_mol is the superposition of atomic VDPs in a molecule, and the solid angle (Ω) corresponding to an intermolecular contact arises from interatomic contacts as:

Ω = \frac{\sum Ω_{i j}}{Ω_{Σ}} \times 100 %

(6)

where Ω_ij is the solid angle for the intermolecular contact ij, and Ω_Σ—the sum of solid angles for all the interatomic contacts for the given molecule with the adjacent ones. Interatomic contacts with Ω_ij < 1.5% of 4π steradians are omitted. In the same way, intermolecular contacts with Ω < 1.5% in this work have been omitted, while the adjacent molecules with Ω ≥ 1.5% are considered as the coordination shell of the initial molecule (Figure 1). As a rule, for non-specific van der Waals interactions the descending order of Ω corresponds to the decrease of interaction energy, allowing to avoid energy calculations for the assessment of supramolecular arrangement [26].

To derive the molecular net, the atoms were pulled to the mass center of the molecule. The molecular coordination number (CN_mol), which includes only symmetrically independent intermolecular contacts, is marked by a prime (CN_mol′), i.e., acrylic acid (ACRLAC04 [31]) has CN_mol = 12 (cuboctahedron) and CN_mol′ = 8. The subset of bearing contacts generating so-called critical net for a given molecule was defined in [32]. In a monosystemic crystal structure the center of gravity of each molecule is connected with the centers of gravity of CN_mol adjacent molecules, and VDP faces have the following order of the solid angles: Ω₁ > Ω₂ > Ω₃ > … > Ω_n (symmetrically equivalent contacts have the same Ω). For any value of n, there is 1 ≤ k ≤ n such that if all edges corresponding to the solid angles Ω_k, Ω_k₊₁, …, Ω_n are removed from the net, the resulted net becomes disconnected. The value max(k) is called a “critical coordination number with a prime” (CN_crit′). If all symmetrically equivalent contacts are considered, the corresponding value is called a “critical coordination number without a prime” (CN_mol). For instance, acrylic acid (ACRLAC04) has CN_crit = 5 (square pyramid) and CN_mol′ = 4. To derive a CN_crit, firstly, the edges of the net of intermolecular contacts, for which Ω > 15%, were removed from the net. In all cases, this led to reduction of the net’s dimensionality from 3D to 2D, 1D, or 0D. Then the contacts with Ω = 14.5–15.0% were returned to the adjacency matrix of the centers of gravity of the molecules, and a check was performed to establish the dimensionality of the net again. If the dimensionality was 3D, the returned contacts were referred as Ω_crit = Ω_max(k), and the constructed 3D net was considered a net of bearing contacts. If the dimensionality of the net did not increase to 3D, then the contacts with Ω = 14.0–14.5% were added to the adjacency matrix, and the dimensionality of the net was checked again. This procedure was repeated with the step of 0.5% until Ω_crit was found for all the structures. The less step values are not reliable since the measurement error is about 0.5%. The obtained distribution of the crystal structures of the considered set is close to normal (Figure 2).

The nets of intermolecular contacts in the most symmetrical embedding in E³ are classified either in accordance with RCSR [15] or TopCryst database [33] (when RCSR classification is lacking). The nets those remain unclassified in RCSR and TopCryst database up to date are characterized by a point symbol. The net for the crystal structure of acrylic acid has the RCSR code fcu (cubic closest packing), while the net of bearing contacts—sqp (Figure 3). For a CN-coordinated net there are CN(CN–1)/2 angles. The shortest cycle in each angle should be identified. The point symbol in the form A^a.B^b…C^c indicates that there are a angles that are A-cycles, b angles that are B-cycles, etc. (A < B < … < C) [34]. For instance, the fcu net has 12∙11/2 = 66 angles in each vertex, and its point symbol is 3²⁴.4³⁶.5⁶, while the sqp net has 5∙4/2 = 10 angles in each vertex, and its point symbol is 4⁴.6⁶.

If there are p sorts of vertices and q sorts of edges in the net, then the net is called p,q-transitive. For instance, the fcu and sqp nets are 1,1 and 1,2-transitive, respectively. In fact, p and q denote the minimal number of orbits occupied by the molecular centers of gravity and the contacts, respectively, and interrelate with the molecular net complexity for its most symmetric embedding in E³.

The complexity of a molecular net was calculated using (3)–(5). The structural information content (SIC = 0–1) [4] meaning the same as a normalized informational complexity [19] and was calculated as follows:

SIC = H/max(H)

(7)

where max(H) is the maximal possible value of H, when each vertex constitutes its own equivalence class: max(H_mol) = log₂N; max(H_edge) = log₂CN_mol; max(H_molNet) = log₂(2N + CN_mol).

The molecule of acrylic acid has N = 9 atoms and all of them are symmetrically unique (the Wyckoff set e⁹ in the space group P2₁/c), m_i = 4, v = 36. Consequently, H_mol = −9∙4/36∙log₂(4/36) = 3.170 bits/atom. The edge net of the CN_mol-coordinated molecular net is generated by the midpoints added to each edge of the molecular net. Two midpoints are connected if and only if they are adjacent to the same vertex, and the vertices of the initial net are removed. The final net (edge net) is 2(CN_mol − 1) = CN_edge-connected. The edge net for acrylic acid is 22-connected and contains 8 symmetrically independent vertices with the Wyckoff sequence e⁴dcba, v = 24, H_edge = −16/24∙log₂(16/24) − 4∙2/24∙log₂(2/24) = 2.918 bits/contact; H(2N, CN_mol) = H(18, 12) = 0.971 bits/d.f. (per a degree of freedom), H_molNet = 4.040 bits/d.f., SIC_molNet = 0.823, H_molNet,tot = 4.040∙15 = 60.60 bits/molecule. Note that if just bearing contacts are included in the net, then the edge net would be 8-connected and contain only 4 of 8 independent vertices with the Wyckoff sequence edca, v = 10, H_edge = −4/10∙log₂(4/10) − 3∙2/10∙log₂(2/10) = 1.922 bits/contact. This net is characterized by the unknown topological type.

The discriminatory power of H, based on the probability of two unrelated objects being characterized as the same type, was calculated according to the following equation [35]:

D = 1 - \frac{1}{N (N - 1)} \sum_{j = 1}^{s} x_{j} (x_{j} - 1)

(8)

where N is the number of the tested crystal structures, s the number of different types of structures with respect to H, and x_j the number of objects belonging to the j-th type. The correlations between calculated values were sought in the Mathematica software ver. 11.0 [36].

3. Results and Discussion

Crystal structures of the analyzed set are distributed over CN_mol, generally, in accordance with the earlier results obtained by Carugo et al. [37]. More than a half of the crystal structures have CN_mol = 14, and the second ranked value CN_mol = 16. The most frequent values of CN_crit are 5, 4, and 6, but unlike CN_mol there is no sharp peak on any of the values (Figure 4). More than a half of the structures is characterized by CN_mol′ = 9 (with e⁵—2355 structures; with e⁶—84 structures; with e⁴—13 structures; e⁷ba—1 structure with refcode HINSOM [38]), and most abundant CN_crit′ is its least value 3 (eba, ecb, e—856 structures; e²a, e²b, e²—737 structures; e³—81 structures).

In the structural class P2₁/c, Z = 4(1) each molecule can form contacts with a multiplicity 1 or 2. The former corresponds to a so-called involution, a symmetry element of the order 2 (the midpoint of a contact occupies the Wyckoff position e). The only involution presence in the space group P2₁/c is the inversion center

\bar{1}

(the midpoint of a contact occupies the Wyckoff position a, b, c or d). All other contacts are formed via a screw axis 2₁, or a glide plane c, or a translation along some direction. It is easy to show that the average multiplicity in between 1 and 2 equals to v/2CN′. According to two-sample t-test, the difference of the mean values for all contacts and for those bearing the net is statistically significant (p-value < 0.001). Moreover, the minimal multiplicity is 1.375 (3 structures) for the hole net of molecular contacts unlike 1.000 (1 structure with refcode KOLRAF [39]) for the critical subnet (Table 1). That is why the subnet of bearing contacts is, in average, more enriched by the inversion centers than the hole molecular net. As shown above, 2355 structures have the Wyckoff sequence e⁵dcba (or similar) for the edge net, and mean multiplicity in this case is (5∙2 + 4)/9 = 1.556. Motherwell [40] previously studied the projection patterns formed by projecting coordination shell of a molecule into 2D in different space groups with none of the special positions occupied. The majority of projection patterns in P2₁/c contained at least one contact via an inversion center.

The distribution of molecular nets in the considered series over the topological types is, generally, in accordance with the trend previously found by Carugo et al. for 105 549 packings of small molecules [37]. The most widespread topological type is bcu-x, a type derived from the body-centered cubic lattice where the coordination shell of the atom is extended by the second coordination shell (CN_mol = 8 + 6). This topological type has the least topological density TD₁₀ that reflects the total number of vertices in the first 10 of coordination shells, among all 14-coordinated nets reported for centrosymmetric [41] and non-centrosymmetric [32] crystalline hydrocarbons, some inorganic molecular crystals [22] (i.e., 14T191 in the orthorhombic sulfur, α-S₈), and those with the most popularity amongst all small molecular crystals [37]. Recently studied crystal structure of 2-(tert-butyl)-4-chloro-6-phenyl-1,3,5-triazine with 2 symmetrically independent molecules [42] is characterized by the 14T319 type topology (after neglecting contacts with Ω ≤ 2%), which occupies the opposite side of 14-coordinated molecular nets with respect to TD₁₀ (Table 2). The more 2nd or 3rd CN does not mean the more 4th and 5th CNs. For instance, in the 2nd coordination sphere there are 54 vertices in 14T134 topological type and 53 in 14T10; nevertheless, TD₁₀ for 14T10 is slightly higher. Remark that the TopCryst database has been extended last years by many new topological types with large CNs, including CN = 14. Thus, the previously found in 2019 a 14T134 topological type [32] in the crystal structure of spiropentane (refcode VAJGOC [43]) has no reference code in the TopCryst database. The corresponding molecular net in the most symmetric embedding in E³ is 1,6-transitive and has the space group R

\bar{3}

c with the only general position occupied by centers of gravity of molecules.

Consider three typical examples of molecular nets realized in α-methyl-trans-cinnamic acid (refcode: BEJVOB [45]), 5-methoxyindan-1-one (refcode: KACSOX01 [46]), which are both isomers with the chemical formula C₁₀H₁₀O₂ (Figure 5), and (1RS,3SR,4SR)-trispiro(2.0.0.2.1.1)nonane-1-carboxylic acid with the chemical formula C₁₀H₁₂O₂ (refcode: FAFDEW [47]). The Wyckoff sequences for the molecules are: e²² for BEJVOB and KACSOX01, and e²⁴ for FAFDEW. This leads to a slightly different values of H_mol: H_mol = 4.459 bits/atom for BEJVOB and KACSOX01, and H_mol = 4.585 bits/atom for FAFDEW. All other structures from the set of 4152 structural files show exactly the same distribution of atoms over general positions, i.e., they have the maximal H_mol for the given N (SIC = 1). Indeed, if a molecular center of gravity occupies a general position, then no atom is able to occupy an inversion center, otherwise the other atoms should be related by the inversion center and the molecule would either occupy the special position or have a symmetry-induced disorder (the latter was restricted by the structure selection). The linear correlation coefficient between N and the molecular mass in the analyzed set is 0.936, between N and H_mol − 0.959, and between the molecular mass and H_mol − 0.889. Consequently, there is a strong positive linear correlation between the molecular mass, N, and H_mol.

There are three crystal structures with CN_mol = 14, but characterized by the different topological types (Table 3): bcu-x, gpu-x, and tcg-x. Furthermore, all the structures have CN_mol′ = 9 and the same Wyckoff sequence for the midpoints of intermolecular contacts (e⁵dcba). This means the same H_edge = 3.093 as for the other 2352 structures of the Wyckoff sequence which contains e⁵, including e⁵dcba (2324 structures).

The critical nets for BEJVOB, KACSOX01, and FAFDEW are of different topological type. It was shown in [26] that the value Δ = CN_crit′ − minCN_crit′ adopts almost normal discrete distribution, where 92% of structures demonstrate Δ ≤ 2 (for the set of crystalline hydrocarbons this portion was even more 95% [48]). In the space group P2₁/c there are 3 generators in a minimal generating subset [49]. If a molecule occupies some special position of the space group with a site-symmetry group containing a generating element of the space group (

\bar{1}

in P2₁/c), then a fewer number of intermolecular contacts along the other symmetry elements could be sufficient for generating of a molecular net. However, for the structural class P2₁/c, Z = 4(1) the value minCN_crit′ = 3. For KACSOX01 and FAFDEW the critical molecular nets are parsimonic (CN_crit′ = 3), while for BEJVOB the net is not parsimonic (CN_crit′ = 5). The last one contains two redundant contacts via the inversion centers. Any pair of two inversion centers separated by a translation generate this translation; however, if it is accompanied by a contact with the multiplicity 2 along the same direction, the pair of inversion centers becomes redundant. Conversely, a sole contact with the multiplicity 2 in the critical net cannot be redundant because a triplet of inversion centers would never generate a 3D-space group instead of a plane group with the triplet belonging to the plane. As a result of the redundancy the critical net in BEJVOB is more complex than in KACSOX01 and FAFDEW (H_edge,crit = 2.252, 1.500 and 1.522 bits/atoms, respectively), i.e., about a half of the molecular net information content for KACSOX01 and FAFDEW, and more than 2/3 of that for BEJVOB. The nets are shown in Figure 6.

The topological types of the molecular and critical nets, which are subnets of the former, are shown in Figure 7. Surprisingly, for BEJVOB the prototype molecular net bcu-x has 2 kinds of edges, while the prototype critical net sxa has three kinds of edges because some Wyckoff positions are split when the symmetry group descends from

I m \bar{3} m

(bcu-x) to Cmme (sxa). The group Cmme has five elements in a minimal generating set [49], and there are Z = 4 (mm2) equivalent vertices in sxa. As the point group mm2 has two generators, the vertex configuration of sxa can be generated by 5 − 2 = 3 “contacts” of the vertices, therefore, the net sxa could be realized even for CN_crit′ = 3. On the contrary, another similar 6-coordinated net sxb of the strucutral class Cccm, Z = 4(2/m) could not be realized in any space group at CN_crit′ = minCN_crit′, since Cccm is generated by just a pair of elements. Recently, sxb was found in a metal-organic framework (MOF) [Mg₃(btdc)₃(dmf)₄] [50], which was synthesized by a topotactic reaction from [Mg₃(btdc)₃(dmf)₄]∙DMF of the pcu type upon heating, thus, the former MOF is not parsimonic in principle.

The set of the combinatorically distinctive critical nets depends on the topology of the initial molecular nets. For bcu-x, gpu-x, and tcg-x in the three above mentioned crystal structures, all subsets of edges, which may correspond to a CN_crit′ = minCN_crit′ = 3, were enumerated (Table 4). As all the initial nets have edges with the Wyckoff sequence e⁵dcba, there are 4 involutions and 5 contacts with the multiplicity 2. In BEJVOB (bcu-x), KACSOX01 (gpu-x) and FAFDEW (tcg-x) there are four contacts along the pair of screw axes 2₁, four contacts along the pair of glide c-planes, and two contacts along the translation vector, but their combination with the four involutions in different topological types is different. This leads to a different number and types of the critical subnets.

Apparently, the complexity of the partition of subnets into the combinatorically distinctive Wyckoff sequences (e, e², and e³), as well as into the topological types (dia, cds, dmp, etc.), can be easily measured in terms of (1) and (2), but this is out of the topic of this work. In fact, the coordination shell of a molecule may be referred as fuzzy [51], because upon the crystallization different subsets of bearing contacts arise simultaneously. In summary, the subnets of gpu-x are obviously more diverse and include such exotic topological types as 4-coordinated 4T19 (2 subnets) and 5-coordinated 5T12 (2 subnets). Meanwhile, the leading topological type of the subnet in all cases is the diamondoid type dia. In the structural class P2₁/c, Z = 4(1) dia, as any other 4-coordinated subnet, is formed by two involutions and two contacts with the multiplicity 2. The formation of dia is limited by two combinatorically different options [48]. In the first one, the generators are the glide c-plane and the inversion centers located at a distance of b/4 from each other along Y. The second option entails the screw axis 2₁ located at the distance of c/4 from one of the inversion centers (Figure 8, top). If one of the inversion centers in the first dia subtype is shifted by b/2, the subnet transforms into the cds type, for the second option it transforms into dmp. The bnn subnet, as dia, exists in two different subtypes (Figure 8, bottom). Each subtype has the only contact via the inversion center and 2 contacts along the translation a. The only difference is the last contact with the multiplicity 2, either along the glide c-plane or along the screw axis 2₁. However, if the contacts along a are replaced by the contacts along the 2₁ axis located at a distance of (a/2 + c/4) from the initial inversion center, then the bnn subtypes are transformed into nov and sqp, respectively. Finally, the pcu subnet of each initial net is generated by three contacts with the multiplicity 2.

Of course, among the critical nets in P2₁/c, Z = 4(1) there are those having CN_crit′ > minCN_crit′, for instance, noz (5-coordinated), acs, bsn, sxd (6-coordinated), the net of simple hexagonal packing hex (8-coordinated), the body-centered cubic net with unextended coordination shell bcu (8-coordinated), etc. Nevertheless, these topological types may be represented as an extension of some of the 5 minimal nets in E³ without collisions and with equal vertex degrees (CNs): dia, cds, ths, pcu, and srs [13]. The last minimal net of this kind, the 3-coordinated srs, was not observed in any crystal structure for the bearing contacts so far. Similarly, the 3-coordinated net ths was not observed in P2₁/c, Z = 4(1), but it is possible in some other monoclinic structural classes such as C2/c, Z = 8(1). Up to date, in the TopCryst database it was exemplified not by a molecular crystal, but by a MOF of the crystal structure with the refcode RAGFAJ [52]. The quotient graph of any critical net, including a redundant one, may be derived by an addition of an edge to the undirected quotient graph of some minimal net (Figure 9). Remark that Δ = CN_crit′ − minCN_crit′ = 0 does not necessarily corresponds to a minimal net, because the deletion of an edge lattice and the deletion of the symmetrically equivalent edges are not exactly the same processes. The deletion of all equivalent edges implies the deletion of translationally equivalent edges, but the converse is not true. As a result, a series of not minimal nets such as bnn, sqp, dmp, nov (Figure 8) also corresponds to Δ = 0.

The contribution of H_edge,crit into H_edge varies from 33.9 to 100% (Table 5). Indeed, there are 4 crystal structures with CN_crit = CN_mol and H_edge,crit = H_edge, these are extremal cases with the most redundant critical net. The contribution of H_mol, H_edge and H(2N, CN_mol) into H_molNet, is, on the average, is 78.9, 9.5 and 11.6%, respectively, with the value of σ being a few percent, i.e., the complexity of the molecular net is substantially defined by the value of H_mol, but the impact of H_edge and H(2N, CN_mol) is meaningful. The differences of min and max values for the contributions of H_edge and H(2N, CN_mol) into H_molNet are much more than σ, that means the outliers being not numerous. The values of SIC, calculated using (7), also show different variances. As it was mentioned above, since there are no atoms in a special position, SIC_mol = 1 for all the structures. As the maximal multiplicity of a contact is 2, theoretically, the minimal SIC_{edge, crit} = − CN_crit/2∙2/CN_crit∙log₂(2/CN_crit)/log₂CN_crit = 1 − 1/log₂CN_crit. All the structures with average multiplicity 2 in the set have CN_crit = 6, consequently, the minimal SIC_edge,crit = 1 − 1/log₂6 = 0.613. The maximal SIC_{edge, crit} = 1 corresponds to the average multiplicity 1 in the structure of (6-methoxycarbonylmethoxynaphthalen-1-yloxy)acetic acid methyl ester (refcode: KOLRAF) [39] with the Wyckoff sequence of edges dcba and the critical net dia. The values of SIC_edge and SIC_molNet have much smaller σ than SIC_edge,crit.

The distribution of the crystal structures by H_mol and H_molNet is shown in Figure 10. Both values are best approximated by a logistic distribution applicable to the modeling of the degrees of pneumoconiosis in coal miners, chronic obstructive respiratory disease prevalence on smoking, survival time of diagnosed leukemia patients, etc. [53]. Generally, it has the probability density function:

f (x; μ; β) = \frac{e^{- (x - μ) / β}}{β {(1 + e^{- (x - μ) / β})}^{2}}

(9)

For H_mol μ ≈ 5.252, β ≈ 0.30; for H_molNet μ ≈ 5.572, β ≈ 0.25. Thus, the difference of the expected values μ is about 0.320 bits/d.f., and the variance for H_mol is greater than for H_molNet.

The discriminatory powers D for H_mol, H_edge,crit, H_edge, H(2N, CN_mol), and H_molNet are listed in Table 6. The simple combinatorial complexity H_mol distinguishes only 99 values, whereas H_molNet—531 values. Surprisingly, H_edge has the least D = 0.6372 and distinguishes only 26 values, while H_edge,crit has even greater D = 0.8762 and s = 28. The reason of such substantial difference of D at a small difference of s is the abnormality of distribution. As shown above, 2355 structures have e⁵dcba or similar Wyckoff sequence for the edge net (in this case H_edge = 1.556 bits/contact). Meanwhile, the most widespread Wyckoff sequence for the critical net is eba (or similar)—856 structures (H_edge,crit = 1.500 bits/contact), i.e., with about three times less probability. The H(2N; CN_mol) has a remarkably high value of s and D in comparison with H_mol, because CN_mol may vary at equal N, i.e., at equal H_mol.

4. Conclusions

For molecular crystals, unlike those with infinite chains, layers, or frameworks, the simple combinatorial information content is of limited usefulness. When each atom occupies its own crystallographic orbit, the value of H_mol reflects only the number of atoms in a molecule. On the contrary, the information content of the molecular net H_edge combined with H_mol gives a hybrid function H_molNet dependent not only on the number of atoms in a molecule, but on the molecular coordination number CN_mol and the number of orbits occupied by the midpoints of the molecular contacts CN_mol′. In comparison with H_mol, this hybrid function has a greater discriminatory power and is more favorable for molecular crystals. The edge net complexity H_edge and that originated from mixing two sources of information H(2N; CN_mol) add, on the average, a little more than 10% H_mol each. The normalized values of H_edge and H_molNet (SIC_edge and SIC_molNet) are distributed near 0.80–0.85 with a small standard deviation. The distribution of both H_mol and H_molNet is approximately logistic.

More than a half of 4152 considered structures have CN_mol = 14 and CN_mol′ = 9 with the Wyckoff set of edges e⁵dcba. The average multiplicity of intermolecular contacts statistically significantly decreases upon excluding all contacts except those bearing the molecular net, i.e., the critical net is more saturated with involutions (the inversion centers) than the initial net. The critical net contains more than 40% information of the molecular net, and H_edge,crit has a more discriminatory power.

The minimal possible CN_crit′ is the invariant of a structural class. Each molecular coordination shell may be split in the finite number of critical coordination shells, from which the complexity of the fuzzy coordination shell arises.

Author Contributions

Conceptualization, A.M.B.; methodology, A.M.B.; writing—original draft preparation, A.M.B.; writing—review and editing, A.M.B. and S.M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Russian Science Foundation, grant number 20-77-10065.

Conflicts of Interest

The authors declare no conflict of interest.

References

Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
Trucco, E. On the information content of graphs: Compound symbols; Different states for each point. Bull. Math. Biophys. 1956, 18, 237–253. [Google Scholar] [CrossRef]
Bonchev, D. Shannon’s Information and Complexity. In Complexity: Introduction and Fundamentals; CRC Press: London, UK, 2003; pp. 157–187. ISBN 978-0-429-16545-0. [Google Scholar]
Sabirov, D.S.; Shepelevich, I.S. Information Entropy in Chemistry: An Overview. Entropy 2021, 23, 1240. [Google Scholar] [CrossRef]
Sabirov, D.S. Information entropy changes in chemical reactions. Comput. Theor. Chem. 2018, 1123, 169–179. [Google Scholar] [CrossRef]
Sabirov, D.S. Information entropy of mixing molecules and its application to molecular ensembles and chemical reactions. Comput. Theor. Chem. 2020, 1187, 112933. [Google Scholar] [CrossRef]
Sabirov, D.S. Information entropy of interstellar and circumstellar carbon-containing molecules: Molecular size against structural complexity. Comput. Theor. Chem. 2016, 1097, 83–91. [Google Scholar] [CrossRef]
Chung, S.J.; Hahn, T.; Klee, W.E. Nomenclature and generation of three-periodic nets: The vector method. Acta Crystallogr. Sect. A 1984, 40, 42–50. [Google Scholar] [CrossRef] [Green Version]
Klee, W.E. Crystallographic nets and their quotient graphs. Cryst. Res. Technol. 2004, 39, 959–968. [Google Scholar] [CrossRef]
Eon, J.-G. From symmetry-labeled quotient graphs of crystal nets to coordination sequences. Struct. Chem. 2012, 23, 987–996. [Google Scholar] [CrossRef]
Eon, J.-G. Topological features in crystal structures: A quotient graph assisted analysis of underlying nets and their embeddings. Acta Crystallogr. Sect. A Found. Adv. 2016, 72, 268–293. [Google Scholar] [CrossRef]
Beukemann, A.; Klee, W.E. Minimal nets. Z. Krist.-New Cryst. Struct. 1992, 201, 37–51. [Google Scholar] [CrossRef]
Bonneau, C.; Delgado-Friedrichs, O.; O’Keeffe, M.; Yaghi, O.M. Three-periodic nets and tilings: Minimal nets. Acta Crystallogr. Sect. A Found. Crystallogr. 2004, 60, 517–520. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Eon, J.G. Euclidian embeddings of periodic nets: Definition of a topologically induced complete set of geometric descriptors for crystal structures. Acta Crystallogr. Sect. A Found. Crystallogr. 2011, 67, 68–86. [Google Scholar] [CrossRef] [PubMed]
O’Keeffe, M.; Peskov, M.A.; Ramsden, S.J.; Yaghi, O.M. The Reticular Chemistry Structure Resource (RCSR) Database of, and Symbols for, Crystal Nets. Acc. Chem. Res. 2008, 41, 1782–1789. [Google Scholar] [CrossRef]
Delgado-Friedrichs, O.; O’Keeffe, M. Identification of and symmetry computation for crystal nets. Acta Crystallogr. Sect. A 2003, 59, 351–360. [Google Scholar] [CrossRef]
Sunada, T. Lecture on topological crystallography. Jpn. J. Math. 2012, 39, 1–39. [Google Scholar] [CrossRef]
Krivovichev, S.V. Topological complexity of crystal structures: Quantitative approach. Acta Crystallogr. Sect. A Found. Crystallogr. 2012, 68, 393–398. [Google Scholar] [CrossRef] [PubMed]
Krivovichev, S.V. Which inorganic structures are the most complex? Angew. Chem.-Int. Ed. 2014, 53, 654–661. [Google Scholar] [CrossRef]
Hornfeck, W. On an extension of Krivovichev’ s complexity measures. Acta Crystallogr. Sect. A Found. Adv. 2020, 76, 534–548. [Google Scholar] [CrossRef]
Kaußler, C.; Kieslich, G. crystIT: Complexity and configurational entropy of crystal structures via information theory. J. Appl. Crystallogr. 2021, 54, 306–316. [Google Scholar] [CrossRef] [PubMed]
Banaru, A.M.; Aksenov, S.M.; Krivovichev, S.V. Complexity Parameters for Molecular Solids. Symmetry 2021, 13, 1399. [Google Scholar] [CrossRef]
Csiszár, I. Axiomatic Characterizations of Information Measures. Entropy 2008, 10, 261–273. [Google Scholar] [CrossRef] [Green Version]
Pidcock, E.; Motherwell, W.D.S.; Cole, J.C. A database survey of molecular and crystallographic symmetry. Acta Crystallogr. Sect. B Struct. Sci. 2003, 59, 634–640. [Google Scholar] [CrossRef] [PubMed]
Slovokhotov, Y.L. Organic crystallography: Three decades after Kitaigorodskii. Struct. Chem. 2019, 30, 551–558. [Google Scholar] [CrossRef]
Banaru, A.M.; Aksenov, S.M.; Banaru, D.A. Critical Molecular Coordination Numbers in the Structural Class P2(1)/c, Z = 4(1). Mosc. Univ. Chem. Bull. 2021, 78, 325–333. [Google Scholar] [CrossRef]
Zorky, P.M. Symmetry, pseudosymmetry and hypersymmetry of organic crystals. J. Mol. Struct. 1996, 374, 9–28. [Google Scholar]
Groom, C.R.; Bruno, I.J.; Lightfoot, M.P.; Ward, S.C. The Cambridge Structural Database. Acta Crystallogr. B Struct. Sci. Cryst. Eng. Mater. 2016, 72, 171–179. [Google Scholar] [CrossRef] [PubMed]
Blatov, V.A.; Shevchenko, A.P.; Proserpio, D.M. Applied Topological Analysis of Crystal Structures with the Program Package ToposPro. Cryst. Growth Des. 2014, 14, 3576–3586. [Google Scholar] [CrossRef]
Blatov, V.A. Voronoi–dirichlet polyhedra in crystal chemistry: Theory and applications. Crystallogr. Rev. 2004, 10, 249–318. [Google Scholar] [CrossRef]
Oswald, I.D.H.; Urquhart, A.J. Polymorphism and polymerisation of acrylic and methacrylic acid at high pressure. CrystEngComm 2011, 13, 4503–4507. [Google Scholar] [CrossRef] [Green Version]
Gridin, D.M.; Banaru, A.M. Coordination Numbers and Topology of Crystalline Hydrocarbons. Mosc. Univ. Chem. Bull. 2020, 75, 354–367. [Google Scholar] [CrossRef]
The Samara Topological Data Center TopCryst. Available online: https://topcryst.com/ (accessed on 13 January 2022).
Blatov, V.A.; O’Keeffe, M.; Proserpio, D.M. Vertex-, face-, point-, Schläfli-, and Delaney-symbols in nets, polyhedra and tilings: Recommended terminology. CrystEngComm 2010, 12, 44–48. [Google Scholar] [CrossRef] [Green Version]
Hunter, P.R.; Gaston, M.A. Numerical index of the discriminatory ability of typing systems: An application of Simpson’s index of diversity. J. Clin. Microbiol. 1988, 26, 2465–2466. [Google Scholar] [CrossRef] [Green Version]
Kroll, L.S. Mathematica—A System for Doing Mathematics by Computer. Wolfram Research. Am. Math. Mon. 1989, 96, 855–861. [Google Scholar] [CrossRef]
Carugo, O.; Blatova, O.A.; Medrish, E.O.; Blatov, V.A.; Proserpio, D.M. Packing topology in crystals of proteins and small molecules: A comparison. Sci. Rep. 2017, 7, 13209. [Google Scholar] [CrossRef]
Starck, F.; Jones, P.G.; Herges, R. Synthesis of Photoresponsive Polyethers. Eur. J. Org. Chem. 1998, 1998, 2533–2539. [Google Scholar] [CrossRef]
Mondal, P.; Karmakar, A.; Singh, W.M.; Baruah, J.B. Crystal packing in some flexible carboxylic acids and esters attached to a naphthalene ring. CrystEngComm 2008, 10, 1550–1559. [Google Scholar] [CrossRef]
Motherwell, W.D.S. Architecture of packing in molecular crystals. CrystEngComm 2017, 19, 6869–6882. [Google Scholar] [CrossRef]
Banaru, A.M.; Gridin, D.M. Coordination Numbers and Critical Topology of Centrosymmetric Hydrocarbons. J. Struct. Chem. 2019, 60, 1885–1895. [Google Scholar] [CrossRef]
Song, X.; Tang, Z.; Zuo, Z.; Duan, J. The crystal structure of 2-(tert-butyl)-4-chloro-6-phenyl-1,3,5-triazine, C13H14Cl1N3. Z. Krist.-New Cryst. Struct. 2018, 233, 779–781. [Google Scholar] [CrossRef]
Boese, R.; Blaeser, D.; Gomann, K.; Brinker, U.H. Spiropentane as a tensile spring. J. Am. Chem. Soc. 1989, 111, 1501–1503. [Google Scholar] [CrossRef]
Shirazi, M.; Soltani, M.-R.; Jahanabadi, Z.; Abdollahifar, M.-A.; Tanideh, N.; Noorafshan, A. Stereological comparison of the effects of pentoxifylline, captopril, simvastatin, and tamoxifen on kidney and bladder structure after partial urethral obstruction in rats. Korean J. Urol. 2014, 55, 756–763. [Google Scholar] [CrossRef]
Bryan, R.F.; White, D.H. α-Methyl-trans-cinnamic acid (m.p. 355 K). Acta Crystallogr. Sect. B 1982, 38, 1332–1334. [Google Scholar] [CrossRef]
Abid, O.-R.; Qadeer, G.; Rama, N.H.; Wong, W.-Y. 5-Methoxyindan-1-one. Acta Crystallogr. Sect. E 2007, 63, o165–o166. [Google Scholar] [CrossRef]
de Meijere, A.; Khlebnikov, A.F.; Kozhushkov, S.I.; Kostikov, R.R.; Schreiner, P.R.; Wittkopp, A.; Rinderspacher, C.; Menzel, H.; Yufit, D.S.; Howard, J.A.K. The First Enantiomerically Pure [n]Triangulanes and Analogues: σ-[n]Helicenes with Remarkable Features. Chem.-Eur. J. 2002, 8, 828–842. [Google Scholar] [CrossRef]
Banaru, A.M.; Banaru, D.A. Zorkii structural classes and critical topology of molecular crystals. J. Struct. Chem. 2020, 61, 1485–1502. [Google Scholar] [CrossRef]
Lord, E.A.; Banaru, A.M. Number of generating elements in space group of a crystal. Mosc. Univ. Chem. Bull. 2012, 67, 50–58. [Google Scholar] [CrossRef]
Dubskikh, V.A.; Lysova, A.A.; Samsonenko, D.G.; Dybtsev, D.N.; Fedin, V.P. Topological polymorphism and temperature-driven topotactic transitions of metal–organic coordination polymers. CrystEngComm 2020, 22, 6295–6301. [Google Scholar] [CrossRef]
Banaru, A.M. A Fuzzy Set of Generating Contacts in a Molecular Agglomerate. Mosc. Univ. Chem. Bull. 2019, 74, 101–105. [Google Scholar] [CrossRef]
Li, S.-L.; Wang, J.; Zhang, F.-Q.; Zhang, X.-M. Light and Heat Dually Responsive Luminescence in Organic Templated CdSO4-type Halogeno(cyano)cuprates with Disorder of Halogenide/Cyanide. Cryst. Growth Des. 2017, 17, 746–752. [Google Scholar] [CrossRef]
Nadarajah, S.; Kotz, S. A generalized logistic distribution. Int. J. Math. Math. Sci. 2005, 2005, 894212. [Google Scholar] [CrossRef] [Green Version]

Figure 1. VDP_mol as the sum of atomic VDPs (represented by different colors) and the 1-st molecular coordination shell in the crystal structure of acrylic acid (CSD-refcode: ACRLAC04).

Figure 2. The distribution of the set of crystal structures by Ω_crit rounded to half-integer %.

Figure 3. A fragment of the net of all intermolecular contacts fcu type (left) and that of bearing intermolecular contacts sqp type (right) for the crystal structure of acrylic acid.

Figure 4. The distribution of crystal structures of the analyzed set over CN_mol and CN_crit (top), CN_mol′ and CN_crit′ (bottom).

Figure 5. The structural formulas of isomeric α-methyl-trans-cinnamic acid (refcode: BEJVOB), 5-methoxyindan-1-one (refcode: KACSOX01), and (1RS,3SR,4SR)-trispiro(2.0.0.2.1.1)nonane-1-carboxylic acid (refcode: FAFDEW).

Figure 6. A fragment of the molecular nets of BEJVOB, KACSOX01, and FAFDEW, view along Y (β-setting). The edges of the critical nets are shown—by black solid lines, those disposable for the molecular net—by blue dashed lines, the molecular centers of gravity—by blue circles.

Figure 7. The topological types of BEJVOB (left), KACSOX01 (center), and FAFDEW (right) for the molecular (top) and critical (bottom) nets.

Figure 8. A scheme of the critical subnets of the molecular nets in P2₁/c, Z = 4(1) at CN_crit = 4 (top) and 5 (bottom). Molecules inverted and turned to the viewer by the reverse side, are shaded up. Molecules shifted by ±t/2 towards the viewer, are outlined by a dashed line. Double lines denote contacts between the initial molecule and two adjacent ones located above and below.

Figure 9. The types of critical subnets in P2₁/c, Z = 4(1) derived from minimal nets (shown in red) by an addition of an edge into the quotient graph of the most symmetric embedding in E³.

Figure 10. The distributions of 4152 structures by H_mol (top) and H_molNet (bottom) with the step of 0.100 bits/d.f. and their approximation by the logistic distribution.

Table 1. The min, max, mean value, and standard deviation of the multiplicity of the contact in the molecular net of all the intermolecular contacts (molecular) and the bearing ones only (critical) for the set of 4152 crystal structures.

The Net	Min	Max	Mean	σ
molecular	1.375	2.000	1.582	0.060
critical	1.000	2.000	1.512	0.186

Table 2. The topological types found in crystalline hydrocarbons [32,44], some inorganic molecular crystals [22], and 2-(tert-butyl)-4-chloro-6-phenyl-1,3,5-triazine, ordered by the increase of TD₁₀.

Topological Type	Point Symbol	Coordination Sphere					TD₁₀
Topological Type	Point Symbol	1st	2nd	3rd	4th	5th	TD₁₀
bcu-x	3³⁶.4⁴⁸.5⁷	14	50	110	194	302	4641
gpu-x	3³⁶.4⁴⁶.5⁹	14	52	114	202	314	4831
tcg-x	3³⁶.4⁴⁶.5⁹	14	52	116	204	318	4893
14T34	3³³.4⁵¹.5⁷	14	53	117	208	324	4996
14T5	3³⁶.4⁴⁵.5¹⁰	14	53	120	212	332	5106
14T6	3³⁶.4⁴⁵.5¹⁰	14	53	120	213	335	5138
14T134	3³⁴.4⁴⁷.5¹⁰	14	54	122	216	338	5201
14T10	3³⁶.4⁴⁵.5¹⁰	14	53	122	218	339	5238
14T37	3³³.4⁴⁵.5¹⁰	14	53	121	217	339	5239
14T65	3³³.4⁵¹.5⁷	14	54	122	218	342	5301
14T9	3³⁶.4⁴⁵.5¹⁰	14	53	123	221	344	5329
14T24	3³⁶.4⁴⁶.5⁹	14	52	120	218	344	5339
14T3	3³⁶.4⁴⁴.5¹¹	14	54	124	222	348	5373
14T8	3³⁶.4⁴⁴.5¹¹	14	54	126	226	354	5475
14T319	3³⁰.4⁵⁰.5¹¹	14	58	130	232	362	5581
14T18	3³⁶.4⁴⁴.5¹¹	14	54	130	242	382	5947
14T191	3³³.4⁴⁷.5¹¹	14	59	141	256	402	6246

Table 3. Topological types and structural characteristics of the molecular net in BEJVOB, KACSOX01, and FAFDEW.

Refcode in CSD	BEJVOB [45]	KASSOX01 [46]	FAFDEW [47]
Formula	C₁₀H₁₀O₂	C₁₀H₁₀O₂	C₁₀H₁₂O₂
Name	α-methyl-trans-cinnamic acid	5-methoxyindan-1-one	(1RS,3SR,4SR)-trispiro(2.0.0.2.1.1)nonane-1-carboxylic acid
Temperature	room	room	100 K
R-factor	4.10	3.90	3.75
Structural class	P2₁/c, Z = 4(1)	P2₁/c, Z = 4(1)	P2₁/c, Z = 4(1)
The molecular net
CN_mol	14	14	14
Type (transitivity)	bcu-x (1,2)	gpu-x (1,4)	tcg-x (1,6)
CN_mol′	9	9	9
Wyckoff sequence of intermolecular contacts	e⁵dcba	e⁵dcba	e⁵dcba
H_edge, bits/contact	3.093	3.093	3.093
The critical net
CN_crit	6	4	5
Type (transitivity)	sxa (1,3)	dia (1,1)	bnn (1,2)
CN_crit′	5	3	3
Wyckoff sequence of bearing contacts	edcba	edc	e²b
H_edge,crit, bits/contact	2.252	1.500	1.522

Table 4. Possible critical subnets of bcu-x, gpu-x, and tcg-x with the Wyckoff sequence of edges e⁵dcba in P2₁/c.

CN_crit	Wyckoff Sequences of Edges	Subnets	Nets
CN_crit	Wyckoff Sequences of Edges	Subnets	Bcu-X	Gpu-X	Tcg-X
4	e.	dia	18	8	20
		cds	–	2	–
		dmp	–	4	–
		4T19	–	2	–
5	e².	sqp	4	4	8
		nov	8	6	10
		bnn	16	–	16
		5T12	–	2	–
6	e³.	pcu	7	2	8
Total			53	30	62

Table 5. The min, max, mean value, and standard deviation of the contributions of H (%) and SIC for the set of 4152 crystal structures.

Value	Min	Max	Mean	σ
%
H_mol in H_molNet	35.5	94.3	78.9	6.4
H_edge,crit in H_edge	41.2	100.0	62.4	11.7
H_edge in H_molNet	1.9	39.2	9.5	3.6
H(2N, CN_mol) in H_molNet	3.8	25.8	11.6	2.8
SIC
SIC_mol	1.000	1.000	1.000	0.000
SIC_edge,crit	0.613	1.000	0.740	0.057
SIC_edge	0.759	0.864	0.811	0.009
SIC_molNet	0.813	0.879	0.853	0.008

Table 6. The number s of distinctive values H and the discriminatory power D over the set of 4152 crystal structures.

Value	s	D, %
H_mol	99	97.76
H_edge,crit	28	87.62
H_egde	26	63.72
H(2N; CN_mol)	389	98.93
H_molNet	531	99.13

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Banaru, A.M.; Aksenov, S.M. Complexity of Molecular Nets: Topological Approach and Descriptive Statistics. Symmetry 2022, 14, 220. https://doi.org/10.3390/sym14020220

AMA Style

Banaru AM, Aksenov SM. Complexity of Molecular Nets: Topological Approach and Descriptive Statistics. Symmetry. 2022; 14(2):220. https://doi.org/10.3390/sym14020220

Chicago/Turabian Style

Banaru, Alexander M., and Sergey M. Aksenov. 2022. "Complexity of Molecular Nets: Topological Approach and Descriptive Statistics" Symmetry 14, no. 2: 220. https://doi.org/10.3390/sym14020220

APA Style

Banaru, A. M., & Aksenov, S. M. (2022). Complexity of Molecular Nets: Topological Approach and Descriptive Statistics. Symmetry, 14(2), 220. https://doi.org/10.3390/sym14020220

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Complexity of Molecular Nets: Topological Approach and Descriptive Statistics

Abstract

1. Introduction

2. Methods

3. Results and Discussion

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI