<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mater.</journal-id>
<journal-title>Frontiers in Materials</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mater.</abbrev-journal-title>
<issn pub-type="epub">2296-8016</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmats.2019.00120</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Materials</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A Simple Approach to Atomic Structure Characterization for Machine Learning of Grain Boundary Structure-Property Models</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Snow</surname> <given-names>Brandon D.</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/715612/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Doty</surname> <given-names>Dustin D.</given-names></name>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Johnson</surname> <given-names>Oliver K.</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/697074/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Mechanical Engineering, Brigham Young University</institution>, <addr-line>Provo, UT</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Norbert Huber, Helmholtz Centre for Materials and Coastal Research (HZG), Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Peter Larsen, Massachusetts Institute of Technology, United States; Liam Huber, Max-Planck-Institut f&#x000FC;r Eisenforschung GmbH, Germany</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Oliver K. Johnson <email>ojohnson&#x00040;byu.edu</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Computational Materials Science, a section of the journal Frontiers in Materials</p></fn></author-notes>
<pub-date pub-type="epub">
<day>31</day>
<month>05</month>
<year>2019</year>
</pub-date>
<pub-date pub-type="collection">
<year>2019</year>
</pub-date>
<volume>6</volume>
<elocation-id>120</elocation-id>
<history>
<date date-type="received">
<day>03</day>
<month>03</month>
<year>2019</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>05</month>
<year>2019</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2019 Snow, Doty and Johnson.</copyright-statement>
<copyright-year>2019</copyright-year>
<copyright-holder>Snow, Doty and Johnson</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Grain boundaries (GBs) have a significant influence on the properties of crystalline materials. Machine learning approaches present an attractive route to develop atomic structure-property models for GBs because of the complexity of their structure. However, the application of such techniques requires an appropriate descriptor of the atomic structure. Unfortunately, common crystal structure identification techniques cannot be applied to characterize the structure of the vast majority of GB atoms (50&#x02013;98% are classified as &#x0201C;other&#x0201D;). This suggests a critical need for atomic structure descriptors capable of identifying arbitrary atomic environments. In this work we present a simple procedure that facilitates the identification of arbitrary atomic structures present in GBs. We apply this approach to characterize the atomic structure of the 388 GBs from the Olmsted data set (Olmsted et al., <xref ref-type="bibr" rid="B20">2009</xref>). We show how this approach facilitates visualization of GB atomic structures in a way that reveals important structural information. We test the recently proposed hypothesis that &#x003A3;3 GBs contain facets of the GBs that form the corners of the corresponding GB plane fundamental zone. Finally, we briefly demonstrate how the structure descriptors resulting from our approach can be used as inputs to machine learning approaches for the development of atomic structure-property models for GBs.</p></abstract>
<kwd-group>
<kwd>grain boundary</kwd>
<kwd>machine learning</kwd>
<kwd>atomic structure identification</kwd>
<kwd>common neighbor analysis</kwd>
<kwd>faceting</kwd>
</kwd-group>
<counts>
<fig-count count="10"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="36"/>
<page-count count="12"/>
<word-count count="8326"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>Grain boundaries (GBs) play an important role for many material properties, such as hydrogen embrittlement (Bechtle et al., <xref ref-type="bibr" rid="B6">2009</xref>), creep (Gertsman and Tangri, <xref ref-type="bibr" rid="B11">1997</xref>; Watanabe et al., <xref ref-type="bibr" rid="B34">2009</xref>), corrosion resistance (Shimada et al., <xref ref-type="bibr" rid="B30">2002</xref>; Tan et al., <xref ref-type="bibr" rid="B33">2008</xref>), and conductivity (Zhang et al., <xref ref-type="bibr" rid="B36">2006</xref>). While the structure of GBs is most often characterized experimentally by their five macroscopic crystallographic degrees of freedom (Ashby et al., <xref ref-type="bibr" rid="B2">1978</xref>), it is the atomic structure that fundamentally governs their properties (Katritzky and Fara, <xref ref-type="bibr" rid="B15">2005</xref>). Atomistic simulation has been used to investigate the atomic structure of GBs and how it correlates with their observed properties (Zhang et al., <xref ref-type="bibr" rid="B35">2009</xref>). However, the atomic structure of GBs is much more complicated than their crystallographic structure and traditional crystal identification descriptors are not designed to classify the structure of the vast majority of atoms present at GBs. As an example, we analyzed the 388 GBs constructed by Olmsted et al. (<xref ref-type="bibr" rid="B20">2009</xref>) using common crystal structure identification methods: bond-angle analysis (BAA) (Ackland and Jones, <xref ref-type="bibr" rid="B1">2006</xref>), common neighbor analysis (CNA) (Faken and J&#x000F3;nsson, <xref ref-type="bibr" rid="B9">1994</xref>), and polyhedral template matching (PTM) (Larsen et al., <xref ref-type="bibr" rid="B17">2016</xref>). <xref ref-type="table" rid="T1">Table 1</xref> provides the percentage of the GB atoms that were unclassified (i.e., classified as &#x0201C;other&#x0201D;/unknown structures) by each technique across all 388 GBs and across the subset of 41 &#x003A3;3 GBs. The fact that 50&#x02013;98% of the GB atoms remain unclassified, makes it difficult to identify atomic structure-property relationships for GBs, and suggests a critical need for new techniques that can describe the complex atomic structure of GBs.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Comparison of characterization methods applied to the Olmsted GB data set (Olmsted et al., <xref ref-type="bibr" rid="B20">2009</xref>).</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th/>
<th valign="top" align="center"><bold>CNA</bold></th>
<th valign="top" align="center"><bold>PTM</bold></th>
<th valign="top" align="center"><bold>BAA</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">% Unclassified Atoms: All 388 GBs</td>
<td valign="top" align="center">98%</td>
<td valign="top" align="center">89%</td>
<td valign="top" align="center">50%</td>
</tr>
<tr>
<td valign="top" align="left">% Unclassified Atoms: &#x003A3;3 GBs</td>
<td valign="top" align="center">95%</td>
<td valign="top" align="center">54%</td>
<td valign="top" align="center">40%</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>For all crystal structure identification methods, a large portion of the grain boundary atoms remain unclassified. Note that the FCC atoms are excluded from this calculation. Because the listed methods are restricted to finding a pre-defined set of reference structures, the GB atoms are only classified if they are either HCP, BCC, ICO, or SC. This is useful in the case of the coherent twin which is 100% HCP, but additional measures are needed to characterize any other grain boundary</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>Due to the complex and high-dimensional nature of GB atomic structures, machine learning and related statistical approaches provide an attractive route for the development of atomic structure-property models. However, the inability to resolve atomic structure within GBs complicates such an effort because the effect of distinct atomic environments cannot be extracted if these environments cannot be distinguished. If it were possible to fully characterize the atomic structure of GBs, dimensionality reduction techniques such as feature selection (e.g., decision trees) and feature transformation (e.g., principle component analysis) could be applied to identify the atomic environments that govern properties of interest. Labeled data from simulations could then be provided to train supervised machine learning algorithms, and predictive models could be developed that would significantly expand our understanding of atomic structure-property relations for GBs.</p>
<p>As demonstrated above, common crystal structure identification techniques are insufficient for this task. Consequently, several authors have developed methods to identify arbitrary non-crystalline atomic structures for applications such as developing interatomic potentials (Bart&#x000F3;k et al., <xref ref-type="bibr" rid="B5">2013</xref>), analyzing colloidal crystallization (Reinhart et al., <xref ref-type="bibr" rid="B26">2017</xref>), and characterizing grain boundaries (Banadaki and Patala, <xref ref-type="bibr" rid="B4">2017</xref>; Rosenbrock et al., <xref ref-type="bibr" rid="B28">2017</xref>; Priedeman et al., <xref ref-type="bibr" rid="B23">2018</xref>). A brief summary of their work is given in section 2. While these methods are effective, they are also significantly more complex than simple crystal structure identification techniques that are in common use. The major contribution of the present work is to bridge this gap.</p>
<p>By employing a simple version of common neighbor analysis (CNA) and leveraging information that is already available&#x02014;but which is normally discarded&#x02014;we develop an approach that (i) can characterize arbitrary atomic environments, while also being both (ii) simple to implement, and (iii) built upon a descriptor that is already familiar to the atomistic modeling community. We demonstrate that, in spite of its simplicity, it can be employed for predictive purposes as part of a machine learning strategy to develop GB structure-property models. We anticipate that the simplicity and effectiveness of this approach will facilitate the development of predictive structure-property models for GBs as well as other applications that involve lower symmetry atomic structures such as those present in metallic glasses.</p></sec>
<sec id="s2">
<title>2. Background</title>
<p>There has been great interest in characterizing atomic structures recently and over the last decade and several reviews are available in the literature (Stukowski, <xref ref-type="bibr" rid="B32">2012</xref>; Priedeman, <xref ref-type="bibr" rid="B22">2018</xref>), so only a brief description is given here.</p>
<sec>
<title>2.1. Identification of Crystalline Atomic Environments</title>
<p>Common methods used to identify crystalline structures include the centrosymmetry parameter (Kelchner et al., <xref ref-type="bibr" rid="B16">1998</xref>), common neighbor analysis (CNA) (Faken and J&#x000F3;nsson, <xref ref-type="bibr" rid="B9">1994</xref>), polyhedral template matching (PTM) (Larsen et al., <xref ref-type="bibr" rid="B17">2016</xref>), and Voronoi cell analysis methods (Bernal, <xref ref-type="bibr" rid="B7">1959</xref>; Rahman, <xref ref-type="bibr" rid="B25">1966</xref>; Bernal and Finney, <xref ref-type="bibr" rid="B8">1967</xref>; Finney, <xref ref-type="bibr" rid="B10">1970</xref>; Hsu and Rahman, <xref ref-type="bibr" rid="B13">1979</xref>; Sheng et al., <xref ref-type="bibr" rid="B29">2006</xref>; Lazar et al., <xref ref-type="bibr" rid="B18">2015</xref>).</p>
<p>The centrosymmetry parameter is a measure of the distance to an atom&#x00027;s <italic>n</italic> nearest neighbors to determine whether or not an atom is within a bulk crystal or a defect. CNA, PTM, and Voronoi analysis methods all classify the atomic structure of an atom by comparison of its local environment to a library of known structures, usually face-centered cubic (FCC), hexagonal close-packed (HCP), body-centered cubic (BCC), icosahedral (ICO), and, for some of these methods, simple cubic (SC).</p>
<p>These methods provide valuable tools for identifying the location, and in some cases the types, of defects present in an atomistic model. However, as with all tools (including those that we present in this paper), each method has certain drawbacks and limitations. The main disadvantages of the centrosymmetry parameter are that the number of neighbors, <italic>n</italic>, is a user-defined parameter, and the centrosymmetry parameter doesn&#x00027;t give any insight into what the local structure is if it is part of a defect. While some of the limitations of CNA have been reduced by the introduction of an adaptive cutoff radius (Stukowski, <xref ref-type="bibr" rid="B32">2012</xref>), the method is typically just used to determine whether an atom belongs to one of a small set of predetermined environments. PTM uses a more robust Voronoi method to identify neighbors, but it too relies on comparison with a small library of known environments. Voronoi analysis generally characterizes local environments by the number of faces with a particular number of edges, but this approach fails to distinguish between some common environments (FCC and HCP) (Bernal, <xref ref-type="bibr" rid="B7">1959</xref>; Rahman, <xref ref-type="bibr" rid="B25">1966</xref>; Bernal and Finney, <xref ref-type="bibr" rid="B8">1967</xref>; Finney, <xref ref-type="bibr" rid="B10">1970</xref>; Hsu and Rahman, <xref ref-type="bibr" rid="B13">1979</xref>; Sheng et al., <xref ref-type="bibr" rid="B29">2006</xref>). The recently developed Voronoi Topology (VoroTop) technique (Lazar et al., <xref ref-type="bibr" rid="B18">2015</xref>) uses planar graph representations to address this issue by including information about the arrangement of the faces, but requires a large database of nearly degenerate variants of the known Voronoi cells to compare against, since small atomic displacements can significantly affect the Voronoi cell topology. As with the other crystal structure identification methods, the VoroTop technique has primarily employed a small library of known structures. Additional environments can be added to these libraries, but this must be done manually.</p></sec>
<sec>
<title>2.2. Identification of Non-crystalline Atomic Environments</title>
<p>To adequately analyze the local atomic structure of defects, such as GBs, a method is needed that can classify atoms without <italic>a priori</italic> knowledge of the structures present (i.e., without reliance on a small precomputed list of known structures). Several recent publications have presented methods to identify arbitrary local environments (Bart&#x000F3;k et al., <xref ref-type="bibr" rid="B5">2013</xref>; Banadaki and Patala, <xref ref-type="bibr" rid="B4">2017</xref>; Reinhart et al., <xref ref-type="bibr" rid="B26">2017</xref>; Rosenbrock et al., <xref ref-type="bibr" rid="B28">2017</xref>; Priedeman et al., <xref ref-type="bibr" rid="B23">2018</xref>), and a brief description of each is given here.</p>
<p>Bart&#x000F3;k et al. (<xref ref-type="bibr" rid="B5">2013</xref>) developed an atomic structure descriptor based on the superposition of Gaussian kernels centered at atomic positions, referred to as the SOAP kernel/descriptor. SOAP is unique in that it is a continuous descriptor (making it robust against small changes in atomic positions) unlike most other descriptors that are discrete in nature. SOAP has recently been applied to characterize GBs by Rosenbrock et al. (<xref ref-type="bibr" rid="B28">2017</xref>) and Priedeman et al. (<xref ref-type="bibr" rid="B23">2018</xref>).</p>
<p>Banadaki and Patala (<xref ref-type="bibr" rid="B4">2017</xref>) presented the polyhedral unit model, which compares the neighborhood around voids in atomic structures (at which vertices in the Voronoi tessellation are centered) against an exhaustive library of configurations of close-packed spheres for up to 12 spheres. A benefit of the polyhedral unit model is that an RMSD value can be calculated to quantify how close of a match particular structures are to their reference structures, but the resulting polyhedra are centered on a void as opposed to an atom which is the more common representation of an atomic environment.</p>
<p>Reinhart et al. (<xref ref-type="bibr" rid="B26">2017</xref>) developed an algorithm called Neighborhood Graph Analysis (NGA), which implemented CNA with an adaptive cutoff radius to produce CNA signatures for arbitrary environments present in colloidal crystallization simulations. The adaptive cutoff however, produces an asymmetric neighborhood graph (i.e., atom B may be a neighbor to atom A, but that does not imply atom A will be in the neighborhood set of atom B) which can artificially increase the number of unique environments (i.e., there is an over-partitioning of the configuration space). This is compensated for by employing a machine learning algorithm to determine relationships between otherwise discrete signatures and consolidate similar environments that have different signatures. Reinhart et. al subsequently developed a modified version of their original algorithm, which they call the &#x0201C;fast NGA&#x0201D; (fNGA) algorithm (Reinhart and Panagiotopoulos, <xref ref-type="bibr" rid="B27">2018</xref>), which defines neighbors using a Delaunay triangulation (similar to PTM), and which uses graphlets to dramatically reduce the computational cost of the consolidation step. The present work can be seen as a simplified version of Reinhart&#x00027;s original approach.</p>
<p>While all of these methods are effective at classifying non-crystalline atomic environments, they are complex and in some cases computationally expensive. In this paper we present a comparatively simple alternative based on CNA to identify arbitrary local environments without the use of a predetermined library of structures. Because of its simplicity and the fact that it only requires some minor post-processing (code provided in <xref ref-type="supplementary-material" rid="SM1">Supplementary Material</xref>) of traditional CNA data that is already ubiquitously available in existing software packages, our approach can be easily adopted. While our method, like others, suffers from over-partitioning of the space of unique atomic environments, we show that it is, nevertheless, possible to gain insight into important structure-property relationships. We demonstrate the usefulness of this technique by characterizing the unique atomic environments (UAEs) present in the 388 GBs of the Olmsted data set (Olmsted et al., <xref ref-type="bibr" rid="B20">2009</xref>). We also test the recent hypothesis (Banadaki and Patala, <xref ref-type="bibr" rid="B3">2016</xref>) that the structures of &#x003A3;3 GBs may decompose into facets of the GBs occupying the corners of the corresponding GB plane fundamental zone (FZ). Finally, we give a brief example of how the UAEs identified using our approach might serve as inputs to machine learning strategies for the development of atomic structure-property models for GBs.</p></sec></sec>
<sec sec-type="methods" id="s3">
<title>3. Methods</title>
<sec>
<title>3.1. Traditional Common Neighbor Analysis</title>
<p>In the traditional CNA method, a set of three indices (<italic>j, k, l</italic>) is defined, which describes the topology of the graph formed by the nearest neighbor atoms (see <xref ref-type="fig" rid="F1">Figure 1</xref>). The three indices are computed for each neighboring atom to define their relationship to the central atom. The first index <italic>j</italic> enumerates the number of shared nearest neighbors (e.g., in <xref ref-type="fig" rid="F1">Figure 1</xref> the four light purple atoms are nearest neighbors of both the central atom and the dark purple atom, so for the dark purple atom <italic>j</italic> &#x0003D; 4). The index <italic>k</italic> enumerates the number of bonds between shared nearest neighbors (e.g., in <xref ref-type="fig" rid="F1">Figure 1</xref> there are two dashed purple lines indicating two distinct bonds between shared nearest neighbors, so for the dark purple atom <italic>k</italic> &#x0003D; 2). Finally, the index <italic>l</italic> enumerates the number of bonds in the longest bond-chain formed by shared neighbors (e.g., in <xref ref-type="fig" rid="F1">Figure 1</xref> the dashed purple lines do not share an atom, so the longest bond-chain between shared nearest neighbors is 1, giving <italic>l</italic> &#x0003D; 1 for the dark purple atom). CNA indices are calculated for each atom pair. The local environment (i.e., &#x0201C;atomic structure&#x0201D;) of a particular atom is then defined by the set of CNA indices of all of its nearest neighbors. As has been done in prior literature (Stukowski, <xref ref-type="bibr" rid="B32">2012</xref>; Reinhart et al., <xref ref-type="bibr" rid="B26">2017</xref>), we refer to this as an atom&#x00027;s CNA signature to distinguish it from the atom&#x00027;s CNA indices. For example, the CNA signature of an atom whose local structure corresponds to an FCC lattice would be denoted {12 &#x000D7; (4, 2, 1)}, indicating that it possesses 12 nearest neighbors, each with CNA indices of (4, 2, 1). An atom with a less symmetric local environment, such as one belonging to a GB might have a CNA signature of {2 &#x000D7; (3, 1, 1), 3 &#x000D7; (4, 2, 1), 2 &#x000D7; (4, 2, 2), 2 &#x000D7; (4, 3, 3), } {1 &#x000D7; (4, 4, 4), 2 &#x000D7; (5, 4, 4)}, indicating a total of twelve nearest neighbors, but which have different CNA indices.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Illustration of the process for determining CNA indices and the CNA signature, concept inspired by Reinhart et al. (<xref ref-type="bibr" rid="B26">2017</xref>). In <bold>(A)</bold> an atom is shown (central yellow atom, which has been reduced in size for visual clarity) together with its nearest neighbors. The corresponding graph representation is provided in <bold>(B)</bold>. The light colored symbols represent the nearest neighbors shared with the central atom (four for the purple neighbor and four for the green neighbor). Solid lines represent bonds between neighbors of the central atom, while dashed lines represent bonds between shared neighbors (two for both the purple and green neighbors). For the purple neighbor the shared bonds (dashed lines) are not connected, so <italic>k</italic> &#x0003D; 1, but for the green neighbor the shared bonds are connected so <italic>k</italic> &#x0003D; 2. Because of the symmetry of this graph, there are six neighbors with the same indices (4, 2, 1) as the purple atom (represented by circles) and six with the same indices (4, 2, 2) as the green (represented by squares). Consequently the CNA signature for the central atom is {6 &#x000D7; (4, 2, 1), 6 &#x000D7; (4, 2, 2)}, which represents an HCP atomic environment.</p></caption>
<graphic xlink:href="fmats-06-00120-g0001.tif"/>
</fig>
<p>We note that neighbors can be identified using various methods, the primary ones being a fixed cutoff radius or an adaptive cutoff (Stukowski, <xref ref-type="bibr" rid="B32">2012</xref>; Reinhart et al., <xref ref-type="bibr" rid="B26">2017</xref>). In this work we chose to use a fixed cutoff of 3.5&#x000C5; (which falls between the first and second nearest neighbors for the FCC lattice, see <bold>Figure 3A</bold>). The fixed cutoff was chosen both because of its simplicity and because it resulted in fewer unique signatures than the adaptive methods (2205 vs. 3716) for the structures that we analyzed.</p>
<p>Once the CNA signature of every atom has been computed, atomic structures are identified by comparison with the CNA signatures of a predefined library of known structures, typically limited to FCC, HCP, BCC, and ICO. In standard usage, any atom whose CNA signature does not match that of one of the predefined structural templates remains unclassified and is labeled as &#x0201C;other.&#x0201D; This is sufficient to identify the <italic>location</italic> of defects because &#x0201C;other&#x0201D; atoms typically are found at defects. However, it is generally insufficient to resolve the <italic>structure</italic> of those defects. Because GBs consist of mostly &#x0201C;other&#x0201D; atoms, their internal atomic structure cannot typically be resolved. Furthermore, if two GBs both contain all &#x0201C;other&#x0201D; atoms, it is difficult to distinguish between them.</p></sec>
<sec>
<title>3.2. Fully-Leveraged CNA</title>
<p>To address this issue, we note that the information necessary to distinguish &#x0201C;other&#x0201D; atoms from one another is already available and encoded in their respective CNA signatures, it is just typically ignored in standard practice. To exploit this information one must simply identify all of the unique CNA signatures; these define distinct atomic structure classes; in some sense this list constitutes an extended structure library. Atoms are then classified using this extended structure library. However, it is constructed at the time of analysis and is compatible with arbitrary atomic structures (one does not need to know what structures they are looking for <italic>a priori</italic>). Furthermore, the &#x0201C;other&#x0201D; category is entirely eliminated as all atoms are classified and belong to one of the UAEs that were identified.</p>
<p>To extract the complete CNA signatures for each atom in the structures that we analyzed, there are built-in functions that can be run as part of a pipeline in the Open Visualization Tool (OVITO) (Stukowski, <xref ref-type="bibr" rid="B31">2010</xref>), and an example python script is available in the online <ext-link ext-link-type="uri" xlink:href="https://ovito.org/manual/python/introduction/examples.html&#x00023;example-compute-cna-bond-indices">OVITO documentation</ext-link>. We modified this script for our particular application, and we provide our modified version in the accompanying <xref ref-type="supplementary-material" rid="SM1">Supplementary Material</xref>. Once extracted, the unique CNA signatures were then identified in MATLAB and each was assigned a unique numerical class ID (we also provide this code in the <xref ref-type="supplementary-material" rid="SM1">Supplementary Material</xref>), which was subsequently imported into OVITO as a custom particle property, allowing for color-coding and visualization.</p></sec></sec>
<sec id="s4">
<title>4. Results and Discussion</title>
<sec>
<title>4.1. Classifying &#x0201C;Other&#x0201D; Atoms in GBs</title>
<p>We applied the fully-leveraged CNA approach to characterize all of the atoms in the 388 GBs from the Olmsted data set (Olmsted et al., <xref ref-type="bibr" rid="B20">2009</xref>), which contains atomic structures for a total of 388 GBs in Al with variation across all five crystallographic degrees of freedom, including 41 &#x003A3;3 GBs. Here we present the results of that analysis. The vast majority of the atoms belong to the grain interiors and are FCC, and could be easily characterized by existing methods. We, therefore, focus on the GB atoms, which are generally classified as &#x0201C;other&#x0201D;/unidentified structures by reference structure based techniques. We define an atom as belonging to the GB if at least one of the nearest neighbors is not FCC. This results in all of the non-FCC atoms, as well as many FCC atoms inside or adjacent to the GB being identified with it (for some tilt GBs, if the dislocation spacing is sufficiently large there will be FCC atoms in the GB plane, that are entirely surrounded by other FCC atoms, which would not be counted as GB atoms by this definition.). Using this definition, there are a total of 462,955 GB atoms, out of a total of 11,922,451 atoms contained in the Olmsted data set (the non-GB atoms belong to the bulk crystal and are all FCC). While some GBs properly contain FCC atoms in their interior (e.g., low-angle GBs have FCC atoms between dislocations), the focus of this work is on characterizing non-FCC atoms. Consequently, we will present our results in two ways: (i) relative to all 462,955 GB atoms (FCC and non-FCC), and (ii) relative to only the non-FCC GB atoms (of which there are 227,401).</p>
<p><xref ref-type="fig" rid="F2">Figure 2A</xref> shows the distribution of GB atomic environments across all 388 GBs for the fully-leveraged CNA approach. This shows that out of the nearly 500,000 GB atoms (across all 388 GBs), there are 2205 unique CNA signatures. However, noting the log-scale in the y-axis, only 448 signatures are needed to account for approximately 90% of the non-FCC GB atoms (see <xref ref-type="fig" rid="F2">Figure 2B</xref>), and only 167 are needed if the GB atoms with FCC structure are included<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref>. While this still represents a non-negligible number of unique environments, it is a considerable reduction in dimensionality for a general set of grain boundaries, which would otherwise require a total of at least 682, 203 parameters to describe the atomic configurations (3 parameters for each atom Rosenbrock et al., <xref ref-type="bibr" rid="B28">2017</xref>).</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>(A)</bold> Histogram of UAEs found in the 388 Omlsted GBs. Note that this is on a log scale and there are approximately 5 &#x000D7; 10<sup>5</sup> GB atoms. <bold>(B)</bold> Cumulative sum of the proportion of atoms that can be described using a given number of UAEs. Approximately 90% of the non-FCC GB atoms can be described by one of the 448 most prevalent UAEs (only 167 UAEs are required if the GB atoms with FCC structure are included).</p></caption>
<graphic xlink:href="fmats-06-00120-g0002.tif"/>
</fig>
<p>We note that, using an alternative spatially continuous descriptor, smooth overlap of atomic positions (SOAP) (Bart&#x000F3;k et al., <xref ref-type="bibr" rid="B5">2013</xref>), Rosenbrock et al. initially found 800,000 UAEs for the same 388 GBs in Ni, using a neighborhood cutoff distance of 5&#x000C5; (Rosenbrock et al., <xref ref-type="bibr" rid="B28">2017</xref>). In the SOAP method, as well as other methods such as PTM, a similarity measure is employed, enabling two structures that differ by only a small perturbation to still be classified as the same environment, which is one way to correct for the overpartitioning phenomenon. After using a similarity metric within a machine learning framework the original 800,000 UAEs were consolidated to only 145 distinct UAEs. We note that, as with any similarity based consolidation approach, the resulting number of unique environments depends on the user specified similarity threshold.</p>
<p>The simple approach to UAE identification embodied in the fully-leveraged CNA does not employ a similarity threshold, so it is expected that the UAE space will be over-partitioned. This manifests itself in the relatively long-tailed distribution of UAEs in <xref ref-type="fig" rid="F2">Figure 2</xref>, which are produced by small deviations in atomic position that cause a single environment to produce multiple CNA signatures (i.e., UAEs that are not frequently observed are most likely slightly distorted versions of other UAEs). The underlying cause of this phenomenon is the difficulty in unambiguously defining atomic neighbors in non-crystalline regions. To illustrate this, compare the radial distribution function (RDF) for bulk FCC with that of a grain boundary, as shown in <xref ref-type="fig" rid="F3">Figure 3</xref>. The clear separation of the first and second peaks&#x02014;corresponding to the first and second nearest neighbors, respectively&#x02014;in the RDF of the FCC lattice (<xref ref-type="fig" rid="F3">Figure 3A</xref>) facilitates the selection of an appropriate neighbor cutoff radius. However, as expected, the RDF for the grain boundary atoms (<xref ref-type="fig" rid="F3">Figure 3B</xref>) does not show a clear separation between first and second neighbors, making CNA sensitive to small perturbations of atomic position and changes in the cutoff radius. This also means that the number of UAEs identified by the fully-leveraged CNA approach of the present work depends on the user chosen cutoff radius. This challenge exists for any method that attempts to characterize GB atoms, because there is no clear choice as to which atoms should be included in the neighborhood, and the resulting structures are likely to over-partition the UAE space.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>(A)</bold> The radial distribution function (RDF) for an FCC lattice calculated in OVITO and <bold>(B)</bold> the RDF for the grain boundary atoms (a <inline-formula><mml:math id="M1"><mml:mrow><mml:mi>&#x003A3;</mml:mi><mml:mn>63</mml:mn><mml:mtext>&#x0205F;</mml:mtext><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>5</mml:mn><mml:mover accent='true'><mml:mn>1</mml:mn><mml:mo stretchy='true'>&#x000AF;</mml:mo></mml:mover><mml:mover accent='true'><mml:mn>3</mml:mn><mml:mo stretchy='true'>&#x000AF;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> GB was used as a representative example).The distinct peaks in the bulk FCC make it easy to choose an appropriate cutoff distance for neighbor identification, however the more continuous nature of the GB RDF causes CNA to be more sensitive to small perturbations in atomic position and changes in the cutoff.</p></caption>
<graphic xlink:href="fmats-06-00120-g0003.tif"/>
</fig>
<p>As mentioned earlier, work has been done by Reinhart et al. (<xref ref-type="bibr" rid="B26">2017</xref>) to establish a machine learning approach to identify environments that have similar structure but different CNA signatures and combine them into a single environment (i.e., clustering in the UAE space). This effectively implements a similarity metric for CNA, and was successful in its application to surfaces of colloidal crystals. However, this process is computationally expensive and does not result in a single universal partitioning of the UAE space, so the repartitioning would need to be recalculated (or at least updated) for every new data set to be characterized. In spite of the overpartitioning that results from the simple fully-leveraged CNA approach, and in the absence of environment consolidation, we find that useful analysis can still be performed to evaluate GB structure-property models as will be described in section 4.3.</p>
<p>For the subset of &#x003A3;3 GBs, the number of UAEs is reduced considerably. <xref ref-type="fig" rid="F4">Figure 4</xref> shows the distribution of atomic environments found in the subset of 41 &#x003A3;3 GBs, for which there were only 117 unique CNA signatures. Moreover, the vast majority of the GB atoms (roughly 90%) correspond to one of just 44 UAEs (or only 29 UAEs if GB atoms with FCC structure are included). This kind of dimensionality reduction for descriptions of GB atomic structure may make inference of GB atomic structure-property models significantly more tractable. Furthermore, this information can be used to compare the structural similarity of different GBs as will be discussed in section 4.3.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>(A)</bold> Histogram of UAEs found in &#x003A3;3 GBs. The large spike at environment 1 corresponds to the FCC structural type and is due to the inclusion of the first layer of FCC atoms as part of the GB. <bold>(B)</bold> Cumulative sum of the proportion of &#x003A3;3 GB atoms that can be described using a given number of UAEs. Approximately 90% of the &#x003A3;3 atoms correspond to one of the 44 most prevalent UAEs.</p></caption>
<graphic xlink:href="fmats-06-00120-g0004.tif"/>
</fig></sec>
<sec>
<title>4.2. Visualization</title>
<p>Without resorting to the more advanced machinery of SOAP or Reinhart&#x00027;s machine learning approach, most analysis of atomic structures relies on the simpler reference structure based crystal structure identification techniques. Because they were designed to identify crystalline regions, and not GBs, 50%&#x02212;98% of the GB atoms in the Olmsted data set are, unsurprisingly, classified as &#x0201C;other&#x0201D; by the reference structure based techniques, making the atomic structure of these GBs largely opaque to classical analysis. As revealed by our fully-leveraged CNA technique, the fact that only 44 UAEs dominate the &#x003A3;3 GBs studied here suggests the possibility of discovering new GB structural information for very little computational effort, and within the familiar CNA framework. We illustrate this through visualization, by coloring GB atoms according to their UAE identifier. As an example, <xref ref-type="fig" rid="F5">Figures 5A,B</xref> provides a rendering of a <inline-formula><mml:math id="M2"><mml:mi>&#x003A3;</mml:mi><mml:mn>3</mml:mn><mml:mtext>&#x000A0;</mml:mtext><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mn>5</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover><mml:mn>1</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> GB with atoms colored according to standard practice (using the traditional CNA approach). The FCC atoms (in green) are identified, but all of the atoms at the GB are classified as &#x0201C;other&#x0201D;/unidentified environments. In contrast, <xref ref-type="fig" rid="F5">Figure 5C</xref> shows the same GB atoms colored using the atomic environment classes identified by our fully-leveraged CNA technique. It is evident that this GB contains a structured arrangement of atomic environments and is quasi-two dimensional. This new approach reveals structure that was previously unresolvable using the common crystal structure identification techniques, and for far less computational effort than the more advanced techniques.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p><bold>(A)</bold> OVITO rendering of a <inline-formula><mml:math id="M3"><mml:mi>&#x003A3;</mml:mi><mml:mn>3</mml:mn><mml:mtext>&#x000A0;</mml:mtext><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>52</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> GB, with atoms colored by traditional CNA. Green atoms are FCC, white are &#x0201C;other&#x0201D;. Notice that all of the GB atoms remain unidentified. <bold>(B)</bold> All FCC atoms removed. <bold>(C)</bold> Atoms colored by the UAE IDs obtained from our fully-leveraged CNA procedure.</p></caption>
<graphic xlink:href="fmats-06-00120-g0005.tif"/>
</fig>
<p>In addition to the ability to easily obtain important structural information for a single GB, coloring each atom according to its local environment facilitates identification of structural similarity among different GBs. In the case of &#x003A3;3 GBs, it has been hypothesized that GBs may form facets whose structure corresponds to that of the GBs that occupy the corners of the relevant boundary plane fundamental zone (FZ) (Banadaki and Patala, <xref ref-type="bibr" rid="B3">2016</xref>). However, a test of this hypothesis would require comparison of the atomic structures of various GBs, which would be difficult using reference structure based descriptors that leave nearly all of those atoms unclassified. For example, the top row of <xref ref-type="fig" rid="F6">Figure 6</xref> shows three different &#x003A3;3 GBs that are near each other in the FZ. While terrace-like features are apparent, it is unclear whether these represent facets of the same structure. Using the fully-leveraged CNA procedure, the bottom row of <xref ref-type="fig" rid="F6">Figure 6</xref> makes it clear that each of these GBs do in fact contain very similar environments, giving some evidence in support of the faceting hypothesis. A more complete analysis of faceting in &#x003A3;3 GBs, enabled by the fully-leveraged CNA technique, is provided in section 4.3.</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p>Visualization of three &#x003A3;3 GBs (boundary plane is indicated in brackets), (above) with atoms colored using traditional CNA available in OVITO, and (below) colored by the UAEs found during the fully-leverage CNA procedure.</p></caption>
<graphic xlink:href="fmats-06-00120-g0006.tif"/>
</fig>
<p>Visualizing a grain boundary in this manner also highlights higher-order defects, or defects inside of other defects (note the dark purple environments that decorate the ledges in <xref ref-type="fig" rid="F6">Figure 6</xref>).</p></sec>
<sec>
<title>4.3. Application</title>
<p>Here we apply the fully-leveraged CNA technique to investigate the relationship between atomic structure and GB properties. As mentioned previously, it was recently hypothesized by Banadaki and Patala (<xref ref-type="bibr" rid="B3">2016</xref>) that &#x003A3;3 GBs may be composed of facets whose structure corresponds to that of the 3 GBs that define the corners of the &#x003A3;3 GB plane FZ. Based on this hypothesis, Banadaki and Patala developed a structure-property model to predict the GB energy of an arbitrary &#x003A3;3 GB as a weighted average of the GB energies of the FZ corners. This model showed good agreement with GB energies calculated by MD for many cases. However, the GB structures were never analyzed to test whether the hypothesized structural faceting actually occurred. The fully-leveraged CNA approach presented here provides an opportunity to test this hypothesis.</p>
<p>The total number of UAEs found in each of the GBs that define the corners of the &#x003A3;3 GB plane FZ are provided in <xref ref-type="table" rid="T2">Table 2</xref>. It is notable that the UAEs appearing in each of the corner GBs form disjoint sets. This implies that they are in some sense orthogonal structures, which might at first appear to support the possibility of faceting. However, the total number of environments (117) found across all of the 41 &#x003A3;3 GBs is greater than the total number of environments found in the FZ corners (12), and, as shown in <xref ref-type="fig" rid="F7">Figure 7</xref>, these additional environments are not concentrated at ledges between facets, but constitute significant portions of the non-corner GBs.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Summary of UAEs found in the &#x003A3;3 fundamental zone.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>GB Plane</bold></th>
<th valign="top" align="center"><bold>&#x00023; of UAEs</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">[111]</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><inline-formula><mml:math id="M4"><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>2</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td valign="top" align="left"><inline-formula><mml:math id="M5"><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>10</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula></td>
<td valign="top" align="center">5</td>
</tr> <tr style="border-top: thin solid #000000;">
<td valign="top" align="left">TOTAL</td>
<td valign="top" align="center">12</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>These environments are present in many of the non-corner GBs, but other environments are also present, increasing the overall GB energy</italic>.</p>
</table-wrap-foot>
</table-wrap>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p>Rendering of the 41 &#x003A3;3 GBs from the Olmsted data set, with atoms colored by their UAE ID. Position in the FZ is relative and approximate (exact placement would cause some images to overlap). Colors were selected manually for the most frequently occurring UAEs in an effort to maximize visual differences between atoms of different UAE ID that are near each other; however, some less frequently observed UAEs do share the same color.</p></caption>
<graphic xlink:href="fmats-06-00120-g0007.tif"/>
</fig>
<p>Several key observations can be derived from <xref ref-type="fig" rid="F7">Figure 7</xref>. First, there are in fact some regions of the FZ where the GBs are made of facets of the corner GBs. In particular, GBs near the [111] coherent twin (&#x003B8; &#x0003D; &#x003D5; &#x0003D; 0) show obvious facets whose structure is that of the coherent twin. Also, GBs along the right boundary of the FZ (&#x003B8; &#x0003D; 90&#x000B0;) show some evidence of faceting (this behavior near the <inline-formula><mml:math id="M6"><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>2</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> corner was also noted by Banadaki and Patala, <xref ref-type="bibr" rid="B4">2017</xref>), though for many of these GBs the structure of these facets does not correspond to any of the FZ corners. As for the rest of the FZ, there is no clear evidence of faceting for the Olmsted Al GBs. It is important to note, however, that the ability of a GB to facet in an atomistic model may depend on the size of the simulation cell that was employed to construct it (see Race et al., <xref ref-type="bibr" rid="B24">2014</xref>; Humberson and Holm, <xref ref-type="bibr" rid="B14">2017</xref>, for a discussion of the impact of simulation cell size), so that it is possible that if larger simulation cells were used, faceting might be observed more generally. Moreover, it has been shown that there can be many metastable atomic structures for the same GB (Han et al., <xref ref-type="bibr" rid="B12">2017</xref>), some of which have nearly degenerate energies. Thus, it is also possible that there are distinct iso-energetic configurations, or that the atomic structures in this data set may not be the lowest energy configurations, which might otherwise exhibit the hypothesized faceting structure. Indeed, Banadaki and Patala found atomic structures for the &#x003A3;3 GBs with considerably lower energies in many cases (Banadaki and Patala, <xref ref-type="bibr" rid="B3">2016</xref>), which may have exhibited faceting more generally, and this may be one explanation for the better fit of the faceting model&#x00027;s energy predictions to their data than to the Olmsted data (see <bold>Figure 9</bold>). Regardless of whether or not the atomic structures in the Olmsted data set are ground state structures or (at least in some cases) metastable structures, the fully-leveraged CNA approach can be applied to characterize the atomic structure that is present, whatever it happens to be. Furthermore, if ground state structures were available, our fully-leveraged CNA approach would easily identify more general faceting if it were to occur in those structures.</p>
<p>Although structural faceting does not occur generally for the Olmsted atomic structures, relatively smooth trends in the composition of UAEs are observed across the FZ. <xref ref-type="fig" rid="F8">Figure 8</xref> shows the fraction of atoms in each GB whose atomic environments match those of each of the FZ corners. For all three corners, smooth trends in atomic environment composition are observed along &#x003B8; &#x0003D; 90&#x000B0; from <inline-formula><mml:math id="M7"><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>2</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M8"><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>10</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> (for the <inline-formula><mml:math id="M9"><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>2</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> corner it is smooth, but not monotonic, see <xref ref-type="fig" rid="F8">Figure 8B</xref>). Smooth trends also occur along &#x003D5; &#x0003D; 0 from [111] to <inline-formula><mml:math id="M10"><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>2</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> and near the coherent twin. Furthermore, as the crystallographic distance to one of the corner GBs increases, the proportion of atomic environments belonging to that corner decreases. This suggests that in the absence of faceting (which represents a sort of structural segregation behavior) there may be a sort of mixing behavior of atomic environments from each of the FZ corners for these GB structures.</p>
<fig id="F8" position="float">
<label>Figure 8</label>
<caption><p><bold>(A&#x02013;C)</bold> Fraction of GB atoms whose local environment belongs to the set of UAEs present in each of the respective corners of the fundamental zone and <bold>(D)</bold> fraction of environments not present in any of the FZ corners.</p></caption>
<graphic xlink:href="fmats-06-00120-g0008.tif"/>
</fig>
<p>Because we do not observe structural faceting generally, it is not surprising that the faceting model does not predict the energies of the Olmsted data set well. However, for some regions of the FZ there are also deviations between the faceting model&#x00027;s predictions and the calculated GB energies for the lower energy atomic structures obtained by Banadaki and Patala. It is notable that where these deviations do occur, they are almost exclusively underpredictions. Our observations here may partially explain this behavior. The faceting model predicts GB energy as a weighted average of the energies of the GBs at the FZ corners, which ignores the energetic contribution of the line defects that will likely exist at the junction of distinct facets, and underpredictions are therefore consistent with this omission. These line defects are likely to be composed of atomic environments that are not present in the FZ corners, and which may have higher cohesive energies. In fact, we find that the non-corner atomic environments have an average cohesive energy<xref ref-type="fn" rid="fn0002"><sup>2</sup></xref> that is 3.5 &#x000D7; 10<sup>&#x02212;21</sup> J (0.022 eV) higher than the average for the atomic environments that belong to the FZ corners. This may seem like a small difference, but because many GBs contain a large portion of non-corner environments (a median of 49% of the GB atoms) the cumulative effect can be significant. A rough estimate is illustrative: if 50% of a GB&#x00027;s atoms (e.g., 500 of 1000) are non-corner environments and possess the average non-corner environment cohesive energy (&#x02212;5.30 &#x000D7; 10<sup>&#x02212;19</sup> J or &#x02212;3.31 eV) then with a GB area of 1800 &#x000C5;<sup>2</sup> (the average cross-section for an Olmsted simulation cell) the non-corner environments would contribute approximately 0.097 J/m<sup>2</sup> to the GB energy, which is similar to the magnitude of the underpredictions shown in <xref ref-type="fig" rid="F9">Figure 9</xref>.</p>
<fig id="F9" position="float">
<label>Figure 9</label>
<caption><p>Comparison of calculated GB energies for GBs that lie along the edges of the &#x003A3;3 fundamental zone with the predictions of Banadaki and Patala&#x00027;s faceting model (Banadaki and Patala, <xref ref-type="bibr" rid="B3">2016</xref>). Two data sets of calculated GB energies are included: (open squares) those from Olmsted et al. (<xref ref-type="bibr" rid="B20">2009</xref>) and (semi-transparent filled circles) those from Banadaki and Patala (<xref ref-type="bibr" rid="B3">2016</xref>). The predictions of the UAE model are also included (filled squares), for comparison with the Olmsted simulations (open squares).</p></caption>
<graphic xlink:href="fmats-06-00120-g0009.tif"/>
</fig>
</sec>
<sec>
<title>4.4. Simple UAE Model</title>
<p>This suggests that a model based on atomic environments, might provide improved predictions for GB energy. We note that important work in this area has already been performed by Rosenbrock et. al within the SOAP framework (Rosenbrock et al., <xref ref-type="bibr" rid="B28">2017</xref>). The rigorous development of such a model is beyond the scope of the present work, whose primary objective has been to present a simple atomic structure characterization technique (the fully-leveraged CNA approach) that enables characterization of GB atomic structure that was unresolvable using crystal structure identification approaches. Nevertheless, we provide a simple and brief example of how the resulting UAEs might be incorporated into machine learning or other model development approaches.</p>
<p>We treat the fraction of each UAE as a predictor (independent) variable and the energy of the GB as the response (dependent) variable. This implies a 2205 dimensional space (corresponding to the 2205 UAEs observed across all 388 GBs). We employ PCA to perform feature transformation and selection and find that only 84 principle components (linear combinations of the original variables) are required to explain 95% of the variance in the data. Thus, we have reduced the dimensionality of the problem from 2205 to 84 dimensions. Using these 84 transformed variables, we employ 5-fold cross-validation to train a simple linear regression model. Comparison of the resulting model to the calculated GB energies for all 388 GBs is provided in <xref ref-type="fig" rid="F10">Figure 10</xref>, with the subset of &#x003A3;3 GBs highlighted. Comparison of the model predictions to the Olmsted simulations for the subset of &#x003A3;3 GBs as a function of boundary plane orientation is also provided in <xref ref-type="fig" rid="F9">Figure 9</xref> (compare filled vs. open squares). The resulting model predictions agree well with the calculated values, and the model predicts the correct GB energy with less than 10% error for 89.69% of the 388 GBs (and 92.68% of the &#x003A3;3 GBs). We note, in particular, the improved predictions of the UAE model across the &#x003B8; &#x0003D; 90&#x000B0; arc of the FZ from <inline-formula><mml:math id="M11"><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>2</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M12"><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mn>10</mml:mn><mml:mover accent="true"><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x00304;</mml:mo></mml:mover></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> (the green filled squares agree well with the green open squares in the right panel of <xref ref-type="fig" rid="F9">Figure 9</xref>) as compared to the faceting model (solid green line).</p>
<fig id="F10" position="float">
<label>Figure 10</label>
<caption><p>Comparison of the predictions of the model&#x02014;trained using the UAE fractions as variables&#x02014;with the true calculated GB energies from the Olmsted data set (Olmsted et al., <xref ref-type="bibr" rid="B20">2009</xref>).</p></caption>
<graphic xlink:href="fmats-06-00120-g0010.tif"/>
</fig></sec></sec>
<sec sec-type="conclusions" id="s5">
<title>5. Conclusion</title>
<p>In this work, we have presented an atomic structure characterization technique (the fully-leveraged CNA approach) that (i) can characterize arbitrary atomic environments, while also being both (ii) simple to implement, and (iii) built upon a descriptor that is already familiar to the atomistic modeling community. This enables characterization of GB atomic structure that was previously unresolvable using crystal structure identification techniques, and for lower computational effort than more advanced techniques. We show that it is possible to describe GB atomic structure in terms of the proportion of the unique atomic environments (UAEs) resulting from the use of our method.</p>
<p>We find that a relatively small number of UAEs account for a large proportion of the GB atoms, suggesting the possibility of a significant dimensionality reduction in the description of GB atomic structure. Specifically, we found that to describe 90% of the non-FCC GB atoms present in the 388 GBs of the Olmsted data set, only 448 UAEs (CNA signatures) are required, and for the subset of 41 &#x003A3;3 GBs only 44 UAEs are necessary. This dimensionality reduction suggests that these UAEs can act as atomic structure descriptors that might be incorporated into machine learning approaches to develop improved GB structure-property models.</p>
<p>We demonstrated how visualization of the UAEs reveals important GB structural information. As an example, we investigated the possible description of &#x003A3;3 GBs as being composed of facets of the GBs occupying the corners of the corresponding boundary plane fundamental zone (FZ). We found that for the Olmsted data set such faceting does occur in certain regions of the FZ, but not generally. Instead, an apparent mixing of atomic environments from the GBs defining the FZ corners was observed, together with the appearance of numerous environments not present in the FZ corners. These observations are consistent with the good agreement of the faceting model with calculated GB energies for some regions of the FZ, as well as the observed underprediction in other regions.</p>
<p>Finally, we provided a brief example to illustrate how the UAE fractions can be used as GB atomic structure descriptors that can serve as input to machine learning approaches for the development of GB atomic structure-property models.</p></sec>
<sec id="s6">
<title>Data Availability</title>
<p>The datasets for this study will not be made publicly available because some data has been used with permission. All other data is available upon request to the corresponding author.</p></sec>
<sec id="s7">
<title>Author Contributions</title>
<p>OJ designed the project and trained the final model. BS and DD developed all of the analysis codes and performed the calculations and analysis. All authors contributed to preparation of the manuscript.</p>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec></sec>
</body>
<back>
<ack><p>We are grateful to Dr. Eric R. Homer and Dr. Srikanth Patala for fruitful discussions, and to Dr. David Olmsted for permission to use the Al GB structures.</p>
</ack>
<sec sec-type="supplementary-material" id="s8">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmats.2019.00120/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmats.2019.00120/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.zip" id="SM1" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink"/></sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ackland</surname> <given-names>G. J.</given-names></name> <name><surname>Jones</surname> <given-names>A. P.</given-names></name></person-group> (<year>2006</year>). <article-title>Applications of local crystal structure measures in experiment and simulation</article-title>. <source>Phys. Rev. B</source> <volume>73</volume>:<fpage>054104</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevB.73.054104</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashby</surname> <given-names>M. F.</given-names></name> <name><surname>Spaepen</surname> <given-names>F.</given-names></name> <name><surname>Williams</surname> <given-names>S.</given-names></name></person-group> (<year>1978</year>). <article-title>The structure of grain boundaries described as a packing of polyhedra</article-title>. <source>Acta Metal.</source> <volume>26</volume>, <fpage>1647</fpage>&#x02013;<lpage>1663</lpage>. <pub-id pub-id-type="doi">10.1016/0001-6160(78)90075-5</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Banadaki</surname> <given-names>A. D.</given-names></name> <name><surname>Patala</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>A simple faceting model for the interfacial and cleavage energies of &#x003A3;3 grain boundaries in the complete boundary plane orientation space</article-title>. <source>Comput. Mater. Sci.</source> <volume>112</volume>, <fpage>147</fpage>&#x02013;<lpage>160</lpage>. <pub-id pub-id-type="doi">10.1016/j.commatsci.2015.09.062</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Banadaki</surname> <given-names>A. D.</given-names></name> <name><surname>Patala</surname> <given-names>S.</given-names></name></person-group> (<year>2017</year>). <article-title>A three-dimensional polyhedral unit model for grain boundary structure in fcc metals</article-title>. <source>npj Comput. Mater.</source> <volume>3</volume>:<fpage>13</fpage>. <pub-id pub-id-type="doi">10.1038/s41524-017-0016-0</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bart&#x000F3;k</surname> <given-names>A. P.</given-names></name> <name><surname>Kondor</surname> <given-names>R.</given-names></name> <name><surname>Cs&#x000E1;nyi</surname> <given-names>G.</given-names></name></person-group> (<year>2013</year>). <article-title>On representing chemical environments</article-title>. <source>Phys. Rev. B Condens. Matter Mater. Phys.</source> <volume>87</volume>, <fpage>1</fpage>&#x02013;<lpage>16</lpage>. <pub-id pub-id-type="doi">10.1103/PhysRevB.87.184115</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bechtle</surname> <given-names>S.</given-names></name> <name><surname>Kumar</surname> <given-names>M.</given-names></name> <name><surname>Somerday</surname> <given-names>B.</given-names></name> <name><surname>Launey</surname> <given-names>M.</given-names></name> <name><surname>Ritchie</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Grain-boundary engineering markedly reduces susceptibility to intergranular hydrogen embrittlement in metallic materials</article-title>. <source>Acta Mater.</source> <volume>57</volume>, <fpage>4148</fpage>&#x02013;<lpage>4157</lpage>. <pub-id pub-id-type="doi">10.1016/J.ACTAMAT.2009.05.012</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bernal</surname> <given-names>J.</given-names></name></person-group> (<year>1959</year>). <article-title>A geometrical approach to the structure of liquids</article-title>. <source>Nature</source> <volume>183</volume>. <fpage>141</fpage>&#x02013;<lpage>147</lpage>. <pub-id pub-id-type="doi">10.1038/183141a0</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bernal</surname> <given-names>J. D.</given-names></name> <name><surname>Finney</surname> <given-names>J. L.</given-names></name></person-group> (<year>1967</year>). <article-title>Random close-packed hard-sphere model. ii. geometry of random packing of hard spheres</article-title>. <source>Discuss. Faraday Soc.</source> <volume>43</volume>, <fpage>62</fpage>&#x02013;<lpage>69</lpage>. <pub-id pub-id-type="doi">10.1039/DF9674300062</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Faken</surname> <given-names>D.</given-names></name> <name><surname>J&#x000F3;nsson</surname> <given-names>H.</given-names></name></person-group> (<year>1994</year>). <article-title>Systematic analysis of local atomic structure combined with 3D computer graphics</article-title>. <source>Comput. Mater. Sci.</source> <volume>2</volume>, <fpage>279</fpage>&#x02013;<lpage>286</lpage>. <pub-id pub-id-type="doi">10.1016/0927-0256(94)90109-0</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Finney</surname> <given-names>J. L.</given-names></name></person-group> (<year>1970</year>). <article-title>Random packings and the structure of simple liquids. I. The geometry of random close packing</article-title>. <source>Proc. R. Soc. Lond. Ser. A Math. Phys. Sci.</source> <volume>319</volume>, <fpage>479</fpage>&#x02013;<lpage>493</lpage>. <pub-id pub-id-type="doi">10.1098/rspa.1970.0189</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gertsman</surname> <given-names>V.</given-names></name> <name><surname>Tangri</surname> <given-names>K.</given-names></name></person-group> (<year>1997</year>). <article-title>Modelling of intergranular damage propagation</article-title>. <source>Acta Mater.</source> <volume>45</volume>, <fpage>4107</fpage>&#x02013;<lpage>4116</lpage>. <pub-id pub-id-type="doi">10.1016/S1359-6454(97)00083-9</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname> <given-names>J.</given-names></name> <name><surname>Vitek</surname> <given-names>V.</given-names></name> <name><surname>Srolovitz</surname> <given-names>D. J.</given-names></name></person-group> (<year>2017</year>). <article-title>The grain-boundary structural unit model redux</article-title>. <source>Acta Mater.</source> <volume>133</volume>, <fpage>186</fpage>&#x02013;<lpage>199</lpage>. <pub-id pub-id-type="doi">10.1016/J.ACTAMAT.2017.05.002</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hsu</surname> <given-names>C. S.</given-names></name> <name><surname>Rahman</surname> <given-names>A.</given-names></name></person-group> (<year>1979</year>). <article-title>Interaction potentials and their effect on crystal nucleation and symmetry</article-title>. <source> J. Chem. Phys.</source> <volume>71</volume>, <fpage>4974</fpage>&#x02013;<lpage>4986</lpage>. <pub-id pub-id-type="doi">10.1063/1.438311</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Humberson</surname> <given-names>J.</given-names></name> <name><surname>Holm</surname> <given-names>E. A.</given-names></name></person-group> (<year>2017</year>). <article-title>Anti-thermal mobility in the &#x003A3;3 [111] 60 {11 8 5} grain boundary in nickel: mechanism and computational considerations</article-title>. <source>Scripta Mater.</source> <volume>130</volume>, <fpage>1</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1016/J.SCRIPTAMAT.2016.10.032</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katritzky</surname> <given-names>A. R.</given-names></name> <name><surname>Fara</surname> <given-names>D. C.</given-names></name></person-group> (<year>2005</year>). <article-title>How chemical structure determines physical, chemical, and technological properties: an overview illustrating the potential of quantitative structure property relationships for fuels science</article-title>. <source>Energy Fuels</source> <volume>19</volume>, <fpage>922</fpage>&#x02013;<lpage>935</lpage>. <pub-id pub-id-type="doi">10.1021/ef040033q</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kelchner</surname> <given-names>C. L.</given-names></name> <name><surname>Plimpton</surname> <given-names>S. J.</given-names></name> <name><surname>Hamilton</surname> <given-names>J. C.</given-names></name></person-group> (<year>1998</year>). <article-title>Dislocation nucleation and defect structure during surface indentation</article-title>. <source>Phys. Rev. B</source> <volume>58</volume>, <fpage>11085</fpage>&#x02013;<lpage>11088</lpage>. <pub-id pub-id-type="doi">10.1103/PhysRevB.58.11085</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Larsen</surname> <given-names>P. M.</given-names></name> <name><surname>Schmidt</surname> <given-names>S.</given-names></name> <name><surname>Schi&#x000F8;tz</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>Robust structural identification via polyhedral template matching</article-title>. <source>Model. Simul. Mater. Sci. Eng.</source> <volume>24</volume>:<fpage>055007</fpage>. <pub-id pub-id-type="doi">10.1088/0965-0393/24/5/055007</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lazar</surname> <given-names>E. A.</given-names></name> <name><surname>Han</surname> <given-names>J.</given-names></name> <name><surname>Srolovitz</surname> <given-names>D. J.</given-names></name></person-group> (<year>2015</year>). <article-title>A topological framework for local structure analysis in condensed matter</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>112</volume>, <fpage>E5769</fpage>&#x02013;<lpage>E5776</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1505788112</pub-id><pub-id pub-id-type="pmid">26460045</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>X.</given-names></name> <name><surname>Ercolessi</surname> <given-names>F.</given-names></name> <name><surname>Adams</surname> <given-names>J.</given-names></name></person-group> (<year>2004</year>). <article-title>Aluminium interatomic potential from density functional theory calculations with improved stacking fault energy</article-title>. <source>Model. Simul. Mater. Sci. Eng.</source> <volume>12</volume>, <fpage>665</fpage>&#x02013;<lpage>670</lpage>. <pub-id pub-id-type="doi">10.1088/0965-0393/12/4/007</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olmsted</surname> <given-names>D. L.</given-names></name> <name><surname>Foiles</surname> <given-names>S. M.</given-names></name> <name><surname>Holm</surname> <given-names>E. A.</given-names></name></person-group> (<year>2009</year>). <article-title>Survey of computed grain boundary properties in face-centered cubic metals: I. Grain boundary energy</article-title>. <source>Acta Mater.</source> <volume>57</volume>, <fpage>3694</fpage>&#x02013;<lpage>3703</lpage>. <pub-id pub-id-type="doi">10.1016/j.actamat.2009.04.007</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Plimpton</surname> <given-names>S.</given-names></name></person-group> (<year>1995</year>). <article-title>Fast parallel algorithms for short-range molecular dynamics</article-title>. <source>J. Comput. Phys.</source> <volume>117</volume>, <fpage>1</fpage>&#x02013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.1006/jcph.1995.1039</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Priedeman</surname> <given-names>J. L.</given-names></name></person-group> (<year>2018</year>). <source>Quantifying Grain Boundary Atomic Structures Using the Smooth Overlap of Atomic Positions</source> (<publisher-loc>Master&#x00027;s thesis</publisher-loc>). <publisher-name>Brigham Young University, USA</publisher-name>.</citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Priedeman</surname> <given-names>J. L.</given-names></name> <name><surname>Rosenbrock</surname> <given-names>C. W.</given-names></name> <name><surname>Johnson</surname> <given-names>O. K.</given-names></name> <name><surname>Homer</surname> <given-names>E. R.</given-names></name></person-group> (<year>2018</year>). <article-title>Quantifying and connecting atomic and crystallographic grain boundary structure using local environment representation and dimensionality reduction techniques</article-title>. <source>Acta Mater.</source> <volume>161</volume>, <fpage>431</fpage>&#x02013;<lpage>443</lpage>. <pub-id pub-id-type="doi">10.1016/j.actamat.2018.09.011</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Race</surname> <given-names>C. P.</given-names></name> <name><surname>Von Pezold</surname> <given-names>J.</given-names></name> <name><surname>Neugebauer</surname> <given-names>J.</given-names></name></person-group> (<year>2014</year>). <article-title>Role of the mesoscale in migration kinetics of flat grain boundaries</article-title>. <source>Phys. Rev. B</source> <volume>89</volume>:<fpage>214110</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevB.89.214110</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rahman</surname> <given-names>A.</given-names></name></person-group> (<year>1966</year>). <article-title>Liquid structure and self-diffusion</article-title>. <source>J. Chem. Phys.</source> <volume>45</volume>, <fpage>2585</fpage>&#x02013;<lpage>2592</lpage>. <pub-id pub-id-type="doi">10.1063/1.1727978</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reinhart</surname> <given-names>W. F.</given-names></name> <name><surname>Long</surname> <given-names>A. W.</given-names></name> <name><surname>Howard</surname> <given-names>M. P.</given-names></name> <name><surname>Ferguson</surname> <given-names>A. L.</given-names></name> <name><surname>Panagiotopoulos</surname> <given-names>A. Z.</given-names></name></person-group> (<year>2017</year>). <article-title>Machine learning for autonomous crystal structure identification</article-title>. <source>Soft Matter</source> <volume>13</volume>, <fpage>4733</fpage>&#x02013;<lpage>4745</lpage>. <pub-id pub-id-type="doi">10.1039/C7SM00957G</pub-id><pub-id pub-id-type="pmid">28621795</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reinhart</surname> <given-names>W. F.</given-names></name> <name><surname>Panagiotopoulos</surname> <given-names>A. Z.</given-names></name></person-group> (<year>2018</year>). <article-title>Automated crystal characterization with a fast neighborhood graph analysis method</article-title>. <source>Soft Matter</source> <volume>14</volume>, <fpage>6083</fpage>&#x02013;<lpage>6089</lpage>. <pub-id pub-id-type="doi">10.1039/C8SM00960K</pub-id><pub-id pub-id-type="pmid">29989134</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rosenbrock</surname> <given-names>C. W.</given-names></name> <name><surname>Homer</surname> <given-names>E. R.</given-names></name> <name><surname>Cs&#x000E1;nyi</surname> <given-names>G.</given-names></name> <name><surname>Hart</surname> <given-names>G. L. W.</given-names></name></person-group> (<year>2017</year>). <article-title>Discovering the building blocks of atomic systems using machine learning: application to grain boundaries</article-title>. <source>npj Comput. Mater.</source> <volume>3</volume>:<fpage>29</fpage>. <pub-id pub-id-type="doi">10.1038/s41524-017-0027-x</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sheng</surname> <given-names>H.</given-names></name> <name><surname>Luo</surname> <given-names>W.</given-names></name> <name><surname>Alamgir</surname> <given-names>F. M.</given-names></name> <name><surname>Bai</surname> <given-names>J. M.</given-names></name> <name><surname>Ma</surname> <given-names>E. P.</given-names></name></person-group> (<year>2006</year>). <article-title>Atomic packing and short-to-medium-range order in metallic glasses</article-title>. <source>Nature</source> <volume>439</volume>, <fpage>419</fpage>&#x02013;<lpage>425</lpage>. <pub-id pub-id-type="doi">10.1038/nature04421</pub-id><pub-id pub-id-type="pmid">16437105</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shimada</surname> <given-names>M.</given-names></name> <name><surname>Kokawa</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Sato</surname> <given-names>Y.</given-names></name> <name><surname>Karibe</surname> <given-names>I.</given-names></name></person-group> (<year>2002</year>). <article-title>Optimization of grain boundary character distribution for intergranular corrosion resistant 304 stainless steel by twin-induced grain boundary engineering</article-title>. <source>Acta Mater.</source> <volume>50</volume>, <fpage>2331</fpage>&#x02013;<lpage>2341</lpage>. <pub-id pub-id-type="doi">10.1016/S1359-6454(02)00064-2</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stukowski</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>Visualization and analysis of atomistic simulation data with OVITO&#x02013;the Open Visualization Tool</article-title>. <source>Model. Simul. Mater. Sci. Eng.</source> <volume>18</volume>:<fpage>015012</fpage>. <pub-id pub-id-type="doi">10.1088/0965-0393/18/1/015012</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stukowski</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Structure identification methods for atomistic simulations of crystalline materials</article-title>. <source>Model. Simul. Mater. Sci. Eng.</source> <volume>20</volume>:<fpage>045021</fpage>. <pub-id pub-id-type="doi">10.1088/0965-0393/20/4/045021</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tan</surname> <given-names>L.</given-names></name> <name><surname>Sridharan</surname> <given-names>K.</given-names></name> <name><surname>Allen</surname> <given-names>T.</given-names></name> <name><surname>Nanstad</surname> <given-names>R.</given-names></name> <name><surname>McClintock</surname> <given-names>D.</given-names></name></person-group> (<year>2008</year>). <article-title>Microstructure tailoring for property improvements by grain boundary engineering</article-title>. <source>J. Nucl. Mater.</source> <volume>374</volume>, <fpage>270</fpage>&#x02013;<lpage>280</lpage>. <pub-id pub-id-type="doi">10.1016/J.JNUCMAT.2007.08.015</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Watanabe</surname> <given-names>T.</given-names></name> <name><surname>Tsurekawa</surname> <given-names>S.</given-names></name> <name><surname>Zhao</surname> <given-names>X.</given-names></name> <name><surname>Zuo</surname> <given-names>L.</given-names></name></person-group> (<year>2009</year>). <article-title>The coming of grain boundary engineering in the 21st century</article-title>, in <source>Microstructure and Texture in Steels</source> (<publisher-loc>London</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>43</fpage>&#x02013;<lpage>82</lpage>.</citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Srolovitz</surname> <given-names>D. J.</given-names></name> <name><surname>Douglas</surname> <given-names>J. F.</given-names></name> <name><surname>Warren</surname> <given-names>J. A.</given-names></name></person-group> (<year>2009</year>). <article-title>Grain boundaries exhibit the dynamics of glass-forming liquids</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>106</volume>, <fpage>7735</fpage>&#x02013;<lpage>7740</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0900227106</pub-id><pub-id pub-id-type="pmid">19416913</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Q. G.</given-names></name> <name><surname>Cao</surname> <given-names>B. Y.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Fujii</surname> <given-names>M.</given-names></name> <name><surname>Takahashi</surname> <given-names>K.</given-names></name></person-group> (<year>2006</year>). <article-title>Influence of grain boundary scattering on the electrical and thermal conductivities of polycrystalline gold nanofilms</article-title>. <source>Phys. Rev. B</source> <volume>74</volume>:<fpage>134109</fpage>. <pub-id pub-id-type="doi">10.1103/PhysRevB.74.134109</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>Including the GB atoms that have FCC structure only adds one UAE, but because GB atoms that possess FCC structure make up a significant percentage of the total GB atoms, fewer UAEs are required to represent 90% of the total GB atoms.</p></fn>
<fn id="fn0002"><p><sup>2</sup>We computed the cohesive energies in LAMMPS (Plimpton, <xref ref-type="bibr" rid="B21">1995</xref>) using the same potential that Olmsted et. al used to produce these Al structures (the Ercolessi &#x00026; Adams potential for aluminum Liu et al., <xref ref-type="bibr" rid="B19">2004</xref>).</p></fn>
</fn-group>
<fn-group>
<fn fn-type="financial-disclosure"><p><bold>Funding.</bold> This work has been supported by the Department of Mechanical Engineering at Brigham Young University.</p>
</fn>
</fn-group>
</back>
</article> 