# Citrine Informatics

## Founded Year

2013## Stage

Series C | Alive## Total Raised

$68.55M## Valuation

$0000## Last Raised

$16M | 6 mos ago## About Citrine Informatics

Citrine Informatics offers a materials informatics platform for data-driven materials and chemicals development. It delivers a platform that helps scientists and engineers accelerate product development .The platform combines smart materials data infrastructure and artificial intelligence (AI) to accelerate the development of materials, facilitate product portfolio optimizations, and codify research internet protocol. It serves the materials and chemicals industry. The platform was founded in 2013 and is based in Redwood City, California.

## Citrine Informatics's Product Videos

## Compete with Citrine Informatics?

Ensure that your company and products are accurately represented on our platform.

## Citrine Informatics's Products & Differentiators

Citrine Platform

The Citrine Platform is a cloud-based software that utilizes artificial intelligence and machine learning to accelerate the discovery and development of new materials and chemicals. It allows customers to predict the properties of new materials and optimize existing materials for specific applications. It can help customers identify new materials and applications that they may not have considered before. It saves significant time and resources for customers by reducing the need for expensive and time-consuming experimentation. The platform can help customers to innovate faster and more efficiently in the materials and chemical space.

## Research containing Citrine Informatics

Get data-driven expert analysis from the CB Insights Intelligence Unit.

CB Insights Intelligence Analysts have mentioned Citrine Informatics in ** 1 CB Insights research brief**, most recently on **Dec 7, 2021.**

## Expert Collections containing Citrine Informatics

Expert Collections are analyst-curated lists that highlight the companies you need to know in the most important technology spaces.

Citrine Informatics is included in **4 Expert Collections,** including **Advanced Materials**.

# Advanced Materials

1,282 items

Startups developing new or improved materials (chemicals, alloys, etc.) that provide physical or functional advantages to basic materials.

# Artificial Intelligence

10,627 items

This collection includes startups selling AI SaaS, using AI algorithms to develop their core products, and those developing hardware to support AI workloads.

# AI 100

100 items

The winners of the 4th annual CB Insights AI 100.

# Advanced Manufacturing

4,564 items

Companies focused on the technologies to increase manufacturing productivity, ranging from automation & robotics to AR/VR to factory analytics & AI, plus many more.

## Citrine Informatics Patents

Citrine Informatics has filed 5 patents.

The 3 most popular patent topics include:

- Machine learning
- Classification algorithms
- Artificial neural networks

Application Date | Grant Date | Title | Related Topics | Status |
---|---|---|---|---|

10/20/2020 | 5/11/2021 | Machine learning, Classification algorithms, Artificial neural networks, Industrial design, Electronic design automation | Grant |

Application Date | 10/20/2020 |
---|---|

Grant Date | 5/11/2021 |

Title | |

Related Topics | Machine learning, Classification algorithms, Artificial neural networks, Industrial design, Electronic design automation |

Status | Grant |

## Latest Citrine Informatics News

Apr 5, 2023

Abstract Severe lattice distortion is a prominent feature of high-entropy alloys (HEAs) considered a reason for many of those alloys’ properties. Nevertheless, accurate characterizations of lattice distortion are still scarce to only cover a tiny fraction of HEA’s giant composition space due to the expensive experimental or computational costs. Here we present a physics-informed statistical model to efficiently produce high-throughput lattice distortion predictions for refractory non-dilute/high-entropy alloys (RHEAs) in a 10-element composition space. The model offers improved accuracy over conventional methods for fast estimates of lattice distortion by making predictions based on physical properties of interatomic bonding rather than atomic size mismatch of pure elements. The modeling of lattice distortion also implements a predictive model for yield strengths of RHEAs validated by various sets of experimental data. Combining our previous model on intrinsic ductility, a data mining design framework is demonstrated for efficient exploration of strong and ductile single-phase RHEAs. Introduction High-entropy alloys (HEA), or namely the compositionally complex alloys for a broader definition, have attracted a great deal of attention as a promising material solution for addressing the urgent societal goals of reduced carbon emissions and increased energy efficiency. Many HEAs have already been demonstrated to have remarkable yield strengths 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , great corrosion 9 , 10 , 11 , 12 , 13 , 14 and fatigue resistance 15 , 16 , 17 , reasonable ductility 3 , 4 , 5 , 6 , 18 , 19 , 20 , 21 , 22 , and/or thermal stability 3 , 23 , 24 . More excitingly, the multicomponent chemistry grants HEAs a nearly unlimited design space, making this group of alloys full of potential as revolutionary structural materials for both light weighting and high-temperature applications. The outstanding performances of HEAs are recognized to largely originate from their unique non-dilute, multicomponent chemistry and atomic structures at the fundamental level 25 , 26 , 27 , 28 . A signature characteristic of the atomic structures of HEAs is severe lattice distortion 29 , 30 . In HEAs, multiple types of atoms with different atomic sizes and electronegativities are randomly mixed in a single-crystal lattice. The size mismatch and charge transfer between individual atoms result in significant variations in interatomic distances and angles. As a result, the atoms cannot reside perfectly on the ideal lattice site, giving rise to a considerable lattice distortion throughout the entire crystal. With respect to the perfect lattice, the distorted lattice of HEAs generates atomic strains that fluctuate from one atom to another, which produces intense interactions with the strain field of dislocations. As a result, every atom in HEAs can actually be considered as a point-pining obstacle to retard dislocation movement, leading to an outstanding solid-solution type strengthening effect 31 , 32 , 33 , 34 , 35 . For instance, extremely high yield strengths have been achieved in HEAs by elaborate manipulation of lattice distortion 4 , 36 , 37 , 38 , 39 . Additionally, lattice distortion is also recognized to play essential roles in the phase stability 40 , 41 , sluggish diffusion 42 , 43 , electrical 44 , 45 , and thermal conductivities 45 of HEAs. Therefore, a deep understanding of lattice distortion could assist the design of high-performance HEAs without wandering in the endless compositional space. Lattice distortion in HEAs can be accurately characterized for individual alloy compositions through various experimental methods, including neutron diffraction, synchrotron X-ray diffraction, and high-resolution transmission electron microscopy 4 , 37 , 38 , 46 . Additionally, first-principles calculations based on density functional theory (DFT) have been extensively applied to study lattice distortion owing to the accurate description of interatomic bonding. The supercell method based on the special quasi-random structure (SQS) 47 is widely employed to mimic the multicomponent random mixing in HEAs 33 , 47 , 48 , 49 . The alloy’s atomic structure at the ground state can be straightforwardly obtained by performing a relaxation calculation. Based on the relaxed structure, lattice distortion can be accurately quantified via a variety of analysis approaches, such as radial or pair distribution function 50 , 51 , bond length distribution and fluctuation 36 , 49 , local atomic volume 52 , least-square atomic strain 53 , and mean square average atomic displacement (MSAD) 33 . Nevertheless, like the experimental approaches, most of the time the DFT calculations were only able to selectively study a small group of discrete alloy compositions due to large computational costs. Apart from the accurate experimental characterization and DFT-based computation, several mathematical models have been proposed for predicting lattice distortion simply according to the alloy composition. However, those models usually oversimplified lattice distortion as the difference in the pure-element atomic size of the constituent elements by assuming that each atom retains its original size even in a complex solid-solution environment. This assumption fails to account for the changes in atomic radii due to charge transfer among constituent atoms, which thus becomes problematic for predicting lattice distortion in the systems containing elements with similar atomic sizes but different electronegativities 37 , 54 . The lack of a generally applicable model for rapid and accurate predictions of lattice distortion for arbitrary alloy compositions significantly limits our ability to explore the large design spaces of HEA efficiently. In the present work, a surrogate model based on physics-informed statistical learning and training data from first-principles calculations is developed to efficiently predict the lattice distortion in the body-centered-cubic (bcc) refractory HEAs (RHEAs) with compositions arbitrarily varying in chemical space composed of ten elements. The study is centered on RHEAs because they are not studied as comprehensively as the face-centered-cubic (fcc) 3d-transition-metal HEAs though lattice distortion is indeed found to be significant in individual cases and critically contribute to the mechanical strength 4 , 35 , 36 . The lattice distortion is quantified using the root mean squared atomic displacement (RMSAD), which measures the average displacement of relaxed atoms from their ideal positions in the undistorted crystal lattice 33 , 55 . On the basis of physical descriptors that characterize atomic bonds of pure metals and their binary ordered compounds, the developed surrogate model delivers high-throughput predictions of lattice distortion quantified in a way that previously can be only done for individual compositions by costly experiments or first-principles calculations. The high-throughput prediction enables a systematic overview of lattice distortion in the alloys with different levels of chemical complexity, from which the physical mechanism of HEA’s unique cocktail effect on lattice distortion is revealed. Based on the predictions of the lattice distortion model, it is further found that the RMSAD parameter quantitatively correlates with the room-temperature yield strengths of the bcc refractory alloys in a general linear form. Accordingly, a strengthening model is established to effectively predict yield strengths of RHEAs according to their lattice distortion. Together with our previous surrogate models on intrinsic ductility 56 and CALculation of PHAse Diagram (CALPHAD)-based phase stability prediction, a high-throughput alloy design framework is demonstrated to perform comprehensive screening of potential alloy compositions with a synergy of high strength and good ductility in a large compositional space far from fully explored. Results and discussion Initial data exploration As schematically illustrated in Fig. 1 , due to lattice distortion, atoms at equilibrium positions in a solid-solution alloy are locally displaced away from their ideal positions in a perfect, undistorted lattice. A mean squared average of those atomic displacements (i.e., the MSAD) has been demonstrated an accurate quantitative measure of lattice distortion in 3d-transition-metal HEAs both experimentally and computationally 33 , 57 , 58 . In the present work, the square root of the mean squared atomic displacement (i.e., the RMSAD parameter) is used to quantify lattice distortion because of its strong linear correlation with yield strength 33 , 57 . Through a DFT relaxation calculation, the RMSAD parameter for a given alloy composition can be straightforwardly derived as described in the method section in detail. To generate the training and testing data for the development of a surrogate predictive model for lattice distortion, we computed the RMSAD parameters of 215 individual alloy compositions as listed in Supplementary Table 1 . Prior to using those DFT data for direct statistical regressions, we first utilized them to perform an initial data exploration, aiming to gain enough physical insights of lattice distortion to better develop the surrogate model. We limited the exploration process to the training dataset, which only contains binary and ternary alloys. As a result, the subsequent development of the surrogate model based on the data exploration is completely blind to the validation and testing datasets (i.e., quaternary and quinary alloys), and those datasets were thus valid to use for testing the model’s ability of extrapolative prediction. Fig. 1: A schematic shows the local atomic displacements due to lattice distortion. The red dots stand for the equilibrium atomic positions in a bcc solid-solution alloy with lattice distortion, while the black dots stand for the ideal positions in an undistorted lattice. The schematic corresponds to a projection view along the \(\left[ {0\bar 11} \right]\) direction. The big solid circles in different colors represent individual types of atoms in the alloy. The standard deviation of local atomic-bond lengths was naturally considered as a starting point for exploring RMSAD, because an ideal, undistorted bcc lattice would have uniform local bond lengths for every atom, and any displacement of an atom from its ideal lattice site would introduce the variance to the interatomic-bond lengths between the displaced atom and its neighbors. In the present work, we calculated the length of every first-nearest neighbor (FNN) atomic bond in the relaxed SQS’s using Voronoi-tessellation analysis, as implemented in the Pymatgen Python package 59 , and then derived the standard deviation of the bond lengths (\(\sigma _{{{{\mathrm{SQS}}}}}^L\)) in comparison with the RMSAD parameter obtained for the same structure. A scatter plot shown in Fig. 2a depicts a strong positive correlation between the \(\sigma _{{{{\mathrm{SQS}}}}}^L\) and RMSAD for the alloy samples in the training dataset, with a correlation coefficient, r = 0.94, and significance level, p < 0.001. The correlation indicates that the lattice distortion in solid-solution alloys tightly connects to the length divergence of individual local atomic bonds. In addition, there are two apparent outliers to be off from the correlation displayed in Fig. 2a , which are NbTi2 and NbHf3. A common feature of the two alloys is their high content of the group IV element, giving valence electron concentrations (VECs) that are at the lower end of those represented in the training dataset. This trend implies that the d-band filling effect may also play a key role to affect the lattice distortion in bcc refractory solid-solution alloys. Fig. 2: Correlations between the root-mean-squared atomic displacement (RMSAD) parameter and fundamental properties of atomic bonds in refractory solid-solution alloys. a RMSAD vs. standard deviation of lengths of the first-nearest neighbor (FNN) atomic bonds in the relaxed SQS’s (\(\sigma _{{{{\mathrm{SQS}}}}}^L\)); b RMSAD vs. average valence electron concentration (\(u^{{{{\mathrm{VEC}}}}}\)); c RMSAD vs. standard deviation of the number of valence electrons of the constituent elements relative to VEC (\(\sigma ^{{{{\mathrm{VEC}}}}}\)); and d RMSAD vs. standard deviation of atomic bond lengths estimated from single-element bcc and binary B2 crystals (\(\sigma _{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^L\)) based on Eq. ( 6 ). Accordingly, a scatter plot of VEC against RMSAD is displayed in Fig. 2b for the alloys in the training dataset. It was found that RMSAD showed a typical dependence on the average filling fraction of the d-band. Specifically, the RMSAD parameter becomes lower around VEC of 5.5 e−/atom, which corresponds to an averagely half-filled d-band, and becomes higher when the VEC is outside the range of 5.0–6.0 e−/atom, at which the d-band deviates from the half-filled state. In addition to this general d-band filling dependence, Fig. 2b also highlights an interesting phenomenon that RMSAD can still vary greatly for the alloys with the same VEC. A typical example is the Ti-Ru binary system. As presented in Fig. 2b , although all the three studied Ti-Ru alloys have VECs within the range of 5.0–6.0 e−/atom, they tend to have more severe lattice distortion (i.e., a larger RMSAD), compared to other alloys with the same VECs. It should be noted that the bcc solid-solution phase is metastable for studied Ti-Ru compositions 60 . The observed severe lattice distortion actually reflects the strong phase-transformation tendency from the disordered solid solution to ordered intermetallic compounds. Another obvious distinction of the studied Ti-Ru alloys from others with similar VECs is that its constituent elements, Ti and Ru, have very different numbers of valence electrons. Considering the strong localization tendency of valence d electrons, one can expect that the local distribution of the d electrons at the Ti-Ti, Ti-Ru, and Ru-Ru atomic bonds in the alloy are inhomogeneous, giving local fluctuations in the VEC. Given this insight, the standard deviation of the number of valence electrons of the constituent elements relative to VEC (\(\sigma ^{{{{\mathrm{VEC}}}}}\)) is also calculated and plotted to against RMSAD in Fig. 2c . As expected, \(\sigma ^{{{{\mathrm{VEC}}}}}\) shows a general positive correlation with RMSAD, suggesting that in addition to the bond-length divergence, the divergence in the d-band filling fraction at local atomic bonds, which is induced by the variance of constituent element valences, could also be an important contributor to lattice distortion. A physically interpretable surrogate model for RMSAD prediction Although the DFT-SQS calculations provide a general approach to assess the lattice distortion of each individual alloy composition by predicting the RMSAD parameter, the relatively large size of SQS supercells make the calculation computationally expensive and consequently, ill-suited for high-throughput screening. Therefore, to fully explore the complex compositional space of RHEAs, it is necessary to develop surrogate models for more efficient prediction of RMSAD. The data exploration discussed above provides a theoretical basis to raise a set of physical descriptors and an interpretable linear model to quantitatively describe RMSAD. As shown in Fig. 2a , the RMSAD parameter displays a strong linear correlation with the standard deviation of the FNN bond lengths in relaxed SQS’s (\(\sigma _{{{{\mathrm{SQS}}}}}^L\)). However, \(\sigma _{{{{\mathrm{SQS}}}}}^L\) cannot be directly used as a descriptor for the prediction of RMSAD because its obtainment requires expensive DFT calculations to relax the SQS for each alloy composition of interest. A surrogate approach is thus needed to efficiently estimate the variance in lengths of FNN atomic bonds in a solid-solution alloy. As discussed later in the methods section, the FNN atomic bonds in the single-element bcc and binary B2 crystals can serve as an approximation to those in the solid-solution lattice if only the atomic-pair interactions in the FNN shell are considered. We expect that this assumption is particularly reasonable for the transition metal alloy systems as the valence d electrons are strongly localized. As a validation, the lengths of the FNN atomic bonds derived from the DFT-relaxed SQS’s are compared with their respective lengths in the single-element bcc and binary B2 crystals by showing the histogram of relative deviations in Supplementary Fig. 1 . By comparing across a considerable amount of FNN bonds in the SQS’s of 68 binary and ternary alloys, the mean error and root mean square error (RMSE) are only 0.0009 Å and 0.0312 Å (−0.0567% mean relative error and 2.258% root mean square relative error), respectively, which validates our assumption. Therefore, the standard deviation of atomic bond lengths in a solid-solution alloy can be effectively estimated from the lengths of the FNN bonds in the single-element bcc and binary B2 structures using the bond-counting approach described in the method section. Correspondingly, a descriptor for the bond length standard deviation, \(\sigma _{{{{\mathrm{bcc}}}}/{{{\mathrm{B}}}}2}^L\), is derived. Based on Eq. ( 6 ), \(\sigma _{{{{\mathrm{bcc/B}}}}2}^L\) can be easily calculated for any alloy composition of interest without the need to run expensive DFT-SQS calculations. As shown in Fig. 2d , the correlation between the RMSAD and the bond-length standard deviation (\(\sigma _{{{{\mathrm{bcc}}}}/{{{\mathrm{B}}}}2}^L\)) calculated using Eq. ( 6 ) is also significant (p < 0.001). On the other hand, significant variability in RMSAD remains unaccounted for by \(\sigma _{{{{\mathrm{bcc}}}}/{{{\mathrm{B}}}}2}^L\) alone as evidenced by an increase in the number of outliers compared to \(\sigma _{{{{\mathrm{SQS}}}}}^L\), suggesting that it is insufficient on its own. The results of data exploration in Fig. 2 b, c suggest that the d-band filling effect could be another important contributor to lattice distortion in addition to the bond-length standard deviation. Therefore, it is natural to consider VEC as a possible descriptor to explain the discrepancies in the RMSAD vs. \(\sigma _{{{{\mathrm{bcc}}}}/{{{\mathrm{B}}}}2}^L\) correlation, especially given that all the outliers share VECs much lower than those of other samples. To test this hypothesis, the residuals of a linear regression of RMSAD on \(\sigma _{{{{\mathrm{bcc}}}}/{{{\mathrm{B}}}}2}^L\) is plotted to against the VEC of each sample in the training set in Supplementary Fig. 2a , where the residuals show a clear parabolic dependence on VEC with a vertex between VECs of 5 and 6. Based on this observation, we modified VEC into a new descriptor, \(u_{5.7}^{{{{\mathrm{VEC}}}}}\), with the form (VEC-X)2, where X is a constant tuned to be 5.7 e−/atom by selecting the value that maximizes the correlation coefficient (Pearson’s r) between the transformed feature and RMSAD (Supplementary Fig. 2b ). Similarly, another descriptor, \(\sigma _{5.7}^{{{{\mathrm{VEC}}}}}\), which describes the variance of VEC from 5.7 e−/atom, was created to capture the fluctuation of VEC at different local atomic bonds in the solid-solution lattice. Similar to the feature standard deviation described in Eq. ( 6 ), \(\sigma _{5.7}^{{{{\mathrm{VEC}}}}}\) is calculated as a modified variance of a discrete probability distribution as follows, \(\sigma _{5.7}^{{{{\mathrm{VEC}}}}} = \mathop {\sum }\nolimits x_i\left( {{{{\mathrm{VEC}}}}_i - 5.7} \right)^2\). Where xi is the chemical composition of the constituent element, i, and VECi is its number of valence electrons. As expected, \(\sigma _{5.7}^{{{{\mathrm{VEC}}}}}\) also shows a strong correlation with RMSAD (Supplementary Fig. 2c ). In addition to VEC, the bimodality of the electronic density of state (DOS) of the d-orbitals is also a key factor to determine the d-band filling effects 61 . The localized characteristics of d electrons cause their DOS to display considerable shape features that strongly depend on the lattice structure. For example, a perfect bcc-type lattice generally results in a bimodal DOS of the d-orbitals. Once the bcc lattice is distorted, such as due to the presence of crystal defects, the shape of the DOS becomes less bimodal 61 . An extreme situation is that the bcc lattice changes to the fcc/hcp (hexagonal-close-packed) lattice through phase transformation. Correspondingly, the shape of the DOS also changes from bimodal to unimodal. More importantly, the change of bimodality can strongly influence the d-band-filling energy as well as the stability of the crystal lattice when the filling fraction varies. As shown in Fig. 3a , when the d-band is about half-filled, which corresponds to a VEC between 5 and 6, a bimodal DOS would have more occupied states far below Fermi Level (Ef) and fewer occupied states close Ef, compared to that of the unimodal DOS. This trend leads to a more negative band-filling energy, and correspondingly make the bcc lattice stable. Inversely, when the filling fraction of the d-band towards to the edges, a lower band filling energy is expected for the less bimodal DOS, correspondingly stabilizing the non-/distorted-bcc lattice (Fig. 3b ). This principle well explains the effect of d-band bimodality on lattice distortion. For instance, if two elements with fewer d electrons (e.g., Ti, Zr, Hf) are forced to form an atomic bond in an undistorted bcc lattice, the DOS of the local d-band between them will then have a bimodal shape but a filling fraction near the left-band edge. As a result, the filling energy of the local d-band will be high, resulting in a negative effect on the lattice stability. To lower the band-filling energy and stabilize the bcc lattice, a local lattice distortion is thus necessary to reduce the bimodality of the local d-band DOS. In other words, a stronger local lattice distortion shall be expected between two atoms if their atomic bond in the undistorted lattice has a local d-band that has a more bimodal DOS and a filling fraction more away from the half-filled state. Fig. 3: Illustration of the difference in the filling energy of the d-bands with a unimodal and bimodal electronic DOS. a The filling fraction is close to the band center. b The filling fraction is close to the band edge. The position of the Fermi level is represented by the red dashed line. The bimodality of a DOS can be quantitatively measured through the Hartigan’s dip test 61 , 62 . A completely unimodal DOS corresponds to a test statistic (i.e., the dip value) of 0, while a more bimodal DOS has a larger dip value. In the present work, the average bimodality of the d-band DOSs of the FNN atomic bonds (\(u_{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^{{{{\mathrm{dip}}}}}\)) in a bcc solid-solution alloy at the undistorted state is estimated using Eq. ( 5 ) based on the bond-counting approach described in the method section. Considering the positive correlation of lattice distortion with both the filling fraction and bimodality of the local d-band DOS, we further weigh the two VEC-associated descriptors, \(u_{5.7}^{{{{\mathrm{VEC}}}}}\) and \(\sigma _{5.7}^{{{{\mathrm{VEC}}}}}\)with the bimodality parameter \(u_{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^{{{{\mathrm{dip}}}}}\) for a complete description of the d-band effect on lattice distortion. By testing the Pearson’s correlation with RMSAD, the square of \(u_{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^{{{{\mathrm{dip}}}}}\) (i.e., \((u_{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^{{{{\mathrm{dip}}}}})^2\)) turns to be the optimal weighing factor to result in the largest correlation coefficients. With the descriptors discussed above, a physically interpretable model is developed to efficiently predict the RMSAD parameter of arbitrary bcc solid-solution alloys within the 10-element composition space studied in the present work. The model expresses RMSAD in a linear form with three terms, $${{{\mathrm{RMSAD}}}} = a_1\sigma _{{{{{\mathrm{B}}}}2}/{{{{{\mathrm{bcc}}}}}}}^L + a_2\left( {u_{{{{{{\mathrm{B}}}}2}}/{{{{{\mathrm{bcc}}}}}}}^{{{{\mathrm{dip}}}}}} \right)^2u_{5.7}^{{{{\mathrm{VEC}}}}} + a_3\left( {u_{{{{{{\mathrm{B}}}}2}}/{{{{{\mathrm{bcc}}}}}}}^{{{{\mathrm{dip}}}}}} \right)^2\sigma _{5.7}^{{{{\mathrm{VEC}}}}}$$ (1) where a1, a2, and a3 are the fitting coefficients obtained by performing ordinary least squares (OLS) regression with the available DFT data. As shown in Fig. 4a , by only regression with the binary and ternary data, the resulting model can already accurately predict the RMSAD of the quaternary alloys in the validation dataset with an RMSE of 0.012 Å, which is comparable to its training RMSE, 0.015 Å. The regressed values of a1, a2, and a3 are summarized in Table 1 . Additionally, to confirm the stability and generalization of the model, we re-performed the regression on the data of binary, ternary, and quaternary alloys and tested the correspondingly obtained model with the DFT-calculated RMSAD of quinary alloys. As expected, the values of the fitting coefficients (i.e., a1, a2, and a3) are negligibly varied after additionally including the quaternary data in the regression, indicating the good stability of the proposed linear model. Additionally, as shown in Fig. 4b , the testing performance of the model on the quinary data is also promising, yielding an RMSE of 0.017 Å over 52 testing compositions. The results of Fig. 4 strongly suggest that the proposed linear model well captures the underlying physical mechanism of RMSAD, which grants the model outstanding capability to make accurate extrapolative predictions for multicomponent systems by only training with a finite amount of binary and ternary data. Fig. 4: Prediction performance of the statistical surrogate model for RMSAD. a Trained on binary/ternary data and validated on quaternary data. b Trained on binary/ternary/quaternary data and tested on quinary data. It is further interesting to point out that a1, a2, and a3 all have positive values, which is consistent with the physical motivation of the model. First, a positive a1 indicates that a more severe lattice distortion should be expected in an alloy if it has larger deviations in the lengths of local atomic bonds, which is consistent with the observations in Fig. 2 a, d. In the previous classical models of lattice distortion, the standard deviation of the atomic radii of pure elements are commonly adopted to describe the variance of atomic bond lengths by assuming atoms are all rigid particles. This assumption fails to consider the bond-length changes due to the possible electron density overlap between heteroelements, which leads to an overestimation on the lattice distortion in some bcc refractory alloys 37 , 54 . In the present work, we tackled this problem by using a bond-counting approach (Eqs. ( 5 ) and ( 6 ) in the methods section). Particularly, the bond length between two heteroelements can be approximated from their binary B2 structure by taking into account the possible charge transfer when forming a bond. Second, a positive value of a2 is consistent with our above analysis of the d-band effects on lattice distortion; the lattice distortion in an alloy becomes stronger if the average filling fraction of the d orbitals of the alloy is more away from half-filled (i.e., a larger value of \(u_{5.7}^{{{{\mathrm{VEC}}}}}\)) or the shape of the local d-orbital DOSs is more bimodal (i.e., a larger value of \(u_{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^{{{{\mathrm{dip}}}}}\)). Third, the positive contribution of VEC standard deviation to lattice distortion is also confirmed by a positive a3 coefficient. Overall, the proposed model suggests that the lattice distortion of bcc refractory solid-solution alloys is mainly attributed to two key factors: 1) the significance of the variance in the lengths of local atomic bonds; and 2) the d-band effect described by the filling fraction and bimodality of d-orbital DOS. Prediction of yield strength based on the RMSAD parameter One of the primary motivations behind developing in-depth understanding of lattice distortion is its physical correlation with solid-solution strengthening, which consequently provides a possible route to design high-strength HEAs. As discussed in the introduction section, ubiquitous lattice distortion over the crystal lattice could introduce significant atomic strains with random spatial fluctuations, which can elastically interact with the strain field of dislocations and provide a pinning force to retard the dislocation movement. Outstanding yield strengths have been generally achieved in the HEAs with severe lattice distortion 3 , 4 , 63 . The correlation between the lattice distortion and yield strength has also been discussed previously in a few individual alloy systems, including fcc HEAs composed of 3d transition metal elements 33 , and bcc Nb-based 35 , 64 and fcc Ni-based solid-solution alloys 34 . In spite of the great success of these previous works, the correlation has so-for been only quantitatively confirmed for a few equimolar alloy compositions in rather limited compositional spaces. Our surrogate model of lattice distortion provides an opportunity for performing a more comprehensive assessment in a much broader space, because the model enables a rapid prediction of RMSAD for any given alloy compositions with the studied ten refractory elements. Towards this goal, we have tried to collect an exhaustive set of the room-temperature experimental yield strength and hardness data from a recent HEA database developed by Citrine Informatics and other literatures 2 , 19 , 20 , 65 , 66 , 67 , 68 , 69 , 70 , 71 , 72 , 73 , 74 , 75 , 76 , 77 , 78 , 79 , 80 , 81 , 82 , 83 , 84 , 85 , 86 , 87 , 88 , 89 , 90 , 91 , 92 , 93 , 94 , 95 . In total 83 yield strength and 28 hardness data were collected, which covered 66 individual compositions (30 equiatomic, and 33 non-equiatiomic cases), ranging from quaternary to senary systems (Supplementary Table 2 ). The data collection is limited to as-cast and annealed alloys, so the strengthening effects due to processing such as grain refinement and strain hardening can be minimized. The detail of our data-collection process is described in the methods section. Then, regarding each of the experimental data, the surrogate model is employed to predict RMSAD according to the alloy composition in order to investigate the potential correlation between the lattice distortion and yield strength. A modified theory of solid-solution strengthening for random alloys has been recently developed by Maresca and Curtin 28 . The theory expresses the athermal yield strength (τY) of a bcc solid-solution alloy as, $$\tau _{{{\mathrm{Y}}}} = 0.051\alpha ^{ - \frac{1}{3}}\mu \left( {\frac{{1 + v}}{{1 - v}}} \right)^{\frac{4}{3}}f^{\tau}\left[ {\frac{{\mathop {\sum }\nolimits_n c_n\Delta V_n^2}}{{b^6}}} \right]^{\frac{2}{3}}$$ (2) In the equation, α is the dislocation line-tension parameter, which is a constant for a given type of dislocation, and \(\left( {\frac{{1 + v}}{{1 - v}}} \right)^{\frac{4}{3}}\) is a term of elastic anisotropy calculated by the Poisson’s ratio (v), which is usually insensitive to the variation of alloy compositions 96 . From the equation, one can consider the variation of the athermal yield strength with alloy composition is dictated by three parts of contributions, the isotropic shear modulus of the alloy, μ, an effective parameter describing the gradient of dislocation stress field, fτ, and the volume misfit quantity, \(\frac{{\mathop {\sum }\nolimits_n c_n\Delta V_n^2}}{{b^6}}\), which is actually closely related to the δ parameter, a commonly-used parameter to describe lattice distortion based on the mismatch of atomic radii. Recently, the volume-misfit quantity has also been found to be strongly correlated with the RMSAD parameter in several HEA systems 55 , correspondingly evidencing the correlation of the yield strength with lattice distortion. Therefore, inspired by Eq. ( 2 ) above, here we propose a simplified linear expression based on RMSAD to effectively model and predict the room-temperature yield strength of bcc refractory HEAs, which can be written as, $$\tau _{{{\mathrm{Y}}}} \approx a\mu \gamma _{{{{\mathrm{USF}}}}} \times {{{\mathrm{RMSAD}}}}$$ (3) where μ is still the isotropic shear modulus, γUSF is the average unstable stacking fault (USF) energy of the {110} plane of the alloy, and a is a constant coefficient, which should be universal to different alloy compositions and systems. The derivation of Eq. ( 3 ) is based on the three main contributions to yield strength in Eq. ( 2 ), where we use RMSAD to parameterize the local atomic misfit strain generated by lattice distortion and use the USF energy (γUSF) to substitute fτ to reflect the gradient of the dislocation-stress field. This is because the derivation of fτ is usually a complex process involving extensive data fitting, and in the meanwhile, the gradient of the dislocation-stress field should be generally proportional to the width of the dislocation core, which essentially relates to the USF energy according to the Peierls-Nabarro model 96 . Moreover, it is noteworthy that the three parameters, μ, γUSF, and RMSAD, in Eq. ( 3 ) at a given alloy composition can be respectively predicted using Vegard’s law 97 , 98 , a statistical learning model developed by us previously 56 , and the present lattice distortion model, without requiring performing any additional time-consuming calculations. Therefore, Eq. ( 3 ) naturally offers a high-throughput prediction approach of yield strength for efficient screening over large compositional spaces. To validate Eq. ( 3 ), experimental yield strength data is normalized by the isotropic shear modulus and USF energy to evaluate its linear dependence on RMSAD. As shown in Fig. 5a , a linear relationship is observed between the normalized yield strength, denoting as \(\tau _{{{\mathrm{Y}}}}/(\mu \gamma _{{{{\mathrm{USF}}}}})\), and RMSAD, giving an R2 value as high as 0.94. Additionally, the value of the constant, a, in Eq. ( 3 ) can be easily obtained via an OLS regression of \(\tau _{{{\mathrm{Y}}}}/(\mu \gamma _{{{{\mathrm{USF}}}}})\) on RMSAD, which is 0.29 Å/eV. An RMSE of 135 MPa was obtained by comparing the predicted yield strength predicted with the experimental truth. Additionally, it is well known that the hardness of most alloys shows a linear dependence on yield strength in general. Therefore, Eq. ( 3 ) should also be capable to model the hardness of bcc refractory HEAs, if it truly sketches the underlying physics of solid-solution strengthening. As expected, a clear linear relationship is also observed between the normalized hardness, \(HV/\left( {\mu \gamma _{{{{\mathrm{USF}}}}}} \right)\), and RMSAD, returning a RMSE of prediction as small as 29.5 HV (Fig. 5b ). Fig. 5: Linear dependence of mechanical properties of RHEAs on the RMSAD parameter. a Normalized yield strength (τY) vs. RMSAD, and b Normalized Vicker’s hardness (HV) vs. RMSAD. The normalization factor is a product of the alloy’s isotropic shear modulus (μ) and USF energy (γUSF). The yield strength and hardens data are collected from the previous experimental measurements in the literature. The unit of the y axis is \(\frac{{{{{\mathrm{GPa}}}}}}{{({{{\mathrm{GPa}}}} \times \frac{{{{\mathrm{J}}}}}{{{{{\mathrm{m}}}}^2}})}} = {{{\mathrm{m}}}}^2/{{{\mathrm{J}}}}\) in (a) and (HV∗ m2)/(GPa ∗ J) in (b). Data mining and interpretation Based on the developed surrogate models, a systematic data mining over massive alloy compositions is performed to uncover the physical relations of lattice distortion to phase stability, strength, and ductility of RHEAs. First, the distribution of lattice distortion in bcc refractory alloys is mapped in a compositional space composed of ten major RHEA elements. Second, with the mapping results of lattice distortion, the room-temperature yield strengths of those mapped compositions are also predicted, using Eq. ( 3 ). Third, the predicted yield strength is leveraged with a previously developed ductility model 56 and the CALPHAD approach 99 to validate an integrated computational framework for the data-driven design of RHEAs with high strength and good ductility. Systematic mapping of lattice distortion Mapping of lattice distortion is performed for various binary, ternary, quaternary, and quinary alloys in a vast compositional space consisting of ten different elements, namely Ti, Zr, Hf, V, Nb, Ta, Mo, W, Re, and Ru. Compositions considered are limited such that every individual element composes between 10 and 60 atomic percent (at%) to maintain focus on the concentrated, high-entropy region of the compositional space. For a homogenous sampling, the binary, ternary, and quaternary alloys are assessed at 5 at% compositional intervals with each element while a 10 at% interval is applied to the quinary alloys. As such, a total of 76,816 compositions are investigated. Of those, 140 compositions are binary alloys; 5208 are ternary; 36,400 are quaternary; and 35,068 are quinary. At each mapped composition, we also utilize the CALPHAD approach 99 to evaluate the phase stability of the single-phase bcc solid solution at 800 °C, which is a common temperature used for the homogenization processing of RHEAs. After the phase-stability evaluation, 5916 compositions are found to result in single-phase bcc solid solutions at 800 °C. Among them, 71 are binary; 1989 are ternary; 2012 are quaternary; and 1844 are quinary. The mapping results of lattice distortion are shown in Fig. 6 a, b where relative frequency histograms showing the distribution of RMSAD for all the screened alloy compositions and those possessing a single bcc phase at 800 °C, respectively. The compositions are grouped into binary/ternary (low/medium entropy) and quaternary/quinary (high entropy) categories to highlight the effect from the number of component elements, i.e., the cocktail effect of HEAs. In Fig. 6a , it can be seen that the RMSAD distributions are nearly identical between the binary/ternary and the quaternary/quinary groups. The mean values of the distributions are also similar, which are 0.1544 Å for the low/medium-entropy alloys and 0.1573 Å for the high-entropy alloys. However, when the solid-solution phase stability is considered, obvious difference is observed in the RMSAD distributions between the low/medium-entropy and high-entropy alloys. As presented in Fig. 6b , with the single-phase constraint, the RMSAD distribution of the low/medium-entropy group is biased towards the compositions with less lattice distortion. This feature is also evidenced by the increased skewness, which more than doubled (0.70 vs. 1.92) and reduced distribution mean (0.1544 vs. 0.0974 Å). In contrast, the distribution of RMSAD for the high-entropy group (quaternary/quinary alloys) does not show significant bias after applying the phase-stability restriction, which still holds a mean RMSAD as high as 0.1502 Å and a weak skewness. Comparing the results of Fig. 6 a, b suggests that while highly distorted binary and ternary bcc solid-solution alloys are predicted to be possible, a majority of them cannot remain thermodynamically stable at typical processing temperatures. On the other hand, the quaternary and quinary alloys with severe lattice distortions are likely to be stabilized by the increased configurational entropy from the inclusion of multiple principal elements. There has been an enduring debate whether the severe lattice distortion is a feature unique to HEAs or it can be generally exiting in any kind of solid-solution alloys regardless compositional complexity 100 . The present observation clearly evidences the importance of compositional complexity in preserving solid-solution alloys with severe lattice distortion from phase decomposition. Fig. 6: Distribution of lattice distortion in bcc refractory solid-solution alloys. Relative frequency histograms of lattice distortion in the alloys (a) before and (b) after screening for single-phase stability at 800 °C. The histogram colored in blue corresponds to the distribution of the RMSAD parameter of the binary and ternary alloys, while the histogram associated with the quaternary and quinary alloys is colored in orange. c–e Boxplots describing the distribution of physical descriptors that contribute to RMSAD in the alloys with severe lattice distortion (i.e., alloys with RMSAD > 0.15 Å). To uncover the mechanism of the HEA’s cocktail effect on lattice distortion, the single-phase-alloy compositions with a RMSAD larger than 0.15 Å are collected for a further investigation, as marked by the red box in Fig. 6b . Here, 0.15 Å is chosen as an indicator for severe lattice distortion, as it represents the 84th percentile of the binary/ternary distribution and the 58th percentile of the quaternary/quinary distribution. In this collective dataset, a comparison between the binary/ternary and quaternary/quinary alloys could clearly elucidate the effects of compositional complexity on severe lattice distortion. According to Eq. ( 1 ), a higher degree of lattice distortion can be achieved in an alloy that has a larger deviation in local atomic bond lengths (\(\sigma _{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^L\) in Eq. ( 1 )), a VEC more away from 5.7 e−/atom (\(u_{5.7}^{{{{\mathrm{VEC}}}}}\) in Eq. ( 1 )), and/or a large variability of the number of valance electrons (\(\sigma _{5.7}^{{{{\mathrm{VEC}}}}}\) in Eq. ( 1 )) among the constituent elements. The distributions of the \(\sigma _{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^L\), \(u_{5.7}^{{{{\mathrm{VEC}}}}}\), and \(\sigma _{5.7}^{{{{\mathrm{VEC}}}}}\) parameters in the binary/ternary and quaternary/quinary alloys with RMSAD > 0.15 Å are visualized via boxplots, as exhibited in Fig. 6 c, d, and e, respectively. It is interesting to note that the quaternary/quinary alloys generally have a larger \(\sigma _{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^L\) than the binary/ternary alloys, but vice versa for the \(u_{5.7}^{VEC}\) parameter. Also, the aforementioned differences are demonstrated to be statistically significant via Welch’s t-tests 101 . This observation suggests that the severe lattice distortion in the quaternary/quinary alloys is more attributed to the deviations in local atomic bond lengths of the constituent elements. However, significant differences in the valences and atomic volumes of the constituent element in a solid-solution phase also leads to a strong driving force for phase decomposition, according to the classic Hume-Rothery rule. Thanks to the multicomponent chemistry, such a driving force in the quaternary and quinary alloys could be balanced by the stabilization effect induced by the increased configurational entropy. As a result, those alloys can still remain as a single-phase solid solution while bearing a large lattice distortion. On the other hand, it is understandable that the severe lattice distortion in the binary and ternary alloys is more associated with a larger \(u_{5.7}^{{{{\mathrm{VEC}}}}}\) parameter (Fig. 6d ), since too large \(\sigma _{{{{\mathrm{B}}}}2/{{{\mathrm{bcc}}}}}^L\) would introduce phase decomposition, which cannot be sufficiently balanced by the entropy stabilization effect. In fact, the \(u_{5.7}^{{{{\mathrm{VEC}}}}}\) parameter cannot be too large either because the bcc lattice structure becomes unstable for a transition-metal alloy if its d orbital band is away from the half-filled state 102 . Therefore, a majority of screened binary and ternary alloys with severe lattice distortion are thermodynamically unstable. Lattice distortion induced solid solution strengthening With the results of the RMSAD screening above, we could further evaluate how the yield strengths of the bcc refractory alloys distribute across a large compositional space using the developed strengthening model (Eq. ( 3 )). Here we restrict the yield-strength prediction to the same alloy compositions in Fig. 6b , which are predicted to be stable as a single bcc solid-solution phase at 800 °C. The compositions with RMSAD less than 0.01 Å, however, are omitted. It is important to note that this exclusion would only disproportionately affect a select few dilute compositions in the Mo-W binary series, which tend to exhibit low lattice distortion and yet could potentially still be strong due to a high lattice friction stress, as reflected by their shear moduli (Supplementary Table 3 ). The predicted yield strengths are visualized as distribution histograms to reveal the difference between the binary/ternary and quaternary/quinary alloys. As shown in Fig. 7a , the quaternary/quinary alloys generally possess a higher yield strength and a narrower distribution than those of the binary/ternary alloys. The higher average strength of the quaternary/quinary alloys is likely to be a benefit of higher levels of lattice distortions in those alloys (Fig. 6b ), which contributes positively to yield strength according to Eq. ( 3 ). Additionally, in our strengthening model, the yield strength of a bcc refractory alloy is also proportional to its shear modulus and USF energy, which are two material features showing strong dependences on VEC. As exhibited in Supplementary Fig. 3 , an alloy with a lower VEC that is away from a half-filled d band state generally has a smaller μ and γUSF. Therefore, to achieve a high yield strength in an alloy would require it to have a large RMSAD and meanwhile not a too small VEC. This combination of features is more possible to achieve in the quaternary/quinary alloys due to the entropy-stabilization effect. Fig. 7: Distribution of yield strength in the bcc refractory alloys screened to be stable as a single-phase solid solution at 800 °C. Relative frequency histograms of (a) room-temperature yield strength and (b) room-temperature specific yield strength. The potentials of RHEAs for light weighting applications are also evaluated through a prediction of specific strength, which is achieved by combining the yield strength data obtained above with a lattice-parameter model developed in our recent work for the prediction of theoretical density 103 . The predicted specific strengths are also overviewed, using the distribution histograms, as shown in Fig. 7b . It is exciting to note that the HEAs also averagely yield higher specific strengths than the alloys with low/medium configurational entropies. The results of Fig. 7 here suggest that more opportunities of light-weight, high-strength alloy compositions shall be expected by increasing the number of components through the “high-entropy” design strategy. Data-driven design of high-performance RHEAs The current screening model for alloy’s yield strength can be seamlessly integrated with a ductility assessment model developed in our previous work for a data-driven exploration of strong and ductile bcc refractory alloys. Taking alloy composition as input, the ductility model yields rapid prediction for a so-called D parameter, which is defined as the ratio between the surface energy of a crack fracture plane and unstable stacking fault energy of the common dislocation-slip plane. The magnitude of the D parameter physically reflects the likelihood of an alloy to be intrinsically ductile, based on the Rice fracture mechanics 104 . More importantly, the D parameter of RHEAs has been demonstrated to positively correlate with the compression fracture strains measured experimentally, which makes it an effective indictor to assess alloy’s ductility 56 . A combination of the strength and ductility models with the CALPHAD-based assessment of phase stability forms a data-driven design framework that is powerful for performing massive searches in a vast design space for promising alloy compositions with the enhanced ductility and strength. Figure 8a shows the flow chart of the framework. With an input alloy composition, the CALPHAD-based thermodynamic assessment is first carried out to check whether the bcc solid solution can be stable as a single phase at a given processing temperature. If yes, the strength and ductility models are sequentially applied to assess the mechanical properties. Fig. 8: A high-throughput framework for screening of RHEAs with a good combination of ductility and strength. a A flowchart detailing the screening process of the framework. The RMSAD and yield strength are predicted, using the surrogate models developed in the present work, while intrinsic ductility is predicted from the models in our previous work 56 . The phase stability assessment is performed using the TCHEA4 database implemented in the Thermo-Calc Software 99 . b A scatter histogram of the yield strength (τY) and intrinsic ductility (D parameter) of the screened alloys that are single-phase solid solution at 800 °C. The data points that overlap with more other points within a bin area are coded by a warmer color. Experimentally studied RHEAs are also identified to show where the current experimental literature on equimolar compositions exists and where there is potential to optimize properties by exploring non-equimolar compositions. The red-dashed box illuminates a range of previously unexplored chemical compositions promising for achieving a single-phase RHEAs with a combination of high strength (τY > 1 GPa) and reasonable ductility (D parameter > 3.3). Scatterplots showing effect of VEC on (c) yield strength and (d) the D parameter of single-phase RHEAs. In (c) and (d), RMSAD is color mapped onto points to show how VEC and RMSAD can be optimized in tandem for high-performance alloy design.

### Citrine Informatics Frequently Asked Questions (FAQ)

When was Citrine Informatics founded?

Citrine Informatics was founded in 2013.

Where is Citrine Informatics's headquarters?

Citrine Informatics's headquarters is located at 2629 Broadway Street, Redwood City.

What is Citrine Informatics's latest funding round?

Citrine Informatics's latest funding round is Series C.

How much did Citrine Informatics raise?

Citrine Informatics raised a total of $68.55M.

Who are the investors of Citrine Informatics?

Investors of Citrine Informatics include Innovation Endeavors, Prelude Ventures, Alumni Ventures, Presidio Ventures, ISAI and 14 more.

Who are Citrine Informatics's competitors?

Competitors of Citrine Informatics include Kebotix, Atinary Technologies, Multiscale Technologies, Polymerize, Turing Labs and 10 more.

What products does Citrine Informatics offer?

Citrine Informatics's products include Citrine Platform.

Who are Citrine Informatics's customers?

Customers of Citrine Informatics include Morrow Battery and Rolls Royce.

## Compare Citrine Informatics to Competitors

Intellegens uses machine learning to speed development in different fields, including materials, chemicals, manufacturing, and more. The company's deep learning technology extracts value from sparse, noisy, real-world data, saving time and cost by reducing experimental workloads and enabling optimized products and processes. It was founded in 2017 and is based in Cambridge, England.

Materials Zone is a Materials Informatics (AI/ML) Platform. Materials Zone’s AI/ML platform harvests, digitalizes, standardizes, and visualizes materials data, revealing insights and predictions that accelerate R&D, and optimize supply chain choices, manufacturing efficiency and business decisions. Materials Zone was founded in 2017 and is based in Kiryat Tivon, Israel.

Alchemy develops cloud-based software for the specialty chemical industry. It facilitates the commercialization of new formulations. Alchemy's configuration engine enables the rapid creation of a digital replica of a company's best practices. The company offers product features including an electronic lab notebook (ELN), laboratory information management system (LIMS), co-creation and collaborative sales, reporting and analytics, project and task management, lab process automation, iso compliance automation and reporting, customization console and self-administration, and an enterprise cloud platform. It was founded in 2017 and is based in San Francisco, California.

Uncountable is an artificial intelligence (AI)-powered web platform designed to support and optimize the work of talented scientists in this complex environment. It brings together all research and development (R&D) data in one place for scientists to access and custom algorithms can suggest formulations to test - enabling complicated development projects to be tackled with less than half the resources. The company was founded in 2016 and is based in San Francisco, California.

Exponential Technologies offers software solutions that include a comprehensive research management system (RMS) and artificial intelligence (AI)-based design of experiment (DoE) software. The company was founded in 2019 and is based in Riga, Latvia.

NobleAI operates as a software development company. It builds software by incorporating scientific laws and constraints along with data-centric methods into an adapted neural network and delivers insights and solutions to complex problems. The company helps research and development organizations to accelerate their process of innovation. It was founded in 2018 and is based in San Francisco, California.

## Discover the right solution for your team

The CB Insights tech market intelligence platform analyzes millions of data points on vendors, products, partnerships, and patents to help your team find their next technology solution.