Next Article in Journal
Experimental Investigation of the Peel Strength of Artificial Leather and Polypropylene Specimens
Next Article in Special Issue
Estimation and Prediction of the Polymers’ Physical Characteristics Using the Machine Learning Models
Previous Article in Journal
Highly Flexible and Foldable Paper-Based Thermoelectric Generator Prepared with Post-Treatment-Free PEDOT:PSS Hybrid Ink
Previous Article in Special Issue
Hysteresis Behavior Modeling of Magnetorheological Elastomers under Impact Loading Using a Multilayer Exponential-Based Preisach Model Enhanced with Particle Swarm Optimization
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting the Performance of Functional Materials Composed of Polymeric Multicomponent Systems Using Artificial Intelligence—Formulations of Cleansing Foams as an Example

1
Department of Mechanical Engineering, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama 223-8522, Kanagawa, Japan
2
Kirin Central Research Institute, Kirin Holdings, 26-1, Muraoka-Higashi 2-Chome, Fujisawa 251-8555, Kanagawa, Japan
3
Research Institute, Fancl Corporation, 12-13 Kamishinano, Totsuka-ku, Yokohama 244-0806, Kanagawa, Japan
*
Author to whom correspondence should be addressed.
Polymers 2023, 15(21), 4216; https://doi.org/10.3390/polym15214216
Submission received: 6 September 2023 / Revised: 12 October 2023 / Accepted: 21 October 2023 / Published: 25 October 2023
(This article belongs to the Special Issue Scientific Machine Learning for Polymeric Materials)

Abstract

:
Cleansing foam is a common multicomponent polymeric functional material. It contains ingredients in innumerable combinations, which makes formulation optimization challenging. In this study, we used artificial intelligence (AI) with machine learning to develop a cleansing capability prediction system that considers the effects of self-assembled structures and chemical properties of ingredients. Over 500 cleansing foam samples were prepared and tested. Molecular descriptors and Hansen solubility index were used to estimate the cleansing capabilities of each formulation set. We used five machine-learning models to predict the cleansing capability. In addition, we employed an in silico formulation by generating virtual formulations and predicting their cleansing capabilities using an established AI model. The achieved accuracy was R2 = 0.770. Our observations revealed that mixtures of cosmetic ingredients exhibit complex interactions, resulting in nonlinear behavior, which adds to the complexity of predicting cleansing performance. Nevertheless, accurate chemical property descriptors, along with the aid of in silico formulations, enabled the identification of potential ingredients. We anticipate that our system will efficiently predict the chemical properties of polymer-containing blends.

1. Introduction

Quantitative structure–property relationships (QSPRs) are used in several fields, such as environmental chemistry [1], drug design [2], and materials science [3], demonstrating their versatility and ability in scientific research and industrial applications. The QSPR is a statistical and mathematical method for expressing the relationship between chemical structure and physical properties, which enables the rapid prediction of physical properties. This expectation is particularly high in materials science, where recent advances in synthetic technology have enabled the creation of nearly an infinite variety of polymers. The materials produced by this synthesis require highly desirable physical properties. In recent years, several materials have been improved by adding different mixtures. For example, adding nanoparticles or nanofillers to a material to tailor a nanocomposite improves its thermal response and ionic conductivity [4,5,6,7,8]. Several attempts have been made to predict the physical and chemical properties of such complex systems using QSPRs. In the context of the prediction of physical or chemical properties of mixtures, reports predicting the flash point [9], diffusion coefficient [10], boiling point [11], refractive indices [12], or toxicity [13] of binary mixtures are available. Surfactants are representative functional polymers, and studies related to surfactants were performed that predicted the properties such as critical micelle concentration, cloud point, and the hydrophilic–lipophilic balance of binary mixtures [14,15]. From the perspective of multicomponent mixtures comprising three or more substances, examples of predicting properties such as vapor–liquid critical volume are available [16].
In several reports, molecular descriptors were acquired, and mixtures were characterized using methods such as weighted averages. Subsequently, predictive models are often constructed using linear prediction models (or their derivatives), machine learning, or deep-learning approaches. However, in the case of multicomponent systems, particularly with large-molecular-weight materials, challenges still exist for QSPRs regarding prediction accuracy [17], and reports using predictive models for estimating mixture effects are scarce. The lack of extensive research on predicting the properties of multicomponent mixtures, particularly those involving surfactants, and applying these predictive models to infer mixture effects and facilitate product design underscores the novelty and significance of this study.
The cleansing foam used in cosmetics is a typical polymeric multicomponent system, and polymers, such as polyethylene glycol or polyglycerin, are often used as ingredients. It is used to wash excess sebum and dirt from the skin, and a makeup remover is used to remove makeup cosmetics. In recent years, a growing need has been observed for highly functional cleansing foams because people want to use only one product for cleansing foam and makeup removal for reasons of time (shortening procedures) and ecology (saving water and reducing chemical emissions into the environment) [18]. Solvent-based cleansing agents, such as makeup remover oils, are highly soluble in makeup products, which contain oil and pigments, resulting in excellent removability. However, solvent-based cleansing agents are associated with problems such as high environmental impact, high material costs, and a feeling of residual oiliness after rinsing [19]. By contrast, surfactant-based cleansing agents such as cleansing foams have excellent rinsing properties but weak oil removability because they are primarily water-based. In this study, the latter approach was used to improve the cleansing performance of the foams.
Cleansing foams are composed of numerous components. They contain several types of surfactants, polyols, pH adjusters, and water, making optimizing the formulations difficult because an infinite number of ingredient combinations are possible. Therefore, artificial intelligence (AI) using machine learning has been introduced into formulation design to construct a cleansing capability prediction system that considers the effects of surfactant self-assembly and chemical characteristics of ingredients.
The focus of this study is twofold. First, it aims to extend our understanding of property prediction for multicomponent mixtures. Second, it aims to determine high-performance mixing conditions using predictive models. To achieve these goals, we employed various machine-learning methods. In addition, in silico simulations have been introduced to assist human formulators in achieving desirable products during product development.

2. Materials and Methods

2.1. Evaluation of Cleansing Capability

Cleaning foams consisting of 537 samples of ionic surfactants, amphoteric surfactants, nonionic surfactants, polyols, a pH adjuster, and water were prepared by thorough mixing and stirring. Examples of ingredients and formulations are listed in Table 1 and Table 2, respectively. Each sample comprised ~20% ionic surfactants, 10% nonionic surfactants, 10% polyols, 1% citric acid, and 60% water by weight. To study the prepared samples, a waterproof eyeliner pencil was placed on a piece of white artificial leather that was dried for 30 min. Then, 0.1 mL of the corresponding cleansing foam sample was added to the dried eyeliner, rubbed 30 times, rinsed, and dried. A schematic of all the procedures is shown in Figure 1.
The cleansing capability was evaluated using the eyeliner pencil residual ratio, which was calculated using the color differences as follows:
Cleansing   capability   ( % ) = ( L 1 * L 2 * ) 2 + a 1 * a 2 * 2 + ( b 1 * b 2 * ) 2 ( L 1 * L 0 * ) 2 + a 1 * a 0 * 2 + ( b 1 * b 0 * ) 2 100
where L* indicates lightness, and a* and b* indicate chromaticity. (L*, a*, b*) represents the color space value measured using a colorimeter (CM-2600d, Konica Minolta, Inc., Tokyo, Japan). (L*0, a*0, b*0), (L*1, a*1, b*1), and (L*2, a*2, b*2) represent the color space values of the white artificial leather before applying the eyeliner pencil, after applying it, and after cleaning it, respectively [20].

2.2. Modeling of AI

2.2.1. Data Processing

We trained the AI on the prescribing data and modeled them using descriptors and Hansen dissolution parameters to incorporate chemical information. Figure 2 shows the data-processing flowchart.

2.2.2. Molecular Descriptors

A molecular descriptor is a numerical molecular property extracted from a chemical structure. Each type of molecular descriptor is related to a specific type of interaction between chemical groups in a particular molecule. Descriptors are used to predict the chemical properties of not only single chemicals but also chemical mixtures. In addition, descriptors have been applied to predict the critical micelle concentration (CMC) of gemini surfactants [21,22]. Therefore, we extracted information from ingredients and predicted the cleansing capabilities of the prepared formulations using molecular descriptors. The structural formula of each ingredient was determined using ChemDraw and converted into a Simplified Molecular Input Line Entry System (SMILES). Regarding the specification of the degree of polymerization, we adopted the representative degree of polymerization for each raw material. Descriptor values were then calculated from the SMILES of each ingredient using the chemoinformatic tools rdkit [23] and PaDEL-descriptor [24]. Entries with infinite or only one value were removed, and a k-NN imputer was applied to predict missing values. The weighted average of each ingredient was then calculated using the molar or weight fraction to estimate the descriptor values of the mixture ingredients.

2.2.3. Hansen Solubility Parameters

We applied Hansen solubility parameters (HSPs) to predict the cleansing capability. The HSPs were developed by Hansen to predict the ability of a material to dissolve in another material, forming a solution. The HSP distance between the solute and solvent is generally calculated to estimate whether a solute dissolves in a solvent. Some studies have used the HSPs to predict the properties of surfactants [25,26]. In this study, instead of the solute and solvent, we calculated the distance between each sample and obtained the cleansing samples with the highest cleansing capability. We adopted this procedure because the solute, the eyeliner in this study, was made of several ingredients, and it was difficult to identify its structural formula. The HSP distance was defined as {4*(dD1-dD2)2 + (dP1-dP2)2 + (dH1-dH2)2}0.5, where dD1, dP1, and dH1 are the values of each sample—the average calculated based on each component’s proportion by wight in the mixture—and dD2, dP2 and dH2 are the average values of the three highest cleansing capabilities in our samples. The HSPs would better estimate the effects of the interactions between the ingredients in a formulation than the descriptor method, in which measuring the nonlinear effect of the ingredient interactions is challenging. The HSP values were calculated using the Hansen Solubility Parameters in Practice (HSPiP) software of version 5.0.09. Because some HSPs cannot be calculated using HSPiP for molecules with high molecular weights, missing HSP values were imputed using a k-NN imputer for the descriptor calculation.

2.2.4. Modeling and Feature Selection

The number of explanatory variables was >1000 when descriptors and HSPs were used. Therefore, we applied machine learning to obtain laws to predict cleansing performance based on these numerous features. Three types of machine-learning algorithms are available: supervised learning, unsupervised learning, and reinforcement learning. Supervised learning (regression) was chosen for this study to predict the results within a continuous output. The input dataset is described in Section 2.1 and Section 2.2, and the output is the cleansing capability. In this study, we aimed to capture the inherent behavior of surfactants in cleansing forms, anticipating their utilization in more generic applications, such as predicting properties other than cleansing capability. Therefore, we did not focus on developing prediction models specialized for cleansing capability; instead, we adopted representative machine-learning models. We adopted two decision-tree-based models (random forest and extra tree regressors), two linear-based models (lasso and partial least squares), and one support-vector-machine-based model (support vector regressor). The hyperparameters are listed in Table 3. Each model exhibits unique characteristics: decision-tree-based models help capture nonlinear relationships in the data, and linear models are particularly suitable when a linear relationship is assumed between variables. Support-vector-machine-based models are suitable for predicting high-dimensional data. We aimed to develop versatile models by employing these diverse methodologies and uncover new potential for understanding surfactant behavior in cleansing forms. The hyperparameters were optimized using a grid-search method. All explanatory features were standardized with a mean of zero and standard deviation of one. Because numerous features cause noise in the modeling, we adopted the Boruta method [27] to reduce the noise from unimportant features.

2.2.5. Modeling Evaluation

Herein, we employed a machine-learning model and optimized its hyperparameters using grid-search cross-validation, which is a popular method for hyperparameter tuning that works systematically through multiple combinations of parameter tunings. The scoring metric used to evaluate the performance of the model was the coefficient of determination, denoted as R2, which represents the proportion of variance for a dependent variable that is explained by the independent variables. The R2 values were calculated as follows:
R 2 = 1 1 n ( y i y i ^ ) 2 1 n ( y i y ¯ ) 2 ,
where y i ^ , y i , and y ¯ represent the predicted, actual, and mean values of the actual output, respectively. The dataset comprised 537 samples, and ten-fold cross-validation was applied to calculate the accuracy. For the computations, 90% of the data were allocated as training data, and the remaining 10% as test data. This computation was conducted ten times, ensuring that all data were used as test data at some point. The average value of the ten R2 scores of the foldout data was accepted as the model performance. The modeling was executed five times with different random seeds, which were applied to the modeling of tree models and cross-validation split, and the averaged values were calculated as a result of accuracy.

2.3. In Silico Formulation

To evaluate whether the AI models could support human formulators, formulations were virtually created with a computer using the rules described below. We call this procedure the ‘in silico formulation’.
  • All ingredients were assigned to one of six categories (the same categories described in Table 1): anionic surfactants, amphoteric surfactants, nonionic surfactants, polyols, a pH adjuster (only citric acid), and a base (only water).
  • To compare the predicted and actual cleansing capabilities, the selection of anionic and amphoteric surfactants was restricted to one type: the anionic surfactant was restricted to potassium cocoyl glutamate, and amphoteric surfactant was restricted to lauramidopropyl hydroxysultaine.
  • Only one ingredient was selected from each category; for example, two nonionic surfactants could not be selected for one formulation.
  • The addition rates of each ingredient, except for the pH adjuster (citric acid) and water, were randomized for each category within the predefined ranges described in Table 4. The addition rate of citric acid was fixed with the value of 0.8 weight%, and the addition rate of water was calculated such that the sum of all the ingredients was 100%.
  • In the procedure, 105 formulations were made, which were predicted with the best model described in Section 2.2.4.
To validate the predictions made by the in silico formulation, the actual cleansing capabilities of some formulations were measured experimentally (the formulations of the measured samples are shown in the Section 3).

3. Results

3.1. Evaluation of AI Modeling

An AI model was established to predict the cleansing capability. The prediction accuracy of each model is listed in Table 5. The best prediction accuracy was obtained with the R2 value of 0.770. The prediction accuracy increased significantly with the use of the descriptors. The results of the best model, extra tree regressor, using molecular descriptors, Hansen solubility index, and feature extraction are shown in Figure 3. An R2 value of 0.770 translates to 15% when converted to a root-mean-square error (RMSE). We believe this accuracy level is sufficient for screening purposes, such as opting not to conduct low-predictive cleansing capability experiments before engaging in experiments using actual substances. Such preliminary filtering enables a more efficient allocation of resources to experiments with higher probabilities of success, thereby optimizing the overall research process.

3.2. In Silico Formulation and Actual Cleansing Capabilities

The cleansing capabilities of the in silico formulations are shown in Figure 4 and Figure 5. A box with light and dark gray color in these figures indicates the middle 50% of the data (that is, the middle two quartiles of the data distribution), and horizontal bars display all points within 1.5 times the interquartile range (that is, all points within 1.5 times the width of the adjoining box), or all points at the maximum or minimum extent of the data. As shown in Figure 4, eicosaglycerol hexacaprylate exhibited the highest cleansing capability at both the median value (middle horizontal line in each box) and best value (top horizontal line).
In Figure 5, the formulation data were stratified into two categories: those not using nonionic surfactants and those using nonionic surfactants. Next, each category was stratified into subcategories based on the polyols to estimate their interactions with nonionic surfactants and polyols. The addition of nonionic surfactants increased the cleansing capabilities, and hydrophobic PPG-9 diglyceryl ether or cyclohexylglycerin, which have lower inorganic and organic balance (IOB) values, boosted the cleansing capability more than glycerin.
Several formulations were selected to validate the predicted data obtained from the in silico formulation, and their cleansing capabilities were measured. The formulations and results are shown in Table 6 and Figure 6. Nonionic surfactants eicosaglycerol hexacaprylate and cyclohexylglycerin/PPG-9 diglyceryl ether, (A) and (B), showed the highest cleansing capabilities among the actual formulations. Formulations with other nonionic surfactants and cyclohexylglycerin/PPG-9, (C) and (D), showed lower cleansing capabilities. Formulations with glycerin, (E) and (F), showed much lower cleansing capabilities, regardless of the type of nonionic surfactant. These tendencies correspond to the results shown in Figure 4 and Figure 5.

4. Discussion

The use of descriptors increased the prediction accuracy of all the models, indicating that the chemical properties expressed as molecular descriptors successfully enabled the prediction of cleansing capabilities. HSPs improved the prediction accuracy for several models, but the accuracy was insufficient for models with descriptors, indicating that descriptors were more informative than HSPs.
Furthermore, weight% was suitable for linear-based models for the weighted average calculation, whereas no difference was observed for tree-based models. The mol% of the weighted average is potentially more accurate based on stoichiometry. However, because water constitutes > 97 mol% on average in the formulations owing to the high molecular weights of the surfactants, the influence of water was more dominant in the mol% calculation. Linear-based models were more affected by this influence than tree-based models. To predict cleansing capability, nonlinear behavior should also be considered owing to the interactions between surfactants and water molecules and their self-assembly. Tree-based models are typically more suitable for nonlinear predictions; therefore, their prediction accuracies are higher than those of linear-based models. The most accurate method is the elastic tree regressor. Because this method is based on decision tree models, its accuracy will decline when the data intended for inference fall into extrapolation regions relative to the training data. In such cases, additional experiments must be conducted to augment data and retrain the model. Although we employed descriptors to enhance the generalization capability of the model, it was estimated that the accuracy of predicting the cleansing capability of samples made with ingredients not present in the training data would be lower than the accuracy calculated in this study.
The raw ingredients used included polymer-based components such as PEG-20 glyceryl triisostearate; however, the prediction accuracy was maintained. In addition, as shown in Figure 4 and Table 6, the formulations using PEG-20 glyceryl triisostearate exhibited a lower cleansing performance than those using shorter-chain raw materials. This suggests that the length of the polymer may not be a key factor influencing the cleansing performance; instead, it is likely that the higher-order structure between the ingredients plays a more significant role in cleansing.
The in silico formulation helped us understand the effect not only of each material on cleansing capabilities but also of combinations of materials with the consequence of molecular interactions. The in silico formulation assisted in developing formulations with higher cleansing capabilities.
In this study, applying prediction and in silico formulation methods, we identified a cleansing foam formulation consisting of eicosaglycerol hexacaprylate and cyclohexylglycerin/PPG-9 that exhibited a high cleansing capability of >85% for the removal of waterproof eyeliners.

5. Conclusions

Using AI with machine learning, we built QSPR models that incorporated super-multicomponent ingredients, including polymers, to estimate the effects of surfactant self-assembly and chemical characteristics of the ingredients. An accuracy of R2 = 0.770 (RMSE = 15%) was obtained to predict the cleansing performance, which was sufficient for ingredient screening. Nonlinear behavior, i.e., interactions among cosmetic ingredients in formulations, makes it more difficult for formulators to predict their performance. However, a high accuracy was obtained by incorporating chemical characteristics with descriptors. Based on the molecular structure of the ingredients and surfactant self-assembly, this AI prediction model showed higher accuracy than conventional approaches, such as multiple linear regression. Using in silico formulations, formulators can obtain information on which ingredients should be selected to achieve the highest cleansing capabilities. These findings suggest that cleansing performance is not merely dependent on the polymer length of the ingredients but also on the higher-order structures resulting from the interactions between the ingredients. This prediction model and in silico formulation significantly reduce the effort required for cosmetic development. In summary, we achieved the following results:
  • A QSPR model was constructed for super-multicomponent ingredients, including polymers, achieving an accuracy of R2 = 0.770 (RMSE = 15%), sufficient for product development screening.
  • Using an in silico formulation, we predicted the optimal combination of the ingredients.
  • The application of these technologies reduces the developmental effort and optimizes the overall development process.
Herein, we demonstrated the applicability of QSPRs to multicomponent mixtures, focusing specifically on cleansing foams. Based on the insights provided by QSPRs, we successfully identified an optimal combination of ingredients suitable for product development. We believe this methodology has high generalizability, facilitating the discovery of ideal combinations with minimal experimentation in various fields, not limited to cosmetics but also in drug development or other material designs. Therefore, the QSPR-based approach can emerge as a potent tool, yielding significant benefits in these industries.

Author Contributions

Conceptualization, M.H. and N.A.; methodology, M.H. and H.M.; software, M.H.; validation, H.M., R.N. and N.A.; formal analysis, M.H.; investigation, M.H. and H.M.; resources, H.M.; data curation, H.M.; writing—original draft preparation, M.H.; writing—review and editing, H.M., R.N. and N.A.; visualization, M.H.; supervision, N.A.; project administration, N.A.; funding acquisition, M.H. and H.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. However, some data are not publicly available because the database comprising individual formulation data is utilized for product development.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Wu, J.; Wang, S.; Zhou, L.; Ji, X.; Dai, Y.; Dang, Y.; Kraft, M. Deep-learning architecture in QSPR modeling for the prediction of energy conversion efficiency of solar cells. Ind. Eng. Chem. Res. 2020, 59, 18991–19000. [Google Scholar] [CrossRef]
  2. Hayashi, Y.; Marumo, Y.; Takahashi, T.; Nakano, Y.; Kosugi, A.; Kumada, S.; Hirai, D.; Takayama, K.; Onuki, Y. In silico predictions of tablet density using a quantitative structure-property relationship model. Int. J. Pharm. 2019, 558, 351–356. [Google Scholar] [CrossRef] [PubMed]
  3. Mallakpour, S.; Hatami, M.; Golmohammadi, H. Prediction of inherent viscosity for polymers containing natural amino acids from the theoretical derived molecular descriptors. Polymer 2010, 51, 3568–3574. [Google Scholar] [CrossRef]
  4. Gordillo, M.A.; Benavides, P.A.; Ma, K.; Saha, S. Transforming an insulating metal–organic framework (MOF) into semiconducting MOF/gold nanoparticle (AuNP) and MOF/polymer/AuNP composites to gain electrical conductivity. ACS Appl. Mater. Interfaces 2022, 5, 13912–13920. [Google Scholar] [CrossRef]
  5. Xiu, X.; Mao, W.; Gong, J.; Liu, H.; Shao, Y.; Sun, L.; Wang, H.; Wang, C. Enhanced electrochemical performance of PEO-based composite polymer electrolyte with single-ion conducting polymer grafted SiO2 nanoparticles. Polymers 2023, 15, 394. [Google Scholar]
  6. Tekell, M.C.; Nikolakakou, G.; Glynos, E.; Kumar, S.K. Ionic conductivity and mechanical reinforcement of well-dispersed polymer nanocomposite electrolytes. ACS Appl. Mater. Interfaces 2023, 15, 30756–30768. [Google Scholar] [CrossRef]
  7. Ortac, B.; Mutlu, S.; Baskan, T.; Yilmaz, S.S.; Yilmax, A.H.; Erol, B. Thermal conductivity and phase-change properties of boron nitride–lead oxide nanoparticle-doped polymer nanocomposites. Polymers 2023, 15, 2326. [Google Scholar] [CrossRef]
  8. Yu, C.; Gong, X.; Wang, M.; Li, L.; Ren, S. Hyper-cross-linked nanoparticle reinforced composite polymer electrolytes with enhanced ionic conductivity and thermal stability for lithium-ion batteries. ACS Appl. Mater. Interfaces 2023, 5, 1509–1519. [Google Scholar] [CrossRef]
  9. Gaudin, T.; Rotureau, P.; Fayet, G. Mixture descriptors toward the development of quantitative structure–property relationship models for the flash points of organic mixtures. Ind. Eng. Chem. Res. 2015, 54, 6596–6604. [Google Scholar] [CrossRef]
  10. Abbasi, A.; Eslamloueyan, R. Determination of binary diffusion coefficients of hydrocarbon mixtures using MLP and ANFIS networks based on QSPR method. Chemom. Intellig. Lab. Syst. 2014, 132, 39–51. [Google Scholar] [CrossRef]
  11. Faramarzi, Z.; Abbasitabar, F.; Zare-Shahabadi, V.; Jahromi, H.J. Novel mixture descriptors for the development of quantitative structure−property relationship models for the boiling points of binary azeotropic mixtures. J. Mol. Liq. 2019, 296, 111854. [Google Scholar] [CrossRef]
  12. Khan, P.M.; Rasulev, B.; Roy, K. QSPR modeling of the refractive index for diverse polymers using 2D descriptors. ACS Omega 2018, 3, 13374–13386. [Google Scholar] [CrossRef] [PubMed]
  13. Chatterjee, M.; Roy, K. Prediction of aquatic toxicity of chemical mixtures by the QSAR approach using 2D structural descriptors. J. Hazard. Mater. 2021, 408, 124936. [Google Scholar] [CrossRef]
  14. Baghban, A.; Sasanipour, J.; Sarafbidabad, M.; Piri, A.; Razavi, R. On the Prediction of Critical Micelle Concentration for Sugar-Based Non-Ionic Surfactants. Chem. Phys. Lipids 2018, 214, 46–57. [Google Scholar] [CrossRef]
  15. Cheng, K.C.; Khoo, Z.S.; Lo, N.W.; Tan, W.J.; Chemmangattuvalappil, N.G. Design and Performance Optimisation of Detergent Product Containing Binary Mixture of Anionic-Nonionic Surfactants. Heliyon 2020, 6, e03861. [Google Scholar] [CrossRef] [PubMed]
  16. Sobati, M.A.; Abooali, D.; Maghbooli, B.; Najafi, H. A New Structure-Based Model for Estimation of True Critical Volume of Multi-Component Mixtures. Chemom. Intellig. Lab. Syst. 2016, 155, 109–119. [Google Scholar] [CrossRef]
  17. Muratov, E.N.; Varlamova, E.V.; Artemenko, A.G.; Polishchuk, P.G.; Kuz’min, V.E. Existing and developing approaches for QSAR analysis of mixtures. Mol. Inform. 2012, 31, 202–221. [Google Scholar] [CrossRef]
  18. Watanabe, K.; Sakurai, N.; Meno, T.; Yasuda, C.; Takahashi, S.; Hori, A.; Tsuchiya, K.; Sakai, K. Novel spontaneous cleansing feature of foam─Hybrid bicontinuous-microemulsion-type foamy makeup remover. J. Soc. Cosmet. Chem. Jpn. 2021, 55, 19–27. [Google Scholar] [CrossRef]
  19. Watanabe, K.; Masuda, M.; Nakamura, K.; Inaba, T.; Noda, A.; Yanagida, T.; Yanaki, T. A new makeup remover prepared with a system comprising dual continuous channels (bicontinuous phase) of silicone oil and water. IFSCC Mag. 2004, 7, 310–318. [Google Scholar] [CrossRef]
  20. Iwanaga, T.; Uchida, K.; Takeuchi, N.; Abe, Y. Development of oil-type make-up remover prepared with polyglycerol fatty acid esters. J. Soc. Cosmet. Chem. Jpn. 2005, 39, 186–194. [Google Scholar] [CrossRef]
  21. Jiao, L.; Wang, Y.; Qu, L.; Xue, Z.; Ge, Y.; Liu, H.; Lei, B.; Gao, Q.; Li, M. Hologram QSAR Study on the Critical Micelle Concentration of Gemini Surfactants. Colloids Surf. A Physicochem. Eng. Asp. 2020, 586, 124226. [Google Scholar] [CrossRef]
  22. Absalan, G.; Hemmateenejad, B.; Soleimani, M.; Akhond, M.; Miri, R. Quantitative Structure–micellization Relationship Study of Gemini Surfactants Using Genetic-PLS and Genetic-MLR. QSAR Comb. Sci. 2004, 23, 416–425. [Google Scholar] [CrossRef]
  23. RDKit: Open-Source Cheminformatics Software. Available online: https://www.knime.com/rdkit (accessed on 5 September 2023).
  24. Yap, C.W. PaDEL-descriptor: An open source software to calculate molecular descriptors and fingerprints. J. Comput. Chem. 2011, 32, 1466–1474. [Google Scholar] [CrossRef] [PubMed]
  25. Faasen, D.P.; Jarray, A.; Zandvliet, H.J.; Kooij, E.S.; Kwiecinski, W. Hansen solubility parameters obtained via molecular dynamics simulations as a route to predict siloxane surfactant adsorption. J. Colloid Interface Sci. 2020, 575, 326–336. [Google Scholar] [CrossRef] [PubMed]
  26. Afzal, O.; Alshammari, H.A.; Altamimi, M.A.; Hussain, A.; Almohaywi, B.; Altamimi, A.S. Hansen solubility parameters and green nanocarrier based removal of trimethoprim from contaminated aqueous solution. J. Mol. Liq. 2022, 361, 119657. [Google Scholar] [CrossRef]
  27. Kursa, M.B.; Jankowski, A.; Rudnicki, W.R. Boruta—A system for feature selection. Fundam. Inform. 2010, 101, 271–285. [Google Scholar] [CrossRef]
Figure 1. Schematic of the evaluation test to determine the cleansing capability of the prepared samples, Equation (1).
Figure 1. Schematic of the evaluation test to determine the cleansing capability of the prepared samples, Equation (1).
Polymers 15 04216 g001
Figure 2. Data-processing flow.
Figure 2. Data-processing flow.
Polymers 15 04216 g002
Figure 3. Experimental vs. predicted values of the cleansing capabilities of the cleansing foam formulations with the best model (10-fold cross validation was executed and all foldout data are shown in this plot).
Figure 3. Experimental vs. predicted values of the cleansing capabilities of the cleansing foam formulations with the best model (10-fold cross validation was executed and all foldout data are shown in this plot).
Polymers 15 04216 g003
Figure 4. Box plots of cleansing capabilities with the in silico formulation based on the best model shown in Section 3.1. Predicted data are stratified with nonionic surfactants.
Figure 4. Box plots of cleansing capabilities with the in silico formulation based on the best model shown in Section 3.1. Predicted data are stratified with nonionic surfactants.
Polymers 15 04216 g004
Figure 5. Box plots of predicted cleansing capabilities with the in silico formulation based on the best model shown in Section 3.1. Predicted data are stratified with nonionic surfactants and polyols.
Figure 5. Box plots of predicted cleansing capabilities with the in silico formulation based on the best model shown in Section 3.1. Predicted data are stratified with nonionic surfactants and polyols.
Polymers 15 04216 g005
Figure 6. Experimental vs. predicted values of the cleansing capabilities for in silico formulations.
Figure 6. Experimental vs. predicted values of the cleansing capabilities for in silico formulations.
Polymers 15 04216 g006
Table 1. Examples of ingredients used in the formulations prepared in this study.
Table 1. Examples of ingredients used in the formulations prepared in this study.
CategoryThe Number of
Ingredients
Ingredient Examples
Anionic surfactant8Potassium cocoyl glutamate
Potassium cocoyl glycinate
Amphoteric surfactant4Lauramidopropyl hydroxysultaine
Sodium cocoamphoacetate
Nonionic surfactant24Eicosaglycerol hexacaprylate
Decaglycerol isostearate
PEG-20 glyceryl triisostearate
Polyol33PPG-9 diglyceryl ether
Cyclohexylglycerin
Glycerin
pH adjuster1Citric acid
Base1Water
Table 2. Example of cleansing foam samples in this study.
Table 2. Example of cleansing foam samples in this study.
CategoryMaterial Name
Anionic surfactantPotassium cocoyl glutamate6713887
Amphoteric surfactantLauramidopropyl
Hydroxysultaine
594977
Nonionic surfactantsEicosaglycerol hexacaprylate1010 10
Decaglycerol isostearate 10
PEG-20 glyceryl
triisostearate
10 10
PolyolsPPG-9 diglyceryl ether 11 11
Cyclohexylglycerin9 14
Glycerin 1113
pH adjusterCitric acid0.80.80.80.80.80.8
BaseWater69.262.258.261.263.262.2
Table 3. Hyperparameter set for modeling.
Table 3. Hyperparameter set for modeling.
Model NameHyperparameterValues
Extra tree regressorThe number of trees10, 50, 100, 500, 1000, 2000
The number of features to consider when looking for the best split(the number of features)0.5,
log2(the number of features),
the number of features
The maximum depth of the tree10, 20, 30
Random forest regressorThe number of trees10, 50, 100, 500, 1000, 2000
The number of features to consider when looking for the best splitsqrt, log2, none
The maximum depth of the tree10, 20, 30
Support vector regressorC0.1, 1, 10
Epsilon0.01, 0.1, 1
KernelLinear, poly, rbf
Partial least squares regressorThe number of components1 to 20 (integer)
Lasso regressorAlpha0.0001, 0.001, 0.01, 0.1, 10
Table 4. Condition of in silico formulation.
Table 4. Condition of in silico formulation.
CategoryMaterial NameRandomize
the Addition Rate
Minimum
Weight% *
Maximum
Weight% *
Anionic surfactantPotassium cocoyl glutamateYes320
Amphoteric surfactantLauramidopropyl hydroxysultaineYes320
Nonionic surfactantEicosaglycerol hexacaplyrate
PPG-20 glyceryl triisosterarate etc.
(24 kinds in total)
Yes010
PolyolsPPG-9 diglyceryl ether
glycerin etc.
(33 kinds in total)
Yes030
pH adjusterCitric acidNo0.8% (fixed ratio)
BaseWaterNo100-Σ (other material amount)
* Water content was excluded in the weight% expression.
Table 5. Prediction accuracy of each model.
Table 5. Prediction accuracy of each model.
Mol/Weight FractionDescriptorsHansen
Solubility
Index
Feature
Extraction
Extra Tree
Regressor
Random
Forest
Regressor
SVRLassoPLS
1WeightNot usedNot usedNot used0.6610.6520.4880.5260.503
2MolNot usedNot usedNot used0.6680.6430.4690.5030.472
3WeightUsedNot usedNot used0.7510.7150.5670.5650.541
4MolUsedNot usedNot used0.7530.7510.5090.5260.519
5WeightNot usedUsedNot used0.7250.6570.3200.5820.535
6MolNot usedUsedNot used0.7190.6540.3050.5560.505
7WeightUsedUsedNot used0.7480.7110.5680.5820.543
8MolUsedUsedNot used0.7510.7490.5240.5370.526
9WeightUsedUsedUsed0.7700.7380.6710.5640.532
10MolUsedUsedUsed0.7680.7530.6200.5230.504
Table 6. Formulations for comparison of predicted and actual cleansing capabilities.
Table 6. Formulations for comparison of predicted and actual cleansing capabilities.
CategoryMaterial Name(A)(B)(C)(D)(E)(F)
Anionic surfactantPotassium cocoyl glutamate6713887
Amphoteric surfactantLauramidopropyl
Hydroxysultaine
594977
Nonionic surfactantsEicosaglycerol hexacaprylate1010 10
Decaglycerol isostearate 10
PEG-20 glyceryl
triisostearate
10 10
PolyolsPPG-9 diglyceryl ether 11 11
Cyclohexylglycerin9 14
Glycerin 1113
pH adjusterCitric acid0.80.80.80.80.80.8
BaseWater69.262.258.261.263.262.2
Cleansing capability testPrediction/%81.278.371.942.17.414.4
Actual/%85.885.939.846.38.33.6
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hamaguchi, M.; Miwake, H.; Nakatake, R.; Arai, N. Predicting the Performance of Functional Materials Composed of Polymeric Multicomponent Systems Using Artificial Intelligence—Formulations of Cleansing Foams as an Example. Polymers 2023, 15, 4216. https://doi.org/10.3390/polym15214216

AMA Style

Hamaguchi M, Miwake H, Nakatake R, Arai N. Predicting the Performance of Functional Materials Composed of Polymeric Multicomponent Systems Using Artificial Intelligence—Formulations of Cleansing Foams as an Example. Polymers. 2023; 15(21):4216. https://doi.org/10.3390/polym15214216

Chicago/Turabian Style

Hamaguchi, Masugu, Hideki Miwake, Ryoichi Nakatake, and Noriyoshi Arai. 2023. "Predicting the Performance of Functional Materials Composed of Polymeric Multicomponent Systems Using Artificial Intelligence—Formulations of Cleansing Foams as an Example" Polymers 15, no. 21: 4216. https://doi.org/10.3390/polym15214216

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop