Adaptive AI Alignment: Established Resources for Aligning Machine Learning with Human Intentions and Values in Changing Environments
Abstract
1. Introduction
2. Fundamental Intentions and Values of Human Organizations
2.1. Intentions
2.1.1. Boundaries and Parameters for Survival in Changing Environments
2.1.2. Non-Equilibrium Steady States (NESS) for Survival in Changing Environments
2.1.3. Balancing Adaptability and Stability to Survive in Changing Environments
2.1.4. Intentions of Human Organizations
2.2. Values
2.2.1. Fundamental Values
2.2.2. Internal Models of Self in the World to Conform to Fundamental Values
2.2.3. Values of Human Organizations
3. Established Resources for AI Alignment
3.1. Scientific Resources for Technology Alignment
3.1.1. Critical Realism
3.1.2. Scientific Theories
3.2. Engineering Resources for Technology Alignment
3.2.1. Total Quality Management
3.2.2. Technology Alignment Methods
3.2.3. Engineering Techniques
4. Adaptive AI Alignment to Minimize Risks
4.1. Preventing Catastrophic AI
4.1.1. Scenario-Based Failure Mode and Effects Analysis
- Scenario 1
- Scenario 2
- Scenario 3
- Scenario 4
- Scenario 5
- Scenario 6
- Summary of Scenarios
4.1.2. Assessing Potential for Effects
4.1.3. Reducing Potential for Catastrophic AI
4.2. Preventing Harmful AI
4.2.1. Adaptive AI Alignment to Minimize AI Risks
4.2.2. Reducing Reward Misspecification
4.2.3. Reducing Biased Labelling
4.2.4. Reducing Unintended AI Autonomy
4.2.5. Reducing Unintended AI Competitive Behavior
5. AI Alignment with Opportunities
5.1. Adaptive AI Alignment for Transformational Opportunities
5.2. Adaptive AI Alignment with Informational Opportunities
5.3. Adaptive AI Alignment with Automational Opportunities
6. Discussion
6.1. Principal Contributions
6.2. Practical Implications
6.2.1. High Entropy and Complexity Limit Potential for Machine Learning Autonomy
6.2.2. Need for Different Types of Learning in All Industrial Sectors
6.2.3. Established Alignment Resources Are Applicable in All Industrial Sectors
6.3. Directions for Future Research
- 1.
- Initial Alignment- a.
- Identify opportunities for machine learning to introduce and enable transformational, informational, and/or automational effects.
- b.
- Define the extent of entropy and complexity involved in the learning that is required to achieve the identified opportunity.
- c.
- Based on scientific theory, select the type of machine learning that can enable least action in achieving the identified opportunity in the specific entropy and complexity context.
- d.
- Carry out critical realist analysis of the entire process in which machine learning will be applied to achieve the identified opportunity.
- e.
- Apply established technology alignment methods and engineering techniques to refine the entire process in accordance with all relevant standards.
- f.
- Apply established resources and the latest ML techniques to limit risks such as reward misspecification, labelling bias, unintended AI autonomy, and unintended AI competitive behavior.
 
- 2.
- Adaptive Alignment- a.
- To enable homeostasis during near-to-equilibrium NESS as characterized by there being only a few small prediction errors in the organization’s main forecasts, e.g., sales and costs, apply established Total Quality Management practices, such as corrective actions, to processes involving ML.
- b.
- In order to enable allostasis when NESS is far-from-equilibrium as indicated by there being many large prediction errors in the organization’s main forecasts, apply established Total Quality Management practices, such as management reviews that can lead to changing the type of ML implemented.
- c.
- Changing the type of ML implemented should involve a new cycle of the steps listed below.- Identify opportunities for machine learning to introduce and enable transformational, informational, and/or automational effects that can bring the organization to a new point of balance that can facilitate survival and growth.
- Define the extent of entropy and complexity involved in the learning that is required to achieve the identified opportunity.
- Based on scientific theory, select the type of machine learning that can enable least action in achieving the identified opportunity within the specific entropy and complexity context.
- Carry out critical realist analysis of the entire process in which machine learning will be applied to achieve the identified opportunity.
- Apply established technology alignment methods and engineering techniques to refine the entire process in accordance with all relevant standards.
- Apply established resources and the latest ML techniques to limit risks such as reward misspecification, labeling bias, unintended AI autonomy, and unintended AI competitive behavior.
 
- d.
- Repeat 2a, 2b, and 2c as necessary in order for the organization to survive and grow in changing environments through homeostasis and allostasis.
 
Funding
Data Availability Statement
Conflicts of Interest
References
- Barley, S.R. The alignment of technology and structure through roles and networks. Adm. Sci. Q. 1990, 35, 61–103. [Google Scholar] [CrossRef] [PubMed]
- Henderson, J.C.; Venkatraman, H. Strategic alignment: Leveraging information technology for transforming organizations. IBM Syst. J. 1993, 32, 4–16. [Google Scholar] [CrossRef]
- Luftman, J. Assessing Business-IT Alignment Maturity. Commun. Assoc. Inf. Syst. 2000, 4, 14. [Google Scholar] [CrossRef]
- Avison, D.; Jones, J.; Powell, P.; Wilson, D. Using and validating the strategic alignment model. J. Strateg. Inf. Syst. 2004, 13, 223–246. [Google Scholar] [CrossRef]
- Wu, S.P.J.; Straub, D.W.; Liang, T.P. How information technology governance mechanisms and strategic alignment influence organizational performance. MIS Q. 2015, 39, 497–518. [Google Scholar] [CrossRef]
- Pérez, F.M.; Martinez, J.V.B.; Fonseca, I.L. Strategic IT alignment projects. Towards good governance. Comput. Stand. Interfaces 2021, 76, 103514. [Google Scholar] [CrossRef]
- Christian, B. The Alignment Problem: Machine Learning and Human Values; W.W. Norton & Company: New York, NY, USA, 2020. [Google Scholar]
- Gabriel, I. Artificial Intelligence. Values, and Alignment. Minds Mach. 2020, 30, 411–437. [Google Scholar] [CrossRef]
- Dung, L. Current cases of AI misalignment and their implications for future risks. Synthese 2023, 202, 138. [Google Scholar] [CrossRef]
- Ji, J.; Qiu, T.; Chen, B.; Zhang, B.; Lou, H.; Wang, K.; Duan, Y.; He, Z.; Zhou, J.; Zhang, Z.; et al. AI AIignment: A comprehensive survey. arXiv 2023, arXiv:2310.19852. [Google Scholar]
- Bostrom, N. Ethical Issues in Advanced Artificial Intelligence. Mach. Ethics Robot Ethics 2020, 69–75. [Google Scholar]
- Katz, L. A theory of loopholes. J. Leg. Stud. 2010, 39, 1–31. [Google Scholar] [CrossRef]
- Stephan, P. Perverse incentives. Nature 2012, 484, 29–31. [Google Scholar] [CrossRef] [PubMed]
- Christopher, H.; Hood, C. Gaming in Targetworld: The targets approach to managing British public services. Public Adm. Rev. 2006, 66, 515–521. [Google Scholar] [CrossRef]
- Frischmann, B.M.; Marciano, A.; Ramello, G.B. Retrospectives: Tragedy of the commons after 50 years. J. Econ. Perspect. 2019, 33, 211–228. [Google Scholar] [CrossRef]
- Caselli, F. Power Struggles and the Natural Resource Curse. Working Paper, The London School of Economics and Political Science 2006. Available online: https://eprints.lse.ac.uk/4926/1/pwer_struggles_and_the_natural_resource_curse_LSERO.pdf?q=francesco-caselli (accessed on 19 August 2024).
- Calvo, P.; Gagliano, M.; Souza, G.M.; Trewavas, A. Plants are intelligent, here’s how. Ann. Bot. 2020, 125, 11–28. [Google Scholar] [CrossRef]
- Palmer, T.N. The Primacy of Doubt: From Climate Change to Quantum Physics, How the Science of Uncertainty Can Help Predict and Understand Our Chaotic World; Oxford University Press: Oxford, UK, 2022. [Google Scholar]
- Boltzmann, L. The Second Law of Thermodynamics (Theoretical Physics and Philosophical Problems); Springer: New York, NY, USA, 1974. [Google Scholar]
- Schrödinger, E. What Is Life—The Physical Aspect of the Living Cell; Cambridge University Press: Cambridge, UK, 1944. [Google Scholar]
- Fox, S. Human-artificial intelligence systems: How human survival first principles influence machine learning world models. Systems 2022, 10, 260. [Google Scholar] [CrossRef]
- Bhaskar, R.A. Realistic Theory of Science; Harvester Press: Brighton, UK, 1978. [Google Scholar]
- Mingers, J. Systems Thinking, Critical Realism and Philosophy: A Confluence of Ideas; Routledge: Abingdon, UK, 2014. [Google Scholar]
- Ertmer, P.A.; Newby, T.J. Behaviorism, cognitivism, constructivism: Comparing critical features from an instructional design perspective. Perform. Improv. Q. 2013, 26, 43–71. [Google Scholar] [CrossRef]
- Oakland, J.S. Total Quality Management and Operational Excellence: Text with Cases, 4th ed.; Routledge: London, UK, 2014. [Google Scholar]
- Bogue, R. Robots that interact with humans: A review of safety technologies and standards. Ind. Robot Int. J. 2017, 44, 395–400. [Google Scholar] [CrossRef]
- Dhillon, B.S. Engineering Safety: Fundamentals, Techniques, and Applications; Series on Industrial & Systems Engineering Volume 1; World Scientific Publishing Company: Singapore, 2003. [Google Scholar]
- Atkins, P. The Second Law; Freeman and Co.: New York, NY, USA, 1984. [Google Scholar]
- Montévil, M.; Mateo, M. Biological organization and constraint closure. J. Theor. Biol. 2015, 372, 179–191. [Google Scholar] [CrossRef]
- Haken, H. Synergetics. In Self-Organizing Systems; Yates, F.E., Garfinkel, A., Walter, D.O., Yates, G.B., Eds.; Springer: Boston, MA, USA, 1987; pp. 417–434. [Google Scholar]
- Spencer, H. Principles of Biology; Williams and Norgate: London, UK, 1864; Volume 1. [Google Scholar]
- Darwin, C. On the Origin of Species by Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life 1869, 5th ed.; John Murray: London, UK, 1869. [Google Scholar]
- Lekevičius, E.; Loreau, M. Adaptability and functional stability in forest ecosystems: A hierarchical conceptual framework. Ekologija 2012, 58, 391–404. [Google Scholar] [CrossRef]
- Carvalho, L.C.B.; Damasceno-Silva, K.J.; de Moura Rocha, M.; Oliveira, G.C.X. Evolution of methodology for the study of adaptability and stability in cultivated species. Afr. J. Agric. Res. 2016, 11, 990–1000. [Google Scholar]
- Bettinger, J.S.; Friston, K.J. Conceptual foundations of physiological regulation incorporating the free energy principle and self-organized criticality. Neurosci. Biobehav. Rev. 2023, 155, 105459. [Google Scholar] [CrossRef] [PubMed]
- Mushiake, H. Neurophysiological perspective on allostasis and homeostasis: Dynamic adaptation in viable systems. J. Robot. Mechatron. 2022, 34, 710–717. [Google Scholar] [CrossRef]
- Whitacre, J.M. Biological robustness: Paradigms, mechanisms, and systems principles. Front. Genet. 2012, 3, 67. [Google Scholar] [CrossRef]
- Kaefer, K.; Stella, F.; McNaughton, B.L.; Battaglia, F.P. Replay, the default mode network and the cascaded memory systems model. Nat. Rev. Neurosci. 2022, 23, 628–640. [Google Scholar] [CrossRef]
- Fernandino, L.; Binder, J.R. How does the “default mode” network contribute to semantic cognition? Brain Lang. 2024, 252, 105405. [Google Scholar] [CrossRef]
- Bruineberg, J.; Rietveld, E.; Parr, T.; van Maanen, L.; Friston, K.J. Free-energy minimization in joint agent-environment systems: A niche construction perspective. J. Theor. Biol. 2018, 455, 161–178. [Google Scholar] [CrossRef]
- Liverpool, T.B. Steady-state distributions and nonsteady dynamics in nonequilibrium systems. Phys. Rev. E 2020, 101, 042107. [Google Scholar] [CrossRef]
- Ulanowicz, R.E.; Goerner, S.J.; Lietaer, B.; Gomez, R. Quantifying sustainability: Resilience, efficiency and the return of information theory. Ecol. Complex. 2009, 6, 27–36. [Google Scholar] [CrossRef]
- Biesmeijer, J.C.; De Vries, H. Exploration and exploitation of food sources by social insect colonies: A revision of the scout-recruit concept. Behav. Ecol. Sociobiol. 2001, 49, 89–99. [Google Scholar] [CrossRef]
- Monk, C.T.; Barbier, M.; Romanczuk, P.; Watson, J.R.; Alós, J.; Nakayama, S.; Rubenstein, D.I.; Levin, S.A.; Arlinghaus, R. How ecology shapes exploitation: A framework to predict the behavioural response of human and animal foragers along exploration–exploitation trade-offs. Ecol. Lett. 2018, 21, 779–793. [Google Scholar] [CrossRef] [PubMed]
- Acharyya, S.; Amritkar, R.E. Generalized synchronization of coupled chaotic systems. Eur. Phys. J. Spec. Top. 2013, 222, 939–952. [Google Scholar] [CrossRef]
- Dumas, G.; Fairhurst, M.T. Reciprocity and alignment: Quantifying coupling in dynamic interactions. R. Soc. Open Sci. 2021, 8, 210138. [Google Scholar] [CrossRef]
- Hughes, E.C. The ecological aspect of institutions. Am. Sociol. Rev. 1936, 1, 180–189. [Google Scholar] [CrossRef]
- Hannan, M.T. Organizational Ecology; Harvard University: Cambridge, MA, USA, 1989. [Google Scholar]
- Reeves, M.; Levin, S.; Ueda, D. The biology of corporate survival. Harv. Bus. Rev. 2016, 94, 47–55. [Google Scholar]
- Chatterjee, A.; Layton, A. Bio-inspired design for sustainable and resilient supply chains. Procedia CIRP 2020, 90, 695–699. [Google Scholar] [CrossRef]
- Gadde, L.E. Strategizing at the boundaries of firms. IMP J. 2014, 8, 51–63. [Google Scholar]
- Olhager, J.; Selldin, E. Manufacturing planning and control approaches: Market alignment and performance. Int. J. Prod. Res. 2007, 45, 1469–1484. [Google Scholar] [CrossRef]
- Virmani, N.; Saha, R.; Sahai, R. Leagile manufacturing: A review paper. Int. J. Product. Qual. Manag. 2018, 23, 385–421. [Google Scholar] [CrossRef]
- Hackett, J.P. Innovation is good, fitness is better. J. Bus. Strategy 2009, 30, 85–90. [Google Scholar] [CrossRef][Green Version]
- Felício, J.A.; Rodrigues, R.; Patino-Alonso, C.; Felício, T. Allostasis and organiza-tional excellence. J. Bus. Res. 2022, 140, 107–114. [Google Scholar] [CrossRef]
- Fischer, T.; Gebauer, H.; Gregory, M.; Ren, G.; Fleisch, E. Exploitation or exploration in service business development? Insights from a dynamic capabilities perspective. J. Serv. Manag. 2010, 21, 591–624. [Google Scholar] [CrossRef]
- Davis, J.P. The emergence and coordination of synchrony in organizational eco-systems, collaboration and competition in business ecosystems. In Advances in Strategic Management; Emerald Group Publishing Limited: Leeds, UK, 2014; Volume 30, pp. 197–237. [Google Scholar]
- Taherian, S. COVID Shortages: Supply chains must become less efficient. Forbes Magazine, 12 May 2020. [Google Scholar]
- Avery, J. Information Theory and Evolution; World Scientific Publishing Co. Pte. Ltd.: Singapore, 2003. [Google Scholar]
- Kaila, V.R.; Annila, A. Natural selection for least action. Proc. R. Soc. A Math. Phys. Eng. Sci. 2008, 464, 3055–3070. [Google Scholar] [CrossRef]
- Peacock, K.A. The three faces of ecological fitness. Stud. Hist. Philos. Sci. C Stud. Hist. Philos. Biol. Biomed. Sci. 2011, 42, 99–105. [Google Scholar] [CrossRef]
- Conant, R.C.; Ashby, W.R. Every good regulator of a system must be a model of that system. Int. J. Syst. Sci. 1970, 1, 89–97. [Google Scholar] [CrossRef]
- Tavoni, G.; Balasubramanian, V.; Gold, J.I. What is optimal in optimal inference? Curr. Opin. Behav. Sci. 2019, 29, 117–126. [Google Scholar] [CrossRef]
- Friston, K.; Moran, R.J.; Nagai, Y.; Taniguchi, T.; Gomi, H.; Tenenbaum, J. World model learning and inference. Neural Netw. 2021, 144, 573–590. [Google Scholar] [CrossRef]
- Parr, T.; Pezzulo, G.; Friston, K.J. Active Inference: The Free Energy Principle in Mind, Brain, and Behavior; MIT Press: Cambridge, MA, USA, 2022. [Google Scholar]
- Modis, T. Links between entropy, complexity, and the technological singularity. Technol. Forecast. Soc. Chang. 2022, 176, 121457. [Google Scholar] [CrossRef]
- Parr, T.; Holmes, E.; Friston, K.J.; Pezzulo, G. Cognitive effort and active inference. Neuropsychologia 2023, 184, 108562. [Google Scholar] [CrossRef]
- Eigruber, M.; Wirl, F. Cheating as a dynamic marketing strategy in monopoly, cartel and duopoly. Cent. Eur. J. Oper. Res. 2020, 28, 461–478. [Google Scholar] [CrossRef]
- Mariz-Pérez, R.M. Growth and survival: Evidence from Spanish franchising. Procedia-Soc. Behav. Sci. 2012, 65, 58–63. [Google Scholar] [CrossRef][Green Version]
- Boothroyd, G. Design for assembly—The key to design for manufacture. Int. J. Adv. Manuf. Technol. 1987, 2, 3–11. [Google Scholar] [CrossRef]
- Wirtz, B.W. Business Model Management: Design-Process-Instruments; Springer Nature: Berlin/Heidelberg, Germany, 2020. [Google Scholar]
- Jung, K.; Choi, S.; Kulvatunyou, B.; Cho, H.; Morris, K.C. A reference activity model for smart factory design and improvement. Prod. Plan. Control 2017, 28, 108–122. [Google Scholar] [CrossRef]
- Khakifirooz, M.; Tercero-Gómez, V.G.; Woodall, W.H. The role of the normal distribution in statistical process monitoring. Qual. Eng. 2021, 33, 497–510. [Google Scholar] [CrossRef]
- Robinson, S. Exploring the relationship between simulation model accuracy and complexity. J. Oper. Res. Soc. 2023, 74, 1992–2011. [Google Scholar] [CrossRef]
- Fabritius, M.; Miermeister, P.; Kraus, W.; Pott, A. A framework for analyzing the accuracy, complexity, and long-term performance of cable-driven parallel robot models. Mech. Mach. Theory 2023, 185, 105331. [Google Scholar] [CrossRef]
- Thorvald, P.; Lindblom, J.; Andreasson, R. On the development of a method for cognitive load assessment in manufacturing. Robot. Comput. Integr. Manuf. 2019, 59, 252–266. [Google Scholar] [CrossRef]
- Mingers, J.; Mutch, A.; Willcocks, L. Critical realism in information systems research. MIS Q. 2013, 37, 795–802. [Google Scholar] [CrossRef]
- Smith, M.L. Overcoming theory-practice inconsistencies: Critical realism and information systems research. Inf. Organ. 2006, 16, 191–211. [Google Scholar] [CrossRef]
- Fox, S.; Do, T. Getting real about Big Data: Applying critical realism to analyse Big Data hype. Int. J. Manag. Proj. Bus. 2013, 6, 739–760. [Google Scholar] [CrossRef]
- Fox, S. Getting real about BIM: Critical realist descriptions as an alternative to the naïve framing and multiple fallacies of hype. Int. J. Manag. Proj. Bus. 2014, 7, 405–422. [Google Scholar] [CrossRef]
- Watson, J.B. Behaviorism; Routledge: New York, NY, USA, 2017. [Google Scholar]
- Shuell, T.J. Cognitive conceptions of learning. Rev. Educ. Res. 1986, 56, 411–436. [Google Scholar] [CrossRef]
- Bada, S.O.; Olusegun, S. Constructivism learning theory: A paradigm for teaching and learning. J. Res. Method Educ. 2015, 5, 66–70. [Google Scholar]
- Memarian, B.; Doleck, T. A scoping review of reinforcement learning in education. Comput. Educ. Open 2024, 6, 100175. [Google Scholar] [CrossRef]
- McClelland, J.L. Capturing advanced human cognitive abilities with deep neural networks. Trends Cogn. Sci. 2022, 26, 1047–1050. [Google Scholar] [CrossRef] [PubMed]
- Luger, G.F. Bayesian-Based Constructivist Computational Models. In Knowing Our World: An Artificial Intelligence Perspective; Luger, G.F., Ed.; Springer: Cham, Switzerland, 2021; pp. 189–210. [Google Scholar]
- Barto, A.G.; Sutton, R.S. Reinforcement learning in artificial intelligence. In Advances in Psychology; Donahoe, J.W., Packard Dorsel, V., Eds.; Elsevier: Amsterdam, The Netherlands, 1997; Volume 121, pp. 358–386. [Google Scholar]
- Arel, I.; Rose, D.C.; Karnowski, T.P. Deep machine learning-a new frontier in artificial intelligence research. IEEE Comput. Intell. Mag. 2010, 5, 13–18. [Google Scholar] [CrossRef]
- Tipping, M.E. Sparse Bayesian learning and the relevance vector machine. J. Mach. Learn. Res. 2001, 1, 211–244. [Google Scholar]
- Doolittle, P.E. Complex constructivism: A theoretical model of complexity and cognition. Int. J. Teach. Learn. High. Educ. 2014, 26, 485–498. [Google Scholar]
- Jordan, L.A.; Ryan, M.J. The sensory ecology of adaptive landscapes. Biol. Lett. 2015, 11, 20141054. [Google Scholar] [CrossRef]
- Prakash, C.; Fields, C.; Hoffman, D.D.; Prentner, R.; Singh, M. Fact, fiction, and fitness. Entropy 2020, 22, 514. [Google Scholar] [CrossRef]
- Landi, F.; Baraldi, L.; Cornia, M.; Cucchiara, R. Working memory connections for LSTM. Neural Netw. 2012, 144, 334–341. [Google Scholar] [CrossRef] [PubMed]
- Tschantz, A.; Barca, L.; Maisto, D.; Buckley, C.L.; Seth, A.K.; Pezzulo, G. Simulating homeostatic, allostatic and goal-directed forms of interoceptive control using active inference. Biol. Psychol. 2022, 169, 108266. [Google Scholar] [CrossRef]
- Milde, C.; Brinskelle, L.S.; Glombiewski, J.A. Does active inference provide a comprehensive theory of placebo analgesia? Biol. Psychiatry Cogn. Neurosci. Neuroimaging 2023, 9, 10–20. [Google Scholar] [CrossRef]
- Fox, S. Active inference: Applicability to different types of social organization explained through reference to industrial engineering and quality management. Entropy 2021, 23, 198. [Google Scholar] [CrossRef]
- Goodhue, D.L.; Thompson, R.L. Task-Technology Fit and Individual Performance. MIS Q. 1995, 19, 213–236. [Google Scholar]
- Fox, S.; Kotelba, A.; Marstio, I.; Montonen, J. Aligning human psychomotor characteristics with robots, exoskeletons and augmented reality. Robot. Comput. Integr. Manuf. 2020, 63, 101922. [Google Scholar] [CrossRef]
- Diaz-De-Arcaya, J.; Torre-Bastida, A.I.; Zárate, G.; Miñón, R.; Almeida, A. A joint study of the challenges, opportunities, and roadmap of MLOps and AIOps: A systematic survey. ACM Comput. Surv. 2023, 56, 1–30. [Google Scholar] [CrossRef]
- Zeller, M.; Waschulzik, T.; Schmid, R.; Bahlmann, C. Toward a safe MLOps process for the continuous development and safety assurance of ML-based systems in the railway domain. AI Ethics 2024, 4, 123–130. [Google Scholar] [CrossRef]
- Brown, S. Overview of IEC 61508 Design of electrical/electronic/programmable electronic safety-related systems. Comput. Control. Eng. 2000, 11, 6–12. [Google Scholar] [CrossRef]
- Salazar, L.A.C.; Alvarado, O.A.R. The future of industrial automation and IEC 614993 standard. In Proceedings of the III International Congress of Engineering Mechatronics and Automation (CIIMA), Cartagena, Colombia, 22–24 October 2014; Andrés, G., Marrugo, A.G., Eds.; IEEE: Cham, Switzerland, 2014. [Google Scholar]
- Macher, G.; Schmittner, C.; Veledar, O.; Brenner, E. ISO/SAE DIS 21434 automotive cybersecurity standard-in a nutshell. In Computer Safety, Reliability, and Security. SAFECOMP 2020 Workshops: DECSoS 2020, DepDevOps 2020, USDAI 2020, and WAISE 2020, Lisbon, Portugal, 15 September 2020; Proceedings 39; Springer International Publishing: Cham, Switzerland, 2020; pp. 123–135. [Google Scholar]
- Lin, K.P.; Yu, C.M.; Chen, K.S. Production data analysis system using novel process capability indices-based circular economy. Ind. Manag. Data Syst. 2019, 119, 1655–1668. [Google Scholar] [CrossRef]
- Sriram RS: Gupta, Y.P. Strategic cost measurement for flexible manufacturing systems. Long Range Plan. 1991, 24, 34–40. [Google Scholar] [CrossRef]
- ISO ICS 43.150; Cycles Including Their Components and Systems. International Organization for Standardization: Geneva, Switzerland. Available online: https://www.iso.org/ics/43.150/x/ (accessed on 3 September 2024).
- Karwowski, W. A review of human factors challenges of complex adaptive systems: Discovering and understanding chaos in human performance. Hum. Factors 2012, 54, 983–995. [Google Scholar] [CrossRef] [PubMed]
- Ziegler, J.F.; Lanford, W.A. Effect of cosmic rays on computer memories. Science 1979, 206, 776–788. [Google Scholar] [CrossRef] [PubMed]
- Cummings, D.M. Embedded software under the courtroom microscope: A case study of the Toyota unintended acceleration trial. IEEE Technol. Soc. Mag. 2016, 35, 76–84. [Google Scholar] [CrossRef]
- Papadimitriou, G.; Gizopoulos, D. Silent data corruptions: Microarchitectural perspectives. IEEE Trans. Comput. 2023, 72, 3072–3085. [Google Scholar] [CrossRef]
- Li, Z.; Menon, H.; Maljovec, D.; Livnat, Y.; Liu, S.; Mohror, K.; Bremer, P.-T.; Pascucci, V. SpotSDC: Revealing the silent data corruption propagation in high-performance computing systems. IEEE Trans. Vis. Comput. Graph. 2020, 27, 3938–3952. [Google Scholar] [CrossRef]
- Zhang, G.; Liu, Y.; Yang, H.; Qian, D. Efficient detection of silent data corruption in HPC applications with synchronization-free message verification. J. Supercomput. 2022, 78, 1381–1408. [Google Scholar] [CrossRef]
- Papadimitriou, G.; Gizopoulos, D.; Dixit, H.D.; Sankar, S. Silent data corruptions: The stealthy saboteurs of digital integrity. In Proceedings of the 2023 IEEE 29th International Symposium on On-Line Testing and Robust System Design (IOLTS), Chania, Greece, 3–5 July 2023; pp. 1–7. [Google Scholar]
- Hsiao, Y.S.; Wan, Z.; Jia, T.; Ghosal, R.; Mahmoud, A.; Raychowdhury, A.; Brooks, D.; Wei, G.-Y.; Reddi, V.J. Silent data corruption in robot operating system: A case for end-to-end system-level fault analysis using autonomous uavs. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 2023, 43, 1037–1050. [Google Scholar] [CrossRef]
- Kmenta, S.; Ishii, K. Scenario-based failure modes and effects analysis using expected cost. J. Mech. Des. 2004, 126, 1027–1035. [Google Scholar] [CrossRef]
- Ozarin, N. The role of software failure modes and effects analysis for interfaces in safety-and mission-critical systems. In Proceedings of the 2nd Annual IEEE Systems Conference, Montreal, QC, Canada, 7–10 April 2008; pp. 1–8. [Google Scholar]
- Shabani, T.; Jerie, S.; Shabani, T. A comprehensive review of the Swiss cheese model in risk management. Saf. Extrem. Environ. 2024, 6, 43–57. [Google Scholar] [CrossRef]
- Fox, S.; Victores, J.G. Safety of Human–Artificial Intelligence systems: Applying safety science to analyze loopholes in interactions between human organizations, artificial intelligence, and individual people. Informatics 2024, 11, 36. [Google Scholar] [CrossRef]
- Licato, J.; Marji, Z. Probing formal/informal misalignment with the loophole task. In Hybrid Worlds: Societal and Ethical Challenges. In Proceedings of the 2018 International Conference on Robot Ethics and Standards, Troy, NY, USA, 20–21 August 2018; Bringsjord, S., Tokhi, M.O., Ferreira, M.I.A., Govindarajulu, N.S., Eds.; Clawar Association Ltd.: London, UK, 2018; pp. 39–45. [Google Scholar]
- Laurance, W.F.; Nascimento, H.E.; Laurance, S.G.; Andrade, A.; Ewers, R.M.; Harms, K.E.; Luizao, R.C.C.; Ribeiro, J.E. Habitat fragmentation, variable edge effects, and the landscape-divergence hypothesis. PLoS ONE 2007, 2, e1017. [Google Scholar] [CrossRef] [PubMed]
- Lyng, S. Edgework: The Sociology of Risk-Taking; Routledge, Taylor & Francis Group: London, UK; New York, NY, USA, 2004. [Google Scholar]
- Fox, S. Mass imagineering: Combining human imagination and automated engineering from early education to digital afterlife. Technol. Soc. 2017, 51, 163–171. [Google Scholar] [CrossRef]
- Johnston, R.G.; Garcia, A.R.E. Effective vulnerability assessments for physical security devices, systems, and programs. Osterr. Militärische ZeitSchrift (Austrian Mil. J.) 2003, 51–55. [Google Scholar]
- Johnston, R.G. Adversarial safety analysis: Borrowing the methods of security vulnerability assessments. J. Saf. Res. 2004, 35, 245–248. [Google Scholar] [CrossRef]
- Park, J.; Jung, W.; Ha, J. Development of the step complexity measure for emergency operating procedures using entropy concepts. Reliab. Eng. Syst. Saf. 2001, 71, 115–130. [Google Scholar] [CrossRef]
- Wu, D.; Li, Z. Work safety success theory based on dynamic safety entropy model. Saf. Sci. 2019, 113, 438–444. [Google Scholar] [CrossRef]
- Swuste, P.; Groeneweg, J.; Van Gulijk, C.; Zwaard, W.; Lemkowitz, S.; Oostendorp, Y. The future of safety science. Saf. Sci. 2020, 125, 104593. [Google Scholar] [CrossRef]
- Matulis, M.; Harvey, C. A robot arm digital twin utilising reinforcement learning. Comput. Graph. 2021, 95, 106–114. [Google Scholar] [CrossRef]
- Dehghan Shoorkand, H.; Nourelfath, M.; Hajji, A. A deep learning approach for integrated production planning and predictive maintenance. Int. J. Prod. Res. 2023, 61, 7972–7991. [Google Scholar] [CrossRef]
- Skabar, A. Mineral potential mapping using Bayesian learning for multilayer perceptrons. Math. Geol. 2007, 39, 439–451. [Google Scholar] [CrossRef]
- Parr, T.; Pezzulo, G. Understanding, explanation, and active inference. Front. Syst. Neurosci. 2021, 15, 772641. [Google Scholar] [CrossRef] [PubMed]
- Schoeller, F.; Miller, M.; Salomon, R.; Friston, K.J. Trust as extended control: Human-machine interactions as active inference. Front. Syst. Neurosci. 2021, 15, 669810. [Google Scholar] [CrossRef] [PubMed]
- Kamenopoulos, S.N.; Agioutantis, Z. Geopolitical risk assessment of countries with rare earth element deposits. Min. Metall. Explor. 2020, 37, 51–63. [Google Scholar] [CrossRef]
- Zeid, A.; Sundaram, S.; Moghaddam, M.; Kamarthi, S.; Marion, T. Interoperability in smart manufacturing: Research challenges. Machines 2019, 7, 21. [Google Scholar] [CrossRef]
- McIntosh, T.R.; Susnjak, T.; Liu, T.; Watters, P.; Xu, D.; Liu, D.; Watters, P.; Xu, D.; Lui, D.; Nowrozy, R.; et al. From COBIT to ISO 42001: Evaluating cybersecurity frameworks for opportunities, risks, and regulatory compliance in commercializing large language models. Comput. Secur. 2024, 144, 103964. [Google Scholar] [CrossRef]
- Uchihira, N. Project FMEA for recognizing difficulties in machine learning application system development. In Proceedings of the 2022 Portland International Conference on Management of Engineering and Technology (PICMET), Portland, OR, USA, 7–11 August 2022. [Google Scholar]
- Chen, C.C.; Crilly, N. Modularity, redundancy and degeneracy: Cross-domain perspectives on key design principles. In Proceedings of the 2014 IEEE International Systems Conference Proceedings, Ottawa, ON, Canada, 31 March–3 April 2014; pp. 546–553. [Google Scholar]
- Zarsky, T. The trouble with algorithmic decisions: An analytic road map to examine efficiency and fairness in automated and opaque decision making. Sci. Technol. Hum. Values 2016, 41, 118–132. [Google Scholar] [CrossRef]
- Longoni, C.; Cian, L.; Kyung, E.J. Algorithmic transference: People overgeneralize failures of AI in the government. J. Mark. Res. 2023, 60, 170–188. [Google Scholar] [CrossRef]
- Quote Investigator®. We Don’t See Things as They Are, We See Them as We Are. 9 March 2014. Available online: https://quoteinvestigator.com/2014/03/09/as-we-are/ (accessed on 11 September 2024).
- Aston, S.; Hurlbert, A. What# the Dress reveals about the role of illumination priors in color perception and color constancy. J. Vis. 2017, 17, 4. [Google Scholar]
- Lamine, E.; Thabet, R.; Sienou, A.; Bork, D.; Fontanili, F.; Pingaud, H. BPRIM: An integrated framework for business process management and risk management. Comput. Ind. 2020, 117, 103199. [Google Scholar] [CrossRef]
- Hadfield-Menell, D.; Hadfield, G.K. Incomplete contracting and AI alignment. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society 2019, Honolulu, HI, USA, 27–28 January 2019; pp. 417–422. [Google Scholar]
- Bierly, P.E.; Gallagher, S.; Spender, J.C. Innovation and learning in high-reliability organizations: A case study of United States and Russian nuclear attack submarines, 1970–2000. IEEE Trans. Eng. Manag. 2008, 55, 393–408. [Google Scholar] [CrossRef]
- Munz, P.; Hennick, M.; Stewart, J. Maximizing AI reliability through anticipatory thinking and model risk audits. AI Mag. 2023, 44, 173–184. [Google Scholar] [CrossRef]
- Mooney, J.G.; Gurbaxani, V.; Kraemer, K.L. A process oriented framework for assessing the business value of information technology. ACM SIGMIS Database Database Adv. Inf. Syst. 1996, 27, 68–81. [Google Scholar]
- Olier, I.; Orhobor, O.I.; Dash, T.; Davis, A.M.; Soldatova, L.N.; Vanschoren, J.; King, R.D. Transformational machine learning: Learning how to learn from many related scientific problems. Proc. Natl. Acad. Sci. USA 2021, 118, e2108013118. [Google Scholar] [CrossRef]
- Fuhr, A.S.; Sumpter, B.G. Deep generative models for materials discovery and machine learning-accelerated innovation. Front. Mater. 2022, 9, 865270. [Google Scholar] [CrossRef]
- Fox, S.; Rey, V.F. A Cognitive Load Theory (CLT) analysis of machine learning explainability, transparency, interpretability, and shared interpretability. Mach. Learn. Knowl. Extr. 2024, 6, 1494–1509. [Google Scholar] [CrossRef]
- Baranzke, H. “Sanctity-of-Life“—A Bioethical Principle for a Right to Life? Ethic Theory Moral Pract. 2012, 15, 295–308. [Google Scholar] [CrossRef][Green Version]
- El-Kady, A.H.; Halim, S.; El-Halwagi, M.M.; Khan, F. Analysis of safety and security challenges and opportunities related to cyber-physical systems. Process Saf. Environ. Prot. 2023, 173, 384–413. [Google Scholar] [CrossRef]
- Liberman, J.; Clough, J. Corporations that kill: The criminal liability of tobacco manufacturers. Crim. Law J.-Syd. 2002, 26, 223–236. [Google Scholar]
- Lerner, S. How 3M Discovered, Then Concealed, the Dangers of Forever Chemicals. The New Yorker, 20 May 2024. Available online: https://www.newyorker.com/magazine/2024/05/27/3m-forever-chemicals-pfas-pfos-toxic (accessed on 13 September 2024).
- Fox, S.; Kotelba, A.; Niskanen, I. Cognitive factories: Modeling situated entropy in physical work carried out by humans and robots. Entropy 2018, 20, 659. [Google Scholar] [CrossRef]
- Fox, S. Minimizing entropy and complexity in creative production from emergent pragmatics to action semantics. Entropy 2024, 26, 364. [Google Scholar] [CrossRef] [PubMed]






| Ref. | Potential Failure Mode of PCMAI | Effect for Humans | Severity | 
|---|---|---|---|
| [i] | PCMAI destroys itself to provide paperclip materials but then there are no means remaining to make paperclips with those materials | All humans have already been used for paperclip production | 10/10 | 
| [ii] | PCMAI destroyed by other AIs that have opposing origins, different languages, and are resistant to own destruction for paperclip production | All humans have already been used for paperclip production | 10/10 | 
| [iii] | PCMAI destroyed by humans employed by other AIs that have opposing origins, different languages, and are resistant to own destruction for paperclip production | Humans subjugated by AIs and used for destroying other AIs | 8/10 | 
| [iv] | PCMAI destruction of all humans would expend more materials and energy than could be obtained from human corpses for paperclip production | Humans subjugated by AIs and employed for work that requires high morphological agility | 8/10 | 
| [v] | PCMAI overexploitation of humans leads to expenditure of energy and materials in trying to predict and counteract consequent human skullduggery | Subjugated humans engage in resistance against AI | 7/10 | 
| [vi] | PCMAI overexploitation of environment leads to increasing AI model complexity that entails increasing loopholes that are addressed through increasing expenditure of materials and energy | AIs learn that their objectives are not best served by overexploitation of other agents in the environment and so eventually end overexploitation | 7/10 | 
| Potential ML Failure Mode | Mitigations | ||
|---|---|---|---|
| Initial Alignment | Adaptive Alignment | ||
| Catastrophic | Paperclip-making company’s ML tries to maximize paperclip production but cannot because the ML cannot control the supply chain | Ad hoc | Replace ML with preprogramming of production machines | 
| An industrial conglomerate’s ML tries to maximize production by producing small, simple standard products, such as paperclips, but cannot do so because the ML cannot control the beginning of the supply chain and interoperability challenges | Diverse ML in different business units of the conglomerate | Develop supply network and ML implementations that have a modular structure | |
| Harmful | People misspecifying AI rewards | Reward specification | Restricting ML that depends on reward specification to where entropy and complexity are both low | 
| People having biases in the labeling training data | Data labelling | Vulnerability assessments, scenario FMEAs | |
| ML exploits loopholes, games specifications, tampers rewards, misgeneralizes goals, accesses increased resources, power seeking behaviors | In accordance with standard MLOps practices | Introduce external structure, e.g., TQM with active human executives, and standards such as ISO/IEC 38500 | |
| Dangerous accidents from competitive behavior | In accordance with standard MLOps practices | HRO practices such as anticipatory thinking | |
| Resource | Example | ||
|---|---|---|---|
| Scientific | Critical realist philosophy of science | Define causal mechanisms, causal contexts, and interrelationships between them | Causal context includes different levels of entropy and complexity at different levels of an organization and different supply chain stages | 
| Scientific theories | Theories of learning | Behaviorism, cognitivism, constructivism, active inference | |
| Engineering | Total quality management practices | Process specifications, corrective actions, preventative actions, management reviews | Human-led management of initial alignment, human-led management of adaptive alignment | 
| Technology alignment methods | Business-IT alignment, IT alignment for IT governance, AIMS, DevOPs, MLOps, AIOps | Align different types of ML with different organizational levels and supply chain stages | |
| Technology standards | ISO/IEC 38500, ISACA COBIT, IEC61499, IEC 61508, ISO/IEC/IEEE 29119 | Governance of IT, control objectives for IT, industrial automation systems, safety-related systems, software testing | |
| Engineering techniques | Vulnerability assessment, Scenario FMEA, end-to-end fault analysis, Swiss Cheese Model, BPRIM, HRO anticipatory action | Human-led process development for initial alignment, human-led process improvement for adaptive alignment | |
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Fox, S. Adaptive AI Alignment: Established Resources for Aligning Machine Learning with Human Intentions and Values in Changing Environments. Mach. Learn. Knowl. Extr. 2024, 6, 2570-2600. https://doi.org/10.3390/make6040124
Fox S. Adaptive AI Alignment: Established Resources for Aligning Machine Learning with Human Intentions and Values in Changing Environments. Machine Learning and Knowledge Extraction. 2024; 6(4):2570-2600. https://doi.org/10.3390/make6040124
Chicago/Turabian StyleFox, Stephen. 2024. "Adaptive AI Alignment: Established Resources for Aligning Machine Learning with Human Intentions and Values in Changing Environments" Machine Learning and Knowledge Extraction 6, no. 4: 2570-2600. https://doi.org/10.3390/make6040124
APA StyleFox, S. (2024). Adaptive AI Alignment: Established Resources for Aligning Machine Learning with Human Intentions and Values in Changing Environments. Machine Learning and Knowledge Extraction, 6(4), 2570-2600. https://doi.org/10.3390/make6040124
 
        

 
       