Experimental Diagnostics of the Emotional State of Individuals Using External Stimuli and a Model of Neurocognitive Brain Activity

In this paper, we study ways and methods to diagnose the emotional state of individuals using external audiovisual stimuli and heart telemetry tools. We apply a mathematical model of neurocognitive brain activity developed specifically for this study to interpret the experimental scheme and its results. This experimental technique is based on monitoring and analyzing the dynamics of heart rate variability (HRV), taking into account the particular context and events occurring around the subject of the study. In addition, we provide a brief description of the theory of information images/representations used for the paradigm and interpretation of the experiment. For this study, we viewed the human mind as a one-dimensional potential hole with finite walls of different sizes and an internal potential barrier modeling the border between consciousness and subconsciousness. We also provided the foundations of the mathematical apparatus for this particular view. This experiment allowed us to identify the characteristic markers of influencing external stimuli, which form a foundation for diagnosing the emotional state of an individual.


Introduction
Heightened stress states and emotional negativization can significantly affect the psychological and physiological state of individuals. This problem becomes exponentially more relevant in today's networked world. The reasons for these adverse effects can be both external (pandemic, wars, migration crises, etc.) and internal, such as from increased social pressure. The latter occurs because modern individuals become increasingly more integrated into a variety of different information processes, which, in turn, produce a growing number of corresponding external stimuli. This is why it is important to have the necessary tools for diagnosing the emotional state (and related phenomena, such as emotional maladjustment, increased anxiety, etc.) of the individual. Given the scale and prevalence of the phenomenon, such a diagnosis has to be large-scale and simple to implement without significant loss of accuracy.
Numerous methods and approaches, for example, the SAM (Self-Assessment Manikin-) questionnaire [1,2], identifies emotions by linking physiological and self-reported indicators.
There are also several works devoted to the use of IAPS in multimodal neuroimaging, for example- [3].
All of these methods often rely on the use of images or, in some cases, sounds (music, sounds of nature) as a stimulus material [4][5][6][7][8]. In addition, scholars actively use modern neurofeedback technologies that utilize resonance stimulation of the endogenous rhythmic activity of the brain. These methods can induce certain functional states in an individual, Accordingly, emotions can be used as a particular tool/marker for the development of an appropriate model and scheme for initiating external stimuli and diagnosing characteristic states of an individual, including the emotional states.
Moreover, one of the aspects of human cognitive activity is the fact that individuals do not think in codes (similar to a computer); our mind operates with multiple interacting images/representations. These images have a very specific physical structure at their functional core (namely, electrical and chemical activity in the human brain). However, their description from the point of view of conventional mathematical models is difficult for a number of reasons [15,16].
Therefore, to develop an appropriate experimental scheme, we need to propose a model capable of correctly displaying the dynamics of information images/representations during the cognitive activity (expressed in the neural activity of a set of synapses and patterns) and give this model a conceptual and mathematical embodiment. This model should qualitatively explain the induced processes in the context of the experiment and predict/interpret the final result to deliver the diagnosis. This article proposes new methods for describing the activity of information images (which, in turn, model the neurocognitive activity of an individual) based on the mathematical apparatus of quantum physics (potential holes and virtual particles used in physics to describe fundamental interactions [17,18]).

Experimental Approaches to Diagnosing Emotions
The following basic methods are used for diagnosing emotions: 1.
Analysis of vegetative and somatic signs of emotional states (electromyography of the face-stimulation of facial muscles, electrical activity of the skin, body temperature, pupil dilation, eye-tracking-analysis of oculomotor movement, blinking on startle, posterior reflex, analysis of heart rate variability).
An important parameter in psychological diagnostics is ambivalence, which allows researchers to keep the subject focused on the experiment without any distractions and, at the same time, obtain reliable data. One of these methods is the analysis of heart rate variability. A heart rate variability study includes tracking the heart rate, the average duration of the RR interval, the number of RR-interval-pairs, and the power of the highfrequency band [2,11,19].
HRV is the time interval from the start of a cycle of one heartbeat to the start of another heartbeat. This interval is constantly changing and can be observed even during rest in a supine position. HRV indices are reliable data for identifying the tone of the ANS (autonomic system, its sympathetic and parasympathetic divisions) and for analyzing the controllability of physiological functions of the body. Scholars use HRV to assess the changes and dynamics of the psycho-emotional state of an individual [20]. Analyzing emotions using heart rate variability has several advantages. It is one of the most convenient indicators-it is based on the performance of the cardiovascular system of the body. Another advantage of this method is that the necessary data can be obtained via a small sensor. This sensor is easy to put on and it does not hinder the movement of a subject. The subject can even forget about a sensor, which will further minimize the impact of the experiment on their natural state, thereby, making the obtained data more accurate [21][22][23].
The novelty of the proposed experimental approach lies in the fact that we use methods suitable for ambient (background) monitoring (ambient registration of physiological signals), which do not involve human cognitive resources at the time of registration [24].
To study the effects of auditory and visual analyzers, most researchers use the IAPS (developed by P. Lang and consists of 1182 images) and IADS (developed by M. Bradley and in the second edition consists of 168 sound compositions [25][26][27][28]) databases.
These photos and sounds have a wide variety of tones and scenarios, and represent numerous contexts. The sounds and photos, therefore, are recognizable by subjects from different countries and can trigger similar emotional reactions. To assess the perception of the stimuli by individuals, the authors of the abovementioned databases used the SAM method (Cook III EW, 1987). The advantage of this technique is that it is simple, can be easily used by all subjects, and can even be applied to children. In the original SAM method, emotions are assessed on three scales: (1) by valence (sign),from extremely negative (1-despair, depression) to extremely positive (9-joy, euphoria), (2) by arousal (somatic)-from relaxed (1) to active (9), and (3) by the ability to control emotion-from complete lack of control (1) to complete control (9). The data were labeled on a large sample based on the assessments of the subjects.
The effectiveness of the application of stimuli from this standard database has been shown in numerous laboratories around the world [27,28]. The set of photographs and images presented in the database covers a variety of aspects of the human emotional sphere and can trigger complex subjective and autonomic emotional reactions that vary depending on the valence and activating content of the stimulus.
The key methodological problem of the study of emotions using subjective assessments boils down to the fact that these assessments can be influenced by many external factors, such as gender, cultural affiliation, and other personal characteristics of the individual [29].
In addition, the validity of subjective reports is strictly limited in time: the longer the time between the assessment of the state and the emotional event, the less its reliability. In this regard, the study of the emotional sphere of a person should rely on the objective methods for assessing emotions obtained both from the central (EEG) and from the autonomic nervous system (ECG, GSR, respiration). With an integrated approach to the study of emotions, special attention should be paid to the assessment of the behavioral reactions of the subject.
The cognitive component of the assessment of emotions is based on assessing the stimulus and individual's state both before and after exposure to a stimulus. These databases proved to be effective for studying HRV and emotional manifestations triggered by stimuli of different valences. However, the specifics of the sample should not be disregarded. Studies show that stimuli of different valence modify the initial state of the individual differently. Therefore, we can assume that the study of heart rate variability can help track the occurrence of emotions and possibly even their valence.
The Self-Assessment Manikin (SAM) Questionnaire (Cook III EW, 1987) is a model for self-assessment of emotions. It involves a non-verbal assessment of three emotional states: the sign of emotion or valence (valence), the strength of the emotion (arousal), and the impact on self-esteem or dominance (dominance). The technique includes three rows of pictures (one row for each of the emotions), each containing five degrees of manifestation of emotion from "low" to "high".
Sometimes subjects find it difficult to describe their emotions. The reasons for that could be semantic (incomplete understanding of meaning) or lexical (does not know how to describe the manifested emotion), etc. Therefore, the preferred way to determine the actual genesis of emotions of individuals in the sample is to use standardized express methods based on pictographic design. This will simplify the correlation of the actual state of the subject and the parameters of the method (scale).
Let us move on to our model representation of the neurocognitive activity of the brain and the way it interprets an individual's emotional response to external stimuli.

The Foundations of the Theory of Information Images/Representations and Its Model
The proposed theory is based on the idea of a universal cognitive unit [15] of information in the human mind-an information representation (or image), the space in which it exists, its topology, and properties. Information representations (hereinafter referred to as IRs) can be defined as displays of objects and events in a given space.
Correspondingly, the theory of information representations (hereinafter referred to as TIR) is a way to describe information interactions of individuals, as well as several cognitive functions of a person.
TIR views the human mind and neurocognitive activity of the brain as a large structured number of interacting images which are constantly affected by external factors.
Images with higher energy (we introduce the notion of energy E to describe the communicative activity of images) are located "above" and closer to the edge of the individual's information image space, which is why they interact with external elements much more often. Meanwhile, images with lower energy and longer response rate are located closer to the center of the space and relatively rarely interact with external stimuli.
TIR gives an alternative perspective on some characteristic regularities in the human mind, allowing for a correct interpretation and explanation of some of them.
The information image cannot be communicated between individuals without changes. Each IR is unique because each individual has specific individual experiences.
Besides, an individual is unable to convey the image that exists in their mind, in their IR space to another individual directly. For this purpose, they use various communication apparatuses formed within the social superstructure of communication or the communication field (CF). A communication field (CF) is an information-based community of individual experiences and collective unconsciousness formed as a result of an individual's presence in society. The communicative apparatus includes speech, visual, tactile, symbolic, and other ways of information transmission.
In previous works, the mathematical model of interaction of information representations was based on diffusion equations (first of all, on the Langevin equation). This approach allowed us to describe several special cases of cognitive processes but also had some limitations related to the specificity of IRs. More details about TIR and its application to real cognitive processes and experimental testing are available in [16].
Additionally, the interaction of individuals is a process of emission and absorption (processing and perception) of IRs. Thus, we can describe any information interaction as a result of virtual processes related to IR.
The classical method for describing a virtual process (a process involving virtual particles) is the Feynman diagram method [17]. However, the direct application of such a device for IRs would hardly be possible.
Within the model, the interaction of two individuals is represented as the interaction of two systems with the help of communication fields in the information environment ( Figure 1). Also, of interest is the impact of a given information environment (e.g., media, Internet resources, social environment, etc.) on an individual.
The interaction function for these two individuals or an individual and the external action should be written accordingly.
Since we are talking about the IR space, it is obvious that this space will have certain coordinates, and the particles simulating the IR movement in this space will also have a certain momentum, energy. At the same time, we should keep in mind the presence of local minimum potential particle energy in this space. Thus, a mind can be represented as a potential well, inside of which information representations make vibrational movements. In physics, we commonly solve the problem of quantization of the field inside the potential well to determine energy levels (by solving the stationary Schrödinger equation [18]). However, in this case, we know about the presence of particles in advance. The interaction function for these two individuals or an individual and the external action should be written accordingly.
Since we are talking about the IR space, it is obvious that this space will have certain coordinates, and the particles simulating the IR movement in this space will also have a certain momentum, energy. At the same time, we should keep in mind the presence of local minimum potential particle energy in this space. Thus, a mind can be represented as a potential well, inside of which information representations make vibrational movements. In physics, we commonly solve the problem of quantization of the field inside the potential well to determine energy levels (by solving the stationary Schrödinger equation [18]). However, in this case, we know about the presence of particles in advance.
The potential well can be represented in the simplest form as- Figure 2.  The potential well can be represented in the simplest form as- Figure 2. The interaction function for these two individuals or an individual and the external action should be written accordingly.
Since we are talking about the IR space, it is obvious that this space will have certain coordinates, and the particles simulating the IR movement in this space will also have a certain momentum, energy. At the same time, we should keep in mind the presence of local minimum potential particle energy in this space. Thus, a mind can be represented as a potential well, inside of which information representations make vibrational movements. In physics, we commonly solve the problem of quantization of the field inside the potential well to determine energy levels (by solving the stationary Schrödinger equation [18]). However, in this case, we know about the presence of particles in advance.
The potential well can be represented in the simplest form as- Figure 2. The value-U(x) describes the potential energy profile in the well. In this model, the indicator of potential energy reflects the levels of activity of information images. This is directly related to the movement of images. The higher levels correspond to IR, which are now in an activated state due to cognitive activity, while low levels represent images/representations that do exist in individual's mind, but have not been activated for a long time, which means they remain in the subconscious zone. That is, IR with high energy are determined by the set of thoughts of a person at the moment, and with IRs with low energy are responsible for information processes in the subconscious.
When a particle crosses the potential barrier in the form of a wall, it simulates the information interaction of an individual (i.e., particle disturbance, the transition from one energy level to a higher one).
A more interesting case of a potential well is the well with uneven walls and an additional internal barrier. Such barrier and heterogeneity of walls simulate certain specific properties of the human mind, in particular, the conditional division into consciousnesssubconsciousness, complexity of interaction with the external environment, etc. Theoretically, depending on the mental state of the modeled individual (or their individual cognitive function), such barriers may be more complex and repeated multiple times- Figure 3.
The value-U(x) describes the potential energy profile in the well. In this model, the indicator of potential energy reflects the levels of activity of information images. This is directly related to the movement of images. The higher levels correspond to IR, which are now in an activated state due to cognitive activity, while low levels represent images/representations that do exist in individual's mind, but have not been activated for a long time, which means they remain in the subconscious zone. That is, IR with high energy are determined by the set of thoughts of a person at the moment, and with IRs with low energy are responsible for information processes in the subconscious.
When a particle crosses the potential barrier in the form of a wall, it simulates the information interaction of an individual (i.e., particle disturbance, the transition from one energy level to a higher one).
A more interesting case of a potential well is the well with uneven walls and an additional internal barrier. Such barrier and heterogeneity of walls simulate certain specific properties of the human mind, in particular, the conditional division into consciousnesssubconsciousness, complexity of interaction with the external environment, etc. Theoretically, depending on the mental state of the modeled individual (or their individual cognitive function), such barriers may be more complex and repeated multiple times- Figure 3.
Thus, the impact of the external information environment on the mind of an individual can be described as the impact on particles (modeling information images) in a potential well. Let us record the equations for the case in Figure 2. Thus, the impact of the external information environment on the mind of an individual can be described as the impact on particles (modeling information images) in a potential well. Let us record the equations for the case in Figure 2.
The Schrödinger equation outside the potential well for the information image (IR) will look as follows: where, ϕ out is a wave function outside the potential well. E is the total energy of an IR. m is mass (complexity) of IR. is the Planck constant. By introducing the interaction coefficient k 1 , We obtain, The external impact will be recorded by disturbance V(x), which is added to the general form of the Schrödinger equation: Disturbance can be set in different ways, depending on its type, for example, by normal distribution or other methods.
More details about the model and TIR are available in [16,18]. In this interpretation, individual's response to the informational impact will be presented in the form of activation (i.e., moving the image/images) from a relatively low level to a higher one. From the point of view of the TIR, images used unconsciously have significantly greater inertia, which leads to a relatively large amount of time for their use in conscious communication. At the same time, the human brain, as a rule, tends to choose the closest images for interaction.
The impact in terms of TIR is as follows ( Figure 4): We obtain, The external impact will be recorded by disturbance V(x), which is added general form of the Schrödinger equation: Disturbance can be set in different ways, depending on its type, for example, b mal distribution or other methods.
More details about the model and TIR are available in [16,18]. In this interpretation, individual's response to the informational impact will b sented in the form of activation (i.e., moving the image/images) from a relatively low to a higher one. From the point of view of the TIR, images used unconsciously ha nificantly greater inertia, which leads to a relatively large amount of time for their conscious communication. At the same time, the human brain, as a rule, tends to c the closest images for interaction.
The impact in terms of TIR is as follows ( Figure 4): CF disturbance -any image / sound, etc.  The figure shows an external stimulus (one or many). As a result of the excitation of the IR, it (IR) moves to a higher energy level. Th of its activation corresponds to the time of decision-making by the individual and psychophysiological reaction of the individual, changes in their functional state, inc the emotional reaction corresponding to their state (including stress). Therefore, d the experiment, it is necessary to create a stable and statistically reliable relatio The figure shows an external stimulus (one or many). As a result of the excitation of the IR, it (IR) moves to a higher energy level. The time of its activation corresponds to the time of decision-making by the individual and/or the psychophysiological reaction of the individual, changes in their functional state, including the emotional reaction corresponding to their state (including stress). Therefore, during the experiment, it is necessary to create a stable and statistically reliable relationship between the psychophysiological response to a provoking stimulus and the presence/absence of changes in the emotional state. Now, it is necessary to discuss the features of our experimental approach to identifying such markers.

Experimental Methods
The method of event-related telemetry of the heart rate [24]. The technology provides monitoring and analysis of the dynamics of heart rate variability (HRV) taking into account the event context. The sequence of time intervals between heartbeats (rhythmogram) is recorded with chest plastic electrodes. Primary signal processing and data transmission to a smartphone is performed by the Zephyr HxM ™ Smart Heart Rate Monitor (HxM, Zephyr Technology, New Zealand) via Bluetooth. The specialized Android (4.4 and above) application "Stress monitor" performs the real-time monitoring and transfers data to the cloud server. Rhythmographic visualization, spectral analysis, and detection of stress episodes are implemented in the specialized online-service "Stress Monitor" (cogninn.ru) [5]. The architecture of the event-based telemetry technology is shown in Figure 5. mission to a smartphone is performed by the Zephyr HxM ™ Smart Heart Rate Monitor (HxM, Zephyr Technology, New Zealand) via Bluetooth. The specialized Android (4.4 and above) application "Stress monitor" performs the real-time monitoring and transfers data to the cloud server. Rhythmographic visualization, spectral analysis, and detection of stress episodes are implemented in the specialized online-service "Stress Monitor" (cogninn.ru) [5]. The architecture of the event-based telemetry technology is shown in Fig  Event-related telemetry uses specialized hardware and software for detecting early biomarkers of extreme conditions in real-time. This complex neither limits the mobility of a test subject, nor attracts their attention to the process. To collect telemetry data and detect stress, the technology uses the dedicated StressMonitor application based on cogninn.ru. The controlled activation of primary cognitive functions in the contexts of sensorimotor activity is done by a Web-platform at platform.apway.ru.
This technique enables continuous long-term collection, transmission, accumulation, and preprocessing of time-synchronized cardiac rhythmographic records, spacial data on the trajectory of a person's movement in a room and open space, video, and audio surveillance data, and the results of psychophysiological tests.
In our case, this technique helps us determine the functional states of the subjects. It analyzes the way subjects function and adapt to changing environmental conditions and determines their functional level. This analysis is based on the following parameters: (1) Average heart rate (HR); (2) Average RR-intervals; Event-related telemetry uses specialized hardware and software for detecting early biomarkers of extreme conditions in real-time. This complex neither limits the mobility of a test subject, nor attracts their attention to the process. To collect telemetry data and detect stress, the technology uses the dedicated StressMonitor application based on cogni-nn.ru. The controlled activation of primary cognitive functions in the contexts of sensorimotor activity is done by a Web-platform at platform.apway.ru.
This technique enables continuous long-term collection, transmission, accumulation, and preprocessing of time-synchronized cardiac rhythmographic records, spacial data on the trajectory of a person's movement in a room and open space, video, and audio surveillance data, and the results of psychophysiological tests.
In our case, this technique helps us determine the functional states of the subjects. It analyzes the way subjects function and adapt to changing environmental conditions and determines their functional level. This analysis is based on the following parameters: (1) Average heart rate (HR); (2) Average RR-intervals; (3) Average power of the low-frequency band of the total HRV spectrum (LF); (4) Average power of the high-frequency band of the total HRV spectrum (HF); (5) Average ratio of slow and fast bands of the general HRV spectrum (a measure of sympathovagal balance) (LF / HF); (6) Average power of the HRV spectrum (TP); (7) Stress Index (SI); (8) Functional reserve (FR); (9) Stress Index (SN).
Our study focuses on their combination in response to audiovisual stimulus material. It is important to determine those individual physiological markers recorded during day-to-day activity which could serve as indicators of emotional maladjustment, i.e., emotional stress and emotional exhaustion. Information about these markers can give individuals timely feedback and demonstrate the level of their current mental stress. Subsequently, the individual can temporarily reduce this stress by switching to physical or other types of activity. In this case, we used the method of assessing the level of emotional maladjustment for diagnostics of individual states [30].

Experiment and Sampling
The samples used in this study are presented in the Appendix B. The study was divided into two stages ( Figure 6): the preparatory, where we standardized stimulus sets, and the experimental, where we studied connections between the autonomic and cognitive assessment of emotions.
day-to-day activity which could serve as indicators of emotional maladjustment, i.e., emotional stress and emotional exhaustion. Information about these markers can give individuals timely feedback and demonstrate the level of their current mental stress. Subsequently, the individual can temporarily reduce this stress by switching to physical or other types of activity. In this case, we used the method of assessing the level of emotional maladjustment for diagnostics of individual states [30].

Experiment and Sampling
The samples used in this study are presented in the Appendix B. The study was divided into two stages ( Figure 6): the preparatory, where we standardized stimulus sets, and the experimental, where we studied connections between the autonomic and cognitive assessment of emotions.

Preparatory Stage:
The stimulus material for the study was selected from the IADS and IAPS databases. The average indicators according to the SAM method were taken based on the research of R.A. Stevenson conducted on American subjects [28].
The study involved 83 people aged 17 to 24, nine of whom were men, and 74 women. All participants had no problems associated with cardiovascular, psychiatric, respiratory

Preparatory Stage:
The stimulus material for the study was selected from the IADS and IAPS databases.  [28].
The study involved 83 people aged 17 to 24, nine of whom were men, and 74 women. All participants had no problems associated with cardiovascular, psychiatric, respiratory diseases, and they did not take any medications. All subjects were informed and signed informed consent to participate in the study.
The study was conducted in a room isolated from excessive noise and with a comfortable workplace. The study conditions met the basic requirements for psychological testing.
At the very beginning, the subjects were asked to familiarize themselves with the informed consent to participate in the study. After that, the subjects were asked to randomly listen to 30 recordings and give each a score on the SAM scales.
2. Development of an experimental model for cognitive manifestations (assessment) of affective images included 84 photographs from IAPS. The photos were selected according to their compliance with the criteria of the Pleasure scale of the SAM method. The average SAM values were taken based on a study by P. Lang conducted on American subjects [26,27].
The study involved 21 people aged 20 to 24, 12 women and nine men. No participants had any cardiovascular, psychiatric or respiratory diseases, and they did not take any medications. All subjects were informed and signed informed consent to participate in the study.
The study was conducted in a room isolated from excessive noise and with a comfortable workplace. The study conditions met the basic requirements for psychological testing.
At the very beginning, the subjects were asked to familiarize themselves with the informed consent for participation in the study. Subsequently, the subjects assessed 28 positive, 28 neutral, and 28 negative images using the SAM rating scales.
Before each photograph was displayed for the assessment, there was a break of 5 s, the photograph was displayed for 6 s, then SAM scales were provided for evaluating the photograph. These time intervals were selected as a result of the analysis of several scientific articles on this topic [1].
This experimental model showed suboptimal efficiency. The exposure time of the pictures turned out to be very short. As a result, we decided to increase the duration of the presentation of affective images and use images with extreme degrees of valence.

Principal Stage:
For the main research model, we selected the sets that simulate the impact of an external information image/representation: These affective stimuli, both auditory and visual, were randomly formed into separate blocks: • Negative sound-with 12 negative sounds; • Positive sound-with 12 positive sounds; • Negative video sequence-with 12 negative images; • Positive video sequence-with 12 positive images.
Next, we used the SAM method to evaluate the parameters of the sets. For this, we invited 30 people aged 20 to 23 (24 women and six men). Fifteen people were invited for the experiment with sound and 15 for the video sequence. Like the previous group, these individuals did not have any pronounced health problems, and they also signed informed consent to participate in the experiment. The experiment took place under similar conditions. The subjects were asked to listen/watch the set and then evaluate it using the SAM method. Thus, the valence of the resulting stimulus materials was confirmed.
For the next and principal research part, we decided to use the following research procedure. Four audiovisual stimulus materials were compiled based on the sets obtained during the previous stage. We decided to run two separate tests using the aforementioned sets: 1. Basic, with consistently lined up negative and positive videos 2.
Dissonance, With dissonant videos 1 and 2 sequentially lined up for presentation.
The main stage of the study involved 37 people aged 17 to 56. Fifteen individuals were in the Dissonance set and 22 in the Basic set. Ten of the individuals were women and 15 were men. Twenty individuals gave standard SAM responses.
Like the previous subjects, the subjects did not have any pronounced health problems, and they also signed an informed consent to participate. The experiment took place under similar conditions. During the study, the subjects were wearing event-communication telemetry sensors. After the placing the sensor, a calibration orthopedic test was carried out to register eventrelated telemetry of the heart rate. The telemetry was registered while sitting and standing. Each recording lasted 3 min. Three minutes was given to track the state of the parasympathetic and sympathetic systems, because it takes 100 s to register their values, and then the equipment tracked the dynamics of these systems in segments of 10 s. The subjects were then asked to take a test to monitor the level of their emotional maladjustment (EML).
After that, a basic or dissonant complex of audiovisual sets was presented to the subjects. After each set, the subjects were asked to rate their impression of the images using the SAM method, as well as EML. Each block consisted of 12 images and 12 sounds presented in parallel. Before each presentation there was a break of 1 s. The audiovisual set was shown for 15 s; for 10 of them, the image flickered, and the sound decreased its volume to attract the subject's attention and move on to the next set. During the breaks between images, the screen turned red for negative images and green for positive images in order to enhance the stimulus effect. Break color was based on scientific literature data. Red is an active color; green is a calming color. The research procedure lasted approximately 24 min.

Results
The selected IAPS material allowed us to determine the mean values for the valence scale in the American and Russian samples (Appendix C). A special symbol (*) marks stimuli with significant differences (p < 0.05) according to the Wilcoxon signed-rank test (Figure 7). We can reliably say that positive images in the Russian sample were assessed less positively, negative images less negatively, and neutral ones more positively than in the original sample.
When comparing unimodal heterovalent stimuli combined into sets of 12 images and 12 sounds, significant differences were found in the comparison between the assessment of the valence of negative and positive video sequences, as well as negative and positive audio sequences (Figures 8-10). We can reliably say that positive images in the Russian sample were assessed less positively, negative images less negatively, and neutral ones more positively than in the original sample.
When comparing unimodal heterovalent stimuli combined into sets of 12 images and 12 sounds, significant differences were found in the comparison between the assessment of the valence of negative and positive video sequences, as well as negative and positive audio sequences (Figures 8-10).
We can reliably say that positive images in the Russian sample were assessed less positively, negative images less negatively, and neutral ones more positively than in the original sample.
When comparing unimodal heterovalent stimuli combined into sets of 12 images and 12 sounds, significant differences were found in the comparison between the assessment of the valence of negative and positive video sequences, as well as negative and positive audio sequences (Figures 8-10).     The mean values and standard errors of the mean estimates of Valence for stimuli from the Base and Dissonance complexes are as follows: We compared the mean values of assessments on the Valence scale for the sets with different modalities and parts of the Basic Complex (negative and positive audiovisual series)- Table 1. The mean values and standard errors of the mean estimates of Valence for stimuli from the Base and Dissonance complexes are as follows: The mean values and standard errors of the mean estimates of Valence for stimuli from the Base and Dissonance complexes are as follows: Upon presentation of the Dissonance set, the absolute value of the assessment on the "valence" scale decreased, and a reduction of emotions was observed. At the same time, the study revealed a tendency of the subjects to a negative assessment of these stimulus sets. Regardless of the type of presentation of multimodal stimuli in this complex, equal negative values were obtained in the mean values for the sample (−1.2). It turns out that the addition of any negative stimulus, whether a visual informational image or an auditory one, inclines subjects (in a subjective assessment) to a negative attitude towards the environment.

Influence of the Interaction of Different-Modal Affective Stimuli on the Level of Emotional Maladjustment
Let us consider the interaction of different-modal affective stimuli on the level of emotional maladjustment as one of the important elements accompanying the stressful state of an individual.
For the basic scenario, we have (Figures 11 and 12): Upon presentation of the Dissonance set, the absolute value of the assessment on the "valence" scale decreased, and a reduction of emotions was observed. At the same time, the study revealed a tendency of the subjects to a negative assessment of these stimulus sets. Regardless of the type of presentation of multimodal stimuli in this complex, equal negative values were obtained in the mean values for the sample (−1.2). It turns out that the addition of any negative stimulus, whether a visual informational image or an auditory one, inclines subjects (in a subjective assessment) to a negative attitude towards the environment.  When compared using the Wilcoxon W-test, significant differences were found in Safety and Independence scales in comparison with pre-stimulation and after negat stimulation, as well as in Safety and Unity after negative and positive stimulation. I noteworthy that in this comparison, no significant differences were found between B and Good sets.
The level of emotional maladjustment in the Basic Set.
• During the experiment, the maladjustment was mild; • After negative stimulation, there was an increase in EML; • After the positive stimulation, the levels returned to their original state; • People with high initial maladjustment are characterized by deviant reactions monovalent audiovisual stimuli.  When compared using the Wilcoxon W-test, significant differences were found in the Safety and Independence scales in comparison with pre-stimulation and after negative stimulation, as well as in Safety and Unity after negative and positive stimulation. It is noteworthy that in this comparison, no significant differences were found between Bad and Good sets.
The level of emotional maladjustment in the Basic Set.
• During the experiment, the maladjustment was mild; • After negative stimulation, there was an increase in EML; • After the positive stimulation, the levels returned to their original state; • People with high initial maladjustment are characterized by deviant reactions to monovalent audiovisual stimuli. When compared using the Wilcoxon W-test, significant differences were found in the Safety and Independence scales in comparison with pre-stimulation and after negative stimulation, as well as in Safety and Unity after negative and positive stimulation. It is noteworthy that in this comparison, no significant differences were found between Bad and Good sets.
The level of emotional maladjustment in the Basic Set.
• During the experiment, the maladjustment was mild; • After negative stimulation, there was an increase in EML; • After the positive stimulation, the levels returned to their original state; • People with high initial maladjustment are characterized by deviant reactions to monovalent audiovisual stimuli.
This provides certain opportunities for identifying markers for diagnosing conditions in the future. No significant differences were found between 'before' and 'after' stimulation. During the experiment, the maladjustment was mild; The results of the "Dissonance" set ( Figure 13): This provides certain opportunities for identifying markers for diagnosing conditions in the future. No significant differences were found between 'before' and 'after' stimulation. During the experiment, the maladjustment was mild; The results of the "Dissonance" set ( Figure 13): Figure 13. The mean values of the emotional disadaptation level (EDL Test) for the Dissonance complex, compared by context.
When compared using the Wilcoxon W-test, one significant difference was found on the Unity scale before stimulation and after combining a positive image with a negative one (Figures 14 and 15).  When compared using the Wilcoxon W-test, one significant difference was found on the Unity scale before stimulation and after combining a positive image with a negative one (Figures 14 and 15). This provides certain opportunities for identifying markers for diagnosing conditions in the future. No significant differences were found between 'before' and 'after' stimulation. During the experiment, the maladjustment was mild; The results of the "Dissonance" set ( Figure 13): Figure 13. The mean values of the emotional disadaptation level (EDL Test) for the Dissonance complex, compared by context.
When compared using the Wilcoxon W-test, one significant difference was found on the Unity scale before stimulation and after combining a positive image with a negative one (Figures 14 and 15).  People with high initial maladjustment are characterized by deviant reactions to monovalent audiovisual stimuli.

Influence of the Interaction of Multi-Modal Affective Stimuli on the Functional State
Now let us consider the effect of the stimuli on the functional state of individuals. At the end of data collection, we performed an alignment of data. The subjects with a high proportion of artifacts were excluded from further analysis. However, the percentage of excluded test subjects did not exceed 5%, which indicates the good quality of the data obtained. People with high initial maladjustment are characterized by deviant reactions to monovalent audiovisual stimuli.

Influence of the Interaction of Multi-Modal Affective Stimuli on the Functional State
Now let us consider the effect of the stimuli on the functional state of individuals. At the end of data collection, we performed an alignment of data. The subjects with a high proportion of artifacts were excluded from further analysis. However, the percentage of excluded test subjects did not exceed 5%, which indicates the good quality of the data obtained.
The next step was to select subjects with standard responses according to the SAM method. We selected the subjects who rated the sets of negative images on the Valence scale from −2 to −4 and the sets of positive images from 2 to 4. The result was 30 subjects. The ration of subjects with standard answers was 67%, with non-standard-33%. Nonstandard variants of perception of affective images included assessing positive and negative images as neutral, and positive images as negative. However, no one assessed negative images as positive.
For each subject with standard responses according to SAM scales, we calculated the indicators associated with blocks of negative, positive images, and resting-state ( Figures  16-18): (1) Average heart rate (HR); (2) Average RR-intervals; (3) Average power of the low-frequency band of the total HRV spectrum (LF); (4) Average power of the high-frequency band of the total HRV spectrum (HF); (5) Average ratio of the slow and fast band of the general HRV spectrum (a measure of sympathovagal balance) (LF / HF); (6) Average power of the HRV spectrum (TP); (7) Stress Index (SI); (8) Functional reserve (FR); (9) Stress Index (SN).
We attempted to simplify the spectral analysis by introducing the State Optimization Index compiled by taking the value of the spectral characteristics and indices associated with the optimization of the functional state as 1 or −1 in the presence of significant differences in the ANOVA analysis with data standardization (Figures 19-25). The next step was to select subjects with standard responses according to the SAM method. We selected the subjects who rated the sets of negative images on the Valence scale from −2 to −4 and the sets of positive images from 2 to 4. The result was 30 subjects. The ration of subjects with standard answers was 67%, with non-standard-33%. Non-standard variants of perception of affective images included assessing positive and negative images as neutral, and positive images as negative. However, no one assessed negative images as positive.
For each subject with standard responses according to SAM scales, we calculated the indicators associated with blocks of negative, positive images, and resting-state (Figures 16-18): (1) Average heart rate (HR); (2) Average RR-intervals; (3) Average power of the low-frequency band of the total HRV spectrum (LF); (4) Average power of the high-frequency band of the total HRV spectrum (HF); (5) Average ratio of the slow and fast band of the general HRV spectrum (a measure of sympathovagal balance) (LF / HF); (6) Average power of the HRV spectrum (TP); (7) Stress Index (SI); (8) Functional reserve (FR); (9) Stress Index (SN).            We attempted to simplify the spectral analysis by introducing the State Optimization Index compiled by taking the value of the spectral characteristics and indices associated with the optimization of the functional state as 1 or −1 in the presence of significant differences in the ANOVA analysis with data standardization (Figures 19-25).             This graph shows that the sympathetic system dominates in response to the presen-  This graph shows that the sympathetic system dominates in response to the presen tation of each block (Figure 26). This graph shows that the sympathetic system dominates in response to the presentation of each block ( Figure 26).
This graph indicates that the sympathetic system dominates in response to the presentation of both positive and negative sets.
In the majority of subjects, during the playback of both sets, the sympathetic system was activated for each of the sets. There were no significant differences according to the Student's t−test between the state of the vegetative balance index during the first and second sets in both complexes. This graph indicates that the sympathetic system dominates in response to the presentation of both positive and negative sets.
In the majority of subjects, during the playback of both sets, the sympathetic system was activated for each of the sets. There were no significant differences according to the Student's t−test between the state of the vegetative balance index during the first and second sets in both complexes.

Discussion and Conclusions
It was revealed that the addition of positive visual information images/representations (in the form of photos) significantly affects the emotional perception of audio IR. Mixing blocks in a dissonant complex produces average results, but does not affect the perception of an image. In the first case, because the image is attractive, we improve the perception of emotions associated with negative sounds, and in the second, because of the image, we spoil the impression of sounds.
Possible reasons for this effect are: 1. The module of one emotional system dampens the effect of another. The pattern affects one emotional component, freeing it from another modality. The components interact with each other and, at some point, the modality sign becomes irrelevant. Thus, their interaction most likely becomes a consequence of the work of the limbic system.
2. Noise (for example, the noise of cars outside the window) makes it difficult to recognize stimuli, which in turn makes it difficult to isolate an emotional image.
Thus, there exists an interaction of information (emotional) images, their interference, including for multi-modal images: This is manifested in the reduction of emotions under the influence of asymmetric affective visual and sound images, and their invariance. The reduction of emotions was registered during the interaction of affective stimuli of different valences.
At the same time, the combination of monovalent stimuli with negative ones leads to an increase in the reaction, yet the same was not observed with positive stimuli.

Discussion and Conclusions
It was revealed that the addition of positive visual information images/representations (in the form of photos) significantly affects the emotional perception of audio IR. Mixing blocks in a dissonant complex produces average results, but does not affect the perception of an image. In the first case, because the image is attractive, we improve the perception of emotions associated with negative sounds, and in the second, because of the image, we spoil the impression of sounds.
Possible reasons for this effect are: 1. The module of one emotional system dampens the effect of another. The pattern affects one emotional component, freeing it from another modality. The components interact with each other and, at some point, the modality sign becomes irrelevant. Thus, their interaction most likely becomes a consequence of the work of the limbic system.
2. Noise (for example, the noise of cars outside the window) makes it difficult to recognize stimuli, which in turn makes it difficult to isolate an emotional image.
Thus, there exists an interaction of information (emotional) images, their interference, including for multi-modal images: This is manifested in the reduction of emotions under the influence of asymmetric affective visual and sound images, and their invariance. The reduction of emotions was registered during the interaction of affective stimuli of different valences.
At the same time, the combination of monovalent stimuli with negative ones leads to an increase in the reaction, yet the same was not observed with positive stimuli.
People with high initial maladjustment are characterized by deviant reactions to monovalent audiovisual stimuli. This makes it possible to diagnose the state of maladjustment based on an individual's response to monovalent stimuli.
When using the Dissonant audiovisual complex, a negative auditory stimulus causes an increase in the level of maladjustment. However, the video sequence does not have the same effect.
Also, the IAPS and IADS stimulus databases represent a vast corpora of affective images and sounds. These databases can be used to study various effects of affective content on the state and behavior of subjects and help in new works devoted to the topic of identifying emotional markers. However, it is necessary to note one shortcoming of these databases -the images and sounds are assessed only using the SAM method. They do not take into account the sequence in which the stimuli are presented or the way they change. Neither do they consider the operation of the visual and auditory systems, as well as the initial state of the subjects. All this makes it impossible to display an individual response to stimuli in a complex without additional means.
Taking into account the problems listed above, the authors developed a methodological complex, which includes methods for: (1) self-diagnosis of the emotional state of the subject (SAM); (2) the level of emotional maladjustment (EML); (3) telemetry of the heart rate.
This helped to identify markers of affective stimuli based on the emotional image they induce, the degree of stress before and during the experiment (developed foundations of the logic of the theory of information images/representations), as well as the functional state throughout the entire duration of the experiment.
An interesting effect was found according to which Americans, on average, experience stronger emotions from different types of stimuli compared to residents of Russia. This can be attributed to a number of reasons: (a) This could be due to the specific mental characteristics of the population and the corresponding consequences in the perception of various information stimuli. (b) Also, the conditions of the experiment cannot be ideally reproduced, and the difference in the result can be explained by a slightly different (possibly more constraining) situation. (c) This could also be due to the peculiarities of the sample, its insufficient representativeness.
In addition, we proposed a model of neurocognitive activity of the brain based on the theory of information images/representations. This model is instrumental for interpreting ambiguous emotions and could be developed further to simulate and predict some special cases. At this stage, however, it serves as a tool to represent the human mind and the foundation of an experimental paradigm that assists with the interpretation of the results.
Author Contributions: Conceptualization, A.Y.P. and S.A.P.; methodology, A.Y.P. and S.A.P.; experiments, S.A.P. and A.V.P.; validation, A.Y.P. and S.A.P.; formal analysis, A.Y.P.; data curation, A.V.P.; writing-original draft preparation, A.Y.P.; writing-A.Y.P. and S.A.P., funding acquisition, A.Y.P. All authors have read and agreed to the published version of the manuscript. Informed Consent Statement: Informed consent was obtained from all subjects involved in the study. This research was carried out in accordance with the Declaration of Helsinki (2013) and approved by the Ethics Committee of the Lobachevsky University following the meeting on 11 February 2021 (protocol No46).

Conflicts of Interest:
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.