Visual Sequential Search Test Analysis: An Algorithmic Approach

D’Inverno, Giuseppe Alessio; Brunetti, Sara; Sampoli, Maria Lucia; Muresanu, Dafin Fior; Rufa, Alessandra; Bianchini, Monica

doi:10.3390/math9222952

Open AccessArticle

Visual Sequential Search Test Analysis: An Algorithmic Approach

by

Giuseppe Alessio D’Inverno

^1,*

,

Sara Brunetti

¹,

Maria Lucia Sampoli

¹

,

Dafin Fior Muresanu

^2,3,

Alessandra Rufa

⁴ and

Monica Bianchini

¹

Department of Information Engineering and Mathematics, University of Siena, 53100 Siena, Italy

²

Department of Neurosciences, “Iuliu Hațieganu” University of Medicine and Pharmacy, 400023 Cluj-Napoca, Romania

³

RoNeuro Institute for Neurological Research and Diagnostic, 400364 Cluj-Napoca, Romania

⁴

Department of Medicine, Surgery and Neuroscience, University of Siena, 53100 Siena, Italy

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(22), 2952; https://doi.org/10.3390/math9222952

Submission received: 17 October 2021 / Revised: 15 November 2021 / Accepted: 17 November 2021 / Published: 18 November 2021

(This article belongs to the Special Issue Mathematical Modelling and Machine Learning Methods for Bioinformatics and Data Science Applications)

Download

Browse Figures

Versions Notes

Abstract

:

In this work we present an algorithmic approach to the analysis of the Visual Sequential Search Test (VSST) based on the episode matching method. The data set included two groups of patients, one with Parkinson’s disease, and another with chronic pain syndrome, along with a control group. The VSST is an eye-tracking modified version of the Trail Making Test (TMT) which evaluates high order cognitive functions. The episode matching method is traditionally used in bioinformatics applications. Here it is used in a different context which helps us to assign a score to a set of patients, under a specific VSST task to perform. Experimental results provide statistical evidence of the different behaviour among different classes of patients, according to different pathologies.

Keywords:

visual sequential search test; episode matching; trail making test; sequence alignment; alignment score

1. Introduction

The Trail Making Test (TMT) is a popular neuropsychological test, commonly used in clinical settings as a diagnostic tool for the evaluation of some frontal functions. It provides qualitative information on high order mental activities, including speed of processing, mental flexibility, visual spatial orientation, working memory and executive functions. Originally, it was part of the Army Individual Test Battery (1944) and subsequently was incorporated into the Halstead–Reitan Battery [1]. In general terms, the test consists of a visual search of objects in an image, where the objects are arranged in sequences of loci called regions of interest (ROIs). While classical TMT requires an individual to draw lines sequentially connecting an assigned sequence of letters and/or numbers (the ROIs) with a pencil or mouse, the same task can be performed by using the eye-tracking technology and asking the subject to fixate the sequence of ROIs in the prescribed order [2]. Poor performance is known to be associated with many types of brain impairment, in particular frontal lobe lesions. For instance, eye-tracking studies have proved their efficacy in the diagnosis of many common neurological pathologies, such as Parkinson’s disease, brain trauma and neglect phenomena [3,4,5,6].

The Visual Sequential Search Test (VSST) adopts the same principle as the TMT but is based on a precise geometry of the ROIs, being designed for the study of top-down visual search. Visual search can be quantified in terms of the analysis of the scan-path, which is a sequence of saccades and fixations. Thus, the identification of precise scores of the VSST may provide a measure of the subject’s visual spatial ability and high order mental activity. Specifically, we know that visual input from the external world is actively sampled for promoting appropriate actions during everyday life; this mechanism is dynamic and involves a continuous re-sampling of spatial elements of a visual scene. The VSST is a repeated search task, in which patients are asked to connect by gaze a logical sequence of numbers and letters.

The main objective of the research is to identify a reliable method for the analysis of the VSST and to investigate common/different characteristics inside and outside three different subjects’ classes. The first group includes patients with extrapyramidal disease who have well known difficulties in visual spatial exploration and executive functions. Thus, we predict low VSS performance in this group of patients. The second group is composed of patients suffering from chronic pain syndrome. This syndrome is not classically associated with cognitive deficits, but rather with mood changes. However, a possible deficit of attention has been suggested in patients suffering from chronic pain. If the prediction is true, the VSS performance in this group should be normal or less altered than extrapyramidal patients. The third, is a control group. The identification of a robust method of analysis and the detection of a reliable indicator for the VSST performance would allow to give a measure of executive functions in a clinical setting for diagnostic and prognostic purposes and eventually in clinical trials. Moreover, scoring the performance of such a VSST may have implications in the rehabilitation of cognitive functions and in general may be used for upgrading mental activity by exercise.

In the vast majority of the literature on eye movements, saccade amplitude or duration, number of fixations, fixation durations, or other close derivatives have been used as the main measures (see for instance the recent contribution in [7]). Although saccades and fixations are fundamentally sequential, very few methods are available for treating their sequential properties. Among those taking into account the fixation order, the most widely applied method is based on the edit distance, i.e., minimum number of “edit” operations transforming a sequence into another [8]. More advanced versions assign different weights to each operation. Such methods have been successfully used by a number of researchers to study saccade sequences (e.g., [9,10,11,12,13]). These methods define a number of spatial ROIs in the scene being scanned and the fixation sequence is coded as a series of letters representing the fixated locations. Although the string edit method has proven to be a useful tool and is relatively fast to compute, one of its main drawbacks is that it does not take the relationship between ROIs into consideration, so that the algorithm cannot differentiate between close and distant ROIs. A second drawback of this kind of method is that they do not take the fixation duration into account; all fixations, however short or long, are treated equally. Instead, it is clear that the fixation duration is an important indicator of processing during a fixation [14]. In [15], Cristino et al. describe a new method, for quantitatively scoring two eye movement sequences: they show how the methodology of global sequence alignment (Needleman–Wunsch algorithm [16]) can be applied to eye movements and then present three experiments in which the method is used.

In this paper, we follow the approach to take the fixation order, the fixation duration and the spatial distance from the ROIs into account. First, we pre-process the data recording the fixation sequence as a series of symbols (possible repeated) representing the fixated locations. Since the observed sequences (scan-paths) have a length quite different from each other, a global alignment is not suitable to evidence their similarity (if any) [17]. Therefore, we at first propose to compare the expected scan-path with the observed scan-path using dot-plots. This provides a visual and hence a qualitative comparison between them but does not permit to evaluate it quantitatively.

From a different point of view, the problem we want to tackle is also related to the so-called episode matching [18]. An episode is a collection of events that occur within a short time interval. In our case, an event corresponds to a fixation, and an episode to a scan-path. Usually, in the episode matching problem, given a long sequence of events, it can be useful to know what episodes occur frequently in the sequence.

A simplified version of this problem can be restated as an approximate string matching problem [19]: Given a text T, find its substrings containing the string P as a subsequence. Conditions on the number of occurrences and/or on the length of the substrings of T can be considered. Here, we investigate the problem in which T is the obtained scan-path, and P is the task scan-path.

In particular, for every obtained scan-path, we determine the first occurrence of P in T, and we score it. A novel scoring scheme is presented that takes into account the spatial relationship between ROIs (differentiate between close and far regions—distance matrix) and the fixation duration (repetitions of the letter corresponding to the ROI in a way that is proportional to the fixation). It also includes the guess that fixations outside the ROIs may be part of the exploration strategy.

The proposed score is validated by comparing the performance of the three different groups: the group of patients with extrapyramidal disease, the second one of patients suffering from chronic pain syndrome and the control group. Our results, as expected, confirm the worst performance of extrapyramidal patients than the chronic pain and control groups, in general. In particular, the medians of the three classes are significantly different from each other, so suggesting that our method can be employed as a measure of the performance in the VSST.

Summarizing, the main contributions of this paper are:

A new way to preprocess the VSST data, so as to represent them as sequences to which classical alignment methods can be applied;
A novel scoring scheme to evaluate the observed scan-path with respect to the target scan–path;
A preliminary experimental analysis on an original VSST dataset which highlights different pathological behaviours validated by human experts.

The method we propose is illustrated in the flowchart of Figure 1. The paper is organized as follows. In the next section, the task that we want to pursue is described, together with the data pre-processing and the proposed alignment approach, based on a new ad hoc definition of the similarity score. Section 3 collects experimental results that are discussed in the following Section 4. Finally, in Section 5, some conclusions are drawn and also open questions and future perspectives are described.

2. Material and Methods

2.1. Task Design

There exists several different TMT settings that can be adopted. For instance, a patient could be supposed to link ordered series of numbers or letters (which we will generally call symbols in the following paragraphs) drawing with paper and pencil [20] or onto an electronic device [21,22]. In other settings, tested people are required to sit in front of a monitor and interact with screen-based content, through an eye-tracker device [23,24]. Our study is carried out based on this last setting, which allows us to perform a Visual Sequential Search Test (VSST). In particular, the stimulus images submitted (in this order) to the patient are illustrated in Figure 2, and the required task is to make the sequence 1-A-2-B-3-C-4-D-5-E at least once during the whole test time.

2.2. Data Preprocessing

2.2.1. Dataset

The data obtained by the eye-tracking experiments, for each person, provide the following information:

average gaze position (x) (pixels)
average gaze position (y) (pixels)
fixation ID (integer) (NaN = saccade)
pupil size (left eye)
pupil size (right eye)
timestamp (every 4 milliseconds)
stimulus (code of the image shown in the screen): the coding is described in the caption of Figure 2.

Regular eye movements alternate between saccades and visual fixations. A fixation is the maintaining of the visual gaze on a single location. A saccade is a quick, simultaneous movement of both eyes between what happens among two or more phases of fixation in the same direction.

In case of blinking, the device loses the signal and it results in “NaN” (Not a Number) values either for the position

(x, y)

on the screen and for the pupil sizes. Pupil sizes were not taken into account for the data processing described in the following.

The results of the eye-tracking experiments for 376 subjects were divided into three classes: 46 patients with extrapyramidal syndrome, 284 affected by chronic pain and 46 controls.

It is worth noting that the collected dataset is significantly unbalanced, a problem naturally attributable to the type of pathologies to be prognosed. In particular:

For extrapyramidal patients the diagnosis is based on objectivable clinical factors (i.e., a movement disorder), while a disability scale exists on which the severity of the disease can be objectively established;
Chronic pain represents a very variable pathology whose prognosis deeply depends on the personal judgement of human experts and that cannot objectified except through a subjective evaluation scale.

Therefore, an alteration in the scan-path for an extrapyramidal patient is invariably pathological, while a similar alteration evidenced in a patient affected by chronic pain must be treated with caution.

2.2.2. Generated Scan-Path Sequences

Starting from the data previously shown, we dealt with the generation of the scan-path sequences as follows.

The goal here is to use the information of the data to reconstruct the scan-path of an individual during the test as a sequence of symbols, associating a letter or a number for fixations on the ROIs accordingly, and the special character “!” for fixations outside the ROIs (black area). In other words, we generated a string

T = t_{1} \dots t_{n}

over the alphabet

A = {1, 2, 3, 4, 5, A, B, C, D, E,!}

. After having determined the centroids of each symbol in the TMT stimulus image, we have calculated the minimum distance between any pair of centroids, and we set a threshold equal to its half. Then, for every fixation ID, we computed the distance from the fixation area to the closer centroid, and we selected it as the associated symbol if the distance was less than the threshold, or “!” otherwise.

For instance, a generated sequence can have the following form:

!5311AA22!DB3533ACC4!AB!C25DD!!!…

Finally, we stacked subsequent repetitions of symbols in a vector of “weights”, associated to the non redundant sequence. Formally speaking, in a string

t_{1} \dots t_{n}

where i s.t.

t_{i} = t_{i + 1} = \dots = t_{i + k}

, we replace

t_{i}, t_{i + 1}, \dots, t_{i + k}

with

t_{i^{'}} = t_{i}

associating the corresponding weight

w_{i^{'}} = k + 1

, with

1 \leq i^{'} \leq n

. This can be easily done by scanning the string and counting the number of consecutive occurrences of the same symbol, in linear time with respect to the length of the string.

For the above sequence, we obtain as the result of the preprocessing of the data:

!531A2!DB353AC4!AB!C25D!

with the corresponding vector of weights:

(1, 1, 1, 2, 2, 2, 1, 1, 1, 1, 1, 2, 1, 2, 1, 1, 1, 1, 1, 1, 1, 1, 2, 3)

2.3. VSST Data Analysis Method

We are going to formulate the VSST problem in terms of a pairwise sequence alignment, where both the target scan-path and the obtained scan-path are strings.

Let

T = t_{1} \dots t_{n}

be any string of length n over the alphabet

A = {1, 2, 3, 4, 5, A, B, C, D, E,!}

, and let

P = 1 A 2 B 3 C 4 D 5 E

. Given T and P, we look for the matches of P in T, that is the occurrences of symbols of P in T. Regions of identity (matches) can be visualized by the so-called dot-plot. A dot-plot is a

10 \times n

binary matrix M such that the entry

m_{i j} = 1

if and only if

p_{i} = t_{j}

, otherwise

m_{i j} = 0

. Some toy examples are shown in Figure 3 where the identity is visualized by a dot.

It is easy to see that “diagonals” of dots correspond to consecutive matches of P in T. This can be formalized as follows. A substring of T is a finite sequence of consecutive symbols of T, while in a subsequence symbols are not necessarily consecutive. Thus, P is a subsequence of T if there exist indices

i_{1} < \dots < i_{m}

such that

p_{1} = t_{i_{1}}

,

p_{2} = t_{i_{2}}

,

p_{m} = t_{i_{m}}

and

T^{'} = t_{i_{1}} t_{i_{1} + 1} \dots t_{i_{m}}

is the substring of T containing P.

Let us define the VSST problem as an approximate string matching problem. The approximate string matching problem looks for those substrings of the text T that can be transformed into pattern P with at most h edit operations: a deletion of a symbol x of T changes the substring

u x v

into

u v

; an insertion of a symbol x changes the substring

u v

of T into

u x v

; a substitution of a symbol x of T with a symbol y changes the substring

u x v

into

u y v

. When deletion is the only edit operation allowed and we choose

h = k - m

, the problem is equivalent to finding all substrings of T of length at most k that contain P of length m as a subsequence.

In the VSST problem we search for the first occurrence of P in T, i.e., we find the substring of T starting in the leftmost symbol in T containing P as a subsequence. This can be done in linear time in the size n of T with the naïve algorithm.

2.4. The Score Scheme

Let

T^{'} = t_{1}^{'} \dots t_{k}^{'}

be the substring of T containing P. Next step consists of scoring the approximate matching between

T^{'}

and P. Actually,

h = k - 10

provides a first evaluation of the distance between

T^{'}

and P since they differ by h symbols. Note that this corresponds to defining a scoring system that assigns value 1 to each deletion and sums up each value. However, this measure is oversimple to provide a meaningful evaluation, and moreover we prefer to measure the complementary information, to calculate a “similarity score” between

T^{'}

and P. Indeed our goal is to assign a final score assessing the performance of the patient in the VSS test. The first step in the definition of the scoring function is to assign a positive value (a reward) to each match, i.e., to each occurrence of a symbol of P in

T^{'}

. On the contrary, each deletion of symbols of

T^{'}

must be assigned a negative value (a penalty). We decided to weakly penalize a deletion of the symbol ! with respect to the deletion of any other symbol, since we consider a fixation of the background as an intermediate pause in the process, but not a true selection of an ROI. We refer to these three values as penalty scale constants.

In addition, in the latter case (deletion of a symbol not ! in

T^{'}

), we compute the distance of the centroid of the ROI corresponding to the deleted symbol to the centroid of the ROI of the next expected symbol of P, to take the spatial relation between the two ROIs. The set of the distances for each pair, normalized by the maximum distance, is then collected in a distance matrix (Figure 4).

Another factor included in the score is the duration of the fixation. We store the information in a parallel array as explained in Section 2.2.2. We assume that the fixation duration is associated to hesitation in the VSST. Since duration corresponds to consecutive repetitions of any symbol, we define a function decreasing in the number of repetitions for scoring the match and increasing in the number of repetitions for scoring the deletion. We refer to it as the duration function.

Finally, since the fixations outside the ROIs may be part of the exploration strategy, we compute the frequency of each symbol in the prefix ending there, to amplify the penalty: the frequency corresponds to the number of times that the symbol has been already fixed in the exploration so that it reflects the number of times needed to learn its position.

To summarize, the final score of

T^{'}

is the sum of the contributions to the score for each symbol in

T^{'}

where each score is obtained by the product of the following factors: the penalty scale constant v, the duration function f, and, in case of deletion of a symbol non !, an item of the distance matrix,

d i s t

, and the frequency

f r e q

of the symbol. The computation of the score is sketched in Algorithm 1.

Algorithm 1 Similarity score evaluation

Require:

T^{'}

, w,

a l i g n

, v, P,

f (w)

Ensure:

s c o r e

j \leftarrow 0

▹ index for P

i \leftarrow 0

▹ index for T’

s c o r e \leftarrow 0

f r e q (k) \leftarrow 0 \forall k i n P

while

j \neq l e n g t h (P) A N D i \neq l e n g t h (T^{'})

do

if

i = a l i g n (j)

then ▹ match

p_s c o r e \leftarrow v (0) \cdot f (w (i))

f r e q (P (j)) \leftarrow f r e q (P (j)) + 1

j \leftarrow j + 1

else if

T^{'} (i) =!

then ▹ deletion

p_s c o r e \leftarrow - v (1) \cdot [1.1 - f (w (i))]

else

f r e q (T^{'} (i)) \leftarrow f r e q (T^{'} (i)) + 1

p_s c o r e \leftarrow - v (2) \cdot f r e q (T^{'} (i)) \cdot d i s t (T^{'} (i), P (j)) \cdot [1.1 - f (w (i))]

end if

s c o r e \leftarrow s c o r e + p_s c o r e

i \leftarrow i + 1

end while

We remark that this algorithm uses three vectors: the substring

T^{'}

, the vector w of the weights of size k and a vector

a l i g n

of size

m = 10

, which stores the indices of the items of P such that

a l i g n (j) = i

iff

t_{i} = p_{j}

, else

a l i g n (j) = - 1

. The algorithm scans

T^{'}

based on the index i and P based on j. Initially

i = j = 0

. Then, it checks if i is equal to

a l i g n (j)

: if true, it scores the match (

t_{i}

is equal to

p_{j}

) and both indices are increased, otherwise it scores the deletion of

t_{i}

and then increases i. In case of deletion, it checks if

t_{i}

is equal to ! and, consequently, computes the appropriate score. Each access to the vectors takes

O (1)

and the algorithm scans the whole vector

T^{'}

so that it runs in

O (k)

time.

3. Experimental Results

After the pre-processing phase described in Section 2.2.2, the data consist of strings with their weights divided into three classes, depending on the individuals performing the test: 46 strings from patients with extrapyramidal syndrome, 284 from patients affected by chronic pain and 46 healthy participants. From now on, we refer to them as the Extrapyramidal (E), the Chronic (C) and the Healthy (H) classes.

For each member of the classes, we computed the score using the algorithm described in Section 2.4. In particular we used

v = [1, 0.25, 0.5]

for the penalty constant vector, and the inverse of the weight of the symbol for the duration function f.

Figure 5 and Figure 6 illustrate the dot-plots and the scores computed for a member of each class, respectively. We are going to show that these members are good “representatives” of their classes. At a glance, the dot-plots suggest that the first image corresponds to a performance better than the second, which in turn, looks better than the third.

In the images of Figure 6, we illustrate the score as the bar graph obtained by visualizing each value assigned to each symbol of the sequences as a bar. Let us notice that bars of positive height correspond to the score of matches, whereas bars of negative height correspond to the score of deletions. Matches can be scored with values lower than 1, when repeated; deletions are scored differently depending on repetition, frequency, and distance from the next symbol objective.

Before the analysis, we dropped some outliers for each class, according to the Chebischev Theorem. Setting

γ = 2

, we were sure to retain at least 75% for each class; such a dropping resulted in retaining 43 sequences out of 46 for the Healthy class, 265 out of 284 for the Chronic class and 44 out of 46 for the Extrapyramidal class.

Therefore, we analysed the results using the R language for computing the basic statistics and graphics. A summary divided by the groups is shown in Figure 7, while in Figure 8 we report the box plots.

Based on the obtained results, we can notice that the data seems not to follow a normal distribution, as we can see from Figure 9, at least for two of the three classes (the Healthy class and the Chronic one). Indeed we run the Kolgomorov–Smirnov test for comparison with the normal distribution on each class, obtaining p-values, respectively, equal to

6.56388017310154 \cdot 10^{- 30}

,

9.711223032427313 \cdot 10^{- 105}

,

1.5674138951676932 \cdot 10^{- 12}

.

Thus, we used the non-parametric Kruskal–Wallis test by rank which extends the two-sample Wilcoxon test in the situation where there are more than two groups. It turns out that at 0.05 significance level, the medians of the data of the three groups are different. In particular, the p-value for the Kruskal–Wallis test is p-value =

6.553 \cdot 10^{- 8}

. In order to know which pairs of groups are significantly different we used the function pairwise.wilcox.test() to calculate pairwise comparisons between group levels with corrections for multiple testing and Bonferroni correction. The results confirm that the pair exhibiting the most significant difference is the Healthy–Extrapyramidal as expected (see Table 1). Indeed, patients with extrapyramidal disease have well known difficulties in visual spatial exploration and executive functions that result in difficulties from the subject to maintain a top-down (human intention) internal representation of the visual scene during task execution. This is reflected in a bad performance in the VSST. Differently, patients in the Chronic class are affected by several kinds of chronic pain syndromes so that they may have different behaviours in performing the task.

Nevertheless, note that, actually, all the pairs have p-values less than

0.05

so that they are significantly different.

4. Discussion

In this study, we propose a method for the analysis of gaze in a top-down visual search task and find a score for the VSST performance. The whole pipeline for the process is illustrated in Figure 1. The considered method and score have been validated by comparing the performance of three different subjects’ groups. The first group includes 46 patients with extrapyramidal disease, who have well known difficulties in visual spatial exploration and executive functions. The second group is composed of 238 patients suffering from chronic pain syndrome and the third, collecting 46 patients, is a control group.

The identification of a robust method of analysis and the detection of a reliable indicator for the VSST performance, would allow to give a measure of executive functions in a clinical setting for diagnostic and prognostic purposes and eventually in clinical trials. Moreover, scoring the performance of such a VSST may have implications in the rehabilitation of cognitive functions and in general may be used for upgrading mental activity by exercise.

Indeed, cognitive rehabilitation is an effective non–pharmacological treatment that consists of learning compensatory strategies and exploiting residual skills in order to counteract, for instance, cognitive impairments and degenerative diseases. In fact, as for dementia, unfortunately, there is no specific pharmacological treatment, being existing drugs able to counteract the symptoms of the disease, but do not change its course. Consequently, the disease progresses: there is a continuous and constant progressive decline of cognitive functions for the patient, which negatively affects the various daily skills. Instead, changing the course of the disease, “pushing forward” the degenerative progression allows the patient to maintain their autonomy for a longer time and reduces the disinterest, anxiety and depression that degenerative diseases entail. Finally, cognitive rehabilitation is also fundamental for maintaining cognitive functions in efficiency and to combat the consequences of normal aging. Similarly, it is possible to implement intellectual stimulation with a preventive purpose.

The main characteristic of VSST is that it forces the subject to perform a default and logic path using high level cognitive resources. In this task the target of the next fixation changes continuously and, thus, in order to perform an adequate eye movement, each fixation must contain the information on the current target position and the next target location [25]. Previously, Veneri et al. [26] suggested that the re-sampling of the spatial element in such a visual search task requires a ranking of each element of the sequence during fixations. To be effective, this process requires a maximization of the discrimination abilities of the peripheral vision. The comparison of the expected scan-path with the observed scan-path provides a valuable method to investigate how a task forces the subject to maintain a top-down (human intention) internal representation of the visual scene during task execution. The proposed method has proved to be really effective in distinguishing between healthy people and patients affected by extrapyramidal pathologies, and less sensitive to the differences among the other cohort combinations. Actually, patients with chronic pain syndrome may be affected by very different pathologies—from severe neoplasms to chronic migraines—not all equally disabling from a neurocognitive point of view, which makes this group of patients extremely heterogeneous and difficult to distinguish, for example, from healthy people. Anyway, the main advantage of the proposed VSS test, equipped with the automatic procedure to score its outcomes, lies in the possibility of standardizing the test—making the obtained results repeatable—as well as memorizing them permanently. In this way, for each patient, a historical series of their performance can reliably be collected and analysed, a suitable procedure for evaluating both the course of a disease or the recovery based on a cognitive rehabilitation process.

Some issues concerning the proposed method naturally arise:

we are aware of the approximation errors that could occur by converting gaze positions into symbols through the ROIs processing, as a threshold has been set to discriminate from region to region. Nevertheless, comparing the retrieved sequences to video analysis, our method seems to catch efficiently the sequence of symbols scanned during the test;
the proposed score algorithm takes into account some factors, such as the duration of the fixation, or the distance from the target symbol. Surely they are not the only factors linked to cognitive tasks involved in the test: improvements in building up the algorithm could be considered by taking into account other minor cognitive factors;
the weight function, applied on the stacked subsequent repetitions of single symbols, has been arbitrarily chosen as the inverse function. Future works could focus on designing a more appropriate weight function, which could take into account cognitive features associated with the repetitive behaviour.

5. Conclusions

In this manuscript, we have described a new method for the analysis of the Visual Sequential Search Test, a neurocognitive task commonly used in clinical settings as a diagnostic tool for the evaluation of frontal functions. The VSS test is an eye-tracking version of the Trail Making Test to discover how selection (fixations) guides next exploration (saccades), and how human top-down factors interact with bottom-up saliency. The problem of analysing the VSST outcomes is faced as an episode–matching problem, where an event corresponds to a fixation, and an episode to a scan–path. In this way, a score can be devised able to quantify how much a particular outcome diverges from the expected one. Based on this score, we are able to predict, with a high statistical confidence, if a particular scan-path corresponds to a patient with an extrapyramidal disease or suffering from the chronic pain syndrome or if it describes a “normal” cognitive behaviour. Having a standardized way to evaluate the VSST can help for monitoring the evolution of a disease, for neurological rehabilitation and for intellectual stimulation with a preventive purpose. In particular, the preventive aspect is taking on an increasingly important role, both in terms of the physical and intellectual well–being of the population, and in a more general process of optimizing economic resources for healthcare. We are perfectly aware of the small size of the used dataset. This is a common problem with medical data. Apart from collecting new data, another possible way to overcome this problem could consist in applying data augmentation techniques in order to both balance and enlarge its size. This will be an issue to discuss in future investigations.

Author Contributions

Conceptualization, G.A.D., S.B. and M.L.S.; Data curation, S.B.; Formal analysis, G.A.D., S.B. and M.L.S.; Investigation, G.A.D., S.B., M.L.S., A.R. and M.B.; Methodology, G.A.D., S.B., A.R. and M.B.; Project administration, M.B.; Resources, D.F.M.; Software, G.A.D., S.B. and M.L.S.; Supervision, M.L.S., A.R. and M.B.; Validation, A.R.; Visualization, G.A.D., S.B. and M.L.S.; Writing—original draft, G.A.D., S.B., M.L.S., A.R. and M.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Patient consent was waived due to the anonymous nature of analyzed data.

Data Availability Statement

Not applicable.

Acknowledgments

The authors wish to thank RoNeuro Institute, part of the Romanian Foundation for the Study of Nanoneurosciences and Neuroregeneration, Cluj-Napoca, Romania, represented by Dafin Fior Muresanu, for providing the datasets used here for the experiments.

Conflicts of Interest

The authors declare no conflict of interest.

References

Reitan, R.; Wolfson, D. The Halstead-Reitan Neuropsychological Test Battery; Horton, A.M., Jr., Webster, J.S., Eds.; John Wiley & Sons: Tuscon, AZ, USA, 1985. [Google Scholar]
Veneri, G.; Pretegiani, E.; Rosini, F.; Federighi, P.; Federico, A.; Rufa, A. Evaluating the human ongoing visual search performance by eye tracking application and sequencing tests. Comput. Methods Programs Biomed. 2012, 107, 468–477. [Google Scholar] [CrossRef]
Hochstadt, J. Set-shifting and the on-line processing of relative clauses in Parkinson’s disease: Results from a novel eye-tracking method. Cortex 2009, 45, 991–1011. [Google Scholar] [CrossRef] [PubMed]
Marx, S.; Respondek, G.; Stamelou, M.; Dowiasch, S.; Stoll, J.; Bremmer, F.; Oertel, W.H.; Höglinger, G.U.; Einhauser, W. Validation of mobile eye-tracking as novel and efficient means for differentiating progressive supranuclear palsy from Parkinson’s disease. Front. Behav. Neurosci. 2012, 6, 88. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kaufmann, B.C.; Cazzoli, D.; Pflugshaupt, T.; Bohlhalter, S.; Vanbellingen, T.; Müri, R.M.; Nef, T.; Nyffeler, T. Eyetracking during free visual exploration detects neglect more reliably than paper-pencil tests. Cortex 2020, 129, 223–235. [Google Scholar] [CrossRef] [PubMed]
Trepagnier, C. Tracking gaze of patients with visuospatial neglect. Top. Stroke Rehabil. 2002, 8, 79–88. [Google Scholar] [CrossRef] [PubMed]
Pancino, N.; Graziani, G.; Lachi, V.; Sampoli, M.; Stefanescu, E.; Bianchini, M.; Dimitri, G.M. A Mixed Statistical/Machine Learning Approach for the Analysis of Multimodal Trail Making Test Data. Preprint 2021. under review. [Google Scholar]
Crochemore, M.; Rytter, W. Jewels of Stringology; World Scientific: Hackensack, NJ, USA, 2003. [Google Scholar]
Brandt, S.A.; Stark, L.W. Spontaneous eye movements during visual imagery reflect the content of the visual scene. J. Cogn. Neurosci. 1997, 9, 27–38. [Google Scholar] [CrossRef]
Choi, Y.S.; Mosley, A.D.; Stark, L.W. String editing analysis of human visual search. Optom. Vis. Sci. 1995, 72, 439–451. [Google Scholar] [CrossRef] [PubMed]
Foulsham, T.; Underwood, G. What can saliency models predict about eye movements? Spatial and sequential aspects of fixations during encoding and recognition. J. Vis. 2008, 8, 1–17. [Google Scholar] [CrossRef]
Hacisalihzade, S.S.; Stark, L.W.; Allen, J.S. Visual perception and sequences of eye movement fixations: A stochastic modeling approach. IEEE Trans. Syst. Man, Cybern. 1992, 22, 474–481. [Google Scholar] [CrossRef]
Noton, D.; Stark, L. Scanpaths in eye movements during pattern perception. Science 1971, 171, 308–311. [Google Scholar] [CrossRef] [PubMed]
Henderson, J.M.; Pierce, G.L. Eye movements during scene viewing: Evidence for mixed control of fixation durations. Psychon. Bull. Rev. 2008, 15, 566–573. [Google Scholar] [CrossRef] [Green Version]
Cristino, F.; Mathôt, S.; Theeuwes, J.; Gilchrist, I.D. ScanMatch: A novel method for comparing fixation sequences. Behav. Res. Methods 2010, 42, 692–700. [Google Scholar] [CrossRef] [Green Version]
Needleman, S.B.; Wunsch, C.D. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 1970, 48, 443–453. [Google Scholar] [CrossRef]
Durbin, R.; Eddy, S.R.; Krogh, A.; Mitchison, G. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids; Cambridge University Press: Cambridge, UK, 1998. [Google Scholar] [CrossRef]
Das, G.; Fleischer, R.; Gasieniec, L.; Gunopulos, D.; Kärkkäinen, J. Episode matching. In Annual Symposium on Combinatorial Pattern Matching; Springer: Berlin/Heidelberg, Germany, 1997; pp. 12–27. [Google Scholar] [CrossRef]
Apostolico, A.; Atallah, M.J. Compact recognizers of episode sequences. Inf. Comput. 2002, 174, 180–192. [Google Scholar] [CrossRef] [Green Version]
Zakzanis, K.K.; Mraz, R.; Graham, S.J. An fMRI study of the trail making test. Neuropsychologia 2005, 43, 1878–1886. [Google Scholar] [CrossRef] [PubMed]
Bracken, M.R.; Mazur-Mosiewicz, A.; Glazek, K. Trail Making Test: Comparison of paper-and-pencil and electronic versions. Appl. Neuropsychol. Adult 2018, 26, 522–532. [Google Scholar] [CrossRef] [PubMed]
Drapeau, C.E.; Bastien-Toniazzo, M.; Rous, C.; Carlier, M. Nonequivalence of computerized and paper-and-pencil versions of Trail Making Test. Percept. Mot. Skills 2007, 104, 785–791. [Google Scholar] [CrossRef]
Hicks, S.L.; Sharma, R.; Khan, A.N.; Berna, C.M.; Waldecker, A.; Talbot, K.; Kennard, C.; Turner, M.R. An eye-tracking version of the trail-making test. PLoS ONE 2013, 8, e84061. [Google Scholar] [CrossRef]
Jyotsna, C.; Amudha, J.; Rao, R.; Nayar, R. Intelligent gaze tracking approach for trail making test. J. Intell. Fuzzy Syst. 2020, 38, 6299–6310. [Google Scholar] [CrossRef]
Veneri, G.; Rufa, A. Extrafoveal Vision Maximizes the Likelihood to Grab Information in Visual-sequential Search. In Computer Communication & Collaboration; Academic Research Centre of Canada: Ottawa, ON, Canada, 2017; Volume 5. [Google Scholar]
Veneri, G.; Rosini, F.; Federighi, P.; Federico, A.; Rufa, A. Evaluating gaze control on a multi–target sequencing task: The distribution of fixations is evidence of exploration optimization. Comput. Biol. Med. 2012, 42, 235–244. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Processing pipeline: average gaze position per timestamp in fixation rows are processed to get the scan-path observed by the patient (the columns related to the pupils’ size are not taken into account); after creating the weight vector, we run the similarity score (with respect to the target sequence) to get the final patient score. Healthy patients and patients with extrapyramidal nervous system disease are distinguished with high statistical confidence.

Figure 2. Stimuli timing: the instruction slide is codified with “NaN”; the central dot target is codified with “0”; the TMT stimulus is codified with “1”.

Figure 3. Dot-plots for the toy-sequences: (a) T = 1A2B3C4D5E; (b) T = E1!A2B3CA4D54E and (c) T = 4C1BA2!3C4E5DA.

Figure 4. Distance matrix.

Figure 5. A dot-plot of the strings: (a) !5511DAAB3223BBD!53DECCE44ADD55E, member of the Healthy class; (b) !!!2CEB1!52!AA55!!24EBBB!!!334!ECC4!B, member of the Chronic class; (c) !!!C!3354CCAA!B!A!!E5!!!2!!C!2!5A23122!2!EC!25D!EEE!1353131ACCA!5A!525!2!, member of the Extrapyramidal class.

Figure 6. A bar graph of the scores of the strings: (a) !5511DAAB3223BBD!53DECCE44ADD55E, member of the Healthy class; (b) !!!2CEB1!52!AA55!!24EBBB!!!334!ECC4!B, member of the Chronic class; (c) !!!C!3354CCAA!B!A!!E5!!!2!!C!2!5A23122!2!EC!25D!EEE!1353131ACCA!5A!525!2!, member of the Extrapyramidal class.

Figure 7. Main statistics for the three classes of patients.

Figure 8. Box plots of the score distribution for each class: Healthy (left), Chronic (center), and Extrapyramidal (right).

Figure 9. Q-Q plots for each patients’ class: (a) Healthy, (b) Chronic, and (c) Extrapyramidal.

Table 1. p-values of pairwise comparisons using Wilcoxon rank sum test with continuity correction.

	Chronic	Extrapyramidal
Extrapyramidal	$2 \cdot 10^{- 6}$	-
Healthy	0.04	$8 \cdot 10^{- 7}$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

D’Inverno, G.A.; Brunetti, S.; Sampoli, M.L.; Muresanu, D.F.; Rufa, A.; Bianchini, M. Visual Sequential Search Test Analysis: An Algorithmic Approach. Mathematics 2021, 9, 2952. https://doi.org/10.3390/math9222952

AMA Style

D’Inverno GA, Brunetti S, Sampoli ML, Muresanu DF, Rufa A, Bianchini M. Visual Sequential Search Test Analysis: An Algorithmic Approach. Mathematics. 2021; 9(22):2952. https://doi.org/10.3390/math9222952

Chicago/Turabian Style

D’Inverno, Giuseppe Alessio, Sara Brunetti, Maria Lucia Sampoli, Dafin Fior Muresanu, Alessandra Rufa, and Monica Bianchini. 2021. "Visual Sequential Search Test Analysis: An Algorithmic Approach" Mathematics 9, no. 22: 2952. https://doi.org/10.3390/math9222952

APA Style

D’Inverno, G. A., Brunetti, S., Sampoli, M. L., Muresanu, D. F., Rufa, A., & Bianchini, M. (2021). Visual Sequential Search Test Analysis: An Algorithmic Approach. Mathematics, 9(22), 2952. https://doi.org/10.3390/math9222952

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Visual Sequential Search Test Analysis: An Algorithmic Approach

Abstract

1. Introduction

2. Material and Methods

2.1. Task Design

2.2. Data Preprocessing

2.2.1. Dataset

2.2.2. Generated Scan-Path Sequences

2.3. VSST Data Analysis Method

2.4. The Score Scheme

3. Experimental Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI