Abstract
An extended four-dimensional version of the traditional Petitot–Citti–Sarti model on contour completion in the visual cortex is examined. The neural configuration space is considered as the group of similarity transformations, denoted as . The left-invariant subbundle of the tangent bundle models possible directions for establishing neural communication. The sub-Riemannian distance is proportional to the energy expended in interneuron activation between two excited border neurons. According to the model, the damaged image contours are restored via sub-Riemannian geodesics in the space M of positions, orientations and thicknesses (scales). We study the geodesic problem in M using geometric control theory techniques. We prove the existence of a minimal geodesic between arbitrary specified boundary conditions. We apply the Pontryagin maximum principle and derive the geodesic equations. In the special cases, we find explicit solutions. In the general case, we provide a qualitative analysis. Finally, we support our model with a simulation of the association field.
1. Introduction
A mathematical description of the functioning of the human body is a pressing problem in the modern world. Particularly, the specification of cerebration and neuron operation is of particular interest. In this paper, we model the visual information processing by the visual cortex. The complete mechanism of the visual signal processing is not fully studied; however, there is a profound understanding [1] of how image processing is carried out via the information accumulated in light-sensitive receptors, bipolar and ganglion cells of the eye retina. Such information includes the spatial coordinates of the image.
After the retina, the visual signal passes through LGN cells of the thalamus and arrives in the visual cortex. The visual cortex has a multi-layered organization and consists of billions of neural cells. Neurons are connected in a complex network, which is extremely difficult to analyze due to the huge number of elements and even more connections between them. The direct simulation approach to modeling such systems faces inevitable obstacles. However, there are some fundamental principles that are used in network configuration, e.g., the principle of minimum energy spent on establishing communication between two excited neurons of the network. A promising direction for studying such complex systems is to understand such principles and propose simple mathematical models based on these principles. Further mathematical analysis of such models can deepen the understanding of the original systems.
In [2], a mathematical model based on scale space theory was proposed to describe the primary processing mechanism. The model is based on the properties of Gaussian kernels and their derivatives as regularized differential operators, as well as solutions to the linear diffusion equation. The model is supported by experimental study [3], where the shape of the receptive fields (RFs) of bipolar and ganglion cells was established. The authors concluded that RFs are mathematically well approximated by filter profiles based on the Gaussian kernel and Gaussian derivatives. Then, their functioning can be represented as the action of a filter on the input signal (convolution of two functions).
In [4], Marr put forward the idea that retinal ganglion cells perform a convolution capable of extracting qualitative breaks encoded in the signal. Marr also posed that higher levels of visual processing are rooted in this first level of morphological organization of the retinal image, which he called the 2D-primal sketch. He discovered that the convolution of a signal with a receptive profile is a wavelet analysis of the signal, i.e., a spatially localized multiscale Fourier analysis capable of detecting discontinuities. An alternative transformation tool can be an internal compression of the image in accordance with its geometric structure. There is Marr’s hypothesis that the image can be reconstructed from its different-scale edges. This contour reconstruction is extremely accurate as only finer details such as textures are smoothed out. In this case, a problem arises with the origin of the breaks. Discontinuities that are robust to large-scale changes can be mistaken for the edges of external objects. Therefore, compression of visual information, i.e., information limitation, is identified with morphological analysis, i.e., geometric constraint.
Further research required a more comprehensive approach. Hubel and Wiesel [5] understood the principles of the primary visual cortex V1 processing. They showed that some parts of the brain react not only to the spatial position of the visible image, but also to its orientation in space. They found that the receptive fields of V1 neurons were elongated rather than rounded. This indicated the ability of V1 cells to detect contour segments with different orientations throughout the image. Mathematically, the operation of V1 cells can be modeled as lifting a two-dimensional input image into an expanded space of positions and orientations.
Even though V1 is physically a two-dimensional neural layer, it implements more than two degrees of freedom. Hubel [1] called the difference between physical and abstract dimensions the grafting of variables. In his pioneering work [6], Hoffmann introduced differential fiber bundles to describe the visual cortex. Here, the base of the fiber bundle represents the retinal plane and the fibers represent the engrafted variables. Further development of this model was performed by Petitot and Tondut [7] (see also the honorable work [8] by Petitot). They described V1 cells as a fiber bundle equipped with a contact structure, and the neural long-range connections were identified as integral curves in the Heisenberg group. In their model, contour completion is carried out along integral curves by minimizing a suitable length functional.
Petitot [8] described the biological functioning of the visual cortex V1 as a sub-Riemannian (SR) structure on the Heisenberg group. Citti and Sarti [9] took into account the nature of the orientation angle and proposed the SR structure on the Lie group . The roto-translation group SE(2) models the configuration space of V1 neurons as the space of positions and SO(2) orientations. In this model, the reconstruction of a hidden contour occurs by minimizing the excitation energy of neurons that perceive visual information. Such a process is interpreted as the action of the hypoelliptic diffusion operator studied in [10,11,12]. Then, the reconstructed parts of the contour are the SR length minimizers on SE(2). The exact expression of the minimizers was found in [13]. Such curves are used to render images [14] and to explain some visual illusions [15,16].
Modern computer vision is actively developing based on the principles of biological systems. They are used for image analysis tasks such as enhancement, segmentation, shading, and feature detection. In [17,18], the authors provided a mathematical background for image processing in an extended space of positions and orientations. Salient lines are tracked by SR length minimizers (see [19,20]) or by optimal trajectories in modified models [21,22,23,24], where the spatial propagation along the minimizers is restricted to avoid cusp points and to normalize the curvature of the detected salient lines.
The classic Petitot–Citti–Sarti model has been widely developed. It is the cornerstone of an entire scientific direction—neuromathematics of vision [25,26]. This model was modified in subsequent works by many authors (e.g., [27,28,29,30,31,32,33,34,35]), taking into account some aspects of the physiology of vision and based on the needs in the field of image processing. A recent work [36] showed an overview of different models.
The work [27] bridged the classic Petitot–Citti–Sarti model with the edge co-occurrence statistics in natural images. The model’s applicability for the association field construction was studied in [28], where the authors showed that the boundary conditions connected by the SR geodesic with cuspless planar projection match the criteria of good continuation. In [29], the authors modified the model to account for the spherical nature of the retina. Later, in [30], this spherical model was extended by considering the non-uniform distribution of photoreceptors on the retina. A link to the well-known Bresloff–Cowan spherical model [31] of V1 hypercolumns was studied in [32]. A link to the widely used Wilson–Cowan equations of neural dynamics was studied in [37]. In [33], a four-dimensional model accounting for the contour curvature was studied. In [38], the authors proposed a five-dimensional model considering the duration and velocity of visual stimuli. Another five-dimensional extension was proposed in [39], where orientation, frequency, and phase selective behavior of the V1 simple cells are analyzed. Based on this model, a contour completion method was developed in [40]. A semidiscrete modification and its application to image processing was studied in [34].
In [35], the authors expanded their classic model by adding a scale parameter and introducing a symplectic structure to describe the structure of neural connectivity. In the current work, we take this model as a basis and develop it by explicitly introducing the length functional and considering the problem of finding length minimizers between arbitrarily specified boundary conditions.
Neurophysiological studies show that spatial hypercolumns also accumulate secondary information about the visible image, such as ocular dominance [41], contour curvature [42,43], contour thickness (scale) [44], and other features. It is known that neurons (simple cells) have different sizes in different areas of the visual cortex. Hence, from V1 to V2, we find simple cells sensitive to objects of different scales (see, e.g., Figure 37 in [25]). In this paper, inspired by [35], we consider a four-dimensional model, which is an extension of the classic Petitot–Citti–Sarti model by adding the contour thickness parameter. The configuration space of neurons is interpreted as a group of similarity transformations . The left-invariant distribution [45] of the tangent subspaces models the possible directions of establishing a neural connection. The sub-Riemannian distance is proportional to the energy expended in interneuron activation between two excited border neurons. According to the model, the contours of the damaged image are restored using sub-Riemannian geodesics in the space M of positions, orientations and thicknesses (scales). This extension is also intended for image processing tasks to find salient lines (see Figure 1), and to restore damaged image contours (see Figure 2). Image enhancement via left-invariant evolution in the SIM(2) group is studied in [46].
Figure 1.
Finding blood vessels (salient lines) in the fundus photography of the human retina. Specifications: x, y are spatial coordinates, is the orientation, and is the thickness of lines.
Figure 2.
Restoration of image contours. From left to right: original image; corrupted image (the damaged area is a white disc); recovering contours via the classical model (sub-Riemannian geodesics in SE(2)); restoration via geodesics in SIM(2), taking into account the thickness of contours.
In this paper, we consider the problem of SR geodesics in SIM(2). In Section 2, we formulate the model and state the sub-Riemannian problem. In Section 3, we prove the complete controllability of the system and the existence of optimal controls. Then, in Section 4, we apply a necessary optimality condition, the Pontryagin maximum principle (PMP), and study the Hamiltonian PMP system. We obtain an explicit expression for the abnormal geodesics and provide a qualitative analysis of the Hamiltonian system for normal geodesics. In Section 5, we discuss the boundary value problem. In Section 6, we provide a simulation to construct the association field using the sub-Riemannian geodesics in our problem.
The main contributions of our research are the following:
- A proof of well-posedness of the geodesic problem in M; see Theorem 1.
- An explicit expression of minimal abnormal geodesics; see Theorem 2.
- An explicit expression of a special case of normal geodesics; see Theorem 3.
- Asymptotic behavior of normal geodesics in the general case; see Theorem 4.
2. Problem Formulation
The classic works of Petitot, Citti, and Sarti [8,9] present a model of V1 as a three-dimensional Lie group SE(2) of positions and orientations. In their model, horizontal long-range connections between cells of V1 are represented by smooth curves adhering to a nonholonomic constraint: the curves must be tangent to the distribution , where is a given left-invariant differential one-form. Through this approach, a horizontal connection between two neurons is established based on the principle of minimum energy spent on its creation. This leads to the natural modeling of the space V1 by the sub-Riemannian manifold , where the metric specifies the distance encoding the expended energy. According to the model, the visual system performs contour completion (restoration of a corrupted or partially hidden from observation contour) by finding a sub-Riemannian length minimizer between two configurations on the boundary of the damaged area (see Figure 2).
In the consequent work [44], the same authors introduced a new variable and defined a symplectic structure in the extended space SIM(2) of positions, orientations and scales. The symplectic structure generates nonholonomic constraints for establishing a long-range neural connection. Note that the explicit form of a metric encoding the energy spent to create the connection was not considered in [44]. In our work, we explicitly present this sub-Riemannian metric and formulate an optimal control problem for finding length minimizers. In [44], the authors presented special types of integral curves with a fixed parameter of scale or orientation, which correspond to constant controls in our model. We are also motivated by image analysis applications, where the thickness of contours varies during tracking (see Figure 1).
According to the model, the contour completion mechanism by V1 is invariant under parallel translations, rotations, and scaling of the image on the retina. Such transformations constitute the group of orientation-preserving similarity transformations on the plane
The retinal plane is a homogeneous space of the Lie group SIM(2), which acts transitively on it. Thus, SIM(2) models the configuration space of simple cells V1.
Now, we explain the lifting of an observable image from the retinal plane to the group SIM(2), which represents V1. The set of receptive profiles of V1 simple cells over a retinal point is formed from the “mother” Gabor function (see [44]),
by rotations on an angle , and dilations on :
where
The lifted image is obtained by probing the observable image on the retinal plane with a family of two parametric Gabor filters,
A selection from all different cells in a fiber is performed by maximum response selection
The values are actual values of engrafted variables and associated with a retinal point . Such values are used as boundary conditions for contour completion problem that we formulate at the end of this paragraph as an optimal control problem.
Beforehand, we explain a nonholonomic constraint on a long-range (horizontal) neural connection. A horizontal connection is modeled by a smooth curve tangent to the distribution , where the one-form is given by (see [44])
Notice that the distribution is given by
where are basis left-invariant vector fields on SIM(2)
where Id is a unit element, and is the left translation on SIM(2) (see Appendix A).
By a horizontal curve, we call a Lipschitz curve tangent to at almost every point
where .
We construct the sub-Riemannian metric by requiring that , , and be orthogonal. Thus, the length of a horizontal curve is given by
where the parameters , are coefficients of the sub-Riemannian metric that encode the balance between penalties for motion in a plane along the contour and changing its orientation and thickness. Further, for simplicity, we consider the model case .
Any horizontal curve of positive length can be reparameterized by arc length (see Lemma 3.15 in [47]). Thus, the problem of length minimization is equivalent to time-optimal problem .
Finally, we formulate a contour completion problem as the optimal control problem. Consider the following control system:
For given boundary conditions , , we aim to find the controls , , , such that the corresponding trajectory transfers the system from the initial configuration to the final configuration by minimum time:
Remark 1.
The problem is invariant under the left action of SIM(2) since the vector fields , , and are left-invariant. Due to this property, without loss of generality, we set .
3. Existence of Solutions
When studying Problems (15) and (16), the natural question arises about the existence of an admissible trajectory connecting boundary conditions (16). The control system is called completely controllable if an admissible trajectory exists for any , . We study the complete controllability of System (15) using the technique of geometric control theory [48].
We have the following nonzero Lie brackets of the controlled vector fields:
System (15) is symmetric with respect to the controls and it satisfies Hormander condition (i.e., the Lie algebra of the controlled vector fields spans at every point the entire tangent space):
By Chow–Rashevsky theorem [48], these two conditions plus connectedness of SIM(2) guarantee complete controllability.
Existence of an optimal admissible trajectory that satisfies conditions (16) is ensured by Filippov’s theorem [48,49]. In such a way, we proved the following theorem.
4. Pontryagin Maximum Principle
The necessary condition for optimality is given by the Pontryagin maximum principle (PMP). In this section, we apply PMP to our problem (15) and (16).
Let . Define the Pontryagin function
where are coordinates in corresponding to in SIM(2).
PMP states the following. Let , , be the optimal control and the corresponding optimal trajectory. Then, there exists a Lipschitz curve such that for (the non-triviality condition), and the following conditions hold for almost every :
- The Hamiltonian system
- The maximum condition
The function being maximized is called the Hamiltonian. This is the first integral of the Hamiltonian system. The case is called abnormal, and the case is called normal. The normal case is reduced to by time reparameterization.
Natural coordinates for left-invariant systems [45] are given by :
The inverse transformation from h to p is given by
The Hamiltonian system (22) takes the following form in the coordinates (23):
with the boundary conditions
The subsystem for the configuration variables x, y, , is called the horizontal part, and the subsystem for adjoint variables is called the vertical part of the Hamiltonian system.
Remark 2.
The vertical part of the Hamiltonian system (26) can be alternatively derived by computing the Poisson brackets: (see [47]).
In PMP formulation for Problems (15) and (16) of searching for a non-trivial (of non-zero length) optimal curve without loss of generality, we choose the arc length parameterization
Next, we consider two cases: (the abnormal case), and (the normal case).
4.1. Abnormal Case
The Pontryagin function is given by . Since the set of admissible controls is symmetric, the maximum condition is satisfied if and only if . Then, the vertical part of (26) is reduced to
The horizontal part of (26) is reduced to , . Taking into account the boundary condition leads to the following theorem.
Theorem 2.
Abnormal extremal trajectories have the following form:
where is an integrable function with values .
Abnormal optimal trajectories parameterized by arc length have the following form:
4.2. Normal Case
The Pontryagin function can be considered as a scalar product . It reaches maximum on the control set when the vector is collinear to and has unit length. Note that, due to the choice of , this implies the following relation for extremal controls:
The Hamiltonian system takes the form
This system has a first integral: the Hamiltonian H. We can find more first integrals by considering right-invariant vector fields , , and associated linear on the fibers of the cotangent bundle Hamiltonians , . Among these four right-invariant Hamiltonians, only two (, ) are in involution (i.e., the Poisson bracket ) and functionally independent. Thus, we found the following set of first integrals:
To prove Liouville integrability, it is necessary to find four functionally independent first integrals in involution. Three of them we found above. We could not find the remaining first integral. Thus, the question of Liouville integrability remains open.
Now, we focus on the vertical part and describe the coadjoint orbits [50]. Consider the Poisson bivector, which is given by a matrix with the components . The structure of Poisson brackets coincides with the structure of Lie brackets (17). Thus, we have only the following nonzero Poisson brackets:
We have ; thus if , and otherwise.
In the case , the coadjoint orbit is zero dimensional. The explicit expression for the extremals are given by the following theorem.
Theorem 3.
For the initial covector values
normal extremal trajectories have the following form:
They are optimal on a time interval , when ; and up to infinity, when .
Proof.
The expression (37) is obtained by integration of the Hamiltonian system (33) with the boundary conditions (27) that, due to the condition (36), is reduced to
Optimality for holds, since the corresponding extremal trajectory is a straight line passing at maximum speed. In the time-optimal problem, this trajectory is optimal, since any other trajectory of the control system requires more time to reach a point on this line.
Optimality for for holds for the same reason.
The trajectory for is not optimal for , since it has a Maxwell point [45] at . A Maxwell point is a point where two distinct geodesics meet with the same time. After a Maxwell point, an extremal loses its optimality. A reason for the Maxwell point in our case is periodicity of the angle . Indeed, consider two trajectories with the initial values , , . They reach the same configuration , , at . □
Remark 3.
In the general case , the coadjoint orbit is four dimensional. We performed a qualitative analysis of the Hamitonian system leading to the following theorem.
Theorem 4.
Any solution to the vertical part corresponding to the initial covector , has the following asymptotic behavior:
Proof.
Let . Due to system (33), we have
Now, we estimate the above integral. Due to (33), we have . Thus, the function is non-increasing. Together with , this implies for all . Then, the integral of decreases indefinitely: . Then, the exponential of the integral tends to zero: . Thus, we proved . It remains to prove the asymptotic behavior of and . The Hamiltonian implies that is bounded from below: . The boundness and non-increasing of implies . The remaining statement follows from the Hamiltonian, since . □
Note that the condition is technical, and we use it in the proof. Based on the numerical experiments, we formulate the following conjecture.
Hypothesis 1.
Any solution to the vertical part corresponding to the initial covector has the following asymptotic behavior:
5. Approach to the Boundary Value Problem
A geodesic on a sub-Riemannian manifold is a horizontal curve whose sufficiently short arcs are length minimizers. In the optimal control formulation, SR geodesics are the Pontryagin extremal trajectories. Note that PMP is only a necessary, but not sufficient, condition for optimality. It is an infinite-dimensional analogy of the zero derivative condition when minimizing a smooth function in . One needs higher-order conditions to find the minimum among all the critical points. The Pontryagin extremals are first-order candidates for being optimal among all admissible trajectories of a control system. Sufficiently short arcs of SR geodesics are optimal, since they satisfy the Legendre condition [48]. An extremal trajectory loses optimality at a so-called cut point [47].
There are two types of Pontryagin extremals, called abnormal and normal. In the previous section, we found an explicit expression for the abnormal extremals and derived the Hamiltonian system (33) for the normal extremals in SIM(2). Note that the set of reachable end conditions for arbitrary time (the attainable set) by the abnormal extremals is a one-dimensional subspace in the four-dimensional space SIM(2). In contrast, the attainable set by the normal extremals is the entire group SIM(2), as we proved in Theorem 1.
By varying the initial value of covector over the set , we obtain a three-parameter family of the normal extremals. Consider a so-called exponential map , which maps the initial covector and the instance of time to the point of the corresponding geodesic. It holds for general sub-Riemannian manifolds that the exponential map is not injective. For example, consider the initial covector , then the corresponding extremal trajectory is given by (see (33)), which is periodic with a period .
Cut points are singularities of the exponential map. There are typically two reasons for a cut point [47]: a conjugate point (a point where the exponential map is degenerate), and a Maxwell point (a point where two distinct geodesics meet at the same time). Finding cut points is a hard mathematical problem, and its solution relies on explicit formulas for the geodesics and analysis of symmetries of the exponential map. Thus, to solve the boundary value problems (15) and (16) of finding a length minimizer between two given configurations, one needs to restrict the preimage of the exponential map to the domain corresponding to only optimal geodesics. Further, the shooting method can be applied.
6. Modeling of Association Field
The problem of contour completion (integration) by human visual system was investigated by psychophysicists. Gestalt laws have been proposed for several phenomena of visual perception. Among them, the law of good continuation plays the central role for perceptual completion. The principle of good continuation is found in the experiments of Field, Hayes and Hess [51]. Those experiments have resulted in the notion of association field, which describes the set of possible subjective contours starting from a given initial configuration. The role of the scale in contour integration process was stressed in [52].
Inspired by Figure 16 in [51], we provide a simulation of association field by sub-Riemannian geodesics in SIM(2) (see Figure 3). A remarkable property of this model is that the further spatial propagation of the present geodesics does not appear with growing time, which corresponds to Hypothesis 1. This gives a natural bound for the spacial distance between given boundary configurations, which corresponds to the Field model.
Figure 3.
Modeling of association field by sub-Riemannian geodesics in SIM(2). The spatial projections of the geodesics are depicted for the following initial covectors: .
In Figure 4, we provide a simulation showing that the sub-Riemannian distance in SIM(2) can be used as a criterion for perceptual grouping of the patterns with different positions, orientations, and sizes. In this experiment, we show the points in SIM(2) that are equidistantly (with the distance ) distributed along the given three segments of geodesics. They are plotted over the background consisting of points in SIM(2) on a regular spatial grid and with randomly chosen orientation and the scale . The grid is constructed in a way to guarantee that the distance between the background elements is greater than . One can see that the equidistantly distributed points are grouped in contrast to the points in the background.
Figure 4.
Perceptual grouping of the elements in SIM(2) having small SR distance between them. They are plotted over the background consisting of the elements located from each other on a bigger distance.
We performed the above simulations in Wolfram Mathematica by numerical integration of the normal Hamiltonian system (33). In the experiments, we relied on the local properties of the exponential map. A further detailed study of the feasibility of SIM(2) model for contour completion on real images requires software to solve a boundary value problem, as discussed in Section 5. This will be a topic of our future research.
7. Conclusions
In this paper, we considered the sub-Riemannian problem in the Lie group SIM(2) of orientation-preserving similarity transformations of the plane. This problem arises when modeling the mechanism of completing contours by the visual cortex. We considered an extended Petitot–Citti–Sarti model, where the thickness of the contours is taken into account. Based on the principle of minimum energy in the contour completion process, we proposed a sub-Riemannian metric encoding the energy. We stated the contour completion problem as the problem of finding a length-minimizer with given boundary conditions. We reformulated the problem as a time-optimal problem. We proved a solution’s existence and applied a necessary optimality condition: PMP. The Hamiltonian PMP system for the geodesics was derived. We found explicit parameterization of abnormal trajectories and provided a qualitative analysis of the normal Hamiltonian system. Finally, we presented simulations on constructing the association field by sub-Riemannian geodesics in SIM(2).
Author Contributions
Conceptualization, A.M.; methodology, A.M.; software, I.G. and A.M.; validation, I.G. and A.M.; formal analysis, I.G. and A.M.; investigation, I.G. and A.M.; resources, I.G. and A.M.; data curation, A.M.; writing original draft preparation, I.G. and A.M.; writing review and editing, I.G. and A.M.; visualization, I.G. and A.M.; supervision, A.M.; project administration, A.M.; funding acquisition, I.G. and A.M. All authors have read and agreed to the published version of the manuscript.
Funding
The work by Alexey Mashtakov was supported by the Russian Science Foundation under grant 22-11-00140 (https://rscf.ru/project/22-11-00140/, accessed on 29 July 2024) and performed in Ailamazyan Program Systems Institute of Russian Academy of Sciences.
Institutional Review Board Statement
Informed Consent Statement
Not applicable.
Data Availability Statement
Dataset available on request from the authors.
Acknowledgments
We gratefully acknowledge the anonymous reviewer whose fruitful comments helped us to refine the model and considerably improve the exposition of the work. We are grateful to Y.L. Sachkov for many fruitful suggestions and valuable comments, especially for pointing out the structure of coadjoint orbits. We thank D.V. Alekseevsky for discussions on the physiology of visual systems and their models. We thank A.A. Ardentov for pointing out the structure of abnormal extremals and A.Y. Popov for discussing the asymptotic behavior of normal extremals. Finally, we thank all control process research center seminar participants for their valuable comments.
Conflicts of Interest
The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Nomenclature
| Set of real numbers. | |
| Set of integer numbers. | |
| One dimensional sphere (a circle): . | |
| SO(2) | Lie group of rotations of the plane . This group is parameterized by angle . |
| SE(2) | Lie group of proper motions of the plane . This group is topologically equivalent to the manifold and parameterized by a vector of parallel translation and an angle . |
| SIM(2) | Lie group of orientation-preserving similarity transformations of the plane . Its matrix representation is given by (1). |
| M | Short notation for the Lie group SIM(2). |
| Differential one form in SE(2) in the classic Petitot–Citti–Sarti model. | |
| Ker | Kernel of a differential form. It is a linear space spanned by the vectors vanishing the form. |
| Distribution in in the classic Petitot–Citti–Sarti model. | |
| Metric in SE(2) in the classic Petitot–Citti–Sarti model. | |
| q | Element of SIM(2). |
| The parameters of SIM(2): is a vector of parallel translation. In our model, are coordinates in image plane. | |
| The parameter of SIM(2): is an angle of rotation. In our model, is the orientation angle between the abscissa and the tangent vector to the contour. | |
| The parameter of SIM(2): is a scaling parameter. In our model, is the thickness of the contour. | |
| Id | Unit element of the group SIM(2). It is given unit matrix and corresponds to the parameters values . |
| G | The Gabor function, see (3). |
| I | Image on the retinal plane. |
| O | Lifted image in the extended space SIM(2), see (5). |
| Left translation on element , see (A3). | |
| Left-invariant vector field on SIM(2), see (9)–(12). | |
| Differential one form in SIM(2), see (7). | |
| Distribution in , see (8) | |
| Horizontal curve in SIM(2), see (13). | |
| Sub-Riemannian length of a horizontal curve , see (14). | |
| T | Terminal time in the optimal control problem. |
| u | The control vector. |
| i-th component of the control vector u. | |
| U | Set of admissible controls, see (28). |
| Commutator (also known as Lie bracket). Commutator of two matrices and is defined by . Commutator of two vector fields at a point is defined by , where denotes a flow generated by the vector field X, see [47]. | |
| sim(2) | Lie algebra of the Lie group SIM(2). This is a vector space spanned by the basis vectors , see (A4) for their matrix representation. |
| Lie | Lie algebra generated by the given vector fields and all their commutators, see (18). |
| p | Adjoint covector in the Darboux coordinates. |
| i-th component of the covector p. | |
| h | Adjoint covector in the left-invariant coordinates. |
| Left-invariant Hamiltonian corresponding to the basis vector field , see (19). is the i-th component of the covector h. | |
| Initial (for ) value of . | |
| Terminal (for ) value of . | |
| Pontryagin function, see (19). | |
| Right-invariant vector field. | |
| Right-invariant Hamiltonian, see (34). | |
| P | Poisson bivector. |
| Poison bracket. | |
| Scalar product of a covector and a vector, . |
Abbreviations
The following abbreviations are used in this manuscript:
| LGN | lateral geniculate nucleus |
| RF | receptive fields |
| V1 | primary visual cortex |
| PMP | Pontryagin maximum principle |
| SR | sub-Riemannian |
Appendix A. Construction of the Left-Invariant Distribution
The Lie group of orientation-preserving similarity transformations on the plane is
The basis vector fields associated with the coordinates are given by
Denote by Id the unit element, which is given by identity matrix and corresponds to the origin, and denote by the left translation on element , which is given by matrix multiplication
The canonical basis for the Lie algebra is given by
Every left-invariant vector field is obtained by push-forward of a Lie algebra element under the left translation. The basis left-invariant vector fields are defined as (see Figure A1). Explicitly, this construction is the following: Let be a smooth curve such that , . Then, is given by , or in matrix representation
Thus, we have
Figure A1.
Construction of left-invariant vector fields. Here, is a smooth curve passing through the unit element Id, is the left translation on element , and A is the tangent vector of .
References
- Hubel, D. Eye, Brain, and Vision; Scientific American Library: New York, NY, USA, 1988. [Google Scholar]
- Ter Haar Romeny, B.M. Front-End Vision and Multi-Scale Image Analysis. In Multi-Scale Computer Vision Theory and Applications, Written in Mathematica; Computational Imaging and Vision; Springer: Dordrecht, The Netherlands, 2003; Volume 27. [Google Scholar]
- Tootell, R.B.H.; Switkes, E.; Silverman, M.S.; Hamilton, S.J. Functional anatomy of macaque striate cortex. II. Retinotopic organization. J. Neurosci. 1988, 8, 1531–1568. [Google Scholar] [CrossRef]
- Marr, D. Vision: A Computational Investigation into the Human Representation and Processing of Visual Information; W.H. Freeman: San Francisco, CA, USA, 1982. [Google Scholar]
- Hubel, D.H.; Wiesel, T.N. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Physiol. 1962, 160, 106–154. [Google Scholar] [CrossRef]
- Hoffman, W.C. The visual cortex is a contact bundle. Appl. Math. Comput. 1989, 32, 137–167. [Google Scholar] [CrossRef]
- Petitot, J.; Tondut, Y. Vers une neurogeometrie. Fibrations corticales, structures de contact et contours subjectifs modaux. Math. Inform. Sci. Hum. 1999, 145, 5–102. (In French) [Google Scholar] [CrossRef]
- Petitot, J. The neurogeometry of pinwheels as a sub-Riemannian contact structure. J. Physiol. 2003, 97, 265–309. [Google Scholar] [CrossRef]
- Citti, G.; Sarti, A. Cortical based model of perceptual completion in the roto-translation space. J. Math. Imaging Vis. 2006, 24, 307–326. [Google Scholar] [CrossRef]
- Boscain, U.; Gauthier, J.-P.; Chertovskih, R.; Remizov, A. Hypoelliptic diffusion and human vision: A semidiscrete new twist. Siam J. Imaging Sci. 2014, 7, 669–695. [Google Scholar] [CrossRef]
- Boscain, U.V.; Chertovskih, R.; Gauthier, J.-P.; Prandi, D.; Remizov, A. Highly corrupted image inpainting through hypoelliptic diffusion. J. Math. Imaging Vis. 2018, 60, 1231–1245. [Google Scholar] [CrossRef]
- Duits, R.; Franken, E. Left-invariant parabolic evolutions on SE(2) and contour enhancement via invertible orientation scores Part I: Linear left-invariant diffusion equations on SE(2). Q. Appl. Math. 2010, 68, 255–292. [Google Scholar] [CrossRef]
- Sachkov, Y.L. Cut locus and optimal synthesis in the sub-Riemannian problem on the group of motions of a plane. ESAIM Control Optim. Calc. Var. 2010, 17, 293–321. [Google Scholar] [CrossRef][Green Version]
- Mashtakov, A.P.; Ardentov, A.A.; Sachkov, Y.L. Parallel Algorithm and Software for Image Inpainting via Sub-Riemannian Minimizers on the Group of Rototranslations. Numer. Math. Theory Methods Appl. 2013, 6, 95–115. [Google Scholar] [CrossRef]
- Franceschiello, B.; Mashtakov, A.; Citti, G.; Sarti, A. Geometrical optical illusion via sub-Riemannian geodesics in the roto-translation group. Differ. Geom. Appl. 2019, 65, 55–77. [Google Scholar] [CrossRef]
- Baspinar, E.; Calatroni, L.; Franceschi, V.; Prandi, D. A Cortical-Inspired Sub-Riemannian Model for Poggendorff-Type Visual Illusions. J. Imaging 2021, 7, 41. [Google Scholar] [CrossRef]
- Duits, R.; Felsberg, M.; Granlund, G. Romeny, B. Image Analysis and Reconstruction using a Wavelet Transform Constructed from a Reducible Representation of the Euclidean Motion Group. Int. J. Comput. Vis. 2007, 72, 79–102. [Google Scholar] [CrossRef]
- Duits, R.; Smets, B.M.N.; Wemmenhove, A.J.; Portegies, J.W.; Bekkers, E.J. Recent Geometric Flows in Multi-orientation Image Processing via a Cartan Connection. In Handbook of Mathematical Models and Algorithms in Computer Vision and Imaging; Chen, K., Schnlieb, C.B., Tai, X.C., Younces, L., Eds.; Springer: Cham, Switzerland, 2021. [Google Scholar]
- Bekkers, E.J.; Duits, R.; Mashtakov, A.; Sanguinetti, G.R. A PDE Approach to Data-driven Sub-Riemannian Geodesics in SE(2). Siam J. Imaging Sci. 2015, 8, 2740–2770. [Google Scholar] [CrossRef]
- Mashtakov, A.; Duits, R.; Sachkov, Y.; Bekkers, E.J.; Beschastnyi, I. Tracking of Lines in Spherical Images via Sub-Riemannian Geodesics in SO(3). J. Math. Imaging Vis. 2017, 58, 239–264. [Google Scholar] [CrossRef]
- Duits, R.; Ghosh, A.; Dela Haije, T.; Mashtakov, A. On sub-Riemannian geodesics in SE(3) whose spatial projections do not have cusps. J. Dyn. Control Syst. 2016, 22, 771–805. [Google Scholar] [CrossRef]
- Duits, R.; Meesters, S.P.L.; Mirebeau, J.-M.; Portegies, J.M. Optimal Paths for Variants of the 2D and 3D Reeds-Shepp Car with Applications in Image Analysis. J. Math. Imaging Vis. 2018, 60, 816–848. [Google Scholar] [CrossRef]
- Mashtakov, A.P. Time minimization problem on the group of motions of a plane with admissible control in a half-disc. Sb. Math. 2022, 213, 534–555. [Google Scholar] [CrossRef]
- Mashtakov, A.; Sachkov, Y. Time-Optimal Problem in the Roto-Translation Group with Admissible Control in a Circular Sector. Mathematics 2023, 11, 3931. [Google Scholar] [CrossRef]
- Petitot, J. Neurogeometrie de la vision. In Modeles Mathematiques et Physiques des Architectures Fonctionnelles; Editions Ecole Polytechnique: Palaiseau, France, 2008; 419p. (In French) [Google Scholar]
- Citti, G.; Sarti, A. (Eds.) Neuromathematics of Vision; Lecture Notes in Morphogenesis; Springer: Berlin/Heidelberg, Germany, 2014; 367p. [Google Scholar]
- Sanguinetti, G.; Citti, G.; Sarti, A. A model of natural image edge cooccurrence in the rototranslation group. J. Vis. 2010, 10, 37. [Google Scholar] [CrossRef]
- Duits, R.; Boscain, U.; Rossi, F.; Sachkov, Y. Association Fields via Cuspless Sub-Riemannian Geodesics in SE(2). J. Math. Imaging Vis. 2014, 49, 384–417. [Google Scholar] [CrossRef] [PubMed]
- Boscain, U.; Rossi, F. Projective Reeds-Shepp car on S2 with quadratic cost. ESAIM Control Optim. Calc. Var. 2010, 16, 275–297. [Google Scholar] [CrossRef]
- Mashtakov, A.; Duits, R. A cortical based model for contour completion on the retinal sphere. Program Syst. Theory Appl. 2016, 7, 231–247. [Google Scholar] [CrossRef]
- Bressloff, P.C.; Cowan, J.D. A spherical model for orientation as spatial-frequency tuning in a cortical hypercolumn. Philos. Trans. R. Soc. Lond. 2002, 358, 1–22. [Google Scholar] [CrossRef] [PubMed]
- Alekseevsky, D. Conformal Model of Hypercolumns in V1 Cortex and the Mobius Group. Application to the Visual Stability Problem. In Geometric Science of Information—GSI 2021; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2021; Volume 12829, pp. 65–72. [Google Scholar]
- Galyaev, I.; Mashtakov, A. Liouville Integrability in a Four-Dimensional Model of the Visual Cortex. J. Imaging 2021, 7, 277. [Google Scholar] [CrossRef] [PubMed]
- Prandi, D.; Gauthier, J.P. A Semidiscrete Version of the Citti-Petitot-Sarti Model as a Plausible Model for Anthropomorphic Image Reconstruction and Pattern Recognition; Springer: Cham, Switzerland, 2018; 113p. [Google Scholar]
- Sarti, A.; Citti, G.; Petitot, J. The symplectic structure of the primary visual cortex. Biol. Cybern. 2008, 98, 33–48. [Google Scholar] [CrossRef] [PubMed]
- Citti, G.; Sarti, A. Cortical Functional Architectures as Contact and Sub-riemannian Geometry. In Morphology, Neurogeometry, Semiotics; Lecture Notes in Morphogenesis; Sarti, A., Ed.; Springer: Cham, Swizterland, 2024. [Google Scholar]
- Bertalmio, M.; Calatroni, L.; Franceschi, V.; Franceschiello, B.; Prandi, D. Cortical-Inspired Wilson–Cowan-Type Equations for Orientation-Dependent Contrast Perception Modelling. J. Math. Imaging Vis. 2021, 63, 263–281. [Google Scholar] [CrossRef]
- Barbieri, D.; Citti, G.; Cocci, G.; Sarti, A. A Cortical-Inspired Geometry for Contour Perception and Motion Integration. J. Math. Imaging Vis. 2013, 49, 511–529. [Google Scholar] [CrossRef]
- Baspinar, E.; Sarti, A.; Citti, G. A sub-Riemannian model of the visual cortex with frequency and phase. J. Math. Neurosci. 2020, 10, 11. [Google Scholar] [CrossRef]
- Baspinar, E. Multi-Frequency Image Completion via a Biologically-Inspired Sub-Riemannian Model with Frequency and Phase. J. Imaging 2021, 7, 271. [Google Scholar] [CrossRef] [PubMed]
- Shatz, C.J.; Stryker, M.P. Ocular dominance in layer iv of the cat’s visual cortex and the effects of monocular deprivation. J. Physiol. 1987, 281, 267–283. [Google Scholar] [CrossRef]
- Yue, X.; Robert, S.; Ungerleider, L.G. Curvature processing in human visual cortical areas. NeuroImage 2020, 222, 117295. [Google Scholar] [CrossRef] [PubMed]
- Zucker, S.W. The computational connection in vision: Early orientation selection. Behav. Res. Methods Instrum. Comput. 1986, 18, 608–617. [Google Scholar] [CrossRef]
- Blakemore, C.T.; Campbell, F. On the existence of neurones in the human visual system selectively sensitive to the orientation and size of retinal images. J. Physiol. 1969, 203, 237–260. [Google Scholar] [CrossRef] [PubMed]
- Sachkov, Y.L. Left-invariant optimal control problems on Lie groups: Classification and problems integrable by elementary functions. Russ. Math. Surv. 2022, 77, 99–163. [Google Scholar] [CrossRef]
- Sharma, U.; Duits, R. Left-invariant evolutions of wavelet transforms on the similitude group. Appl. Comput. Harmon. Anal. 2015, 39, 110–137. [Google Scholar] [CrossRef]
- Agrachev, A.; Barilari, D.; Boscain, U. A Comprehensive Introduction to Sub-Riemannian Geometry; Cambridge University Press: Cambridge, UK, 2019; 745p. [Google Scholar]
- Agrachev, A.A.; Sachkov, Y.L. Control Theory from the Geometric Viewpoint; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
- Zelikin, M.I. Optimal Control and Variational Calculus; Editorial URSS: Moscow, Russia, 2004. (In Russian) [Google Scholar]
- Kirillov, A.A. Lectures on the Orbit Method; AMS: Providence, RI, USA, 2004; 408p. [Google Scholar]
- Field, D.J.; Hayes, A.; Hess, R. Contour integration by the human visual system: Evidence for a local “association field”. Vis. Res. 1993, 33, 173–193. [Google Scholar] [CrossRef]
- Dakin, S.C.; Hess, R.F. Contour integration and scale combination processes in visual edge detection. Spat. Vis. 1999, 12, 309–327. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).