A Cortical-Inspired Sub-Riemannian Model for Poggendorff-Type Visual Illusions

Baspinar, Emre; Calatroni, Luca; Franceschi, Valentina; Prandi, Dario

doi:10.3390/jimaging7030041

Open AccessFeature PaperArticle

A Cortical-Inspired Sub-Riemannian Model for Poggendorff-Type Visual Illusions

¹

INRIA Sophia Antipolis Méditerranée, MathNeuro, 06902 Sophia Antipolis, France

²

CNRS, UCA, INRIA Sophia Antipolis Méditerranée, Morpheme, I3S, 06902 Sophia Antipolis, France

³

Dipartimento di Matematica Tullio Levi-Civita, Università di Padova, 35131 Padova, Italy

⁴

CNRS, CentraleSupélec, Laboratoire des Signaux et des Systèmes, Université Paris-Saclay, 91190 Gif-sur-Yvette, France

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

J. Imaging 2021, 7(3), 41; https://doi.org/10.3390/jimaging7030041

Submission received: 24 December 2020 / Revised: 27 January 2021 / Accepted: 11 February 2021 / Published: 24 February 2021

(This article belongs to the Special Issue Mathematical Models of Visual Perception and Biology with Applications to Images Processing and Computer Vision)

Download

Browse Figures

Versions Notes

Abstract

:

We consider Wilson-Cowan-type models for the mathematical description of orientation-dependent Poggendorff-like illusions. Our modelling improves two previously proposed cortical-inspired approaches, embedding the sub-Riemannian heat kernel into the neuronal interaction term, in agreement with the intrinsically anisotropic functional architecture of V1 based on both local and lateral connections. For the numerical realisation of both models, we consider standard gradient descent algorithms combined with Fourier-based approaches for the efficient computation of the sub-Laplacian evolution. Our numerical results show that the use of the sub-Riemannian kernel allows us to reproduce numerically visual misperceptions and inpainting-type biases in a stronger way in comparison with the previous approaches.

Keywords:

Wilson-Cowan modelling; visual illusions; cortical-inspired imaging; local histogram equalisation; sub-Riemannian heat kernel

1. Introduction

The question of how we perceive the world around us has been an intriguing topic since ancient times. For example, we can consider the philosophical debate around the concept of entelechy, which started with the early studies of the Aristotelian school, in order to answer this question while, on the side of phenomenology and its relation to natural sciences, we can think of the theory started by Husserl. A well-known and accepted theory of perception is that formulated within Gestalt psychology [1,2].

Gestalt psychology is a theory for understanding the principles underlying the configuration of local forms giving rise to a meaningful global perception. The main idea of Gestalt psychology is that the mind constructs the whole by grouping similar fragments rather than simply summing the fragments as if they were all different. In terms of visual perception, such similar fragments correspond to point stimuli with the same (or very close) valued features of the same type. As an enlightening example from vision science, we tend to group the same coloured objects in an image and to perceive them as an ensemble rather than as objects with different colours. There have been many psychophysical studies which have attempted to provide quantitative parameters describing the tendencies of the mind in visual perception based on Gestalt psychology. A particularly important one is the pioneering work of Field et al. [3], where the authors proposed a representation, called the association field, that modelled specific Gestalt principles. Furthermore, they also showed that it is more likely that the brain perceives fragments together that are similarly oriented and aligned along a curvilinear path than the ones that are rapidly changing orientations.

The presented model for neural activity is a geometrical abstraction of the orientation-sensitive V1 hypercolumnar architecture observed by Hubel and Wiesel [4,5,6]. This abstraction generates a good phenomenological approximation of the V1 neuronal connections existing in the hypercolumnar architecture, as reported by Bosking et al. [7]. In this framework, the corresponding projections of the neuronal connections in V1 onto a 2D image plane are considered to be the association fields described above and the neuronal connections are modeled as the horizontal integral curves generated by the model geometry. The projections of such horizontal integral curves were shown to produce a close approximation of the association fields, see Figure 1. For this reason, the approach considered by Citti, Petitot and Sarti and used in this work is referred to as cortically-inspired.

We remark that the presented model for neural activity is a phenomenological model that provides a mathematical understanding of early perceptual mechanisms at the cortical level by starting from very structure of receptive profiles. Nevertheless, it has been very useful for many image-processing applications, see, for example, [9,10].

In this work, we follow this approach for a better understanding of the visual perception biases due to visual distortions often referred to as visual illusions. Visual illusions are described as the mismatches between reality and its visual perception. They result either from a neural conditioning introduced by external agents such as drugs, microorganisms and tumours [11,12], or from self-inducing mechanisms evoking visual distortions via proper neural functionality applied to a specific stimulus [13,14]. The latter type of illusion is due to the effects of neurological and biological constraints on the visual system [15].

In this work, we focus on illusions induced by contrast induction and orientation misalignments, with a particular focus on the well-known Poggendorff illusion and its variations, see Figure 2. This is a geometrical optical illusion [16,17] in which a misaligned oblique perception is induced by the presence of a central bar [18].

1.1. The Functional Architecture of the Primary Visual Cortex

It has been known since the celebrated experiments of Hubel and Wiesel [4,5,6] that neurons (simple cells) in the primary visual cortex (V1) perform boundary (hence orientation) detection and propagate their activations through cortical connectivity, in accordance with the psychophysical results of Fields and Hayes [3]. Hubel and Wiesel showed that simple cells have a spatial arrangement based on the so-called hypercolumns in V1. In this arrangement, simple cells that are sensitive to different orientations at the same retinal location are found in the same vertical column constructed on the cortical surface. Adjacent columns contain simple cells, which are sensitive to close positions.

Several models have been proposed to describe the functional architecture of V1 and the neural connectivity within it. Koenderink et al. [19,20] focused on differential geometric approaches to study the visual space where they modelled the invariance of simple cells with respect to suitable symmetries in terms of a family of Gaussian functions. Hoffman [21,22] provided the basic framework of vision models by interpreting the hypercolumn architecture of V1 as a fibre bundle. Following a similar reasoning, Petitot and Tondut [23] further developed this modelling, providing a new model, coherent both with the structure of orientation sensitive simple cells and the long range neural connectivity between V1 simple cells. In their model, they first observed that the simple cell orientation selectivity induces a contact geometry (associated with the first Heisenberg group) rendered by the fibres of orientations. Moreover, they showed that a specific family of curves found via a constrained minimisation approach in the contact geometry fits the aforementioned association fields reported by Field et al. [3]. In [8,24], Citti and Sarti further developed the model of Petitot and Tondut, by introducing a group based approach, which was then refined by Boscain, Gauthier et al. [25,26], see also the monograph in [27]. The so-called Citti-Petitot-Sarti (CPS) model exploits the natural sub-Riemannian (sR) structure of the group of rotations and translations

SE (2)

as the V1 model geometry.

In this framework, simple cells are modelled as points of the three-dimensional group

M = R^{2} \times P^{1}

. Here,

P^{1}

is the projective line, obtained by identifying antipodal points in

S^{1}

. The response of simple cells to the two-dimensional visual stimuli is identified by lifting them to

M

via a Gabor wavelet transform. Neural connectivity is then modelled in terms of horizontal integral curves given by the natural sub-Riemannian structure of

M

. Activity propagation along neural connections can further be modelled in terms of diffusion and transport processes along the horizontal integral curves.

In recent years, the CPS model has been exploited as a framework for several cortical-inspired image processing problems by various researchers. We mention the large corpus of literature by Duits et al., see, for example, [28,29,30] and the state-of-the-art image inpainting and image recognition algorithms developed by Boscain, Gauthier, et al. [9,31]. Some extensions of the CPS model geometry and its applications to other image processing problems can be found in [32,33,34,35,36,37,38,39].

1.2. Mean-Field Neural Dynamics & Visual Illusions

Understanding neural behaviors is in general a very challenging task. Reliable responses to stimuli are typically measured at the level of population assemblies comprised by a large number of coupled cells. This motivates the reduction, whenever possible, of the dynamics of a neuronal population to a neuronal mean-field model, which describes large-scale dynamics of the population as the number of neurons goes to infinity. These mean-field models, inspired by the pioneering work of Wilson and Cowan [40,41] and Amari [42], are low dimensional in comparison with their corresponding ones based on large-scale population networks. Yet, they capture the same dynamics underlying the population behaviours.

In the framework of the CPS model for V1 discussed above, several mathematical models were proposed to describe the neural activity propagation favouring the creation of visual illusions, including Poggendorff type illusions. In [37], for instance, illusions are identified with suitable strain tensors, responsible for the perceived displacement from the grey levels of the original image. In [43], illusory patterns are identified by a suitable modulation of the geometry of

SE (2) = R^{2} \times S^{1}

and are computed as the associated geodesics via the fast-marching algorithm.

In [44,45,46], a variant of the Wilson-Cowan (WC) model based on a variational principle and adapted to the

M

geometry of V1 was employed to model the neuronal activity and generate illusory patterns for different illusion types. The modelling considered in these works is strongly inspired by the integro-differential model firstly studied in [47] for perception-inspired Local Histogram Equalisation (LHE) techniques and later applied in a series of work, see, for example, [48,49] for the study of contrast and assimilation phenomena. By further incorporating a cortical-inspired modelling, the authors showed in [44,45,46] that cortical Local Histogram Equalisation (LHE) models are able to replicate visual misperceptions induced not only by local contrast changes, but also by orientation-induced biases similar to the ones in Figure 2. Interestingly, the cortical LHE model [44,45,46] was further shown to outperform both standard and cortical-inspired WC models and was rigorously shown to correspond to the minimisation of a variational energy, which suggests more efficient representation properties [50,51]. One major limitation in the modelling considered in these works is the use of neuronal interaction kernels (essentially, isotropic 3D Gaussian), which are not compatible with the natural sub-Riemannian structure of V1 proposed in the CPS model.

1.3. Main Contributions

In this work, we encode the sub-Riemannian structure of V1 into both WC and LHE models by using a sub-Laplacian procedure associated with the geometry of the space

M

described in Section 1.1. Similar to [44,45,46], with the lifting procedure associated with a given two dimensional image, the corresponding neuronal response in

M

is performed by means of all-scale cake wavelets, introduced in [52,53]. A suitable gradient-descent algorithm is applied to compute the stationary states of the neural models.

Within this framework, we study the family of Poggendorf visual illusions induced by local contrast and orientation alignment of the objects in the input image. In particular, we aim to reproduce such illusions by the proposed models in a way that is qualitatively consistent with the psychophysical experience.

Our findings show that it is possible to reproduce Poggendorff-type illusions by both the sR cortical-inspired WC and LHE models. This, compared with the results in [44,45] where the cortical WC model is endowed with a Riemannian (isotropic) 3D kernel was shown to fail to reproduce Poggendoff-type illusions, shows that adding the natural sub-Laplacian procedure to the computation of the flows improves the capability of those cortical-inspired models in terms of reproducing orientation-dependent visual illusions.

2. Cortical-Inspired Modelling

In this section we recall the fundamental features of CPS models. The theoretical criterion underpinning the model relies on the so-called neurogeometrical approach introduced in [8,23,54]. According to this model, the functional architecture of V1 is based on the geometrical structure inspired by the neural connectivity in V1.

2.1. Receptive Profiles

A simple cell is characterised by its receptive field, which is defined as the domain of the retina to which the simple cell is sensitive. Once a receptive field is stimulated, the corresponding retinal cells generate spikes which are transmitted to V1 simple cells via retino-geniculo-cortical paths.

The response function of each simple cell to a spike is called the receptive profile (RP), and is denoted by

ψ_{(ζ, θ)} : Q \to C

. It is basically the impulse response function of a V1 simple cell. Conceptually it is the measurement of the response of the corresponding V1 simple cell to a stimulus at a point (Note that we omit the coordinate maps between the image plane and retina surface, and the retinocortical map from the retina surface to the cortical surface. In other words, we assume that the image plane and the retinal surface are identical and denote both by

Q \subset R^{2}

.)

ζ = (x, y) \in Q

.

In this study, we assume the response of simple cells to be linear. That is, for a given visual stimulus

f : Q \to R

we assume the response of the simple cell at V1 coordinates

(ζ, θ)

to be

a_{0} (ζ, θ) = {〈 f, ψ_{(ζ, θ)} 〉}_{L^{2} (Q)} = \int_{Q} ψ_{(ζ, θ)} (u) f (u) d u .

(1)

This procedure defines the cortical stimulus

a_{0} : M \to C

associated with the image f. We note that receptive field models consisting of cascades of linear filters and static non-linearities, although not perfect, may be more adequate to account for responses to stimuli [20,55,56]. Several mechanisms such as, for example, response normalisation, gain controls, cross-orientation suppression or intra-cortical modulation, might intervene to radically change the shape of the profile. Therefore, the above static and linear model for the receptive profiles should be considered as a first approximation of the complex behaviour of a real dynamic receptive profile, which cannot be perfectly described by static wavelet frames.

Regarding the form of the RP, in [8], a simplified basis of Gabor functions was proposed as good candidates for modelling the position-orientation sensitive receptive profiles for neuro-physiological reasons [57,58]. This basis has then been extended to take into account additional features such as scale [54], velocity [33] and frequency-phase [39]. On the other hand, Duits et al. [53] proposed so-called cake kernels as a good alternative to Gabor functions, and showed that cake kernels were adequate for obtaining simple cell output responses which were used to perform certain image processing tasks such as image enhancement and completion based on sR diffusion processes.

In this study, we employed cake kernels as the models of position-orientation RPs obtaining the initial simple cell output responses to an input image, and we used the V1 model geometry

M

to represent the output responses. We modelled the activity propagation along the neural connectivity by using the combination of a diffusion process based on the natural sub-Laplacian and a Wilson-Cowan type integro-differential system.

2.2. Horizontal Connectivity and Sub-Riemannian Diffusion

Neurons in V1 present two type of connections—local and lateral. Local connections connect neurons belonging to the same hypercolumn. On the other hand, lateral connections account for the connectivity between neurons belonging to different hypercolums, but along a specific direction. In the CPS model these are represented (This expression does not yield smooth vector fields on

M

. Indeed, e.g.,

X_{1} (ζ, 0) = - X_{1} (ζ, π)

despite that 0 and

π

are identified in

P^{1}

. Although in the present application such difference is inconsequential, since we are only interested in the direction (which is smooth) and not in the orientation, this problem can be solved by defining

X_{1}

in an appropriate atlas for

M

[25].) by the vector fields

X_{1} = cos θ \partial_{x} + sin θ \partial_{y}, X_{2} = \partial_{θ} .

(2)

The above observation yields to the modelling of the dynamic of the neuronal excitation

{Z_{t}}_{t \geq_{0}}

starting from a neuron

(ζ, θ)

via the following stochastic differential equation

d Z_{t} = X_{1} d u_{t} + X_{2} d v_{t}, Z_{0} = (ζ, θ),

(3)

where

u_{t}

and

v_{t}

are two one-dimensional independent Wiener processes. As a consequence, in [25] the cortical stimulus

a_{0}

induced by a visual stimulus

f_{0}

is assumed to evolve according to the Fokker-Planck equation

\partial_{t} ψ = L ψ, L = X_{1}^{2} + β^{2} X_{2}^{2} .

(4)

Here,

β > 0

is a constant encoding the unit coherency between the spatial and orientation dimensions.

The operator

L

is the sub-Laplacian associated with the sub-Riemannian structure on

M

with orthonormal frame

{X_{1}, X_{2}}

, as presented in [8,25]. It is worth mentioning that this operator is not elliptic, since

{X_{1}, X_{2}}

is not a basis for

T M

. However,

span {X_{1}, X_{2}, [X_{1}, X_{2}]} = T M

. Hence,

{X_{1}, X_{2}}

satisfies the Hörmander condition and

L

is a hypoelliptic operator [59] which models the activity propagation between neurons in V1 as the diffusion concentrated to a neighborhood along the (horizontal) integral curves of

X_{1}

and

X_{2}

.

A direct consequence of hypoellipticity is the existence of a smooth kernel for (4). That is, there exists a function

(t, ξ, ν) \in R_{+} \times M \times M \mapsto k_{t} (ξ, ν)

such that the solution of (4) with initial datum

a_{0}

reads

ψ (t, ξ) = e^{t L} a_{0} (ξ) = \int_{M} k_{t} (ξ, ν) a_{0} (ν) d ν .

(5)

An analytic expression for

k_{t}

can be derived in terms of Mathieu functions [10,29]. This expression is however cumbersome to manipulate, and it is usually more efficient to resort to different schemes for the numerical implementation of (4), see, for example, Section 4.

2.3. Reconstruction on the Retinal Plane

Activity propagation evolves the lifted visual stimulus in time. In order to obtain a meaningful result, which is represented on a 2-dim image plane, we have to transform the evolved lifted image back to the 2-dim image plane. We achieve this by using the projection given by

f (ζ, T) = \int_{0}^{π} a (ζ, θ, T) d θ,

(6)

where

f : R^{2} \times (0, T] \to R

and

0 < T < \infty

denote the processed image and the final time of the evolution, respectively. One easily checks that this formula yields

f (\cdot, 0) = f_{0}

under the assumption

\int_{0}^{π} ψ_{ξ, θ} (u) d θ = 1 .

(7)

3. Describing Neuronal Activity via Wilson-Cowan-Type Models

In neurophysiological experiments, reliable neural responses to visual stimuli are generally observed at the neuronal population level—the information processing and the response produced are obtained by integrating the individual dynamics of the neurons interacting within the population. Modelling neuronal populations can be done via coupled differential systems (networks) consisting of a large number of equations, and the average behaviour can in principle be used to represent population behaviour. This requires high computational power and the use of challenging analytical approaches due to the high dimension of the network. A different mesoscopic approach consists in considering the average network behaviour as the number of neurons in the network is let to infinity. The asymptotic limit of the network can thus be written in terms of the probability distribution (density) of the state variables. This asymptotic limit is the so-called mean-field limit. It has been successfully used as a reference framework in several papers, see, for example, [60,61,62] and will also be the approach considered in this work.

3.1. Wilson-Cowan (WC) Model

Let

a (ζ, θ, t)

denote the evolving activity of the neuronal population located at

ζ \in R^{2}

and sensitive to the orientation

θ \in P^{1}

at time

t \in (0, T]

. By using the shorthand notation

ξ = (ζ, θ), η = (ν, ϕ) \in M

, the Wilson-Cowan (WC) model on

Q \subset R^{2}

can be written as follows:

\partial_{t} a (ξ, t) = - (1 + λ) a (ξ, t) + \frac{1}{2 M} \int_{Q \times [0, π)} ω_{ξ} (η) σ (a (η, t)) d η + λ a_{0} (ξ) + μ (ξ) .

(8)

Here,

μ : Q \to R

is a smoothed version of the simple cell output response

a_{0}

via a Gaussian filtering, while parameters

λ >

and

M > 0

are fixed positive constants. Following the standard formulation of WC models studied, for example, in [60,63] we have that the role of the time-independent external stimulus

h : Q \times [0, π) \to R

is played here by

h (ξ) : = λ a_{0} (ξ) + μ (ξ)

while model parameters can be set as

β : = 1 + λ

and

ν : = 1 / 2 M

. The function

σ : R \to [- 1, 1]

stands for a nonlinear saturation function, which we choose as the sigmoid:

σ (r) : = - min (1, max (α (r - 1 / 2), - 1)), α > 1 .

(9)

The connectivity kernel

ω_{ξ}

models the interaction between neurons in

M

. Its definition should thus take into account the different type of interactions happening between connected neurons in V1, for example, it should model at the same time both local and lateral connections via the sub-Riemannian diffusion described in Section 2.2.

In [44,45] the authors showed that (8) does not arise from a variational principle. That is, it there exists no energy function

E : L^{2} (M) \to R

such that (8) can be recast as the problem

\partial_{t} a (ξ, t) = - \nabla E (a (ξ, t)), a (ξ, 0) = a_{0} = L f_{0} .

(10)

Under this formulation, stationary states

a^{*}

of (8) are (local) minima of E.

The interest of considering an evolution model following a variational principle in the sense (10) is given by its connection with the optimisation-based approaches considered in [64] to describe the efficient coding problem as an energy minimisation problem, which involves natural image statistics and biological constraints which force the final solution to show the least possible redundancy. Under this interpretation, the non-variational model (8) is suboptimal in reducing redundant information in visual stimuli, see Section 2.1 in [44] for more details.

3.2. Local Histogram Equalisation (LHE) Model

In order to build a model, which complies with the efficient neural coding described above, in [44,45], the authors showed that (8) can be transformed into a variational problem by replacing the term

σ (a (η, t))

with

\hat{σ} (a (ξ, t) - a (η, t))

for a suitable choice of the nonlinear sigmoid function

\hat{σ}

, thus enforcing non-linear activations on local contrast rather than on local activity. The corresponding model reads: -4.6cm0cm

\partial_{t} a (ξ, t) = - (1 + λ) a (ξ, t) + \frac{1}{2 M} \int_{Q \times [0, π)} ω_{ξ} (η) \hat{σ} (a (ξ, t) - a (η, t)) d η + λ a_{0} (ξ) + μ (ξ),

(11)

where

\hat{σ} (r) : = - σ (r + 1 / 2)

, and

σ

as in (9). This model has been first introduced in [47] as a variational reformulation of the Local Histogram Equalization (LHE) procedure for RGB images. The corresponding energy

E : L^{2} (M) \to R

for which (10) holds is:

\begin{matrix} E (a) = \frac{λ}{2} \int_{Q \times [0, π)} | a (ξ) - a_{0} {(ξ) |}^{2} d ξ + \frac{1}{2} \int_{Q \times [0, π)} {| a (ξ) - μ (ξ) |}^{2} d ξ \\ + \frac{1}{2 M} \int_{Q \times [0, π)} \int_{Q \times [0, π)} ω_{ξ} (η) Σ (a (ξ) - a (η)) d ξ d η, \end{matrix}

(12)

where

Σ : R \to R

is any (even) primitive function for

\hat{σ}

.

As it is clear from (12), the local histogram equalisation properties of the model are due here to the activation averaging, which is localised by the kernel

ω_{ξ}

, which should thus be adapted to the natural geometry of

M

(see Section 3.3 for a more detailed discussion).

3.3. A Sub-Riemannian Choice of the Interaction Kernel $ω_{ξ}$

In (8) and (11), the geometric structure of the underlying space

M

is captured by the connectivity kernel

ω_{ξ}

, which characterises the activity propagation along neural connections in V1. In [44,45], simple 3-dimensional Gaussian-type kernels were considered. This choice was shown to be good enough in these works to reproduce a large number of contrast- and orientation-dependent Poggendorff-like illusions via the LHE model in (11), but not by the WC one (8).

Here, motivated by the discussion in Section 2.2, we study the effect of a more natural choice for the interaction kernel

ω_{ξ}

, which we set as

ω_{ξ} (η) = k_{τ} (ξ, η)

, where

k_{τ} : M \times M \to R

is the sub-Riemannian heat kernel evaluated at time

τ > 0

. Indeed, 3-dimensional isotropic Gaussian kernels are obtained via the Euclidean heat equation are not coherent with the intrinsically anisotropic neuronal connectivity structure of V1. Recalling (5), this choice of

ω_{ξ}

allows us to rewrite the WC Equation (8) as

\partial_{t} a (ξ, t) = - (1 + λ) a (ξ, t) + \frac{1}{2 M} e^{τ L} [σ (a (\cdot, t))] (ξ) + λ a_{0} (ξ) + μ (ξ) .

(13)

We will call (13) from now on model (sR-WC) throughout the paper.

Using this formulation, the evaluation of the interaction term at point

(ξ, t) \in M \times (0, T]

can be done by solving the sub-Riemannian heat equation and let it evolve for a certain inner-time

τ > 0

. This avoids to deal directly with the explicit expression of

k_{τ}

whose numerical implementation is very delicate, as explained, for example, in [10].

A similar simplification is not readily available for the LHE Equation (11), due to the dependence on

ξ

of the integrand function. In this setting, we follow the discussion in [47] and replace the non-linearity

\hat{σ}

by a polynomial approximation of sufficiently large order n. Namely, we look for a polynomial approximation of

\hat{σ}

of the form

\hat{σ} (r) = c_{0} + \dots + c_{n} r^{n}

, which allows us to write

\begin{matrix} \hat{σ} (a (ξ, t) - a (η, t)) & \approx \sum_{i = 0}^{n} \underset{C_{i} (ξ, t) : =}{\underset{︸}{[\sum_{j = 0}^{i} {(- 1)}^{j - i + 1} c_{j} (\begin{matrix} j \\ i \end{matrix}) a^{j - i} (ξ, t)]}} a^{i} (η, t) \\ = \sum_{i = 0}^{n} C_{i} (ξ, t) a^{i} (η, t) . \end{matrix}

(14)

This allows us to approximate the interaction term in (11) as

\begin{matrix} \int_{Q \times [0, π)} k_{τ} (ξ, η) \hat{σ} (a (ξ, t) - a (η, t)) d η & \approx \sum_{i = 0}^{n} C_{i} (ξ, t) \int_{Q \times [0, π)} k_{τ} (ξ, η) a^{i} (η, t) d η \\ = \sum_{i = 0}^{n} C_{i} (ξ, t) e^{τ L} [a^{i} (\cdot, t)] (ξ) . \end{matrix}

(15)

Finally, the resulting (approximated) sub-Riemannian LHE equation reads:

\partial_{t} a (ξ, t) = - (1 + λ) a (ξ, t) + \frac{1}{2 M} \sum_{i = 0}^{n} C_{i} (ξ, t) e^{τ L} [a^{i} (\cdot, t)] (ξ) + λ a_{0} (ξ) + μ (ξ) .

(16)

We will call (16) from now on model (sR-LHE) throughout the paper.

4. Discrete Modelling and Numerical Realisation

In this Section, we report a detailed description of how models (sR-WC) and (sR-LHE) can be formulated in a complete discrete setting, providing, in particular, some insights on how the sub-Riemannian evolution can be realised. We further add a self-contained section regarding the gradient-descent algorithm used to perform the numerical experiments reported in Section 5, for more details see [44,46].

4.1. Discrete Modelling and Lifting Procedure via Cake Wavelets

First, the sub-Riemannian diffusion

e^{τ L}

is discretised by a final time

τ = m Δ τ

, where m and

Δ τ

denote the number of iterations and the time-step, respectively. For

N \in N^{+}

and

Δ x, Δ y \in R^{+}

denoting the spatial sampling size, we then discretise the given grey-scale image function

f_{0}

associated with the retinal stimulus on a uniform square spatial grid

Q : = {(x_{i}, y_{j}) = (i Δ x, j Δ y) : i, j = 1, 2, \dots, N} \subset R^{2}

and denote, for each

i, j = 1, 2, \dots, N

, the brightness value at point

ζ_{i, j} : = (x_{i}, y_{j}) \in Q

by

F_{0} [i, j] = f_{0} (x_{i}, y_{j}) = f_{0} (ζ_{i, j}) .

(17)

As far as the orientation sampling is concerned, we used a uniform orientation grid with points

Θ : = {θ_{k} : = k Δ θ, k = 1, \dots, K}, K \in N^{+}

and

Δ θ = π / K

. We can then define the discrete version of the simple cell response

a_{0} (x_{i}, y_{j}, θ_{k})

to the visual stimulus located at

ζ_{i, j} \in Q

with local orientation

θ_{k} \in Θ

at time

t = 0

of the evolution as

A_{0} [i, j, k] = a (x_{i}, y_{j}, θ_{k}, 0) = a (ζ_{i, j}, θ_{k}, 0) = {(L f_{0})}_{i, j, k},

(18)

where

L : Q \to Q \times Θ

is the lifting operation to be defined.

To do so, we consider in the following the image lifting procedure based on cake kernels introduced in [53] and used, for example, in [32,44,46]. We write the cake kernel centered at

ζ_{i, j}

and rotated by

θ_{k}

as

Ψ_{[i, j, k]} [ℓ, m] = ψ_{(ζ_{i, j}, θ_{k})} (x_{ℓ}, x_{m}),

(19)

where

ℓ, m \in {1, 2, \dots, N}

. We can then write the lifting operation applied to the initial image

f_{0}

for all

ζ_{i, j} \in Q

and

θ_{k} \in Θ

as:

{(L f_{0})}_{i, j, k} = A_{0} [i, j, k] = \sum_{l, m} Ψ_{[i, j, k]} [ℓ, m] f_{0} [ℓ, m] .

(20)

Finally, for

P \in N^{+}

we consider a time-discretisation of the interval

(0, T]

at time nodes

T : = {t_{p} : = p Δ t, p = 1, \dots P}, P \in N^{+}

with

Δ t : = T / P

.

The resulting fully-discretised neuronal activation at

ζ_{i, j} = (x_{i}, y_{j}) \in Q

,

θ_{k} \in Θ

and

t_{p} \in T

will be thus denoted by:

A_{p} [i, j, k] = a (ζ_{i, j}, θ_{k}, t_{p}) .

(21)

4.2. Sub-Riemannian Heat Diffusion

Let

g : M \to R

be a given cortical stimulus, and denote and set

G [i, j, k] = g (ξ_{i, j}, θ_{k})

. In this section we describe how to compute

{exp}_{τ} G [i, j, k] \approx e^{τ L} g (ζ_{i, j}, θ_{k}) .

(22)

The main difficulty here is due the degeneracy arising from the anisotropy of the sub-Laplacian. Indeed, developing the computations in (4), we have

L = D^{T} ℓ D, D = (\begin{matrix} \partial_{x} \\ \partial_{y} \\ \partial_{θ} \end{matrix}), ℓ = (\begin{matrix} {cos}^{2} θ & cos θ sin θ & 0 \\ cos θ sin θ & {sin}^{2} θ & 0 \\ 0 & 0 & β^{2} \end{matrix}) .

(23)

In particular, it is straightforward to deduce that the eigenvalues of ℓ are

(0, β^{2}, 1)

.

The discretisation of such anisotropic operators can be done in several ways, see for example [29,30,39,65]. In our implementation, we follow the method presented in [26], which is tailored around the group structure of

SE (2)

, the universal cover of

M

, and based on the non-commutative Fourier transform, see also [9].

It is convenient to assume for the following discussion

Δ x = Δ y = \sqrt{N}

and

Δ θ = π / K

. The “semi-discretised” sub-Laplacian

L_{K}

can be defined by

L g \approx L_{K} G : = D^{2} G + Λ_{K} G,

(24)

where by

Λ_{K}

we denote the central difference operator discretising the derivatives along the

θ

direction, that is, the operator

\partial_{θ}^{2} G [i, j, k] \approx Λ_{K} G [i, j, k] = \frac{g (ξ_{i, j}, θ_{k - 1}) - 2 g (ξ_{i j}, θ_{k}) + g (ξ_{i, j}, θ_{k + 1})}{2} .

(25)

The operator D is the diagonal operator defined by

D G [i, j, k] = (cos (k Δ θ) \partial_{x} + sin (k Δ θ) \partial_{y}) g (ξ_{i, j}, θ_{k}) .

(26)

The full discretisation is then achieved by discretising the spatial derivatives as

\begin{matrix} \partial_{x} G [i, j, k] \approx \frac{\sqrt{N}}{2} (g (ξ_{i + 1, j}, θ_{k}) - g (ξ_{i - 1, j}, θ_{k - 1})), \end{matrix}

(27)

\begin{matrix} \partial_{y} G [i, j, k] \approx \frac{\sqrt{N}}{2} (g (ξ_{i, j + 1}, θ_{k}) - g (ξ_{i, j - 1}, θ_{k - 1})) . \end{matrix}

(28)

Under the discretisation

L_{K}

of

L

defined in (24), we now resort to Fourier methods to compute efficiently the solution of the sub-Riemannian heat equation

\partial_{t} ψ = L_{g} {ψ, ψ |}_{t = 0} = g .

(29)

In particular, let

\hat{G} [r, s, k]

be the discrete Fourier transform (DFT) of G w.r.t. the variables

i, j

, i.e.,

\hat{G} [r, s, k] = \frac{1}{N} \sum_{r, s = 1}^{N} G [i, j, k] e^{\frac{ι 2 π}{N} ((r - 1) (i - 1) + (s - 1) (j - 1))} .

(30)

A straightforward computation shows that

\begin{matrix} \hat{D G} [r, s, k] = & ι \sqrt{N} d [r, s, k] \hat{G} [r, s, k], \\ d [r, s, k] : = & cos (k Δ θ) sin (\frac{2 π r}{N}) + sin (k Δ θ) sin (\frac{2 π s}{N}) . \end{matrix}

(31)

Hence, (29) is mapped by the discrete Fourier transform (DFT) to the following completely decoupled system of

N^{2}

ordinary linear differential equations on

C^{K}

:

\{\begin{matrix} \frac{d}{d t} Ψ_{t} [r, s, \cdot] = (Λ_{N} - \frac{N}{2} {diag}_{k} d {[r, s, k]}^{2}) Ψ_{t} [r, s, \cdot], \\ Ψ_{0} [r, s, k] = \hat{G} [r, s, k] \end{matrix} r, s \in {1, \dots, N},

(32)

which can be solved efficiently through a variety of standard numerical schemes. We chose the semi-implicit Crank-Nicolson method [66] for its good stability properties. Let us remark that the operator at the r.h.s. of the above equations are periodic tridiagonal matrices, that is, tridiagonal matrices with additional non-zero values at positions

(1, K)

and

(K, 1)

. Thus, the linear system appearing at each step of the Crank-Nicolson method can be solved in linear time w.r.t. K via a variation of the Thomas algorithm.

The desired solution

{exp}_{τ} G

can be then be simply recovered by applying the inverse DFT to the solution of (32) at time

τ

.

4.3. Discretisation via Gradient Descent

We follow [44,45,47] and discretise both models (sR-WC) and (sR-LHE) via a simple explicit gradient descent scheme. Denoting the discretised version of the local mean average

μ (ξ)

appearing in the models by

U [i, j, k] = μ (i Δ x, y Δ j, k Δ θ)

, we have that the the time stepping reads for all

p \geq 1

-4.6cm0cm

A_{p} [i, j, k] = A_{p - 1} [i, j, k] + Δ t (- (1 + λ) A_{p - 1} [i, j, k] + A_{0} [i, j, k] + λ U [i, j, k] + S A_{p - 1} [i, j, k]),

(33)

where

S A_{p - 1}

is defined depending on the model by: -4.6cm0cm

S A_{p - 1} [i, j, k] = {exp}_{τ} σ (A_{p - 1}) [i, j, k] or S A_{p - 1} [i, j, k] = \sum_{ℓ = 0}^{n} C_{ℓ, p - 1} [i, j, k] {exp}_{τ} A_{p - 1} [i, j, k],

(34)

with

C_{ℓ, p - 1}

being the discretised version of the coefficient

C_{ℓ}

in (14) at time

t_{p - 1}

.

A sufficient condition on the time-step

Δ t

guaranteeing the convergence of the numerical scheme (33) is

Δ t \leq 1 / (1 + λ)

(see [47]).

4.4. Pseudocode

Our algorithmic procedure consists of three main numerical sub-steps. The first one is the lifting of the two dimensional input image

f_{0}

to the space

M

via (20). The second one is the Fourier-based procedure described in Section 4.2 to compute the sub-Riemannian diffusion (22), which can be used as kernel to describe the neuronal interactions along the horizontal connection. This step is intrinsically linked to the last iterative procedure, based on computing the gradient descent update (33)–(34) describing the evolution of neuronal activity in the cortical framework both for (sR-WC) and (16).

We report the simplified pseudo-code in Algorithm 1 below. The detailed Julia package used to produce the following examples is freely available at the following webpage https://github.com/dprn/srLHE (Accessible starting from 28 December 2020).

Algorithm 1: sR-WC and sr-LHE pseudocode.

5. Numerical Experiments

In this section we present the results obtained by applying models (13), (sR-LHE) via Algorithm 1 to two Poggendorf-type illusions reported in Figure 3. Our results are compared to the ones obtained by applying the corresponding WC and LHE 3-dimensional models with a 3D-Gaussian kernel as described in [44,45]. The objective of the following experiments is to understand whether the output produced by applying (sR-WC) and (sR-LHE) to the images in Figure 3 agrees with the illusory effects perceived. Since the quantitative assessment of the strength of these effects is a challenging problem, the outputs of Algorithm 1 have to be evaluated by visual inspection. Namely, for each output, we consider whether the continuation of a fixed black stripe on one side of a central bar connects with a segment on the other side. Differently from inpainting-type problems, we stress that for these problems the objective is to replicate the perceived wrong alignments due to contrast and orientation effects rather than its collinear prosecution and/or to investigate when both types of completions can be reproduced.

(a) Testing data: Poggendorff-type illusions. We test the (sR-WC) and (sR-LHE) models on a greyscale version of the Poggendorff illusion in Figure 2 and on its modification reported in Figure 3b where the background is constituted by a grating pattern—in this case, the perceived bias also depends on the contrast between the central surface and the background lines.

(b) Parameters. Images in Figure 3 have size

N \times N

pixels, with

N = 200

. The lifting procedure to the space of positions and orientations is obtained by discretising

[0, π)

into

K = 16

orientations (this is in agreement with the standard range of 12–18 orientations typically considered to be relevant in the literature [67,68]). The relevant cake wavelets are then computed following [32], setting the frequency band

bw = 5

for all experiments. The scaling parameter

β

appearing in (4) is set (Such parameter adjusts the different spatial and orientation sampling. A single spatial unit is equal to

\sqrt{2}

pixel edge whereas a single orientation unit is 1 pixel edge.) to

β = K / (N^{2} \sqrt{2})

, and the parameter M appearing in (13), (sR-LHE) is set to

M = 1

.

Parameters varying from test to test are: the slope

α > 0

of the sigmoid functions

σ

in (9) and

\hat{σ}

, the fidelity weight

λ > 0

, the variance of the 2D Gaussian filtering

σ_{μ}

use to compute the local mean average

μ

in (sR-WC) and (16), the gradient descent time-step

Δ t

, the time step

Δ τ

and the final time

τ

used to compute the sub-Riemannian heat diffusion

e^{τ L}

.

5.1. Poggendorff Gratings

In Figure 4, we report the results obtained by applying (sR-WC) to the Poggendorff grating image in Figure 3b. We compare them with the ones obtained by the cortical-inspired WC model considered [44,45], where the sR heat-kernel is an isotropic 3D Gaussian, which are reported in Figure 4a. In Figure 4b, we observe that the sR diffusion encoded in (sR-WC) favours the propagation of the grating throughout the central grey bar so that the resultant image agrees with our perception of misalignment. We stress that such an illusion could not be reproduced via the cortical-inspired isotropic WC model proposed in [44,45]. The use of the appropriate sub-Laplacian diffusion is thus crucial in this example to replicate the illusion.

We further report in Figure 5 the result obtained by applying (sR-LHE) on the same image. We observe that in this case both the (sR-LHE) model and the LHE cortical model introduced in [44,45] reproduce the illusion.

Note that both (sR-WC) and (sR-LHE) further preserve fidelity w.r.t. the given image outside the target region, which is not the case in the LHE cortical model presented in [44,45].

5.2. Dependence on Parameters: Inpainting vs. Perceptual Completion

The capability of the (sR-LHE) model to reproduce visual misperceptions depends on the chosen parameters. This fact was already observed in [45] for the cortical-inspired LHE model proposed therein endowed by a standard Gaussian filtering. There, LHE was shown to reproduce illusory phenomena only in the case where the chosen standard deviation of the Gaussian filter was set to be large enough (w.r.t. the overall size of the image). On the contrary, the LHE model was shown to perform geometrical completion (inpainting) for small values of the standard deviation. Roughly speaking, this corresponds to the fact that perceptual phenomena—such as geometrical optical illusions—can be modelled only when the interaction kernel is wide enough for the information to cross the central grey line. This is in agreement with psycho-physical experiences in [17], where the width of the central missing part of the Poggendorff illusion is shown to be directly correlated with the intensity of the illusion.

In the case under consideration here, the parameter encoding the width of the interaction kernel is the final time

τ

of the sub-Riemannian diffusion used to model the activity propagation along neural connections. To support this observation, in Figure 6, we show that the completion obtained via (sR-LHE) shifts from a geometrical one (inpainting), where

τ

is small, to a perceptual one, where

τ

is sufficiently big.

As far as the (sR-WC) model is concerned, we observed that, despite the improved capability of replicating the Poggendorf gratings, the transition from perceptual completion to inpainting could not be reproduced. In agreement with the efficient representation principle, this supports the idea that visual perceptual phenomena are better encoded by variational models as (sR-LHE) than by non-variational ones as (13).

5.3. Poggendorff Illusion

In Figure 7 we report the results obtained by applying LHE methods to the standard Poggendorff illusion in Figure 3a. In particular, in Figure 7a we show the result obtained via the LHE method of [44,45], while in Figure 7b we show the result obtained via (16), with two close-ups in Figure 7c,d showing a normalized detail of the central region onto the set of values

[0, 1]

. As shown by these preliminary examples, the prosecutions computed by both (LHE) models agree with our perception as the reconstructed connection in the target region links the two misaligned segments, while somehow ’stopping’ the connection of the collinear one.

This phenomenon, as well as a more detailed study on how the choice of the parameters used to generate Figure 3a (such as the incidence angle, the width of the central gray bar, the distance between lines) in a similar spirit to [69] where psysho-physicis experiments were performed on analogous images, is an interesting topic for future research.

6. Conclusions

In this work we presented the sub-Riemannian version (16) of the Local Histogram Equalisation mean-field model previously studied in [44,45] and here denoted by (sR-LHE). The model considered is a natural extension of existing ones where the kernel used to model neural interactions was simply chosen to be a 3D Gaussian kernel, while in (sR-LHE) this is chosen as the sub-Riemannian kernel of the heat equation formulated in the space of positions and orientations given by the primary visual cortex (V1). A numerical procedure based on Fourier expansions is described to compute such evolution efficiently and in a stable way and a gradient-descent algorithm is used for the numerical discretisation of the model.

We tested the (sR-LHE) model on orientation-dependent Poggendorff-type illusions and showed that (i) in presence of a sufficiently wide interaction kernel, model (sR-LHE) is capable to reproduce the perceptual misalignments perceived, in agreement with previous work (see Figure 5 and Figure 7); (ii) when the interaction kernel is too narrow, (sr-LHE) favours a geometric-type completion (inpainting) of the illusion (see Figure 6) due to the limited amount of diffusion considered.

We also considered the sub-Riemannian version (13) of the standard orientation-dependent Wilson-Cowan equations previously studied in [44,45] and denoted here by (sR-WC). We obtained (sR-WC) by using the sub-Riemannian interaction kernel in the standard orientation-dependent Wilson-Cowan equations. We showed that the introduction of such cortical-based kernel improves the capability of WC-type models of reproducing Poggendorff-type illusions, in comparison to the analogous results reported [44,45], where the cortical version of WC with a standard 3D Gaussian kernel was shown to fail to replicate the illusion.

Finally, we stress that, in agreement with the standard range of 12–18 orientations typically considered to be relevant in the literature [67,68], all the aforementioned results have been obtained by considering

K = 16

orientations. The LHE and WC models previously proposed were unable to obtain meaningful results with less than

K = 30

orientations.

Author Contributions

All authors contributed equally. All authors have read and agreed to the published version of the manuscript.

Funding

L. Calatroni, V. Franceschi and D. Prandi acknowledge the support of a public grant overseen by the French National Research Agency (ANR) as part of the Investissement d’avenir program, through the iCODE project funded by the IDEX Paris-Saclay, ANR-11-IDEX-0003-02. V. Franceschiello acknowledges the support received from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No. 794592. E. Baspinar acknowledges the support by Human Brain Project (HBP) funded from the European Union’s Horizon 2020 Framework Programme for Research and Innovation under the Specific GrantAgreement No. 785907 (Human Brain Project SGA2).

Data Availability Statement

Publicly available data sets were analyzed in this study. This data can be found in the following link: https://github.com/dprn/srLHE accessed on 28 December 2020.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wertheimer, M. Laws of Organization in Perceptual Forms. In A Source Book of Gestalt Psychology; Kegan Paul, Trench, Trübner & Co.: London, UK, 1938. [Google Scholar]
Kohler, W. Gestalt Psychology: An Introduction to New Concepts in Modern Psychology; W. W. Norton & Company: New York, NY, USA, 1992. [Google Scholar]
Field, D.J.; Hayes, A.; Hess, R.F. Contour integration by the human visual system: Evidence for a local “association field”. Vis. Res. 1993, 33, 173–193. [Google Scholar] [CrossRef]
Hubel, D.H.; Wiesel, T.N. Receptive fields of single neurones in the cat’s striate cortex. J. Physiol. 1959, 148, 574. [Google Scholar] [CrossRef] [PubMed]
Hubel, D.H.; Wiesel, T.N. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Physiol. 1962, 160, 106. [Google Scholar] [CrossRef] [PubMed]
Hubel, D.H.; Wiesel, T. Shape and arrangement of columns in cat’s striate cortex. J. Physiol. 1963, 165, 559. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bosking, W.H.; Zhang, Y.; Schofield, B.; Fitzpatrick, D. Orientation selectivity and the arrangement of horizontal connections in tree shrew striate cortex. J. Neurosci. 1997, 17, 2112–2127. [Google Scholar] [CrossRef] [PubMed]
Citti, G.; Sarti, A. A cortical based model of perceptual completion in the roto-translation space. J. Math. Imaging Vis. 2006, 24, 307–326. [Google Scholar] [CrossRef]
Boscain, U.; Chertovskih, R.; Gauthier, J.P.; Prandi, D.; Remizov, A. Cortical-inspired image reconstruction via sub-Riemannian geometry and hypoelliptic diffusion. ESAIM Proc. Surv. 2018, 64, 37–53. [Google Scholar] [CrossRef]
Zhang, J.; Duits, R.; Sanguinetti, G.; Ter Haar Romeny, B.M. Numerical Approaches for Linear Left-Invariant Diffusions on SE(2), Their Comparison to Exact Solutions, and Their Applications in Retinal Imaging. Numer. Math. Theory Methods Appl. 2016, 9, 1–50. [Google Scholar] [CrossRef] [Green Version]
Gaillard, M.C.; Borruat, F.X. Persisting visual hallucinations and illusions in previously drug-addicted patients. Klin. MonatsblÄTter FÜR Augenheilkd. 2003, 220, 176–178. [Google Scholar] [CrossRef] [PubMed]
Levi, L.; Miller, N.R. Visual illusions associated with previous drug abuse. J. Neuro-Ophthalmol. 1990, 10, 103–110. [Google Scholar]
Hine, T.J.; Cook, M.; Rogers, G.T. An illusion of relative motion dependent upon spatial frequency and orientation. Vis. Res. 1995, 35, 3093–3102. [Google Scholar] [CrossRef] [Green Version]
Prinzmetal, W.; Shimamura, A.P.; Mikolinski, M. The Ponzo illusion and the perception of orientation. Percept. Psychophys. 2001, 63, 99–114. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Purves, D.; Wojtach, W.T.; Howe, C. Visual illusions: An empirical explanation. Scholarpedia 2008, 3, 3706. [Google Scholar] [CrossRef]
Westheimer, G. Illusions in the spatial sense of the eye: Geometrical–optical illusions and the neural representation of space. Vis. Res. 2008, 48, 2128–2142. [Google Scholar] [CrossRef] [Green Version]
Weintraub, D.J.; Krantz, D.H. The Poggendorff illusion: Amputations, rotations, and other perturbations. Atten. Percept. Psychophys. 1971, 10, 257–264. [Google Scholar] [CrossRef]
Day, R.; Dickinson, R. The components of the Poggendorff illusion. Br. J. Psychol. 1976, 67, 537–552. [Google Scholar] [CrossRef] [PubMed]
Koenderink, J.J. The structure of images. Biol. Cybern. 1984, 50, 363–370. [Google Scholar] [CrossRef] [PubMed]
Koenderink, J.J.; van Doorn, A.J. Representation of local geometry in the visual system. Biol. Cybern. 1987, 55, 367–375. [Google Scholar] [CrossRef]
Hoffman, W.C. Higher visual perception as prolongation of the basic Lie transformation group. Math. Biosci. 1970, 6, 437–471. [Google Scholar] [CrossRef]
Hoffman, W.C. The visual cortex is a contact bundle. Appl. Math. Comput. 1989, 32, 137–167. [Google Scholar] [CrossRef]
Petitot, J.; Tondut, Y. Vers une neurogéométrie. Fibrations corticales, structures de contact et contours subjectifs modaux. MathÉMatiques Sci. Hum. 1999, 145, 5–101. [Google Scholar] [CrossRef] [Green Version]
Citti, G.; Sarti, A. Neuromathematics of Vision; Springer: New York, NY, USA, 2014; Volume 32. [Google Scholar]
Boscain, U.; Duplaix, J.; Gauthier, J.P.; Rossi, F. Anthropomorphic image reconstruction via hypoelliptic diffusion. SIAM J. Control Optim. 2012, 50, 1309–1336. [Google Scholar] [CrossRef]
Boscain, U.; Chertovskih, R.; Gauthier, J.P.; Remizov, A. Hypoelliptic diffusion and human vision: A semi-discrete new wwist on the Petitot theory. Siam J. Imaging Sci. 2014, 7, 669–695. [Google Scholar] [CrossRef]
Prandi, D.; Gauthier, J.P. A Semidiscrete Version of the Citti-Petitot-Sarti Model as a Plausible Model for Anthropomorphic Image Reconstruction and Pattern Recognition; Springer Briefs in Mathematics; Springer: Cham, Switzerland, 2018. [Google Scholar]
Duits, R.; Franken, E. Line Enhancement and Completion via Linear Left Invariant Scale Spaces on SE(2). In Proceedings of the International Conference on Scale Space and Variational Methods in Computer Vision, Voss, Norway, 1–5 June 2009; pp. 795–807. [Google Scholar]
Duits, R.; Franken, E. Left-invariant parabolic evolutions on SE(2) and contour enhancement via invertible orientation scores Part I: Linear left-invariant diffusion equations on SE(2). Q. Appl. Math. 2010, 68, 255–292. [Google Scholar] [CrossRef] [Green Version]
Duits, R.; Franken, E. Left-invariant parabolic evolutions on SE(2) and contour enhancement via invertible orientation scores Part II: Nonlinear left-invariant diffusions on invertible orientation scores. Q. Appl. Math. 2010, 68, 293–331. [Google Scholar] [CrossRef] [Green Version]
Bohi, A.; Prandi, D.; Guis, V.; Bouchara, F.; Gauthier, J.P. Fourier descriptors based on the structure of the human primary visual cortex with applications to object recognition. J. Math. Imaging Vis. 2017, 57, 117–133. [Google Scholar] [CrossRef] [Green Version]
Bekkers, E.; Duits, R.; Berendschot, T.; ter Haar Romeny, B. A multi-orientation analysis approach to retinal vessel tracking. J. Math. Imaging Vis. 2014, 49, 583–610. [Google Scholar] [CrossRef] [Green Version]
Barbieri, D.; Citti, G.; Cocci, G.; Sarti, A. A cortical-inspired geometry for contour perception and motion integration. J. Math. Imaging Vis. 2014, 49, 511–529. [Google Scholar] [CrossRef] [Green Version]
Citti, G.; Franceschiello, B.; Sanguinetti, G.; Sarti, A. Sub-Riemannian mean curvature flow for image processing. Siam J. Imaging Sci. 2016, 9, 212–237. [Google Scholar] [CrossRef] [Green Version]
Baspinar, E.; Citti, G.; Sarti, A. A geometric model of multi-scale orientation preference maps via Gabor functions. J. Math. Imaging Vis. 2018, 60, 900–912. [Google Scholar] [CrossRef] [Green Version]
Janssen, M.H.; Janssen, A.J.; Bekkers, E.J.; Bescós, J.O.; Duits, R. Design and processing of invertible orientation scores of 3d images. J. Math. Imaging Vis. 2018, 60, 1427–1458. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Franceschiello, B.; Sarti, A.; Citti, G. A neuromathematical model for geometrical optical illusions. J. Math. Imaging Vis. 2018, 60, 94–108. [Google Scholar] [CrossRef] [Green Version]
Lafarge, M.W.; Bekkers, E.J.; Pluim, J.P.; Duits, R.; Veta, M. Roto-translation equivariant convolutional networks: Application to histopathology image analysis. arXiv 2020, arXiv:2002.08725. [Google Scholar] [CrossRef] [PubMed]
Baspinar, E.; Sarti, A.; Citti, G. A sub-Riemannian model of the visual cortex with frequency and phase. J. Math. Neurosci. 2020, 10, 1–31. [Google Scholar] [CrossRef] [PubMed]
Wilson, H.R.; Cowan, J.D. Excitatory and inhibitory interactions in localized populations of model neurons. Biophys. J. 1972, 12, 1–24. [Google Scholar] [CrossRef] [Green Version]
Wilson, H.R.; Cowan, J.D. A mathematical theory of the functional dynamics of cortical and thalamic nervous tissue. Kybernetik 1973, 13, 55–80. [Google Scholar] [CrossRef] [PubMed]
Amari, S.I. Dynamics of pattern formation in lateral-inhibition type neural fields. Biol. Cybern. 1977, 27, 77–87. [Google Scholar] [CrossRef]
Franceschiello, B.; Mashtakov, A.; Citti, G.; Sarti, A. Geometrical optical illusion via sub-Riemannian geodesics in the roto-translation group. Differ. Geom. Its Appl. 2019, 65, 55–77. [Google Scholar] [CrossRef] [Green Version]
Bertalmío, M.; Calatroni, L.; Franceschi, V.; Franceschiello, B.; Prandi, D. Cortical-inspired Wilson–Cowan-type equations for orientation-dependent contrast perception modelling. J. Math. Imaging Vis. 2020, 63, 263–281. [Google Scholar] [CrossRef]
Bertalmío, M.; Calatroni, L.; Franceschi, V.; Franceschiello, B.; Gomez Villa, A.; Prandi, D. Visual illusions via neural dynamics: Wilson–Cowan-type models and the efficient representation principle. J. Neurophysiol. 2020, 123, 1606–1618. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bertalmío, M.; Calatroni, L.; Franceschi, V.; Franceschiello, B.; Prandi, D. A Cortical-inspired Model for Orientation-dependent Contrast Perception: A Link with Wilson-Cowan Equations. In Scale Space and Variational Methods in Computer Vision; Lellmann, J., Burger, M., Modersitzki, J., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 472–484. [Google Scholar]
Bertalmío, M.; Caselles, V.; Provenzi, E.; Rizzi, A. Perceptual color correction through variational techniques. IEEE Trans. Image Process. 2007, 16, 1058–1072. [Google Scholar] [CrossRef] [PubMed]
Bertalmío, M.; Cowan, J.D. Implementing the Retinex algorithm with Wilson–Cowan equations. J. Physiol. Paris 2009, 103, 69–72. [Google Scholar] [CrossRef] [PubMed]
Bertalmío, M. From image processing to computational neuroscience: A neural model based on histogram equalization. Front. Comput. Neurosci. 2014, 8, 71. [Google Scholar] [CrossRef] [Green Version]
Attneave, F. Some informational aspects of visual perception. Psychol. Rev. 1954, 61, 183. [Google Scholar] [CrossRef]
Barlow, H.B. Possible principles underlying the transformation of sensory messages. Sens. Commun. 1961, 1, 217–234. [Google Scholar]
Duits, R. Perceptual Organization in Image Analysis: A Mathematical Approach Based on Scale, Orientation and Curvature; Technische Universiteit Eindhoven: Eindhoven, The Netherlands, 2005. [Google Scholar]
Duits, R.; Duits, M.; van Almsick, M.; ter Haar Romeny, B. Invertible orientation scores as an application of generalized wavelet theory. Pattern Recognit. Image Anal. 2007, 17, 42–75. [Google Scholar] [CrossRef] [Green Version]
Sarti, A.; Citti, G.; Petitot, J. The symplectic structure of the primary visual cortex. Biol. Cybern. 2008, 98, 33–48. [Google Scholar] [CrossRef]
Bekkers, E.J.; Lafarge, M.W.; Veta, M.; Eppenhof, K.A.; Pluim, J.P.; Duits, R. Roto-translation Covariant Convolutional Networks for Medical Image Analysis. In Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Granada, Spain, 16–20 September 2018; pp. 440–448. [Google Scholar]
Lindeberg, T. A computational theory of visual receptive fields. Biol. Cybern. 2013, 107, 589–635. [Google Scholar] [CrossRef]
Daugman, J.G. Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. JOSA A 1985, 2, 1160–1169. [Google Scholar] [CrossRef]
Barbieri, D.; Citti, G.; Sanguinetti, G.; Sarti, A. An uncertainty principle underlying the functional architecture of V1. J. Physiol. Paris 2012, 106, 183–193. [Google Scholar] [CrossRef]
Hörmander, L. Hypoelliptic second order differential equations. Acta Math. 1967, 119, 147–171. [Google Scholar] [CrossRef]
Faugeras, O. A constructive mean-field analysis of multi population neural networks with random synaptic weights and stochastic inputs. Front. Comput. Neurosci. 2009, 3. [Google Scholar] [CrossRef] [Green Version]
Bressloff, P.C.; Cowan, J.D. An amplitude equation approach to contextual effects in visual cortex. Neural Comput. 2002, 14, 493–525. [Google Scholar] [CrossRef] [Green Version]
Destexhe, A.; Sejnowski, T.J. The Wilson–Cowan model, 36 years later. Biol. Cybern. 2009, 101, 1–2. [Google Scholar] [CrossRef] [PubMed]
Sarti, A.; Citti, G. The constitution of visual perceptual units in the functional architecture of V1. J. Comput. Neurosci. 2015, 38, 285–300. [Google Scholar] [CrossRef] [Green Version]
Olshausen, B.A.; Field, D.J. Vision and the Coding of Natural Images: The human brain may hold the secrets to the best image-compression algorithms. Am. Sci. 2000, 88, 238–245. [Google Scholar] [CrossRef]
Mirebeau, J.M. Anisotropic fast-marching on cartesian grids using lattice basis reduction. Siam J. Numer. Anal. 2014, 52, 1573–1599. [Google Scholar] [CrossRef] [Green Version]
Crank, J.; Nicolson, P. A Practical Method for Numerical Evaluation of Solutions of Partial Differential Equations of the Heat-conduction Type. In Mathematical Proceedings of the Cambridge Philosophical Society; Cambridge University Press: Cambridge, UK, 1947; Volume 43, pp. 50–67. [Google Scholar]
Chariker, L.; Shapley, R.; Young, L.S. Orientation selectivity from very sparse LGN inputs in a comprehensive model of macaque V1 cortex. J. Neurosci. 2016, 36, 12368–12384. [Google Scholar] [CrossRef] [Green Version]
Pattadkal, J.J.; Mato, G.; van Vreeswijk, C.; Priebe, N.J.; Hansel, D. Emergent orientation selectivity from random networks in mouse visual cortex. Cell Rep. 2018, 24, 2042–2050. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Retsa, C.; Ariza, A.H.; Noordanus, N.W.; Ruffoni, L.; Murray, M.M.; Franceschiello, B. A psychophysically-tuned computational model of human primary visual cortex produces geometric optical illusions. bioRxiv 2020. [Google Scholar] [CrossRef]

Figure 1. Projections of horizontal integral curves approximate the association fields from the experiment of Field, Hayes and Hess [3]. They are generated by the sub-Riemannian model geometry proposed by Citti and Sarti [8]. Figures are adapted from [3,8].

Figure 2. The original Poggendorff illusion: the red colored line is aligned with the black line although the blue one is falsely perceived as its continuation. Source: Wikipedia.

Figure 3. Greyscale Poggendorff-type illusions. (a) is the standard 200 × 200 Poggendorff illusion with a 30 pixel-wide central and an incidence angle of π/3 drawn by the black lines with the central bar. (b) is a variation of the classical Poggendorff illusion where a further background grating is present.

Figure 4. Model output for Poggendorff gratings in Figure 3b via WC models. (a) result of the WC model proposed in [44,45]. (b) result of (sR-WC) with parameters λ = 0.01, α = 20, σ_μ = 6.5, Δt = 0.1, Δτ = 0.01, τ = 5.

Figure 5. Model output for Poggendorff gratings in Figure 3b via LHE models. (a) result of the LHE model proposed in [44,45]. (b) result of (sR-LHE) with parameters α = 8, τ = 5, λ = 2, σ_μ = 1, Δτ = 0.15, Δτ = 0.01.

Figure 6. Sensitivity to the parameter for τ for (sR-LHE) model for the visual perception of Figure 3b. The completion inside the central grey bar changes from geometrical (inpainting type) to illusory (perception type). Parameters: τ varies from 0.1 to 5, α = 6, σ_μ = 1, Δt = 0.15, Δτ = 0.01.

Figure 7. Model output for Poggendorff gratings in Figure 3b via LHE models. (a) result of the LHE model proposed in [44,45] (with parameters σ_μ = 2, σ_ω = 12, λ = 0.7, α = 5). (b) result of (sR-LHE) with parameters α = 8, τ = 2.5, λ = 0.5, σ_μ = 2.5, Δt = 0.15, Δτ = 0.1. (d) (resp. Figure 7c): zoom and renormalization on [0,1] of the central region of the result in (b) (resp. Figure 7a).

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Baspinar, E.; Calatroni, L.; Franceschi, V.; Prandi, D. A Cortical-Inspired Sub-Riemannian Model for Poggendorff-Type Visual Illusions. J. Imaging 2021, 7, 41. https://doi.org/10.3390/jimaging7030041

AMA Style

Baspinar E, Calatroni L, Franceschi V, Prandi D. A Cortical-Inspired Sub-Riemannian Model for Poggendorff-Type Visual Illusions. Journal of Imaging. 2021; 7(3):41. https://doi.org/10.3390/jimaging7030041

Chicago/Turabian Style

Baspinar, Emre, Luca Calatroni, Valentina Franceschi, and Dario Prandi. 2021. "A Cortical-Inspired Sub-Riemannian Model for Poggendorff-Type Visual Illusions" Journal of Imaging 7, no. 3: 41. https://doi.org/10.3390/jimaging7030041

APA Style

Baspinar, E., Calatroni, L., Franceschi, V., & Prandi, D. (2021). A Cortical-Inspired Sub-Riemannian Model for Poggendorff-Type Visual Illusions. Journal of Imaging, 7(3), 41. https://doi.org/10.3390/jimaging7030041

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Cortical-Inspired Sub-Riemannian Model for Poggendorff-Type Visual Illusions

Abstract

1. Introduction

1.1. The Functional Architecture of the Primary Visual Cortex

1.2. Mean-Field Neural Dynamics & Visual Illusions

1.3. Main Contributions

2. Cortical-Inspired Modelling

2.1. Receptive Profiles

2.2. Horizontal Connectivity and Sub-Riemannian Diffusion

2.3. Reconstruction on the Retinal Plane

3. Describing Neuronal Activity via Wilson-Cowan-Type Models

3.1. Wilson-Cowan (WC) Model

3.2. Local Histogram Equalisation (LHE) Model

3.3. A Sub-Riemannian Choice of the Interaction Kernel $ω_{ξ}$

4. Discrete Modelling and Numerical Realisation

4.1. Discrete Modelling and Lifting Procedure via Cake Wavelets

4.2. Sub-Riemannian Heat Diffusion

4.3. Discretisation via Gradient Descent

4.4. Pseudocode

5. Numerical Experiments

5.1. Poggendorff Gratings

5.2. Dependence on Parameters: Inpainting vs. Perceptual Completion

5.3. Poggendorff Illusion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Cortical-Inspired Sub-Riemannian Model for Poggendorff-Type Visual Illusions

Abstract

1. Introduction

1.1. The Functional Architecture of the Primary Visual Cortex

1.2. Mean-Field Neural Dynamics & Visual Illusions

1.3. Main Contributions

2. Cortical-Inspired Modelling

2.1. Receptive Profiles

2.2. Horizontal Connectivity and Sub-Riemannian Diffusion

2.3. Reconstruction on the Retinal Plane

3. Describing Neuronal Activity via Wilson-Cowan-Type Models

3.1. Wilson-Cowan (WC) Model

3.2. Local Histogram Equalisation (LHE) Model

3.3. A Sub-Riemannian Choice of the Interaction Kernel ω ξ

4. Discrete Modelling and Numerical Realisation

4.1. Discrete Modelling and Lifting Procedure via Cake Wavelets

4.2. Sub-Riemannian Heat Diffusion

4.3. Discretisation via Gradient Descent

4.4. Pseudocode

5. Numerical Experiments

5.1. Poggendorff Gratings

5.2. Dependence on Parameters: Inpainting vs. Perceptual Completion

5.3. Poggendorff Illusion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.3. A Sub-Riemannian Choice of the Interaction Kernel $ω_{ξ}$