Open Access This article is
- freely available
Vision 2018, 2(3), 32; doi:10.3390/vision2030032
Apparent Motion Perception in the Praying Mantis: Psychophysics and Modelling
Institute of Neuroscience, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
Author to whom correspondence should be addressed.
Received: 19 June 2018 / Accepted: 8 August 2018 / Published: 10 August 2018
Computer monitors, smart phone screens, and other forms of digital displays present a series of still images (frames) in which objects are displaced in small steps, tricking us into perceiving smooth motion. This illusion is referred to as “apparent motion”. For motion to be perceived, the magnitude of each displacement step must be smaller than a certain limit, referred to as . Previous studies have investigated the relationship between this limit and object size in humans and found that the maximum displacement is larger for larger objects than for smaller ones. In this work, we investigated the same relationship in the praying mantis Sphodromantis lineola by presenting them with moving random chequerboard patterns on a computer monitor. Even though motion perception in humans and insects are believed to be explained equally well by the same underlying model, we found that scales differently with object size in mantids. These results suggest that there may be qualitative differences in how mantids perceive apparent motion compared to humans.
Apparent motion is the perception of motion created by rapidly presenting still frames in which objects are displaced in space. Observers can reliably discriminate the direction of apparent motion when inter-frame object displacement is below a certain limit, . Earlier studies of motion perception in humans found that is lower-bounded at around 15 arcmin, and thereafter scales with the size of the spatial elements in the images. Here, we run corresponding experiments in the praying mantis Sphodromantis lineola to investigate how scales with the element size. We use random moving chequerboard patterns of varying element and displacement step sizes to elicit the optomotor response, a postural stabilization mechanism that causes mantids to lean in the direction of large-field motion. Subsequently, we calculate as the displacement step size corresponding to a 50% probability of detecting an optomotor response in the same direction as the stimulus. Our main findings are that the mantis scales roughly as a square-root of element size and that, in contrast to humans, it is not lower-bounded. We present two models to explain these observations: a simple high-level model based on motion energy in the Fourier domain and a more-detailed one based on the Reichardt Detector. The models present complementary intuitive and physiologically-realistic accounts of how scales with the element size in insects. We conclude that insect motion perception is limited by only a single stage of spatial filtering, reflecting the optics of the compound eye. In contrast, human motion perception reflects a second stage of spatial filtering, at coarser scales than imposed by human optics, likely corresponding to the magnocellular pathway. After this spatial filtering, mantis and human motion perception and Dmax are qualitatively very similar.
Keywords:apparent motion; insect vision; Hassenstein-Reichardt detector; Dmax
The detection of motion is an important visual function in all animals. Movement can signal the presence of prey, a predator, or a conspecific in the animal’s surrounding environment and is therefore an important cue for many forms of behavior. To understand how motion detection in animal visual systems works, it is helpful to first examine and understand their responses to basic forms of visual input such as apparent motion stimuli. Apparent motion is an illusion of motion created by the rapid presentation of successive still frames in which objects are gradually displaced in space. The ability of an observer to perceive motion in such stimuli depends on a number of factors including the frame rate of presentation and the amount of spatial displacement between individual frames. When inter-frame displacement is made large, for example, objects no longer appear to be smoothly moving and are instead perceived to be jumping in discrete steps. Increasing inter-frame displacement further, beyond a certain limit, makes observers unable to determine the direction of motion. This limit is known as  and is a characteristic feature of a visual motion detection system. Measuring and examining its relationship with other stimuli features can reveal details about the computational mechanisms of motion detection in the visual system. It can also improve our understanding of how apparent motion is perceived and offer ways to improve video display technologies such as TVs and computer monitors that rely on this phenomenon.
In an early study of apparent motion perception in humans, subjects were presented with random dot patterns and their ability to identify a correlated region of dots that was uniformly displaced on each frame was tested . The individual stimulus frames contained no cues about the region’s location, and so subjects could only distinguish it based on the motion of its constituent dots. The study found that the subjects could not detect motion when the region was displaced by more than 15 arcmins between frames, and that varying dot size in the range 2 to 11 arcmins had no effect on this result. The author postulated that humans have a constant of 15 arcmins that is imposed by the dimensions of elementary motion detectors in the human visual system. Subsequent studies, however, have shown that, while appears to be constant for element sizes of up to 15 arcmins (consistent with ), increasing the element size further caused a proportional increase in [2,3]. Absolute and relative values were presumed to be in favor of different explanations of how humans perceive motion [1,4,5] but were later shown to be accounted for by a simple model consisting of an initial spatial filter stage followed by a motion detection mechanism that is dependent on element size . Although many studies have investigated and its implications in humans over the course of four decades [1,3,6], no corresponding studies have been reported in insects. Measuring the properties of would not only give an insight into the operation of visual motion detection in insects but would also enable comparison with humans, potentially shedding light on the evolutionary origins of motion perception.
In this study, we tested the apparent motion perception in the praying mantis Sphodromantis lineola using moving random chequerboard patterns rendered on a computer monitor. We measured for a range of element sizes and found that insect scales with element size. This scaling was observed in humans for large sizes, but whereas the human asymptoted to a fixed minimum at fine scales, the insect continued to decrease as the element size decreased, down to the smallest scales detectable by insects.
We present two computational models for the relationship between and element size in the mantis: (1) a simplified energy-based model  that assumes uniform spatiotemporal sensitivity but still accounts for the qualitative features of our results; and (2) a more-detailed model based on the Hassenstein Reichardt correlator or “Reichardt Detector” (RD) that is widely used to account for large field motion perception in insects [8,9] and is in good agreement with our observations, both qualitatively and quantitatively. Taken together, the models offer complementary insights into insect . The first offers an intuitive explanation for scaling based on motion energy distribution in the Fourier domain and the second demonstrates consistency with standard motion detection models.
2.1. Experimental Findings
Mantids were placed in front of a Cathode Ray Tube (CRT) monitor and viewed a full screen random chequerboard stimulus moving horizontally in a series of steps. The size of the individual chequers (the element size) and the displacement magnitude in each step (the step size) were varied in different trials (Figure 1). When the step size was small, the stimulus appeared to humans as a smoothly moving pattern and elicited a postural stabilization mechanism (the optomotor response) in the mantis, causing them to lean in the direction of motion. As the step size was made larger, humans perceived the motion as jerky instead of smooth, and then at still larger step sizes, the motion percept was lost altogether, and the stimulus appeared as a sequence of random chequerboards with no relationship between them. In mantids, as the step size increased, stimuli ceased to elicit the optomotor response and mantids responded by either peering (moving the head and body from side to side, simultaneously counter-rotating the head so as to keep the eyes looking straight) or remaining still. In all trials, the time between steps was proportional to the step size, so the apparent motion speed was kept constant at 12.5 cm/s. Figure 2 shows stimulus space–time plots and corresponding spatiotemporal Fourier spectra for different step sizes.
An observer viewed the mantis through a web camera (while being blind to the stimulus) and coded its response in each trial as “moved left”, “moved right” or “other” (peering in both directions equally or no response) (). The motion probability was subsequently calculated as the proportion of trials in which motion was detected (mantis leaned in the same direction as the stimulus) under each condition. This is the closest feasible analogy to human psychophysics experiments, where humans classify their own perceptual experience into discrete classes. We have shown previously that human observers very rarely code the mantis as moving in the opposite direction to the stimulus [10,11,12]. Consistent with our earlier work, we found that observers reported mantids to be moving in the opposite direction to the stimulus in only 4% of trials (this proportion was similar across all conditions).
Figure 3 shows the collected data and fitted psychometric curves of motion probability versus the step size for different element sizes. From comparing the different panels in Figure 3, it is clear that increasing the element size shifts the psychometric curve in the positive direction of the step size axis. To quantify this change, we defined for each element size as the step size corresponding to a 50% probability of detecting motion, as estimated by a cumulative Gaussian psychometric function fit. Fifty percent corresponds to a 75% threshold on a two-alternative forced choice task where performance is at chance (50%) in the absence of a signal, and thus aligns with the human Dmax literature. At this point, thresholds are most accurate and least influenced by the shape of the psychometric function . Figure 4 shows pooled and individual plots of versus the element size. We found that the relationship between and the element size is in good agreement (standard error of the regression S = 0.318, over a range of 2–15 in Dmax) with the following power law (whose parameters were determined using maximum likelihood fitting):where x is element size in degrees.
2.2. Modeling and Simulation
2.2.1. Model 1: Fourier Energy
Modifying a pattern moving smoothly at a given speed (v) by introducing a fixed step size () is equivalent to passing it through a “sample and hold” circuit with a sampling interval of . The transformation introduces an aliasing artifact whereby temporal frequencies higher than are cast into lower frequencies, as illustrated in Figure 2. A chequerboard pattern moving smoothly at speed v consist of a series of components, each with a spatial frequency () and a temporal frequency () where that lie in quadrants 1 and 3 of the spatiotemporal Fourier domain (for motion in the positive direction). When the pattern is moved in discrete steps, a fraction of its component energy is transferred to quadrants 2 and 4 (i.e., motion in the negative direction). Increasing the step size (and correspondingly , since speed is constant) causes a larger portion of pattern motion energy to be distributed across the four quadrants, making it more difficult to identify the direction of motion. This aliasing effect can be observed in the space–time plots of moving patterns. When motion is smooth, the space–time plot shows clear rightward-pointing structures, as in the left column of Figure 2. If the pattern is moved by a step size that is larger than the element size (right column), leftward-pointing structures emerge as a result of the false matches between non-corresponding blocks across different frames.
As a first order approximation, the motion energy of a stepped chequerboard pattern can be taken as the sum of its unaliased components. The rationale behind this is that components of any temporal frequency range (where N is a non-zero natural number) are cast by aliasing to the range and so, assuming symmetry around , would be split into two groups with equal energy in the positive and negative directions. The net motion energy of aliased components in the pattern can therefore be approximated as zero (Figure 5). We ran numerical simulations in which we calculated the motion energy of stimuli in our experiment using this model (shown as a block diagram in Figure 6). Figure 7 (left) shows a plot of approximated pattern energy versus step size for element sizes in the range 1–8. The plot shows that increasing element size causes a proportional shift in the curve over a region of energy levels (up to ∼ units). Assuming the energy threshold (T) corresponding to Dmax is within this range (e.g., ), we calculated Dmax as the step size at which motion energy crosses the threshold (for each element size). Figure 7 (right) shows a plot of the calculated Dmax versus the element size based on this model and compares this with our experimental findings. The predictions and experimental results are in good agreement, showing that this simple high-level model with a single parameter, the threshold (T), can account well for the values that we observed in the mantis.
2.2.2. Model 2: Reichardt Detector
Model 1 is a simple and intuitive model which provides an excellent account of our data, but it does not explain how Fourier energy is computed in the mantis brain. More importantly, it fails to take account of the mantis spatiotemporal contrast sensitivity function , instead assuming that all Fourier components are equally visible. To verify that the intuitive account of Model 1 can survive a more realistic implementation, we built and simulated a more detailed, lower-level motion detection model to investigate whether our experimental findings could be explained by a system of narrow-band detectors. This second model is based on the well known Hassenstein Reichardt correlator or “Reichardt Detector” (RD) that is widely used to account for the optomotor response and similar large-field visually-guided forms of behavior in insects [8,15,16,17]. The detector (shown in Figure 8) consists of two subunits that compute motion in opposite directions, typically assumed to be neighboring ommatidia and their neural cartridges . The spatial input from each subunit is temporally delayed and then compared with the other subunit’s input to detect luminance changes that have “travelled” in the same direction with a similar delay. The two subunit outputs are then summed to produce a direction-sensitive measure of motion.
Although the preferred spatial frequency of the detector can be varied by adjusting the spatial separation of its subunits and the extent of their input spatial filters, its spatial bandwidth is considerably constrained . To enable the model to respond to stimuli covering a spatial bandwidth larger than that of the detector’s (such as the range of element sizes we tested), we combined multiple detectors tuned to two spatial frequencies (i.e., two detector classes). This is consistent with the existence of regional variations in ommatidia size and spacing across mantis eyes .
Our model is shown in Figure 9 and is essentially a slight variation of a published model of the mantis optomotor response . The (free) parameters that we fitted were the separation and extent of spatial filters for each class (, , , ), the two post-detector thresholds and (each for one of the two classes, normalized to maximum detector output), the mantis noise level () and the observer noise level () (both normalized to the maximum signal level).
The spatial parameters and determine the spatial frequency tuning of each detector class (intuitively, larger and values facilitate the detection of longer spatial periods and hence shift detector sensitivity towards lower spatial frequencies). Leaky integrator blocks calculate the average of each RD over the last 300 ms and pass this through hard two-sided thresholds to obtain “ternary votes” on the motion direction (−1, +1 or 0, corresponding to “moved left”, “moved right” or “other”). There are 25 detectors in each class (50 in total) and each class has its own threshold level (, ). The votes are subsequently summed and passed through a noisy threshold block (using Gaussian noise) to calculate an optomotor response signal in the range [−1, 1]. Finally, is integrated over stimulus presentation and passed through a noisy ternary threshold to model the decision process of the human observer in the experiment.
The fitted parameter values were = 1.07, = 3.42, = 1.0, = 2.5, = 0.022, = 0.022, = 0.0112 and = 0.0588. Assuming the detector spatial filters correspond to individual ommatidia, the interommatidial angle () and acceptance angle () of each class were = 1.0, = 2.5, = 2.5, = 8. These values are consistent with measurements of interommatidial parameters in the comparable mantis species Tenodera australasiae  and indicate that classes 1 and 2 are near-foveal and peripheral detectors, respectively. In addition to the fitted parameters, there were two other (non-free) parameters: the time constants of the RD’s low and high-pass temporal filters. Their values were selected as = 13 ms and = 40 ms based on physiologically plausible data reported in literature ([10,11,20]).
Predictions were obtained by simulating the model using the same stimuli used in experiments. To reduce the simulation time, both RD spatial filters and stimuli had a single spatial dimension instead of two. Simulated trials lasted for 1 second, after which the model produced a ternary judgment: “moved left”, “moved right” or “other” (peering in both directions equally or no response), corresponding to the experiment. Figure 10 shows the predictions of this model versus the experiment data. The predictions are in good agreement with our experimental observations showing that, by combining at least two classes of detectors, mantis values can be explained by a system of narrow-band motion detection units.
2.2.3. Model Comparison
Model 1, based on the Fourier spectra of stimuli, provides a good qualitative account of but assumes an underlying (ideal) motion detection system that is equally sensitive at all spatial and temporal frequencies. Published data of the mantis contrast sensitivity function, however, show maximum sensitivity in the spatial frequency range of 0.01 to 0.1 cycles per degree (cpd) and peak temporal sensitivity at 8 Hz . In addition, there are significant regional variations in the size and spacing of ommatidia , the presumed structural correlates of motion detection spatial filters , making it likely that spatial sensitivity is also non-uniform across the visual field of the mantis. Since Model 1 did not take these details into account, it was important to cross-check the intuitive account of in terms of Fourier energy (derived from Model 1) against a more detailed model. In Model 2, we used a variation of the RD-based model from  to account for the spatial and temporal sensitivity of the mantis. We found it necessary to include two classes of RDs with distinct spatial tuning, since the bandwidth of any single detector class was too narrow to account for our experimental observations. Our results show that both Models 1 and 2 provide good accounts of experimental observations (Figure 11), consistent with the interpretation that (1) the mantis optomotor response depends on the total Fourier energy present within the stimulus; (2) mantis is imposed by aliasing artefacts which limit this energy and (3) spatiotemporal sensitivity does not significantly affect mantis provided that the stimulus is visible to the animal.
The Motion Energy model  is another prominent model that has been very successful in explaining a range of electro-physiological observations in mammals [21,22]. Even though the energy model relies on calculating Fourier energy, similar to Model 1, it includes spatial and temporal filters and can therefore account for spatiotemporal tuning, similar to Model 2 (it has in fact been proven to be formally equivalent to the RD under the assumption of separable filters ).
Our results show similarities and differences between in humans and mantids. First, while the mantis appears to scale with element size similar to humans (for elements larger than 15 arcmin), we did not observe a region of element sizes with a constant (up to 15 arcmin in humans). Humans can perceive static dot patterns with much smaller element sizes, and so the lower bound on is evidence for the existence of a spatial filtration stage that precedes motion detection . Consistent with this, applying spatial low pass filtration to random dot stimuli was found to increase the lowest bound . Although it is possible that element sizes smaller than the ones tested may have a constant , the smallest element size in our experiment (0.24 deg) was smaller than the foveal interommatidial angle in the comparable mantis species Tenodera australasiae . Mantids are also most sensitive to motion with spatial frequencies of 0.01∼0.1 cpd , a range much lower than the fundamental spatial frequency component of a chequerboard pattern with 1 deg elements (2 cpd). Patterns with elements as small as 0.24 deg can still be detected because of their lower frequency components, but as the element size decreases further, a larger portion of their energy shifts towards the highest spatial frequencies and outside the mantis sensitivity range. It is therefore likely that the smallest element size tested in this study was near the limit of mantis visual acuity. Consistent with this, our data (Figure 3) show that the response rate for 0.24 deg elements is significantly lower, even at the smallest step size, compared to larger elements. Mantids may, therefore, have no range of element sizes over which is constant. In other words, the highest spatial frequency experienced by the mantis visual system in these stimuli was already limited by their optics rather than by the element size. Thus, reducing the pattern size further would simply abolish the response rather than revealing a shift to constant Dmax. We conclude that mantids, unlike humans, do not show a range of element sizes over which is constant before it begins to rise.
Another apparent difference between mantis and human data concerns the relationship between and element size. In humans, is assumed to scale linearly with the element size (i.e., for humans) [2,3,23,24] where the coefficient of proportionality (k) depends on a number of stimulus parameters, such as the spatial extent [5,25,26,27] and duration of presentation [26,28,29,30]. Our results in the mantis, however, appear to be best approximated by a power law fit with an exponent of of 0.462 (i.e., ). Even though the coefficient of proportionality (k) is also likely to depend on the stimulus parameters, the different exponent may reflect a qualitative difference between motion processing in humans and mantids. However, a comparison between mantis and published human data does not make a compelling case for a qualitative difference (Figure 12). Fitting k to all the human data while constraining the exponent to be 0.462, as for mantids, produces a reasonable fit, even neglecting the lower bound on at small element sizes. In a more complex model accounting for this limit, it seems far from clear that an exponent of 1 would be the best fit. In fact, in both humans and insects may show a sublinear dependence on scale.
The ability of both our Fourier and RD-based models to reproduce the relationship between and element size in the mantis suggests further similarities. Our simple model based on stimulus energy in the Fourier domain is similar to the Motion Energy model used to account for apparent motion perception in humans . Both assume that motion perception is computed by a cross-correlation operation in the space–time domain (or, equivalently, a summation of stimulus energy in the spatio-temporal Fourier domain). A direct implication of our model is that corresponds to a motion energy detection threshold that is reliably exceeded by smoothly moving stimuli but not by those with aliasing artifacts caused by large motion steps. According to this model, scales with the element size, because stimuli composed of larger elements have a greater portion of their energy within lower frequencies and can therefore be more severely aliased by larger step sizes before their unaliased component energy falls below the threshold. This simple account of is sufficient to explain the main characteristics of the relationship between and the element size but does not take into account the limited spatial sensitivity of local correlation-type detectors. The response properties predicted by these detectors were found to be in excellent agreement with a series of behavioral and physiological studies [31,32,33,34,35,36], and much is known about their neural substrates . Reproducing the relationship between and the element size using the RD-based model was therefore important to verify that the intuition behind the Fourier model still holds when motion detection is mediated by units with narrow-band sensitivity and other physiologically-realistic details.
Because the spatial bandwidth of the Reichardt detector is considerably constrained, we found it necessary to use two detector classes with different levels of spatial tuning to replicate our experimental observations across the element size range tested (0.24–9.55). Our stimulus subtended 142 at the viewing distance of the mantis, an extent that spans significant variations in ommatidia size and spacing. It is therefore likely that the two detector classes in Model 2 correspond to different eye regions where ommatidia parameters (and hence detector spatial frequency tuning) are different. Reported measurements of ommatidial parameters in comparable mantis species Tenodera australasiae range from = 0.6, = 2 in the fovea to = 2.5, = 6 in peripheral eye regions, under dark-adapted conditions that match our experimental setup . The fitted spatial filter parameters of Model 2 ( = 1.0, = 2.5, = 2.5, = 8) are in reasonable agreement with these ranges and are consistent with the interpretation that classes 1 and 2 represent near-foveal and peripheral detectors, respectively. This interpretation makes the prediction that if stimulus presentation were limited to a subregion of the eye with little variation between ommatidia, then a model with a single detector class would explain the results. This prediction could be tested by, for example, painting over peripheral eye regions. Ref.  showed that the mantis optomotor response is driven predominantly by the central 50 or so of the visual field, so even after this manipulation, the optomotor response should still be elicitable. This experiment would be a valuable test of our interpretation.
In conclusion, we observed that mantis scales sublinearly with the element size (in fact roughly as the square-root of element size, specifically ). Although human Dmax has been summarised as being proportional to element size, the human results were also consistent with a sublinear increase. We also found that two models based on the RD and motion energy (that are predominantly used in insect and human motion detection, respectively) provided consistent accounts of these experimental observations. The major qualitative difference between humans and insects is that human Dmax remains roughly constant for small element sizes before it begins its sublinear increase. We agree with  that this may reflect the distinct parvo- and magno-cellular pathways in human vision. In this account, human sensitivity to static patterns is mediated by the parvocellular pathway and has a high spatial-frequency limit matching the excellent optics of human eyes. Human motion perception, however, largely relies on magnocellular processing with a much lower spatial-frequency limit. Thus, in humans, Dmax initially reflects a coarse, magnocellular filter, even though humans can perceive much finer patterns using the parvocellular pathway. Only when the element size is coarser than the magnocellular filter does human Dmax begin to scale with element size. In insects, the spatial “sampling base” of the optomotor response fits the interommatidial angle  and so subsequent motion processsing stages, e.g., the pooling of Reichardt detectors into wide-field motion sensors apparently do not impose additional, coarser spatial filtering of the stimulus, at least not in the pathways mediating the optomotor response.
A total of 13 female mantids of the species Sphodromantis lineola were used in the experiment. Each individual was kept in a plastic box of dimensions cm with a porous lid for ventilation and fed a live cricket twice per week. The boxes were kept at a temperature of 25 and were cleaned and misted with water twice per week.
4.2. Experimental Setup
In each experiment, a mantis was hung upside down from a Perspex base and viewed full-screen stimuli on a CRT monitor (HP P1130). The base was held in place by a clamp such that the mantis was facing the middle point of the screen and the viewing distance (distance between mantis and monitor) was 7 cm in all trials. A web camera (Kinobo USB B3 HD Webcam) was placed under the base, providing a view of the mantis (but not the screen). The monitor, Perspex base and camera were all placed inside a wooden enclosure to isolate the mantis from distractions and to maintain consistent dark ambient lighting during experiments. The screen had physical dimensions of cm and pixel dimensions of pixels. At the viewing distance of the mantis, the horizontal extent of the monitor subtended a visual angle of 142. The mean luminance of the monitor was 27.1 cd/m, and its refresh rate was 85 Hz.
Experiments were programmed in Matlab 2012b (Mathworks, Inc., Natick, MA, USA) and stimuli developed using Psychophysics Toolbox Version 3 (PTB-3) [37,38,39] were used. The experiment computer was a Dell Optiplex 9010 (Dell Corporation Limited, Bracknell, UK) with an Nvidia Quadro K600 graphics card running Microsoft Windows 7.
4.3. Experimental Procedure
Each experiment consisted of a number of randomly-interleaved trials in which a mantis was presented with moving chequerboard patterns. An experimenter observed the mantis through the camera underneath and blindly coded the direction of the elicited optomotor response (if any). The response code for each trial was “moved left”, “moved right” or “other” (peering in both directions equally or no response). There were equal repeats of left-moving and right-moving patterns of each condition in all experiments. At the beginning of each trial, an alignment stimulus was used to steer the mantis back such that it was facing the middle point of the screen. The alignment stimulus was a smoothly moving chequerboard pattern controlled by the experimenter using a keyboard.
4.4. Visual Stimulus
The stimulus was a full-screen chequerboard pattern moving either leftwards or rightwards in each trial. The pattern was composed of black and white chequers of 0.161 and 54.1 cd/m luminance levels, respectively, chosen randomly with equal probability. The whole pattern was displaced horizontally by a step () at intervals where was kept constant across the different conditions. The stimulus speed was 12.5 cm/sec, corresponding to an optic flow of 44.4 deg/sec (averaged over screen width) at the viewing distance of the mantis (7 cm).
4.5. Calculating Dmax
We used likelihood fitting to calculate as the step size corresponding to a 50% response rate for each element size. The fitted psychometric functions were of the formwhere p is the probability that motion is detected, is the step size, and is a parameter that specifies the width of the function’s transition period. The parameters and were calculated for each element size using maximum likelihood estimation in Matlab, assuming mantids’ responses had a simple binomial distribution. Therefore,where k is the number of step sizes, is the number of trials in which mantids detected motion, is the total number of trials, and is motion detection probability predicted by given and values, as per Figure 2. We calculated and for each insect’s individual data and for all insects pooled together.
4.6. Simulation and Model Fitting
4.6.1. Model 1
Model 1 was simulated by calculating the Fourier transform of sample visual stimuli identical to those used in the experiment and summing their energy in the Fourier domain. This was done using the functions fft and trapz in Matlab (Mathworks Inc, Natick, MA, USA). The threshold level (T) was fitted in Matlab using the function fminsearch.
4.6.2. Model 2
In RD simulations, we used Gaussian spatial filters and low/high pass temporal filters of the formwhere = 13 ms, = 40 ms, , depended on the detector class ( = 1.0, = 2.5, = 1.07, = 3.42), and was the detector position on the simulated retina. The models were normalized such that all gave a mean response of 1 to a drifting grating at the optimal spatial and temporal frequencies.
Model 2 was fitted using the optimization function fminsearch in Matlab (Mathworks, Inc., Natick, MA, USA). Repeated fitting attempts from randomly selecting initial solutions resulted in the same final solution (within numerical error) in the overwhelming majority of cases, confirming that the algorithm had located the global optimum rather than being stuck at a local minimum.
5. Data Availability
The data collected in this study are available on https://github.com/m3project/mantis-dmax.
Conceptualization, J.C.A.R., G.T., V.N., R.R., L.J. and C.R.; Methodology, G.T., V.N., L.J. and C.R.; Software, G.T.; Validation, G.T.; Formal Analysis, G.T.; Investigation, L.J.; Data Curation, L.J.; Resources, C.R.; Writing—Original Draft Preparation, G.T.; Writing—Review & Editing, J.C.A.R.; Visualization, G.T.; Supervision, J.C.A.R.; Project Administration, J.C.A.R. and C.R.; Funding Acquisition, J.C.A.R. and C.R.
This research was funded by The Leverhulme Trust with Research Leadership Award RL-2012-019 to J.C.A.R., and by a BBSRC (Biotechnology and Biological Sciences Research Council) PhD studentship (1219068) supporting L.J.
Conflicts of Interest
The authors declare no conflict of interest.
- Braddick, O. A short-range process in apparent motion. Vis. Res. 1974, 14, 519–527. [Google Scholar] [CrossRef]
- Cavanagh, P.; Boeglin, J.; Favreau, O.E. Perception of motion in equiluminous kinematograms. Perception 1985, 14, 151–162. [Google Scholar] [CrossRef] [PubMed]
- Morgan, M. Spatial filtering precedes motion detection. Nature 1992, 355, 344–346. [Google Scholar] [CrossRef] [PubMed]
- Sato, T. Reversed apparent motion with random dot patterns. Vis. Res. 1989, 29, 1749–1758. [Google Scholar] [CrossRef]
- Chang, J.J.; Julesz, B. Displacement limits for spatial frequency filtered random-dot cinematograms in apparent motion. Vis. Res. 1983, 23, 1379–1385. [Google Scholar] [CrossRef]
- Tripathy, S.P.; Shafiullah, S.N.; Cox, M.J. Influence of correspondence noise and spatial scaling on the upper limit for spatial displacement in fully-coherent random-dot kinematogram stimuli. PLoS ONE 2012, 7, e42995. [Google Scholar] [CrossRef] [PubMed]
- Adelson, E.H.; Bergen, J.R. Spatiotemporal energy models for the perception of motion. JOSA A 1985, 2, 284–299. [Google Scholar] [CrossRef]
- Hassenstein, B.; Reichardt, W. Systemtheoretische analyse der zeit-, reihenfolgen-und vorzeichenauswertung bei der bewegungsperzeption des rüsselkäfers chlorophanus. Z. Naturforsch. B 1956, 11, 513–524. [Google Scholar] [CrossRef]
- Borst, A. Fly visual course control: Behaviour, algorithms and circuits. Nat. Rev. Neurosci. 2014, 15, 590–599. [Google Scholar] [CrossRef] [PubMed]
- Nityananda, V.; Tarawneh, G.; Jones, L.; Busby, N.; Herbert, W.; Davies, R.; Read, J.C. The contrast sensitivity function of the praying mantis Sphodromantis lineola. J. Comp. Physiol. A 2015, 201, 741–750. [Google Scholar] [CrossRef] [PubMed]
- Tarawneh, G.; Nityananda, V.; Rosner, R.; Errington, S.; Herbert, W.; Cumming, B.G.; Read, J.C.; Serrano-Pedraza, I. Invisible noise obscures visible signal in insect motion detection. Sci. Rep. 2017, 7, 3496. [Google Scholar] [CrossRef] [PubMed]
- Tarawneh, G.; Nityananda, V.; Rosner, R.; Errington, S.; Herbert, W.; Arranz-Paraíso, S.; Busby, N.; Tampin, J.; Read, J.; Serrano-Pedraza, I. Contrast thresholds reveal different visual masking functions in humans and praying mantises. Biol. Open 2018, 7, 1–9. [Google Scholar] [CrossRef] [PubMed]
- Wichmann, F.A.; Hill, N.J. The psychometric function: I. Fitting, sampling, and goodness of fit. Percept. Psychophys. 2001, 63, 1293–1313. [Google Scholar] [CrossRef] [PubMed]
- Nityananda, V.; Tarawneh, G.; Errington, S.; Serrano-Pedraza, I.; Read, J. The optomotor response of the praying mantis is driven predominantly by the central visual field. J. Comp. Physiol. A 2017, 203, 77–87. [Google Scholar] [CrossRef] [PubMed]
- Bahl, A.; Ammer, G.; Schilling, T.; Borst, A. Object tracking in motion-blind flies. Nat. Neurosci. 2013, 16, 730–738. [Google Scholar] [CrossRef] [PubMed]
- Srinivasan, M.; Lehrer, M.; Kirchner, W.; Zhang, S. Range perception through apparent image speed in freely flying honeybees. Vis. Neurosci. 1991, 6, 519–535. [Google Scholar] [CrossRef] [PubMed]
- Borst, A.; Bahde, S. Visual information processing in the fly’s landing system. J. Comp. Physiol. A 1988, 163, 167–173. [Google Scholar] [CrossRef]
- Borst, A.; Haag, J.; Reiff, D.F. Fly motion vision. Ann. Rev. Neurosci. 2010, 33, 49–70. [Google Scholar] [CrossRef] [PubMed]
- Rossel, S. Regional differences in photoreceptor performance in the eye of the praying mantis. J. Comp. Physiol. 1979, 131, 95–112. [Google Scholar] [CrossRef]
- Borst, A. Neural circuits for elementary motion detection. J. Neurogenet. 2014, 28, 361–373. [Google Scholar] [CrossRef] [PubMed]
- Emerson, R.C.; Bergen, J.R.; Adelson, E.H. Directionally selective complex cells and the computation of motion energy in cat visual cortex. Vis. Res. 1992, 32, 203–218. [Google Scholar] [CrossRef]
- Movshon, J.A.; Thompson, I.D.; Tolhurst, D.J. Spatial summation in the receptive fields of simple cells in the cat’s striate cortex. J. Physiol. 1978, 283, 53–77. [Google Scholar] [CrossRef] [PubMed]
- Watanabe, T. High-Level Motion Processing: Computational, Neurobiological, and Psychophysical Perspectives; MIT Press: Cambridge, MA, USA, 1998. [Google Scholar]
- Smith, A.; Ledgeway, T. Motion detection in human vision: a unifying approach based on energy and features. R. Soc. Lond. B Biolog. Sci. 2001, 268, 1889–1899. [Google Scholar] [CrossRef] [PubMed]
- Baker, C.L.; Braddick, O.J. The basis of area and dot number effects in random dot motion perception. Vis. Res. 1982, 22, 1253–1259. [Google Scholar] [CrossRef]
- Nakayama, K.; Silverman, G.H. Temporal and spatial characteristics of the upper displacement limit for motion in random dots. Vis. Res. 1984, 24, 293–299. [Google Scholar] [CrossRef]
- Todd, J.T.; Norman, J.F. The effects of spatiotemporal integration on maximum displacement thresholds for the detection of coherent motion. Vis. Res. 1995, 35, 2287–2302. [Google Scholar] [CrossRef]
- Nishida, S.; Sato, T. Positive motion after-effect induced by bandpass-filtered random-dot kinematograms. Vis. Res. 1992, 32, 1635–1646. [Google Scholar] [CrossRef]
- Snowden, R.; Braddick, O. The combination of motion signals over time. Vis. Res. 1989, 29, 1621–1630. [Google Scholar] [CrossRef]
- Snowden, R.; Braddick, O. Extension of displacement limits in multiple-exposure sequences of apparent motion. Vis. Res. 1989, 29, 1777–1787. [Google Scholar] [CrossRef]
- Von Fermi, G.; Richardt, W. Optomotorische reaktionen der fliege Musca domestica. Kybernetik 1963, 2, 15–28. [Google Scholar] [CrossRef]
- Eckert, H. Optomotorische Untersuchungen am visuellen System der Stubenfliege Musca domestica L. Kybernetik 1973, 14, 1–23. [Google Scholar] [CrossRef] [PubMed]
- Joesch, M.; Plett, J.; Borst, A.; Reiff, D.F. Response properties of motion-sensitive visual interneurons in the lobula plate of Drosophila melanogaster. Curr. Biol. 2008, 18, 368–374. [Google Scholar] [CrossRef] [PubMed]
- Schnell, B.; Joesch, M.; Forstner, F.; Raghu, S.V.; Otsuna, H.; Ito, K.; Borst, A.; Reiff, D.F. Processing of horizontal optic flow in three visual interneurons of the Drosophila brain. J. Neurophysiol. 2010, 103, 1646–1657. [Google Scholar] [CrossRef] [PubMed]
- Rister, J.; Pauls, D.; Schnell, B.; Ting, C.Y.; Lee, C.H.; Sinakevitch, I.; Morante, J.; Strausfeld, N.J.; Ito, K.; Heisenberg, M. Dissection of the peripheral motion channel in the visual system of Drosophila melanogaster. Neuron 2007, 56, 155–170. [Google Scholar] [CrossRef] [PubMed]
- Joesch, M.; Schnell, B.; Raghu, S.V.; Reiff, D.F.; Borst, A. ON and OFF pathways in Drosophila motion vision. Nature 2010, 468, 300–304. [Google Scholar] [CrossRef] [PubMed]
- Brainard, D.H.; Vision, S. The psychophysics toolbox. Spat. Vis. 1997, 10, 433–436. [Google Scholar] [CrossRef] [PubMed]
- Pelli, D.G. The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spat. Vis. 1997, 10, 437–442. [Google Scholar] [CrossRef] [PubMed]
- Kleiner, M.; Brainard, D.; Pelli, D. What’s new in Psychophysics Toolbox-3? [ECVP Abstract Supplement]. Perception 2007, 36, 1–16. [Google Scholar]
Figure 1. Examples of the random chequerboard pattern stimuli used in the experiment. Each column of panels shows still frames of a random chequerboard pattern that moves across a screen. The pattern is displaced by a fixed step () on intervals where the speed is constant (the red arrow at the top of each panel points to a reference point for easier visualization). The perception of motion in these patterns is strongly dependent on the displacement step () relative to the pattern’s chequer/element size. When is comparable to the element size, humans perceive the pattern as moving smoothly and can identify its direction reliably (column 1). Increasing causes the pattern motion to appear more “jerky” and makes its direction harder to identify (column 2). Because the perception of motion is dependent on the ratio between the element and step size, increasing the element size to match the step size makes the patterns appear to be moving smoothly again (column 3).
Figure 2. Spacetime plots and Fourier spectra for stimuli with different step sizes (10 deg elements). The top panels show space–time plots and the bottom panels show corresponding spatiotemporal Fourier spectra for a number of step sizes (the element size is 10 deg in all plots). When the step size is much less than the element size (left column), the stimulus is perceived by humans as moving smoothly and its motion energy is situated in quadrants 1 and 3 in the Fourier spectra (i.e., motion in the rightwards direction). In the middle column (step size equals element size), humans see the stimulus as moving less smoothly and part of its motion energy is now distributed in quadrants 2 and 4 in the Fourier domain (leftward motion). Finally, when the step size is much larger than the element size (right column), motion energy is more evenly distributed across the four Fourier quadrants, and humans find it significantly more difficult to identify the motion direction.
Figure 3. Optomotor response rates against step size for various element sizes. Error bars are 95% confidence intervals calculated using simple binomial statistics. Each panel shows the pooled responses of 13 mantids to a moving chequerboard stimulus of a given element size. The data and corresponding psychometric fits show that motion detection becomes increasingly more difficult (i.e., the response rate decreases) as the step size increases and the step size corresponding to a 50% response rate () increases with element size.
Figure 4. Relationship between Dmax and element size. Calculated values and power law fits for individual mantids (left) and pooled data (right). The points appear to lie on a straight line on this log-log plot indicating that can be described accurately as a power law function of element size x, determined via fitting as (standard error of the regression ) for pooled data. Individual power law fits (left) had a mean S of 1.12 with 0.714.
Figure 5. Effect of aliasing on the distribution of motion energy in the Fourier domain. A pattern moving coherently at a constant speed (v) has spatial and temporal frequency components along the line , and its motion energy content is in quadrants 1 and 3 that signify positive motion (left panel). When the pattern is moved in steps of at the same speed, components with temporal frequencies higher than get aliased and cast towards lower frequencies (right panel). Aliased components are distributed in the four quadrants, and their net motion energy is therefore close to zero, leaving the energy of unaliased components as a close approximation of what remains in quadrants 1 and 3.
Figure 6. Apparent motion detection model based on Fourier Energy (Model 1). In this model, the spatiotemporal Fourier transform of the stimulus is filtered by a aliasing window (whose extent is cpd, where is presentation step size) and multiplied with the spatiotemporal sensitivity of an (ideal) motion detector. The net energy is then summed and passed through a ternary threshold to obtain a decision on the motion direction. The red and blue rectangles in filter blocks represent the spatiotemporal regions with positive and negative sensitivities, respectively.
Figure 7. Simple motion energy-based model of in the praying mantis (Model 1). (Left) the total motion energy of unaliased pattern components contained within a frequency window () decreases as the step size () is increased. Patterns of larger elements have a larger portion of their motion energy within lower frequencies and therefore contain more energy at large step sizes. According to this model, is the step size corresponding to a given motion energy threshold (chosen here to be ∼ units); (Right) This simple model is in good agreement with our experimental results for the mantis (standard error of the regression, ).
Figure 8. The Reichardt Detector. The spatial input from two identical Gaussian filters (standard deviation ) separated by is passed through high and low pass temporal filters (HP and LP, respectively). The LP output in each subunit is cross-correlated with the HP output from the other subunit using a multiplication stage (M), and the two products are then subtracted to produce a direction-sensitive measure of motion. In the RD-based Model 2 (Figure 9), the spatial filter parameters are = 1.07, = 1.0 for detector class 1, and = 3.42, = 2.5 for detector class 2, while the low and high pass filter time constants are = 13 ms, = 40 ms, respectively.
Figure 9. Optomotor response model based on the Reichardt Detector (Model 2). The model has two arrays of detectors that are positioned in random locations across a simulated 1D retina. Detectors within each array share the same subunit separation and spatial filter extent (and are thus all tuned to the same spatial frequency). Having two detector classes enables the model to respond to stimuli across a range of spatial frequencies broader than that of a single detector. The output of each detector is temporally integrated and then converted to a “vote” for leftward, rightward, or no motion by a hard threshold. The votes are subsequently combined, thresholded, and then passed through another temporal integrator and hard threshold blocks that model the decision process of the human observer in the experiment.
Figure 10. Predictions of Model 2 against experimental results. The mean (solid line) and interquartile range of Model 2 (dashed lines, based on 50 simulated trials per data point), showed good agreement with our experimental observations (symbols, with error-bars showing 95% confidence intervals).
Figure 11. Comparison between Models 1 and 2. Plots of vs. element size for Models 1 and 2 show that both are in good agreement with experimental results (Model 1 has a standard error of the regression of and Model 2 has ).
Figure 12. Comparison between in humans and mantids. Mantis increases with the element size, similar to humans, but does not appear to have a lower limit below which it remains constant (in humans, this limit is about 15 arcmin = 0.25 deg). The relationship between mantis and element size is well accounted for by a power law function with an exponent of 0.462 (blue line). Human data from  is plotted for comparison. Morgan interpreted the values he observed as scaling proportionately for element sizes larger than 0.25 deg (i.e., assumes a power law exponent of 1, green line) although his data appears to be accounted for equally well by a power law with an exponent of 0.462 (red line), similar to mantids. It is, therefore, unclear as to whether a true difference in power law exponents exists between humans and mantids.
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).