HYPOTHESIS AND THEORY article

Front. Psychol., 10 May 2013

Sec. Consciousness Research

Volume 4 - 2013 | https://doi.org/10.3389/fpsyg.2013.00221

This article is part of the Research TopicAwareness shaping or shaped by prediction and postdictionView all 15 articles

Prediction, postdiction, and perceptual length contraction: a Bayesian low-speed prior captures the cutaneous rabbit and related illusions

Daniel Goldreich*

Jonathan Tong

Department of Psychology, Neuroscience & Behaviour, McMaster University, Hamilton, ON, Canada

Illusions provide a window into the brain’s perceptual strategies. In certain illusions, an ostensibly task-irrelevant variable influences perception. For example, in touch as in audition and vision, the perceived distance between successive punctate stimuli reflects not only the actual distance but curiously the inter-stimulus time. Stimuli presented at different positions in rapid succession are drawn perceptually toward one another. This effect manifests in several illusions, among them the startling cutaneous rabbit, in which taps delivered to as few as two skin positions appear to hop progressively from one position to the next, landing in the process on intervening areas that were never stimulated. Here we provide an accessible step-by-step exposition of a Bayesian perceptual model that replicates the rabbit and related illusions. The Bayesian observer optimally joins uncertain estimates of spatial location with the expectation that stimuli tend to move slowly. We speculate that this expectation – a Bayesian prior – represents the statistics of naturally occurring stimuli, learned by humans through sensory experience. In its simplest form, the model contains a single free parameter, tau: a time constant for space perception. We show that the Bayesian observer incorporates both pre- and post-dictive inference. Directed spatial attention affects the prediction-postdiction balance, shifting the model’s percept toward the attended location, as observed experimentally in humans. Applying the model to the perception of multi-tap sequences, we show that the low-speed prior fits perception better than an alternative, low-acceleration prior. We discuss the applicability of our model to related tactile, visual, and auditory illusions. To facilitate future model-driven experimental studies, we present a convenient freeware computer program that implements the Bayesian observer; we invite investigators to use this program to create their own testable predictions.

Introduction

Illusions provide investigators a window into the brain’s unconscious perceptual strategies. In a particularly interesting category of illusions, an ostensibly task-irrelevant stimulus feature strongly influences the perception of a target feature. Here we consider one group of such illusions, characterized by the curious influence of time on the tactile perception of space (Figure1).

FIGURE 1

Figure 1. Perceptual length contraction. Perception underestimates the distance between successive taps to the skin. Stimuli on the forearm are illustrated in the upper panels, along with their perception (forearm sketches). Corresponding human data and Bayesian model fits are plotted in the lower panels. In this and subsequent figures, we illustrate stimulus sequences that progress distally on the arm; the illusions occur also for stimuli in the opposite direction.(A)Top: at short ISI (t), the perceived length (l*) between two taps to the forearm is less than the actual length (l).Bottom: perceived length grows linearly with actual length, but with a slope less than 1. Filled circles: human perceptual data fromMarks et al. (1982) for electrocutaneous stimuli delivered att = 0.24 s. Solid line: fit of the Bayesian model. Dashed line:l =l*.(B)Top: a pair of taps delivered to the right forearm at short ISI (t₂) is perceived to have the same spacing as a more closely spaced pair of taps (l₁ <l₂) delivered to the left forearm at longer ISI (t₁ >t₂).Bottom: the spacing ratio,l₂-to-l₁, resulting in perceived equality of spacing on the two arms, as a function of the ISI ratio,t₁-to-t₂. Filled circles: human perceptual data fromLechelt and Borchert (1977). Curve: fit of the Bayesian model. Data points from left to right hadt₁ = 0.2, 0.35, 0.5, 0.65, and 0.8 s, witht₂ = 1.0 s −t₁, andl₁ = 10 cm.(C)Top: 4 taps delivered to two skin sites are perceived as hopping sequentially along the arm, because the short ISI (t) between taps 2 and 3 results in contraction of the perceived distance between them (l* <l).Bottom: the perceived length from taps 2–3 asymptotically approaches the actual length (l = 10 cm, dashed line) as ISI is increased. Filled circles: human perceptual data fromKilgard and Merzenich (1995). Curve: fit of the Bayesian model.

When humans are asked to judge the distance between two brief taps delivered in rapid succession to the skin, they consistently underestimate the true distance. Indeed, the perceived distance between taps shortens systematically as the time between taps is reduced. Thisperceptual length contraction occurs even when the participant is explicitly instructed to attend only to the distance between stimuli, and to ignore the time. The phenomenon is particularly pronounced on the forearm and other body areas that have poor spatial acuity. Several striking illusions result from this puzzling compressive effect of time on space perception (Figures1A–C). For instance, a stimulus sequence consisting of two-taps delivered at one position followed by two taps at another, with a short inter-stimulus interval (ISI) separating the second and third taps, is perceived as four taps hopping progressively along the arm: the second and third taps are perceptually displaced from their true positions, as if attracted toward one another (Figure1C). This phenomenon is known as sensory saltation, or more famously, the cutaneous rabbit illusion (Geldard and Sherrick, 1972;Geldard, 1982). Analogous phenomena occur in vision (Geldard, 1976;Lockhead et al., 1980;Khuu et al., 2011) and audition (Bremer et al., 1977;Shore et al., 1998;Getzmann, 2009).

Why does time influence space perception in this manner? Much research supports the view that perception works out a probabilistic best guess. An optimal probabilistic (i.e., Bayesian) observer interprets the current sensory input, not in isolation, but rather within the context of the structure and statistics of the natural world (Knill and Pouget, 2004;Vilares and Kording, 2011). By exploiting its knowledge of the world, the observer achieves a more accurate perceptual inference. Following the Bayesian model ofGoldreich (2007), we hypothesize that perception interprets successive taps to the skin as arising from a moving object that touches down intermittently, and that perception expects slowly moving objects to occur more often than rapidly moving ones. We speculate that the expectation for slow movement results from a lifetime of experience with tactile stimuli that are primarily stationary (e.g., the pressure of clothing against the skin) or – somewhat less frequently – slowly moving (e.g., grooming, movement of clothing during walking, etc.). Thus, in the observer’s experience, stimuli separated by large distances at short ISI are uncommon. Faced with such a stimulus sequence, and somewhat uncertain as to the true locations of the taps, the brain concludes that the sensory measurements were caused by a stimulus sequence that was more probablea priori: one that moved at a slower speed (i.e., shorter distance) on the skin. Under this view, the influence of time over space perception, far from reflecting a design flaw in our perceptual machinery, is a consequence of optimal probabilistic inference under conditions of sensory uncertainty.

Here, we present and elaborate on the Bayesian observer model introduced byGoldreich (2007). We show that our model is compatible with the view that the rabbit illusion – and perceptual length contraction generally – involves concomitant pre- and postdiction. By prediction, we mean an inference process in which earlier sensory events influence the perception of later ones. By postdiction, we mean an inference process in which later sensory events influence the perception of earlier ones (Eagleman and Sejnowski, 2000). We show interestingly that pre- and postdiction emerge naturally from our model, even though the model does not explicitly represent these processes. We show further that directed spatial attention shifts the Bayesian observer’s percept by modulating the prediction-postdiction balance. Finally, we apply our Bayesian model to the perception of spatiotemporal stimulus patterns that are more complex than those depicted in Figure1.

The Fundamentals of the Bayesian Observer

Stochastic variability in stimulus-evoked neural activity presents one of many challenges to perception. An identical repeated stimulus – such as a tap to a particular location on the skin – will evoke a different neural response on each trial (Sripati et al., 2006). Consequently, a given response could have been caused by a stimulus at any one of many locations. The spatial uncertainty caused by stochastic variability is lessened, but not eliminated, when a stimulus activates a larger number of neurons. On the forearm, where receptor density is relatively low, humans can localize a stimulus to within about ±1 cm of its true location; on the fingertip, where receptor density is much higher, localization improves to about ±1 mm (Weinstein, 1968).

To model stochastic neural variability, we assume that a single tap to the skin evokes an internal positionmeasurement that is randomly sampled from a Gaussian distribution centered at the true tap position, with a standard deviation, σ_s, that depends on the receptor density (the subscripts signifies “spatial”)¹. On repeated trials with an identical tap position, the measurement will vary stochastically, but on average will equal the true position. In the absence of any other perceptual influence, the measurement is the location the observer perceives. Consequently, on average the perception of an isolated single tap to the skin is veridical. However, unlike an isolated single tap, a rapid spatiotemporal tap sequence is not veridically perceived (Figure1). To understand why, we explore a probabilistic model – a Bayesian observer that makes a perceptual best guess.

We begin by considering sequences of two taps, which result in two uncertain spatial measurements (x_1m,x_2m) and a detected time,t, between them². The Bayesian observer (Figure2) attempts to infer the actual tap positions (x₁,x₂) that produced the measurements (x_1m,x_2m). We refer to each possible (x₁,x₂) pair as acandidatetrajectory, and to the measured positions (x_1m,x_2m) as themeasured trajectory. The Bayesian observer considers both thelikelihood and theprior probability of every candidate trajectory. A trajectory’s likelihood is the probability that the trajectory would give rise to the measured trajectory. The plot of trajectory likelihoods – the likelihood function – is a cloud of uncertainty centered on the measured trajectory (Figure2A, top). We analogize the likelihood function to a (typically unconscious)sensation – a precursor to the conscious percept.

FIGURE 2

Figure 2. Bayesian model.(A) The observer’s likelihood function, prior probability density, and posterior probability density in response to taps sensed (i.e., measured by the observer) at positions (x_1m,x_2m) = (3, 7 cm) (open red circles in all plots). Each pixel in the intensity plots represents a candidate trajectory: a possible tap 1 position and tap 2 position pair (x₁,x₂). Lighter color indicates higher probability (each plot is individually auto-scaled to take advantage of the full brightness range). The measured trajectory length isl_m =x_2m −x_1m = 4 cm.Top: the observer’s likelihood function plots the probability of the measured trajectory given each candidate trajectory. The observer understands that a single tap at any location produces a measurement drawn from a Gaussian distribution centered at that location, with standard deviation σ_s; thus, the likelihood function is a two-dimensional Gaussian density centered on the measured trajectory.Middle: the observer expects slow movement to occur more commonly; we model this expectation as a Gaussian distribution over trajectory speed, with mean zero and standard deviation, σ_v. Consequently, the observer expects closely spaced taps, and its prior is maximal along thex₁ =x₂ diagonal.Bottom: the posterior probability of each trajectory is proportional to the product of its likelihood and prior. The mode of the posterior (filled red circle) is the percept.(B) Space-time plots equivalently illustrate the inference process.Top: open red circles show measured tap positions (vertical-axis) and times of occurrence (horizontal-axis). Error bars (±1σ_s) represent the spatial imprecision of the measurements. The slope of the line connecting the taps is the measured trajectory speed:l_m /t = 4 cm/0.15 s = 27 cm/s.Middle: the observer’s low-speed expectation is represented by the line of slope zero and diagonal lines of slopes ±1σ_v = ±10 cm/s. The distance traversed at speed σ_v in timet istσ_v = 1.5 cm. The ascending diagonal line is shallower than the measured velocity: 10 cm/s < 27 cm/s. Equivalently, tσ_v = 1.5 cm <l_m = 4 cm. Thus, the measured trajectory violates the observer’s low-speed expectation.Bottom: the perceived trajectory (filled red circles and red line) is a compromise between the measured trajectory (open circles, reproduced from top panel) and expectation (middle panel). Each tap has migrated perceptually by 1 cm toward the other, resulting in perceptual length contraction:l* = 2cm <l_m = 4 cm. The perceived trajectory speed isl*/t = 2 cm/0.15 s = 13 cm/s. In both panels, σ_s = 1 cm, σ_v = 10 cm/s,t = 0.15 s,x_1m = 3 cm,x_2m = 7 cm.

A trajectory’s prior probability is the frequency with which the observer expects the trajectory to occur; this may be the prevalence of the trajectory in nature, which the observer has learned from experience. The plot of prior probabilities – the prior density – represents the observer’sexpectation regarding trajectory occurrence. Crucially, our Bayesian observer believes that slow trajectories are more common than fast ones. We model this low-speed prior as a Gaussian density over trajectory speed, with mean zero and standard deviation σ_v (the subscriptv signifies “velocity”). Thus, trajectories in which the two taps are spaced closer together (i.e., lower-speed trajectories) have greater prior probability than those in which the taps are spaced farther apart (Figure2A, middle).

Using Bayes’ rule, the observer multiplies each trajectory’s likelihood by its prior probability to obtain its posterior (final) probability. In essence, the observer combinessensation withexpectation to achieveperception. The mode of the posterior distribution – the most probable trajectory – is the observer’s percept (Figure2A, bottom). Because of the low-speed prior, the percept underestimates the distance between rapidly presented stimuli. In the example illustrated, whereas the measured tap positions were (3, 7 cm), the percept was (4, 6 cm). The perceived distance between taps (l* = 2 cm) was thus half the measured distance (l_m = 4 cm) (Figures2A,B).

How, exactly, does the time between taps influence perceptual length contraction? This question is answered in Figure3. Because speed is distance divided by time, the prior probability falls off more sharply with distance when the time between taps is short. While always maximal along thex₁ =x₂ diagonal, the prior widens as ISI increases (Figure3A, left to right). As a consequence, perceptual length contraction is most pronounced at shorter ISIs; as ISI increases, the perceived distance between taps asymptotically approaches the measured distance (Figure3B).

FIGURE 3

Figure 3. Time affects space perception.(A) The columns display the observer’s likelihood function, prior probability density, and posterior probability density on four trials in which the measured trajectory (open red circle in all plots) wasx_1m = 3 cm,x_2m = 7 cm, and the time,t, between taps was (left to right) 0.05, 0.15, 0.25, and 0.35 s. Because the observer has a low-speed expectation, it most strongly expects the taps to fall close together when the time between them is short; thus, the narrowest prior distribution is found in the left column, and the prior distribution widens ast increases. The perceived trajectory (mode of the posterior, filled red circle) is pulled closer to thex₁ =x₂ diagonal when the prior is sharper. Therefore, the observer experiences more pronounced length contraction ast decreases. Conversely, ast increases, length contraction diminishes, and the perceived trajectory asymptotically approaches the measured trajectory (note diminishing distance between filled and open circles in the posterior plots ast increases). For all columns, σ_s = 1 cm, σ_v = 10 cm/s.(B) The perceived first and second tap positions (filled red circles), corresponding to the mode of each of the posterior plots above, are graphed along with the measured tap positions (dashed lines). The perceived distance between taps asymptotically approaches the measured distance ast increases (compare to Figure1C, lower).(C) The amount of perceptual length contraction depends not only ont and σ_v but also on σ_s. Here we simulate a trial att = 0.1 s for an observer whose spatial acuity is worse (σ_s = 2 cm) than the observer in(A). Although its posterior density is broader, this observer has the same percept (mode of the posterior) as the observer in(A) witht = 0.05 s (leftmost column inA). Note that the ratio of σ_s to σ_vt is identical (=2) in the two cases. It is this ratio that determines the amount of perceptual length contraction.

We have explained the influence of time on the Bayesian observer’s perception of space, but what of the influence of space itself on space perception? In Figure4, we find reassuringly thatl* varies linearly withl_m, although length contraction ensures that the slope of the relationship is less than one.

FIGURE 4

Figure 4. Perceived distance grows linearly with measured distance.(A) The columns display the observer’s likelihood function, prior probability density, and posterior probability density on five trials, in which the measured distance was progressively increased from 2 to 6 cm whilet was held constant at 0.1 s. The mode of the posterior (filled red circle) tracks but lags the measured trajectory (open red circle). To facilitate comparison, yellow crosshairs in all posterior plots mark the posterior mode in the leftmost column.(B) The measurements,x_1m andx_2m, are plotted as open circles; the observer’s percept (mode of the posterior), as filled circles.l* grows linearly with, but consistently underestimates,l_m (compare to Figure1A, lower). The measurements (x_1m,x_2m) were, from left to right: (4, 6 cm), (3.5, 6.5 cm), (3, 7 cm), (2.5, 7.5 cm), and (2, 8 cm). In all panels, σ_s = 1 cm, σ_v = 10 cm/s.

The Perceptual Length Contraction Formula

In the Section “The Bayesian model” in Appendix, we show that the Bayesian observer’s posterior density is a two-dimensional Gaussian distribution. The mode of the posterior reveals a relationship betweenl* andl_m:

l^{*} = \frac{l_{m}}{1 + 2 {(\frac{σ_{s}}{σ_{v} t})}^{2}}

(1)

Equation 1 is the perceptual length contraction formula, first reported byGoldreich (2007). Notice that, as we have seen, this formula predicts thatl* asymptotically approachesl_m in the limit thatt approaches infinity (Figures3A,B), that the degree of length contraction is determined by the ratio of σ_s to σ_vt (Figure3C), and that, at fixedt,l* relates linearly to, but underestimates,l_m (Figure4).

Because σ_s and σ_v occur only as a ratio in the length contraction formula, it is convenient to rewrite the formula as:

l * = \frac{l_{m}}{1 + 2 {(\frac{τ}{t})}^{2}}

(2)

where tau (τ), defined as σ_s/σ_v, has units of time, and is the model’s single free parameter³. From Eq. 2 we see that tau is a time constant for space perception. The smaller the value of tau, the more the perceived length increases toward the measured length as inter-stimulus time increases:l* = (1/3)l_m whent =τ, andl* = (2/3)l_m whent = 2τ (Figure5A). Thus, the larger the value of τ, the more susceptible the observer is to perceptual length contraction: for a givent andl_m, an observer with a larger τ will perceive a shorter trajectory (Figures5A,B).

FIGURE 5

Figure 5. Exploring the perceptual length contraction formula.(A) Perceived length,l*, plotted against ISI (t), for a trajectory of measured lengthl_m = 10 cm, at five values of the parameter τ (Eq. 2). Perceived length asymptotically approaches measured length ast increases. Each curve reachesl* = (1/3)l_m (lower dashed line) whent =τ, andl* = (2/3)l_m (upper dashed line) whent = 2τ.(B) Perceived length,l*, plotted against measured length,l_m, for a trajectory oft = 0.1 s, at five values of τ [color code as in(A)]. Perceived length grows linearly with, but underestimates, measured length. Observers with larger τ experience more pronounced length contraction. Dashed diagonal line:l* =l_m.

To develop an intuition for these effects of tau, consider that the parameter can be rewritten:

τ = \frac{σ_{s}}{σ_{v}} = \frac{1 / σ_{v}}{1 / σ_{s}} = \frac{strength of low-speed expectation}{spatial acuity}

(3)

Thus, tau reflects the strength of the observer’s low-speed expectation relative to the observer’s spatial acuity. Tau is large in an observer with poor spatial acuity (large σ_s) and a strong expectation for slow movement (small σ_v). This observer places trust in the low-speed expectation; the observer’s perception is considerably length contracted. Tau is small in an observer with excellent spatial acuity (small σ_s) and little expectation regarding movement speed (large σ_v). This observer places trust in the measurement; the observer’s perception is only modestly length contracted.

The perceptual length contraction formula closely fits human data from a variety of experiments (Figure1; see alsoGoldreich, 2007 for additional data fits). The fit is particularly satisfying given that the formula has just a single free parameter. The best-fitτ-values for the data displayed in Figures1A–C were 0.21, 0.11, and 0.08 s. The largerτ for the Figure1A fit may reflect the use of electrocutaneous stimuli byMarks et al. (1982), the source of the data plotted in Figure1A. Electrical pulses tend to be more difficult to localize (larger σ_s) than mechanical taps (Higashiyama and Hayashi, 1993), which were used to generate the data in Figure1B (Lechelt and Borchert, 1977) and Figure1C (Kilgard and Merzenich, 1995). Measures of point localization suggest that σ_s is on the order of 1 cm in response to light mechanical stimuli on the forearm (Weinstein, 1968;Martikainen and Pertovaara, 2002;Cody et al., 2008); thus, taking τ = 0.1 s as a nominal value for mechanosensory perception on the forearm, we infer that σ_v is on the order of 10 cm/s.

Bayesian Perception is Optimal because It is Beneficially Biased

Before developing our model further, we pause to consider an important conceptual question: we have described the Bayesian observer as achieving an optimal perceptual inference, but we have also shown that the observer consistently underestimates the measured distance between taps. How can an observer be both biased and optimal? This important question applies to any Bayesian observer with a non-uniform prior distribution.

The short answer to the question is that bias is optimal when it accurately reflects the stimulus statistics. In a world in which slow trajectories are more common than fast ones (and, therefore, among trajectories with any given inter-stimulus time,t, short lengths are more common than long ones), an observer is justified in perceiving trajectories as shorter than measured. Paradoxically, then, the Bayesian observer is optimal precisely because it is biased.

To understand this thoroughly, we must appreciate the consequences of both measurement and stimulus variability. In Figures2–5 we artificially specified (x_1m,x_2m). In a laboratory experiment, however, the investigator can control only the stimulus, not the measurements. As explained, we conceive of each measured tap location as sampled from a Gaussian distribution of standard deviation σ_s, centered on the actual tap location. Thus, if the skin is stimulated repeatedly with the identical trajectory, the measurement and consequently the percept will vary stochastically from trial to trial (Figure6).

FIGURE 6

Figure 6. Measurement noise causes stochastic perception.(A) The columns display the observer’s likelihood function, prior probability distribution, and posterior probability distribution on five trials with the identical stimulus trajectory:x₁ = 3 cm,x₂ = 7 cm,t = 0.15 s. Each measured stimulus position was randomly sampled from the true location; thus, the measured trajectory (x_1m,x_2m; open red circle) bounces randomly from trial to trial around the fixed true value (3, 7 cm; red cross). Because the likelihood function is centered on the measurement, it too bounces. Consequently, the observer’s percept (mode of the posterior, filled red circle) varies stochastically from trial to trial.(B) The measured tap positions (open circles) and perceived tap positions (mode of posterior, filled red circles) on each trial, compared to the actual tap positions (dashed lines). On every trial, the perceived trajectory length (l*, distance between filled circles) underestimates the measured length (l_m, distance between open circles); the perceived trajectory length therefore on average underestimates the actual trajectory length (l).

By incorporating measurement variability, the simulation shown in Figure6 is a more realistic representation of a laboratory experiment than are the simulations shown in the earlier Figures. Crucially for our understanding of the paradox of bias and optimality, however, Figure6 would be an unrealistic portrayal of the Bayesian observer’s experience in the real-world. In the real-world, not only the measurements but also the trajectories themselves are drawn from a distribution. In Figure7, we more closely simulate what we envision to be real-world tactile experience. The figure plots the lengths of one million trajectories sampled from a zero-mean velocity distribution (for clarity of illustration, all witht = 0.15 s), from each of which spatial measurements were sampled and processed into a percept.

FIGURE 7

Figure 7. Bayesian perception is optimal because it is biased. On each of 1 million trials, a first tap position (x₁) was drawn from a uniform distribution, and a second tap position (x₂) was drawn from a Gaussian distribution centered on the first tap position, with standard deviationtσ_v = 1.5 cm (i.e., σ_v = 10 cm/s,t = 0.15 s; see Eq. A8 in Appendix). Measured positions,x_1m andx_2m, were then drawn independently from Gaussian distributions of standard deviation σ_s = 1 cm, centered on the corresponding tap positions (x₁ andx₂).(A)Left: scatterplot of measured trajectory length (l_m =x_2m −x_1m) against actual trajectory length (l =x₂ −x₁) for each of the trials (dots); negative lengths indicate trajectories in whichx₂ <x₁. Dashed vertical and horizontal lines:l = 0 andl_m = 0. Diagonal dashed line:l_m =l. Vertical blue line:l = 3 cm. Horizontal red line:l_m = 3 cm.Center: histogram (h) ofl_m values that occurred whenl was between 2.95 and 3.05 cm (i.e.,l_m samples that fell along the blue vertical line in the scatterplot). The histogram is a Gaussian distribution centered atl_m = 3 cm (asterisk).Right: histogram ofl values of trajectories that gave rise tol_m between 2.95 and 3.05 cm (i.e.,l samples that fell along the red horizontal line in the scatterplot). The histogram represents the observer’s posterior density overl. It is a Gaussian distribution centered atl = 1.6 cm, not 3 cm (asterisk).(B) Left, center, and right panels as in(A), but forl* rather thanl_m.Center:l* is a biased estimator.Right: on trials in which the observer perceivedl* = 3 cm, the true trajectory length averaged 3 cm. Because the perceived length is a deterministic function of the measurement, this histogram has the same variance as the posterior density overl. Inset formulas in(A)center and(B)right show the variances of these histograms (See “One-dimensional reductions” in Appendix). These are equal to the mean-squared error between each estimator and the true length.

A comparison of the statistics of the measured length,l_m (Figure7A) with those of the perceived length,l* (Figure7B) reveals that, although the observer’s perception is biased, it is more accurate than the measurement. In fact, the observer’s perception is optimal precisely because it is biased. To understand why, consider that the majority of these real-world trajectories have very short lengths (l close to zero). Because short trajectories are more common, any measured length,l_m, most often originates from a trajectory of shorter true length,l. The Bayesian observer’s percept is biased by the prior to take this crucial knowledge into account; consequently, over the course of many trials, the percept more closely reflects the true stimulus than the measurement does. This is indicated by the smaller vertical scatter of the percept (Figure7B, left) than of the measurement (Figure7A, left) around the diagonal line.

Further inspection of the scatterplot in Figure7A reveals that, for any true trajectory length,l, the measurement,l_m, occurs with equal frequency above and below the diagonal line. Thus, the histogram ofl_m samples is centered onl (Figure7A, center). For this reason, the measured length is termed an “unbiased estimator” of the true length. Despite this lofty denomination, however, it is clear from the same scatterplot that for any magnitudel_m other than 0, the distribution of true lengths has a smaller average magnitude (whenl_m > 0,l tends to lie to the left of the diagonal line; whenl_m < 0,l tends to lie to the right of the diagonal line). Thus,l_m is an inaccurate estimator in the sense that the stimuli that result in a particularl_m are on average offset from thatl_m (Figure7A, right). If an observer were to reportl_m as the estimate of trajectory length, the observer would be found to systematically report trajectories as being longer than they actually are.

Figure7B shows that the statistics of the perceived length,l*, are opposite in character to those of the measured length. For any true trajectory length,l, the perceived length,l*, systematically underestimates the magnitude ofl (Figure7B, left and center). Thus, the perceived length is termed a “biased estimator.” This bias is beneficial, however: because of it, at anyl*, the distribution of true lengths is centered on a mean ofl* (the values ofl are symmetrically distributed around the diagonal line in the scatterplot). Thus,l* is an accurate estimator in the sense that the stimuli that result in a particularl* indeed on average have length equal to thatl* (Figure7B, right). The observer’s report ofl* can be trusted as accurately reflecting, on average, the true trajectory length. Importantly, the variance ofl givenl* (Figure7B, right) is smaller than the variance ofl_m givenl (Figure7A, center). This again reveals that the percept is more accurate than the measurement.

Selective Spatial Attention Shifts the Perceived Trajectory

Up to this point, we have assumed that the observer’s spatial uncertainty, σ_s, is uniform within the tested area (σ_s will, of course, differ between body areas, such as forearm and finger). However, spatial attention is associated with cortical receptive field recruitment and sharpening within the attended area (Anton-Erxleben and Carrasco, 2013). Thus, if an observer were to focus attention preferentially on one location, we might expect σ_s to decrease there while plausibly increasing at unattended locations. Indeed, on the arm, the spatial error of localization decreases by as much as 30% when attention is directed to the stimulated skin region (Moore et al., 1999;O’Boyle et al., 2001).

If spatial acuity is modulated by selective attention, how might length contraction percepts be affected? In a cutaneous rabbit experiment,Kilgard and Merzenich (1995) found that when participants were not asked to focus their attention to any particular area of the arm, the midpoints of the perceived and actual trajectories tended to coincide (Figure8A, left). In contrast, when participants were instructed to direct their attention either distally or proximally, the midpoint of the perceived trajectory shifted toward the attended location (Figure8A, center, right). This occurred because the tap within the attended skin area migrated less perceptually than did the tap within the unattended area, an effect confirmed byFlach and Haggard (2006).

FIGURE 8

Figure 8. Modeling the effects of spatial attention.(A) Depiction of a cutaneous rabbit illusion experiment reported byKilgard and Merzenich (1995). Participants either received no specific instruction or were instructed to direct their attention (yellow highlight) toward the proximal or distal forearm. The investigators found that in the directed attention conditions, the perceived positions of tap 2 (green) and tap 3 (blue) were shifted toward the attended location (forearm sketches).(B) In the Bayesian observer, a reduction in σ_s at the attended relative to the unattended location reproduces the perceptual shift reported byKilgard and Merzenich (1995).Left panel: the Bayesian observer’s likelihood function, prior and posterior density when σ_s does not vary with location, simulating the no-instruction condition in(A). In this case, the perceived and measured trajectory midpoints coincide.Center two panels: effect of σ_sp < σ_sd, where the subscriptsp andd refer to the proximal and distal arm areas. The greater the reduction of σ_sp relative to σ_sd, the more the perceived trajectory migrates proximally toward the tap 2 measurement.Right two panels: effect of σ_sd < σ_sp. The greater the reduction of σ_sd relative to σ_sp, the more the perceived trajectory migrates distally toward the tap 3 measurement. For all plots in(B), the measurements (x_2m,x_3m) were (3, 7 cm), the time between taps 2 and 3 was 0.06 s, and σ_v was 10 cm/s.(C) The perceived (mode of posterior) tap 2 and 3 positions (green and blue circles) for each of the five conditions in(B) directly above, compared to the measured tap positions (dashed lines).

The Bayesian observer replicates this attention effect: when σ_s decreases in one skin area relative to the other, the perceived trajectory midpoint shifts toward the attended location (Figures8B,C). The relatively precise measurement of the “attended tap” impedes its perceptual migration, while the relatively imprecise measurement of the “unattended tap” facilitates its perceptual migration. In this situation, length contraction is accomplished primarily by the perceptual displacement of the unattended tap.

In the Section “Generalization to inhomogeneous spatial uncertainty” in Appendix, we derive a generalization of the length contraction formula that incorporates separate σ_s1 and σ_s2 values representing spatial uncertainty around the two tap locations. In the general equation, the single spatial uncertainty, σ_s, of Eq. 1 is replaced by the root-mean-square uncertainty at the two locations, σ_rms:

l^{*} = \frac{l_{m}}{1 + 2 {(\frac{σ_{s (rms)}}{σ_{v} t})}^{2}} = \frac{l_{m}}{1 + \frac{σ_{s 1}^{2} + σ_{s 2}^{2}}{{(σ_{v} t)}^{2}}}

(4)

We show further that the shift, Δ_midpt, in the perceived trajectory midpoint away from the measured trajectory midpoint is:

Δ_{midpt} = \frac{l_{m}}{2} (\frac{σ_{s 1}^{2} - σ_{s 2}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2} + σ_{s 2}^{2}})

(5)

The Predictive-Postdictive Formulation

The rabbit illusion is often described as providing compelling evidence for perceptual postdiction, a process whereby the perception of an earlier event is modified by the occurrence of a later one. Postdiction is indeed an attractive explanation for the perceptual migration of tap 2 toward the location of tap 3 in the rabbit illusion (Figure1C). As shown byKilgard and Merzenich (1995), tap 3 also migrates perceptually toward the location of tap 2 (Figure1C). Therefore, prediction apparently is also at play: the perception of a later event (tap 3) depends upon an earlier one (tap 2).

In light of these considerations, it may seem surprising that our Bayesian observer replicates length contraction illusions without explicitly representing either pre- or postdictive inference. How is this possible? The answer is that pre- and postdiction are implicitly embedded in the model via the action of the low-speed prior. The low-speed prior transforms the observer’s likelihood function into a posterior density by pulling the observer’s perception of each tap position toward the measured position of the other (Figure2).

We can reveal the pre- and postdiction hidden in the Bayesian observer by decomposing the model’s two-dimensional (x₁,x₂) calculations (Figure9A) into a series of one-dimensional inferences regarding each tap’s position individually (Figure9B). Using its low-speed expectation, the observer can from the first tap’s likelihood function predict a probability distribution over the position of the subsequent, second, tap, and from the second tap’s likelihood function postdict a probability distribution over the position of the previous, first, tap (arrows in Figure9B). We call these two distributions thepredicted prior andpostdicted prior densities⁴.

FIGURE 9

Figure 9. Prediction-postdiction formulation.(A) The observer’s two-dimensional joint (x₁,x₂) likelihood function, prior and posterior densities. The measured trajectory wasx_1m = 3 cm,x_2m = 7 cm, witht = 0.15 s. The observer settings were σ_s = 1 cm, σ_v = 10 cm/s.(B) The inference process in(A) reformulated as a series of one-dimensional inferences regardingx₁ andx₂ individually.Top left: the tap 1 likelihood function (red),p(x_1m |x₁), is centered onx_1m. Because of its low-speed expectation, the observer predicts (red arrow) that the most probable position for a future tap 2 will also be 3 cm.Middle right: the observer’spredicted prior over tap 2 (light red) represents its belief concerning the position of tap 2, projected 150 ms forward in time from the occurrence of tap 1.Top right: the observer’s tap 2 likelihood function (blue),p(x_2m |x₂), is centered onx_2m. Because of its low-speed expectation, the observer postdicts (blue arrow) that the most probable position for the preceding tap 1 was also 7 cm.Middle left: the observer’spostdicted prior over tap 1 (light blue) represents its belief concerning the position of tap 1, projected 150 ms backward in time from the occurrence of tap 2.Left column: using Bayes’ theorem, the observer multiplies the tap 1 likelihood function (red) by the tap 1 postdicted prior (light blue) to obtain the tap 1 posterior (purple).Right column: similarly, the observer multiplies the tap 2 likelihood function (blue) by the tap 2 predicted prior (light red) to obtain the tap 2 posterior (purple).(C) Individual tap likelihoods, priors, and posteriors graphed with the same color scheme as in(B), for three trajectories of progressively increasing ISI. Att = 0.05 s, pre- and postdiction both result in relatively sharp priors that exert a strong influence over the percept (mode of the posterior). Ast is increased, the pre- and postdicted priors become lower and broader: pre- and postdiction become increasingly uncertain with the passage of time. The priors thus exert diminishing influence, and the percept approaches the measurement (compare to Figure3A). For all panels in(C), σ_s = 1 cm, σ_v = 10 cm/s.(D) Effect of directed spatial attention, as in Figure8.Top: a reduction in σ_s1 sharpens the tap 1 likelihood function, increasing the strength of prediction (note sharp predicted prior over tap 2), while an increase in σ_s2 broadens the tap 2 likelihood function, decreasing the strength of postdiction (note broad postdicted prior over tap 1).Middle: when σ_s1 = σ_s2, pre- and postdiction have equal strength.Bottom: reduction in σ_s2 relative to σ_s1 results in effects opposite those seen in the top panel. For all panels in(D),t = 0.06 s, σ_v = 10 cm/s.

Next, the observer simply multiplies each tap’s likelihood function by that tap’s prior to obtain the posterior density over the tap’s position. We show in the Sections “One-dimensional reductions” and “The prediction-postdiction formulation” in Appendix that the posteriors so obtained are identical to those that would result from extracting one-dimensional distributions from the joint (x₁,x₂) posterior: if the joint posterior (Figure9A, bottom) were marginalized (i.e., integrated) vertically, it would yield the posterior overx₁ shown in Figure9B, bottom left; if integrated horizontally, it would yield the posterior overx₂ shown in Figure9B, bottom right.

In the Section “The prediction-postdiction formulation” in Appendix, we show that the predicted and postdicted priors are Gaussian densities, and that their means and variances are:

\begin{matrix} μ_{pre} & = x_{1 m} & μ_{post} & = x_{2 m} \\ σ_{pre}^{2} & = σ_{s 1}^{2} + {(σ_{v} t)}^{2} & σ_{post}^{2} & = σ_{s 2}^{2} + {(σ_{v} t)}^{2} \end{matrix}

(6)

Equations 6 show that the prior density over each tap’s position is centered on the measurement of the other tap, reflecting the observer’s low-speed expectation (the most probable speed being zero). The variance of each prior density reflects the observer’s uncertainty regarding the other tap’s measurement (σ_s1 or σ_s2) and the observer’s prior uncertainty regarding trajectory speed (σ_v), which translates into an increasing uncertainty regarding the distance traversed as the elapsed time,t, increases (σ_vt). Thus, perceptual length contraction diminishes with increasingt (Figure9C), as shown previously (Figures3 and5A).

Figure9D shows that the predictive-postdictive formulation accurately reproduces the effects of directed spatial attention, previously explored in Figure8. When attention is directed around the location of the first tap (σ_s1 < σ_s2), the predicted prior is sharper than the postdicted prior (σ²_pre < σ²_post). Consequently, prediction exerts a dominant influence, perceptually displacing the second tap asymmetrically toward the first (Figure9D, top). When attention is directed around the location of the second tap (σ_s2 < σ_s1), the postdicted prior is sharper (σ²_post < σ²_pre). In this case, postdiction dominates, perceptually displacing the first tap asymmetrically toward the second (Figure9D, bottom).

The Perception of Multi-Tap Sequences

Up to this point, we have modeled the perception of two-tap trajectories⁵. How might a Bayesian observer handle multi-tap sequences, delivered conceivably to any number of skin sites? An observer could apply a low-speed prior independently to the movement between each tap and the next one. Alternatively, an observer might apply a low-speed prior to the first tap pair of the sequence, but thereafter incorporate an expectation that the velocity of each pair be similar to that of the preceding pair: a low-acceleration prior (See “Multi-tap perception” in Appendix).

Here, we test each of these Bayesian observers with multi-tap sequences that produce illusions in humans. We consider two well-known illusions. The first is the tau effect, so-named byHelson (1930) and subsequently described in elegant detail byHelson and King (1931). The second is a multi-tap rabbit, characterized in a delightful paper byGeldard (1982). In Figures10 and11, we show that the observer with a low-speed prior produces good fits to the human perceptual data; in Figure12, we show that the observer with a low-acceleration prior does not.

FIGURE 10

Figure 10. The tau effect.(A) Three taps to the arm, at positionsx₁ = 0 cm,x₂ = 3 cm, andx₃ (variable), define two spatial intervals,l₁ = 3 cm andl₂ (variable), and two temporal intervals,t₁ = 0.5 s andt₂ (variable). Becauset₂ <t₁, at somel₂ >l₁ the two intervals will be perceived to be of equal length (l₂* =l₁*).(B) At each of fivet₂ settings (identified at right of plots),Helson and King (1931) progressively increasedl₂ by shiftingx₃ along the arm in 0.5-cm increments. On each trial, the participant reported whether the second spatial interval was perceived to be shorter than, equal to, or longer than the first interval. To accurately estimate each participant’s point of subjective equality (PSE), we transformed these data into a two-alternative forced-choice format by distributing the participant’s “equal” responses evenly to the “shorter” and “longer” response categories. We then fit each participant’s transformed data (proportion “l₂ is longer” responses) at eacht₂ setting with a Weibull psychometric function (blue curves). Each psychometric function provides a PSE (vertical line): thex₃ at which the psychometric function intersected 0.5 (horizontal line), indicating thatl₂* =l₁*. The PSE shifted progressively to the left ast₂ was increased (note: whenx₃ = 6 cm,l₂ actually does equall₁). The transformed data shown are from one participant (“Observer C”) inHelson and King (1931).(C) Trajectories for whichl₂* =l₁*. Blue points: meanx₃ that resulted inl₂* =l₁* among the six participants tested byHelson and King (1931), at each of the fivet₂ settings. Blue lines: ±1 SD. Red points: best-fit performance of the Bayesian low-speed observer (τ = 0.10 s).

FIGURE 11

Figure 11. The 15-tap rabbit illusion.(A)Geldard (1982) delivered five taps at each of three locations along the arm. When ISI between successive taps was 0.05 s, participants reported perceiving a linear spatial progression of taps 1 through 10 (forearm sketch).(B) The same spatial sequence shown in(A), at three different ISIs, resulted in distinct percepts (Geldard, 1982).Left: at 0.3 s ISI, perception was veridical.Center: at 0.05 s ISI, perception was as shown in(A).Right: at 0.02 s ISI, the taps were perceived to begin at a position between 2 and 3 cm along the arm, and to advance in a non-linear spatial progression. Open circles: true tap positions; blue points: human perceptual report.(C) The Bayesian low-speed observer’s perception with a standard setting of τ = 0.10 s (e.g., σ_s = 1 cm, σ_v = 10 cm/s) shows much similarity to participants’ subjective reports. Open circles: true tap positions; red points: Bayesian observer’s perception (mode of the posterior). Dashed slanted lines have slope 10 cm/s (i.e., 1σ_v). Note that the two rapid jumps in the true trajectory (from tap 5 to tap 6, and from tap 10 to tap 11) occur at a speed much greater than σ_v when the ISI is 0.05 s (center) or 0.02 s (right); thus, perceptual length contraction occurs in these cases. In contrast, at an ISI of 0.3 s (left), the trajectory does not strongly violate the observer’s low-speed expectation; thus, perception is nearly veridical.(D) The Bayesian low-speed observer’s perception can be made even closer to human reports if the value of σ_s varies along the arm. The observer’s percept at each ISI is shown for σ_s = 1, 2, and 0.5 cm around the proximal, middle, and distal arm regions, respectively. Line segments at right have length equal to 1σ_s at each location. The value of σ_v was fixed at 10 cm/s.

FIGURE 12

Figure 12. Comparison between the low-speed-prior and low-acceleration-prior observers.(A) The tau effect. Red points: low-speed-prior observer’s performance, reproduced from Figure10C, and extended to 1 s on thex-axis. Purple points: low-acceleration-prior observer’s performance.(B) The 15-tap rabbit. Red points: low-speed-prior observer’s performance, reproduced from Figure11B. Purple points: low-acceleration-prior observer’s performance. For both observers in(A) and(B), τ was set to 0.10 s (i.e., σ_s = 1 cm, σ_v = 10 cm/s).

In the tau effect experiment, taps at three skin positions define two spatial and two temporal intervals (Figure10).Helson and King (1931) reported that, whent₂ =t₁ andl₂ =l₁, the participants perceived the two lengths as equal: $l_{2}^{*} = l_{1}^{*}$ . Ast₂ was progressively reduced, however, tap 3 had to be located progressively farther down the arm (i.e.,l₂ had to be progressively increased) in order to make $l_{2}^{*}$ equal $l_{1}^{*}$ (Figures10B,C). The best-fit of our low-speed-prior observer to the average of the human data occurred at τ = 0.10 s. The Bayesian observer closely replicated the space-time curve characterizing human perception (Figure10C).

In the 15-tap rabbit experiment, five taps are delivered consecutively at each of three positions along the arm (Figure11).Geldard (1982) found that when the time between consecutive taps was 0.05 s, participants perceived the first 10 taps in the sequence as hopping at an approximately uniform rate up the arm, each tap displaced by a constant spatial increment from the preceding one (Figures11A,B, center). At an ISI of 0.3 s, perception was reportedly veridical (Figure11B, left). At an ISI of 0.02 s, the perceived sequence began partway up the arm and traced a non-linear, somewhat sigmoidal path (Figure11B, right).

The low-speed-prior observer’s perception with τ = 0.10 s agrees qualitatively with the perception of human participants (Figure11C). To understand why, first note that, at an ISI of 0.05 s (Figure11C, center) or 0.02 s (Figure11C, right), the rapid jumps in the stimulus sequence are in clear violation of the observer’s low-speed expectation (see diagonal dotted lines with slope σ_v). Consequently, perceptual length contraction occurs for those tap pairs: the perceived distance between taps 5 and 6, and between taps 10 and 11, is considerably smaller than the actual distance. Now, what causes the progressive perceptual displacement of the many taps that are, in reality, at the same position? Interestingly, each jump in the actual stimulus sequence results in a chain reaction that propagates, with diminishing strength, to more distant taps. The rapid jump from tap 5 to tap 6 induces perceptual length contraction that pulls tap 5 considerably upward in the plot (and tap 6 downward). This places perceived distance between taps 4 and 5, which given the short ISI is sufficient to violate the observer’s low-speed expectation as applied to that tap pair. Consequently, taps 4 and 5 are perceptually attracted, resulting in some upward perceptual displacement of tap 4, placing perceptual distance between it and tap 3, and so on.

How would perception of the 15-tap sequence change if the observer were to direct its spatial attention unequally along the arm? To explore this question, in Figure11D we have plotted the low-speed-prior observer’s perception under conditions of “standard” attention to the proximal arm (σ_s = 1 cm), directed attention to the distal arm (σ_s = 0.5 cm), and relative inattention (σ_s = 2 cm) to the area in-between. Comparison of Figures11D,C indicates that adjustment to spatial attention affects perception in ways that depend upon ISI. For the particular values of σ_s used in this example, perception of the 0.3 s ISI sequence remains nearly veridical (Figure11D, left), whereas perception of the 0.05 s ISI sequence to some extent (center), and of the 0.02 s ISI sequence to a greater extent (right), are shifted upwards in the plots. The result is that the observer’s perception even more closely resembles that of the human participants reported byGeldard (1982).

Unlike the low-speed-prior observer, the low-acceleration-prior observer distinctly fails to match human perception (Figure12). In the tau effect scenario, a discordant feature of the low-acceleration-prior observer is that, whent₂ =t₁ andl₂ =l₁, the observer fails to perceive the lengths as equal, instead perceivingl₂* >l₁*. This perceptual asymmetry occurs because only the first segment of the trajectory is subject to a low-speed prior. Thus, whent₂ =t₁,l₂ must be made shorter thanl₁ in order to be perceived as equal. Consequently, in our simulation ofHelson and King (1931) using the low-acceleration-prior-observer,x₃ fails to converge to 6 cm as the tap 3 time approaches 1 s (Figure12A, purple points). The performance of the low-speed-prior observer, in contrast, does converge as expected (red points).

In the 15-tap rabbit experiment, at 0.05 s ISI and more markedly at 0.02 s ISI, the low-acceleration-prior observer perceives the trajectory to start below the actual tap 1 location and to end above the actual tap 15 location: the perceived trajectory is longer than the actual trajectory (Figure12B, purple points). This is incompatible with human perceptual report, and opposite to the perception of the low-speed-prior observer (red points). The perceptual undershoot and overshoot occur because the rapid jumps in the actual stimulus sequence extend perceptually in both directions at nearly constant velocity, in keeping with the observer’s low-acceleration expectation.

Discussion

Perceptual Length Contraction as Bayesian Inference

Length contraction illusions have long fascinated and puzzled investigators. The tactile tau effect was first reported almost 100 years ago (Gelb, 1914). It was later named and investigated in detail in the early 1930s (Helson, 1930;Helson and King, 1931). The best-known length contraction illusion, the cutaneous rabbit, was discovered serendipitously some 40 years later, when Geldard and colleagues, intending to study the tau effect, mistakenly produced a stimulus pattern similar to the rapid sequences shown in Figure11B (Geldard and Sherrick, 1972;Geldard, 1982). The resulting perception of taps hopping up the arm led a surprised observer to exclaim “who let the rabbit loose?” (Geldard, 1982). Over the years, investigators have proposed creative explanations – geometrical, mathematical, and neural – for these and related illusions (Jones and Huang, 1982;Brigner, 1988;Wiemer et al., 2000;Grush, 2005;Flach and Haggard, 2006).

The Bayesian observer model expounded here provides a concise and coherent explanation for the tau effect, the cutaneous rabbit, and related spatiotemporal illusions. Elapsed time influences the perception of traversed space because the observer expects objects to move slowly. In its simplest form, the model contains a single free parameter, tau: a time constant for space perception (Eqs 2 and 3). While much research remains to be done, we are encouraged by the close fit of the model to human perceptual data. Because a single model replicates the tau effect (Figure10), the rabbit (Figures1C and11), and other spatiotemporal illusions (Figures1A,B; see alsoGoldreich, 2007), we suggest that these illusions are manifestations of a single perceptual assumption: a low-speed prior. Our confidence in this suggestion is strengthened by the finding that a single value of the tau parameter (∼0.1 s) provides good fits to perception on the forearm as measured in experiments using different paradigms and carried out by multiple laboratories.

A central feature of Bayesian perceptual models is that they consider multiple hypotheses – in our case, candidate trajectories. The idea that the brain perceives by evaluating candidates is consistent with the “multiple drafts” theory ofDennett and Kinsbourne (1992). These authors propose that, confronted with stimuli such as those depicted in Figure11, the brain favors a distributed sequence of taps as the most “parsimonious” interpretation. This suggestion is compatible with our model if one equates parsimony with posterior probability. However,Dennett and Kinsbourne (1992) do not explain on what grounds an observer judges a particular interpretation to be the most parsimonious, nor do they explain why the percept changes as a function of ISI.

Bayesian perceptual models make precise, quantitative predictions regarding the relationships among perceptual variables (e.g., Eq. 1). These relationships spring from Bayes’ theorem: the product of a hypothesis’ likelihood and prior probability is proportional to its posterior probability. We liken the prior distribution to the observer’s expectation derived from experience, and the likelihood function to the sensation evoked by the stimulus (Figure2). In our view, then, the Bayesian perceptual framework beautifully formalizes Helmholtz’s suggestion that “previous experiences act in conjunction with present sensations to produce a perceptual image” (Helmholtz, 1925).

Bayesian observers interpret sensory data in light of an internal model – a conception of the structure and statistics of the world. Bayesian perception is optimal when the observer’s internal model accurately represents the world – that is, when the observer’s prior distribution matches the stimulus distribution, and the observer’s likelihood function accurately reflects the process by which stimuli map to measurements (Figure7). Unfortunately, the natural statistics of tactile stimuli have not been sufficiently characterized to constrain a prior distribution, nor is our knowledge of tactile sensorineural responses sufficient to specify the precise shape of a likelihood function. Accordingly, we fit a Gaussian prior and Gaussian likelihood to the human behavioral data. Subtle discrepancies between the human data and the model’s performance could result from our Gaussian assumptions. Future research is needed to determine the precise shapes of the priors and likelihoods used by individual participants. In any event, we speculate that a low-speed prior reflects the natural statistics of tactile stimuli, learned by humans through experience. If so, illusions such as the cutaneous rabbit may reveal the operation of an optimal observer who brings an expectation forged by real-world experience (the low-speed prior) into an artificial setting (the laboratory).

The Wide Applicability of the Low-Speed-Prior Observer

Our Bayesian observer model may explain a variety of perceptual phenomena beyond the tactile illusions we have considered. One such phenomenon is the out-of-body rabbit illusion. In a clever experiment,Miyazaki et al. (2010) showed that humans perceived taps as hopping progressively along an aluminum bar resting across the index fingers of the hands, when in actuality the taps were delivered only to the points on the bar directly above each finger. To apply the model to this scenario, it is necessary only to know the observer’s likelihood function evoked by a tap to the bar:p(measurement | tap location along bar). An interesting twist here is that both hands might detect any single tap to the bar. This does not preclude the construction of a likelihood function; it simply requires consideration of the sensory input to both hands. For instance, a more intense vibration felt with the right hand would result in a likelihood function whose peak lies to the right of the bar’s center. Once the single tap likelihood functions are determined empirically, it would be straightforward to fit the model to the behavioral data with a low-speed prior. Of interest would be to compare the value of σ_v so obtained to the value (∼10 cm/s) that fits the perception of trajectories delivered directly to the skin.

Our model provides insight into crossmodal interactions in length contraction illusions (Kawabe et al., 2008;Asai and Kanayama, 2012). In a 2-location, 3-tap rabbit paradigm,Asai and Kanayama (2012) demonstrated that the cutaneous rabbit was more consistently perceived when a visual flash occurred concurrently with, and at the typical illusory location of, the second tap. The model readily accommodates this cue-combination scenario. As shown in Figure6, stochastic variability in the measurement causes trial-to-trial variability in the perceived location of either tap. Provided the Bayesian observer assumes that the concurrent visual and tactile measurements resulted independently from the same event, the observer’s likelihood function over that event’s location will be the product of the visual and tactile likelihoods. The visual measurement will therefore sharpen and shift the combined likelihood function toward the flash location, increasing the frequency with which the observer perceives the tactile stimulus to fall at that location. To test the model, one would first measure participants’ spatial uncertainty (σ_s) in response to taps and flashes delivered in isolation. The model could then be used to make testable predictions regarding the perceptual influence of the flash.

Finally, our model may account for saltation illusions in both vision (Geldard, 1976;Lockhead et al., 1980;Khuu et al., 2011) and audition (Bremer et al., 1977;Shore et al., 1998;Getzmann, 2009). Provided the brain expects visual and auditory stimuli to move slowly, the model predicts pronounced length contraction when stimulus sequences traverse areas of poor spatial acuity (high σ_s). In vision, this prediction has already been confirmed: the visual rabbit illusion occurs in response to peripheral but not central stimuli (Geldard, 1976). Furthermore, a low-speed prior has been implicated in visual motion perception (Weiss et al., 2002;Stocker and Simoncelli, 2006). Future experimental studies will assess the quantitative fit of our model to visual and auditory saltation illusions.

Despite its apparently wide applicability, we do not suggest that a low-speed prior alone can account for a majority of motion illusions. Interestingly, several visual motion phenomena (Nijhawan, 2002;Hubbard, 2005) involve endpoint overestimation similar to that caused by the low-acceleration prior that did not match the tactile data considered here (Figure12B). Research is needed to clarify the conditions under which perception incorporates a low-acceleration prior.

The Percept as a Combined Pre- and Post-Dictive Inference

Our Bayesian observer’s percept can be viewed as resulting from concomitant pre- and post-dictive inference. For instance, in two-tap trajectories, the first tap predicts the location of the second, while the second postdicts the location of the first (Figure9). We suspect that Bayesian pre- and postdiction will be found to act together in many perceptual scenarios, whether or not these scenarios incorporate a low-speed prior. Indeed, it has already been reported that the two processes collaborate in the flash-lag effect (Rao et al., 2001;Soga et al., 2009), an illusion in which a brief visual flash placed alongside a moving object is perceived to lag behind the object.

By hypothesizing a link between spatial attention and σ_s, as suggested by point localization experiments (Moore et al., 1999;O’Boyle et al., 2001), we have shown how attention can shape the relative influence of pre- and postdiction on the percept (Figure9D). When attention is directed around the location of the first tap (σ_s1 < σ_s2), prediction dominates, and the second tap is perceived as asymmetrically displaced toward the first. When attention is directed around the location of the second tap (σ_s2 < σ_s1), postdiction dominates, and the first tap is perceived as asymmetrically displaced toward the second. Under conditions of imbalanced spatial attention, the trajectory midpoint is therefore perceived as shifted toward the attended location, as specified by Eq. (5). As the spatial attention balance is adjusted from one extreme to another, the model smoothly transitions between a percept influenced predominantly by prediction to one influenced predominantly by postdiction.

Researchers have often referred to the rabbit illusion as a post-dictive phenomenon, without mentioning the involvement of prediction (Bays et al., 2006;Blankenburg et al., 2006;van Wassenhove, 2009;Miyazaki et al., 2010;Asai and Kanayama, 2012). Indeed, initial work on the rabbit described only the perceptual displacement of the earlier tap(s) toward the later one(s) (Geldard and Sherrick, 1972), consistent with an exclusively postdictive process. However, it is clear from modern studies of the rabbit that both earlier and later taps undergo perceptual displacement – whether by equal distances or not (Kilgard and Merzenich, 1995;Flach and Haggard, 2006;Trojan et al., 2010). This supports our conclusion that the illusion involves concomitant predictive and postdictive inference.

Why did initial rabbit illusion investigations describe only the displacement of earlier taps toward later ones? In his three-tap “reduced rabbit” paradigm,Geldard (1982) stimulated with a “locator” (tap 1) followed at large ISI by an “attractee” (tap 2) at the same position, which he reported as perceptually displaced toward the subsequent “attractant” (tap 3) delivered at a different location. The participants’ report that tap 2 was perceptually displaced toward tap 3, but not vice versa, may have owed to the absence of a second locator tap placed at the position of tap 3. Without a locator tap for spatial comparison, participants may have been unaware that tap 3 was perceptually displaced. This hypothesis was considered and discarded byGeldard (1982) upon preliminary investigation, butKilgard and Merzenich (1995), using a 4-tap paradigm that included a second locator tap, did find symmetric perceptual displacement of taps 2 and 3 (Figure1C).

Alternatively, as demonstrated byKilgard and Merzenich (1995) and modeled here, asymmetric rabbit percepts could reflect an imbalance in spatial attention (Figures8 and9D; Eq. 5). An interesting possibility is that – particularly during multi-tap sequences – participants have time to redistribute their spatial attention on the fly. When investigators randomize the direction of movement (up or down along the arm), the participants cannot know where to expect the first tap, so they presumably distribute their spatial attention equally. After the first tap has occurred, however, experienced participants will know where the trajectory is heading, and might direct their attention fully toward the upcoming final location. This would cause a decrease in σ_s at the final location, consequently shifting the percept toward that point (e.g., Figure11D).

Speculations Regarding Neural Implementation

We have described two computational approaches by which our Bayesian observer could obtain its percept: either multi-dimensional inference (e.g., the two-dimensional inference shown in Figure9A) or equivalent one-dimensional prediction-postdiction (Figure9B). Which, if either, approach might the brain implement? The two approaches yield the same percept, but they scale very differently in difficulty as the number of taps increases. In the case of a sequence ofn taps, the joint likelihood function, prior, and posterior would each requiren dimensions. The neural representation of such multi-dimensional distributions would appear to pose considerable challenges. More plausibly, the brain could undertake one-dimensional predictive-postdictive inference recursively.

It is tempting to reinterpret the graphs in Figure9 as plots of activity (e.g., spike rates) of a series of cortical neurons that represent the corresponding skin positions (x-axes). Under this interpretation, the predicted prior is a mound of cortical neural activity evoked by tap 1 that decays and broadens over time (Figure9C). When the second tap initiates a second mound of cortical activity (the tap 2 likelihood function), the two mounds interact (e.g., through summation), resulting in a tap 2 percept that is shifted toward the tap 1 location. For trajectories with greater ISI, the tap 1 mound would have more time to decay, and would thus exert less influence over the tap 2 percept. This idea is similar to a model proposed byFlach and Haggard (2006). The idea is attractively simple; nevertheless, it seems able to account satisfactorily only for prediction, not postdiction. A more complex network model was proposed byWiemer et al. (2000), but that model produces perceptual length dilation at large ISIs, a result contradicted by behavioral data.

Computationally, the perception of multi-tap sequences can be achieved with recursive predictive-postdictive Bayesian inference. The Kalman filter is an algorithm for recursive predictive inference (Haykin, 2001), for which plausible neural implementation schemes have been proposed (Deneve et al., 2007;Beck et al., 2011). Kalman smoothing combines the Kalman filter with recursive postdictive inference (Haykin, 2001). The percepts obtained by our Bayesian observer are identical to those that would result from an appropriately configured Kalman smoother (see “Multi-tap perception” in Appendix). Smoothing has already been implicated in the flash-lag effect (Rao et al., 2001) and proposed to contribute to a variety of motion illusions, including the rabbit (Grush, 2005), though to our knowledge a specific neural implementation for the Kalman smoother has not yet been proposed.

Testable Predictions

Our Bayesian observer model makes many testable predictions; we encourage other investigators to pursue these experimentally.

The model predicts that perceptual length contraction will be more pronounced on body areas with worse spatial acuity or – on a given body area – in response to stimuli that are harder to localize (e.g., weaker taps to the skin). Because σ_s can be independently manipulated and measured using single taps, the length contraction formula (Eq. 1) can be used to make specific testable predictions regarding the effect of body area or stimulus strength on the perception of two-tap trajectories.

Under conditions of imbalanced spatial attention, the model predicts that perceptual length contraction will occur in accordance with Eq. 4 and that the midpoint of the perceived two-tap trajectory will vary in accordance with Eq. 5. These predictions could be tested experimentally by independently measuring an observer’s σ_s1 and σ_s2 under different degrees of directed spatial attention, then measuring the trajectory percepts under the same conditions.

As explained above, the model can be used to make testable predictions regarding a variety of perceptual length contraction phenomena beyond those that we have modeled in this paper. These include the out-of-body rabbit, crossmodal influences on the rabbit percept, and the visual and auditory rabbit illusions.

We encourage readers to generate their own predictions by using our freely downloadable computer program, Leaping Lagomorphs (http://psych.mcmaster.ca/goldreich-lab/LL/Leaping_Lagomorphs.html). This convenient program implements the Bayesian observer, with either balanced or imbalanced spatial attention, and outputs its perception in response to any stimulus sequence that the user cares to enter.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This research was supported by an Individual Discovery Grant from the Natural Sciences and Engineering Research Council of Canada (NSERC). The authors thank Andy Bhattacharjee, Luxi Li, Ryan Peters, Mike Wong, and Deda Gillespie for their insightful comments.

Footnotes

^Neuroscientists may find it useful to conceive of the measurement as the location of the peak of evoked activity in the underlying receptor population (or its cortical equivalent), or more precisely as the maximal likelihood estimate of stimulus location, based on the neural response.
^We assume here that the observer veridically perceives the time between taps, such that temporal uncertainty is zero.Goldreich (2007) showed that temporal uncertainty exerts a negligible effect on the percept when stimuli occur on a skin region with poor spatial acuity, such as the forearm. Accordingly, here we confine ourselves to modeling stimuli on the forearm, which is also the skin region most often tested in experimental studies of the cutaneous tau and rabbit illusions.
^We note for reference thatGoldreich (2007) defined the model’s free parameter as λ = σ_v/σ_s; thus, the lambda parameter in that paper is simply the reciprocal of the tau parameter.
^Note that “prior” in the Bayesian context does not imply “before” the stimulus occurs, but rather “independent of the measurement.” The predicted prior over tap 2’s position is constructed using all knowledge available to the observer except the tap 2 measurement,x_2m. Similarly, the postdicted prior over tap 1’s position is constructed using all knowledge available to the observer except the tap 1 measurement,x_1m.
^Although we have encountered a four-tap rabbit experiment (Figures1C and8), our approach was to consider the first and forth taps as mere reference points, so we modeled the perception of taps 2 and 3 only. Indeed, the first and forth taps in that sequence do not interact perceptually with the second and third, from which they are separated by large ISIs.

References

Anton-Erxleben, K., and Carrasco, M. (2013). Attentional enhancement of spatial resolution: linking behavioural and neurophysiological evidence.Nat. Rev. Neurosci. 14, 188–200.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Asai, T., and Kanayama, N. (2012). “Cutaneous rabbit” hops toward a light: unimodal and cross-modal causality on the skin.Front. Psychol. 3:427. doi:10.3389/fpsyg.2012.00427

CrossRef Full Text

Bays, P. M., Flanagan, J. R., and Wolpert, D. M. (2006). Attenuation of self-generated tactile sensations is predictive, not postdictive.PLoS Biol. 4:e28. doi:10.1371/journal.pbio.0040028

CrossRef Full Text

Beck, J. M., Latham, P. E., and Pouget, A. (2011). Marginalization in neural circuits with divisive normalization.J. Neurosci. 31, 15310–15319.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Blankenburg, F., Ruff, C. C., Deichmann, R., Rees, G., and Driver, J. (2006). The cutaneous rabbit illusion affects human primary sensory cortex somatotopically.PLoS Biol. 4:e69. doi:10.1371/journal.pbio.0040069

CrossRef Full Text

Bremer, C. D., Pittenger, J. B., Warren, R., and Jenkins, J. J. (1977). An illusion of auditory saltation similar to the cutaneous “rabbit.”Am. J. Psychol. 90, 645–654.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Brigner, W. L. (1988). Saltation as a rotation of space-time axes.Percept. Mot. Skills 66, 637–638.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Cody, F. W., Garside, R. A., Lloyd, D., and Poliakoff, E. (2008). Tactile spatial acuity varies with site and axis in the human upper limb.Neurosci. Lett. 433, 103–108.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Deneve, S., Duhamel, J. R., and Pouget, A. (2007). Optimal sensorimotor integration in recurrent cortical networks: a neural implementation of Kalman filters.J. Neurosci. 27, 5744–5756.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Dennett, D. C., and Kinsbourne, M. (1992). Time and the observer: the where and when of consciousness in the brain.Behav. Brain Sci. 15, 183–201.

CrossRef Full Text

Eagleman, D. M., and Sejnowski, T. J. (2000). Motion integration and postdiction in visual awareness.Science 287, 2036–2038.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Flach, R., and Haggard, P. (2006). The cutaneous rabbit revisited.J. Exp. Psychol. Hum. Percept. Perform. 32, 717–732.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Gelb, A. (1914). “Versuche auf dem Gebiete der Zeit- und Raumanschauung,” inBericht Über Den VI. Kongress für Experimentelle Psychologie: in Göttingen April 1914, ed. F. Schumann (Leipzig: J. A. Barth), 36–42.

Geldard, F. A. (1976). The saltatory effect in vision.Sens. Processes 1, 77–86.

Pubmed Abstract |Pubmed Full Text

Geldard, F. A. (1982). Saltation in somesthesis.Psychol. Bull. 92, 136–175.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Geldard, F. A., and Sherrick, C. E. (1972). The cutaneous “rabbit”: a perceptual illusion.Science 178, 178–179.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Getzmann, S. (2009). Exploring auditory saltation using the “reduced-rabbit” paradigm.J. Exp. Psychol. Hum. Percept. Perform. 35, 289–304.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Goldreich, D. (2007). A Bayesian perceptual model replicates the cutaneous rabbit and other tactile spatiotemporal illusions.PLoS ONE 2:e333. doi:10.1371/journal.pone.0000333

CrossRef Full Text

Grush, R. (2005). Internal models and the construction of time: generalizing from state estimation to trajectory estimation to address temporal features of perception, including temporal illusions.J. Neural Eng. 2, S209–218.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Haykin, S. S. (2001). “Kalman filters,” inKalman Filtering and Neural Networks, ed. S. S. Haykin (New York: Wiley), 1–21.

Helmholtz, H. V. (1925).Treatise on Physiological Optics, III: The Perceptions of Vision (1910). Rochester, NY: Optical Society of America.

Helson, H. (1930). The tau effect – an example of psychological relativity.Science 71, 536–537.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Helson, H., and King, S. M. (1931). The tau effect: an example of psychological relativity.J. Exp. Psychol. 14, 202–217.

CrossRef Full Text

Higashiyama, A., and Hayashi, M. (1993). Localization of electrocutaneous stimuli on the fingers and forearm: effects of electrode configuration and body axis.Percept. Psychophys. 54, 108–120.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Hubbard, T. L. (2005). Representational momentum and related displacements in spatial memory: a review of the findings.Psychon. Bull. Rev. 12, 822–851.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Jones, B., and Huang, Y. L. (1982). Space-time dependencies in psychological judgment of extent and duration: algebraic models of the tau and kappa effects.Psychol. Bull. 91, 128–142.

CrossRef Full Text

Kawabe, T., Miura, K., and Yamada, Y. (2008). Audiovisual tau effect.Acta Psychol. (Amst.) 128, 249–254.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Khuu, S. K., Kidd, J. C., and Badcock, D. R. (2011). The influence of spatial orientation on the perceived path of visual saltatory motion.J. Vis. 11, ii:5.

CrossRef Full Text

Kilgard, M. P., and Merzenich, M. M. (1995). Anticipated stimuli across skin.Nature 373, 663.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Knill, D. C., and Pouget, A. (2004). The Bayesian brain: the role of uncertainty in neural coding and computation.Trends Neurosci. 27, 712–719.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Lechelt, E. C., and Borchert, R. (1977). The interdependence of time and space in somesthesis: the Tau effect reexamined.Bull. Psychon. Soc. 10, 191–193.

Lockhead, G. R., Johnson, R. C., and Gold, F. M. (1980). Saltation through the blind spot.Percept. Psychophys. 27, 545–549.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Marks, L. E., Girvin, J. P., Quest, D. O., Antunes, J. L., Ning, P., O’Keefe, M. D., et al. (1982). Electrocutaneous stimulation II. The estimation of distance between two points.Percept. Psychophys. 32, 529–536.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Martikainen, I. K., and Pertovaara, A. (2002). Spatial discrimination of one versus two test stimuli in the human skin: dissociation of mechanisms depending on the task and the modality of stimulation.Neurosci. Lett. 328, 322–324.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Miyazaki, M., Hirashima, M., and Nozaki, D. (2010). The “cutaneous rabbit” hopping out of the body.J. Neurosci. 30, 1856–1860.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Moore, C. E., Partner, A., and Sedgwick, E. M. (1999). Cortical focusing is an alternative explanation for improved sensory acuity on an amputation stump.Neurosci. Lett. 270, 185–187.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Nijhawan, R. (2002). Neural delays, visual motion and the flash-lag effect.Trends Cogn. Sci. (Regul. Ed.) 6, 387.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

O’Boyle, D. J., Moore, C. E., Poliakoff, E., Butterworth, R., Sutton, A., and Cody, F. W. (2001). Human locognosic acuity on the arm varies with explicit and implicit manipulations of attention: implications for interpreting elevated tactile acuity on an amputation stump.Neurosci. Lett. 305, 37–40.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Rao, R. P., Eagleman, D. M., and Sejnowski, T. J. (2001). Optimal smoothing in visual motion perception.Neural Comput. 13, 1243–1253.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Shore, D. I., Hall, S. E., and Klein, R. M. (1998). Auditory saltation: a new measure for an old illusion.J. Acoust. Soc. Am. 103, 3730–3733.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Soga, R., Akaishi, R., and Sakai, K. (2009). Predictive and postdictive mechanisms jointly contribute to visual awareness.Conscious. Cogn. 18, 578–592.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Sripati, A. P., Yoshioka, T., Denchev, P., Hsiao, S. S., and Johnson, K. O. (2006). Spatiotemporal receptive fields of peripheral afferents and cortical area 3b and 1 neurons in the primate somatosensory system.J. Neurosci. 26, 2101–2114.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Stocker, A. A., and Simoncelli, E. P. (2006). Noise characteristics and prior expectations in human visual speed perception.Nat. Neurosci. 9, 578–585.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Trojan, J., Stolle, A. M., Carl, A. M., Kleinbohl, D., Tan, H. Z., and Holzl, R. (2010). Spatiotemporal integration in somatosensory perception: effects of sensory saltation on pointing at perceived positions on the body surface.Front. Psychol. 1:206. doi:10.3389/fpsyg.2010.00206

CrossRef Full Text

van Wassenhove, V. (2009). Minding time in an amodal representational space.Philos. Trans. R. Soc. Lond. B Biol. Sci. 364, 1815–1830.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Vilares, I., and Kording, K. (2011). Bayesian models: the structure of the world, uncertainty, behavior, and the brain.Ann. N. Y. Acad. Sci. 1224, 22–39.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Weinstein, S. (1968). “Intensive and extensive aspects of tactile sensitivity as a function of body part, sex, and laterality,” inThe Skin Senses: Proceedings, ed. D. R. Kenshalo (Springfield, IL: Thomas), 195–222.

Weiss, Y., Simoncelli, E. P., and Adelson, E. H. (2002). Motion illusions as optimal percepts.Nat. Neurosci. 5, 598–604.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Wiemer, J., Spengler, F., Joublin, F., Stagge, P., and Wacquant, S. (2000). Learning cortical topography from spatiotemporal stimuli.Biol. Cybern. 82, 173–187.

Pubmed Abstract |Pubmed Full Text |CrossRef Full Text

Appendix

Here, we further develop mathematically, and offer new conceptual insights into, thebasic Bayesian observer model put forth byGoldreich (2007). In the following seven sections, we: 1) specify the observer’s generative model, and derive the posterior probability density over tap trajectories and the perceptual length contraction formula; 2) generalize the derivation to include inhomogeneous spatial acuity caused by selective spatial attention; 3) consider useful one-dimensional reductions of the two-dimensional posterior density; 4) reformulate the observer’s percept as a combined predictive-postdictive inference; 5) model the perception of multi-tap sequences; 6) consider extensions of the model that incorporate additional sources of uncertainty; and 7) describe how we fit the model to human perceptual data.

The Bayesian Model

We consider here an observer whose goal is to perceive the locations of two-taps delivered to the skin in rapid succession. We assume that the observer has an internal generative model – a conception of the statistics of moving tactile stimuli – and that it interprets the stimulus sequence optimally within the context of its generative model. Briefly, the observer considers two taps that occur in rapid succession to result from a single moving object, and it considers that tactile objects tend to move slowly. Specifically, according to the generative model: (1) An object briefly touches the skin at a location,x₁, drawn from a uniform density. (2) The object moves away fromx₁ with velocityv, drawn from a Gaussian density with mean zero and standard deviation σ_v; at some elapsed timet (independent ofx₁), the object again briefly touches the skin, at locationx₂. (3) Noisy sensorineural activity evoked by each tap results in measured values for the tap positions,x_1m andx_2m, drawn from Gaussian densities centered on the actual tap positions,x₁ andx₂, with standard deviations σ_s.

Bayes’ formula

The observer’s goal is to infer the positions of the taps (x₁,x₂), which we refer to as the movement trajectory. We assume in this basic model that the observer perceives the time between taps,t, veridically. Thus, the observer knowsx_1m,x_2m, andt, and wishes to inferx₁ andx₂. According to Bayes’ formula, the posterior over trajectories is proportional to the product of likelihood and prior:

p (x_{1}, x_{2} | x_{1 m}, x_{2 m}, t) \propto p (x_{1 m}, x_{2 m} | x_{1}, x_{2}, t) p (x_{1}, x_{2} | t)

(A1)

We now work out the observer’s prior and likelihood.

Prior probability density

The observer’s prior probability density over trajectories is:

p (x_{1}, x_{2} | t) = p (x_{2} | x_{1}, t) p (x_{1} | t)

(A2)

Becauset andx₁ are independent, $p (x_{1} | t) = p (x_{1})$ , and this is a constant (x₁ being drawn from a uniform distribution). Therefore, we can write more concisely:

p (x_{1}, x_{2} | t) \propto p (x_{2} | x_{1}, t)

(A3)

We note that, givenx₁ andt,x₂ is a function of the velocity,v:

x_{2} = x_{1} + v t

(A4)

Thus, the probability thatv resides in the infinitesimal region $(v \pm \frac{d v}{2})$ is equal to the probability thatx₂ resides in the corresponding infinitesimal region $(x_{2} \pm \frac{d x_{2}}{2})$ :

p (x_{2} | x_{1}, t) d x_{2} = p (v) d v

(A5)

It follows that:

p (x_{2} | x_{1}, t) = p (v) |\frac{d v}{d x_{2}}| = \frac{p (v)}{t}

(A6)

Now recall that the observer has a low-velocity prior expectation:

p (v) = \frac{1}{\sqrt{2 π} σ_{v}} exp (- \frac{v^{2}}{2 σ_{v}^{2}}) = \frac{1}{\sqrt{2 π} σ_{v}} exp (- \frac{{((x_{2} - x_{1}) ∕ t)}^{2}}{2 σ_{v}^{2}})

(A7)

Referring to Eqs A3, A6, and A7, we therefore have:

p (x_{1}, x_{2} | t) \propto p (x_{2} | x_{1}, t) = \frac{1}{\sqrt{2 π} σ_{v} t} exp (- \frac{{(x_{2} - x_{1})}^{2}}{2 {(σ_{v} t)}^{2}})

(A8)

The observer’s prior probability density over trajectories is proportional to a Gaussian distribution over the distance between taps, with mean zero and standard deviation $σ_{v} t$ . Reflecting the low-speed prior, when the elapsed time,t, is large, a wide range of displacements is permissible; whent is shorter, the observer expects the two taps to more closely coincide spatially.

For future reference, we note thatx₂, likex₁, is independent oft. We see this by integrating Eq. A8 with respect tox₁:

p (x_{2} | t) = \int_{x_{1}} p (x_{1}, x_{2} | t) d x_{1} \propto \int_{x_{1}} \frac{1}{\sqrt{2 π} σ_{v} t} exp (- \frac{{(x_{2} - x_{1})}^{2}}{2 {(σ_{v} t)}^{2}}) d x_{1} = 1

(A9)

Thus,x₂ is independent oft, and, likep(x₁),p(x₂) is a constant. Eq. A8 shows thatx₂ is conditionally dependent ont, givenx₁.

Likelihood function

The tap positions measured by the observer,x_1m andx_2m, are drawn independently from Gaussian densities centered on the actual tap positions, with standard deviations σ_s. Therefore, the observer’s likelihood function is:

p (x_{1 m}, x_{2 m} | x_{1}, x_{2}, t) = p (x_{1 m} | x_{1}) p (x_{2 m} | x_{2})

(A10)

where

p (x_{1 m} | x_{1}) = \frac{1}{\sqrt{2 π} σ_{s}} exp (- \frac{{(x_{1 m} - x_{1})}^{2}}{2 σ_{s}^{2}}) p (x_{2 m} | x_{2}) = \frac{1}{\sqrt{2 π} σ_{s}} exp (- \frac{{(x_{2 m} - x_{2})}^{2}}{2 σ_{s}^{2}})

(A11)

Posterior probability density

The observer uses Bayes’ formula (Eq. A1) to calculate the posterior density over trajectories. It is useful to express the posterior density in several ways. First, referring to Eqs A3 and A10, we see that Bayes’ formula can be rewritten:

p (x_{1}, x_{2} | x_{1 m}, x_{2 m}, t) \propto p (x_{1 m} | x_{1}) p (x_{2 m} | x_{2}) p (x_{2} | x_{1}, t)

(A12)

Next, from Eqs A8 and A11, we have

p (x_{1}, x_{2} | x_{1 m}, x_{2 m}, t) \propto exp (- (\frac{{(x_{1 m} - x_{1})}^{2} + {(x_{2 m} - x_{2})}^{2}}{2 σ_{s}^{2}} + \frac{{(x_{2} - x_{1})}^{2}}{2 {(σ_{v} t)}^{2}}))

(A13)

Finally, following some rearrangement, Eq. A13 can be written as a two-dimensional (2D) Gaussian distribution

p (x_{1}, x_{2} | x_{1 m}, x_{2 m}, t) \propto exp (- \frac{1}{2 (1 - ρ^{2})} (\frac{{(x_{1} - x_{1^{*}})}^{2} + {(x_{2} - x_{2^{*}})}^{2} - 2 ρ (x_{1} - x_{1^{*}}) (x_{2} - x_{2^{*}})}{σ^{2}}))

(A14)

where the posterior mode $(x_{1^{*}}, x_{2^{*}})$ is given by

\begin{array}{l} x_{1^{*}} = x_{1 m} (\frac{{(σ_{v} t)}^{2} + σ_{s}^{2}}{{(σ_{v} t)}^{2} + 2 σ_{s}^{2}}) + x_{2 m} (\frac{σ_{s}^{2}}{{(σ_{v} t)}^{2} + 2 σ_{s}^{2}}) \\ x_{2^{*}} = x_{1 m} (\frac{σ_{s}^{2}}{{(σ_{v} t)}^{2} + 2 σ_{s}^{2}}) + x_{2 m} (\frac{{(σ_{v} t)}^{2} + σ_{s}^{2}}{{(σ_{v} t)}^{2} + 2 σ_{s}^{2}}) \end{array}

and the variance (σ²) and correlation coefficient (ρ) are given by:

σ^{2} = σ_{s}^{2} \frac{σ_{s}^{2} + {(σ_{v} t)}^{2}}{2 σ_{s}^{2} + {(σ_{v} t)}^{2}} ρ = \frac{σ_{s}^{2}}{σ_{s}^{2} + {(σ_{v} t)}^{2}}

We assume that the observer reads out the posterior mode as the percept. Note that the perceived positions, $x_{1^{*}} and x_{2^{*}}$ , are weighted averages of the measurements,x_1m andx_2m. The perceived positions are drawn toward one another as the time between taps shortens, converging toward the measurement midpoint, (x_1m +x_2m)/2, in the limit thatt approaches zero. Ast approaches infinity, by contrast, $x_{1^{*}}$ and $x_{2^{*}}$ approach the measured values,x_1m andx_2m.

Subtracting $x_{1^{*}}$ from $x_{2^{*}}$ , we find that the perceived distance between taps, $l^{*} = x_{2^{*}} - x_{1^{*}},$ relates to the measured distance, $l_{m} = x_{2 m} - x_{1 m}$ , according to the formula:

l^{*} = x_{2^{*}} - x_{1^{*}} = \frac{x_{2 m} - x_{1 m}}{1 + 2 {(\frac{σ_{s}}{σ_{v} t})}^{2}} = \frac{l_{m}}{1 + 2 {(\frac{τ}{t})}^{2}}

(A15)

where we have defined the parameter tau as the ratio of the observer’s spatial uncertainty to the width of the low-speed prior: $τ = \frac{σ_{s}}{σ_{v}}$ .

Although the measured tap positions will vary stochastically from trial to trial, on average they will equal the actual tap positions. Thus, on average the perceived distance is related to the true distance,l, as:

l^{*} = \frac{l}{1 + 2 {(\frac{τ}{t})}^{2}}

(A16)

This is the perceptual length contraction formula, previously derived – using a different approach and expressed in a slightly different form – byGoldreich (2007).

Generalization to Inhomogeneous Spatial Uncertainty

So far we have assumed equal spatial uncertainty, σ_s, at each point on the skin. Here, we consider the more general situation in which each tap may be associated with a different spatial uncertainty, σ_s1 and σ_s2, as might occur if the participant were to focus spatial attention on one skin region. In this case, the likelihood functions, Eq. A11, become:

p (x_{1 m} | x_{1}) = \frac{1}{\sqrt{2 π} σ_{s 1}} exp (- \frac{{(x_{1 m} - x_{1})}^{2}}{2 σ_{s 1}^{2}}) p (x_{2 m} | x_{2}) = \frac{1}{\sqrt{2 π} σ_{s 2}} exp (- \frac{{(x_{2 m} - x_{2})}^{2}}{2 σ_{s 2}^{2}})

(A17)

Consequently, the posterior density over tap positions (Eq. A13) becomes

p (x_{1}, x_{2} | x_{1 m}, x_{2 m}, t) \propto exp (- (\frac{{(x_{1 m} - x_{1})}^{2}}{2 σ_{s 1}^{2}} + \frac{{(x_{2 m} - x_{2})}^{2}}{2 σ_{s 2}^{2}} + \frac{{(x_{2} - x_{1})}^{2}}{2 {(σ_{v} t)}^{2}}))

(A18)

Following rearrangement, Eq. A18 can be re-written as a 2D Gaussian distribution,

p (x_{1}, x_{2} | x_{1 m}, x_{2 m}, t) \propto exp (- \frac{1}{2 (1 - ρ^{2})} (\frac{{(x_{1} - x_{1^{*}})}^{2}}{σ_{1}^{2}} + \frac{{(x_{2} - x_{2^{*}})}^{2}}{σ_{2}^{2}} - \frac{2 ρ (x_{1} - x_{1^{*}}) (x_{2} - x_{2^{*}})}{σ_{1} σ_{2}}))

(A19)

where the posterior mode ( $x_{1^{*}}, x_{2^{*}}$ ) is given by

\begin{array}{l} x_{1^{*}} = x_{1 m} (\frac{{(σ_{v} t)}^{2} + σ_{s 2}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2} + σ_{s 2}^{2}}) + x_{2 m} (\frac{σ_{s 1}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2} + σ_{s 2}^{2}}) \\ x_{2^{*}} = x_{1 m} (\frac{σ_{s 2}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2} + σ_{s 2}^{2}}) + x_{2 m} (\frac{{(σ_{v} t)}^{2} + σ_{s 1}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2} + σ_{s 2}^{2}}) \end{array}

and the variances $(σ_{1}^{2}, σ_{2}^{2})$ and correlation coefficient (ρ) are given by:

σ_{1}^{2} = σ_{s 1}^{2} \frac{σ_{s 2}^{2} + {(σ_{v} t)}^{2}}{σ_{s 1}^{2} + σ_{s 2}^{2} + {(σ_{v} t)}^{2}} σ_{2}^{2} = σ_{s 2}^{2} \frac{σ_{s 1}^{2} + {(σ_{v} t)}^{2}}{σ_{s 1}^{2} + σ_{s 2}^{2} + {(σ_{v} t)}^{2}} ρ = \frac{σ_{s 1} σ_{s 2}}{\sqrt{(σ_{s 1}^{2} + {(σ_{v} t)}^{2}) (σ_{s 2}^{2} + {(σ_{v} t)}^{2})}}

It follows that

l^{*} = x_{2^{*}} - x_{1^{*}} = \frac{l_{m}}{1 + \frac{σ_{s 1}^{2} + σ_{s 2}^{2}}{{(σ_{v} t)}^{2}}} = \frac{l_{m}}{1 + 2 {(\frac{σ_{s (rms)}}{σ_{v} t})}^{2}}

(A20)

Thus, the uniform spatial uncertainty, σ_s, of Eq. A15 is replaced by the root-mean-square of the uncertainty at the two locations:

σ_{s (rms)} = \sqrt{\frac{σ_{s 1}^{2} + σ_{s 2}^{2}}{2}} .

Interestingly, when $σ_{s 1} \neq σ_{s 2},$ the midpoint of the perceived trajectory no longer coincides with the midpoint of the measured trajectory. From the expressions (Eq. A19) for $x_{1^{*}}$ and $x_{2^{*}}$ it is easily shown that the shift, $Δ_{midpt},$ in the perceived trajectory midpoint away from the measured trajectory midpoint is:

Δ_{midpt} = \frac{x_{1^{*}} + x_{2^{*}}}{2} - \frac{x_{1 m} + x_{2 m}}{2} = \frac{l_{m}}{2} (\frac{σ_{s 1}^{2} - σ_{s 2}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2} + σ_{s 2}^{2}})

(A21)

One-Dimensional Reductions

The two-dimensional joint (x₁,x₂) posterior density (Eq. A19) fully represents the observer’s belief distribution over stimulus trajectories, and it captures dependencies between the variables. Nevertheless, it can be useful to express the observer’s belief about a single parameter of interest, although this entails a loss of information about dependencies. One such parameter of interest is the length,l, between taps. Other parameters of interest are the tap positions,x₁ andx₂, considered individually. Here we derive the observer’s one-dimensional posterior densities over each of these parameters.

Posterior density over trajectory length

The posterior over trajectory length,l =x₂ −x₁, can be found by integrating across the joint posterior:

p (l | x_{1 m}, x_{2 m}, t) = \int_{x_{1}} p (x_{1}, x_{2} = l + x_{1} | x_{1 m}, x_{2 m}, t) d x_{1}

(A22)

The posterior overl can also be found by noting that, from Eq. A8, the observer’s prior overl is:

p (l | t) = \frac{1}{\sqrt{2 π} σ_{v} t} exp (- \frac{l^{2}}{2 {(σ_{v} t)}^{2}})

(A23)

Further, from Eq. A17, we see that the observer’s displacement measurement, l_m =x_2m −x_1m, is normally distributed with meanl and variance $σ_{s 1}^{2} + σ_{s 2}^{2}$ :

p (l_{m} | l) = \frac{1}{\sqrt{2 π (σ_{s 1}^{2} + σ_{s 2}^{2})}} exp (- \frac{{(l_{m} - l)}^{2}}{2 (σ_{s 1}^{2} + σ_{s 2}^{2})})

(A24)

Thus, by Bayes’ rule, the posterior overl is proportional to the product of these two Gaussian densities:

p (l | l_{m}, t) \propto p (l_{m} | l, t) p (l | t)

(A25)

The result is a Gaussian posterior density with mean and variance given by:

μ_{l posterior} = \frac{l_{m}}{1 + \frac{σ_{s 1}^{2} + σ_{s 2}^{2}}{{(σ_{v} t)}^{2}}}, σ_{l posterior}^{2} = \frac{1}{\frac{1}{σ_{s 1}^{2} + σ_{s 2}^{2}} + \frac{1}{{(σ_{v} t)}^{2}}}

(A26)

The mean of the posterior overl is again the length contraction formula, Eq. A20. The variance of the posterior overl is smaller than the variance ofl_m, givenl. For this reason, the observer’s length percept is more accurate than the length measurement (see Figure7).

Marginal posterior densities overx₁ andx₂

To express the observer’s belief about each tap’s position individually, we can integrate the joint posterior alongx₂ to find the marginal posterior overx₁, and integrate the joint posterior alongx₁ to find the marginal posterior overx₂:

\begin{matrix} p (x_{1} | x_{1 m}, x_{2 m}, t) = \int_{x_{2}} p (x_{1}, x_{2} | x_{1 m}, x_{2 m}, t) d x_{2} \\ p (x_{2} | x_{1 m}, x_{2 m}, t) = \int_{x_{1}} p (x_{1}, x_{2} | x_{1 m}, x_{2 m}, t) d x_{1} \end{matrix}

(A27)

Because the joint posterior density is a 2D Gaussian (Eq. A19), the marginalization integrals (Eq. A27) have simple solutions:

\begin{matrix} p (x_{1} | x_{1 m}, x_{2 m}, t) = \frac{1}{\sqrt{2 π} σ_{1}} exp (- \frac{{(x_{1} - x_{1^{*}})}^{2}}{2 σ_{1}^{2}}) \\ p (x_{2} | x_{1 m}, x_{2 m}, t) = \frac{1}{\sqrt{2 π} σ_{2}} exp (- \frac{{(x_{2} - x_{2^{*}})}^{2}}{2 σ_{2}^{2}}) \end{matrix}

(A28)

The Prediction-Postdiction Formulation

Here, we show that the observer’s marginal posterior overx₂ can be equivalently derived from predictive inference: upon observing tap 1, the observer predicts (infers forward in time) a prior over tap 2; the observer then combines thispredicted prior with the tap 2 likelihood to obtain the posterior overx₂. Conversely, the marginal posterior overx₁ can be derived from postdictive inference: upon observing tap 2, the observer postdicts (infers backward in time) a prior over tap 1; the observer then combines thispostdicted prior with the tap 1 likelihood to obtain the posterior overx₁.

Predicting tap 2 upon observing tap 1

Replacing the integrand in lower Eq. A27 with the expression from Eq. A1, we have:

p (x_{2} | x_{1 m}, x_{2 m}, t) \propto \int_{x_{1}} p (x_{1 m}, x_{2 m} | x_{1}, x_{2}, t) p (x_{1}, x_{2} | t) d x_{1}

(A29)

Further expanding the integrand, we have:

p (x_{2} | x_{1 m}, x_{2 m}, t) \propto \int_{x_{1}} p (x_{1 m} | x_{1}) p (x_{2 m} | x_{2}) p (x_{2} | x_{1}, t) p (x_{1}) d x_{1}

(A30)

Because $p (x_{2 m} | x_{2})$ does not depend onx₁, we move it outside the integral. Thus, we have:

p (x_{2} | x_{1 m}, x_{2 m}, t) \propto p (x_{2 m} | x_{2}) \int_{x_{1}} p (x_{1 m} | x_{1}) p (x_{2} | x_{1}, t) p (x_{1}) d x_{1}

(A31)

Now we note that, according to Bayes’ formula:

p (x_{1 m} | x_{1}) p (x_{1}) \propto p (x_{1} | x_{1 m})

(A32)

Substituting Eq. A32 into Eq. A31 yields:

p (x_{2} | x_{1 m}, x_{2 m}, t) \propto p (x_{2 m} | x_{2}) \int_{x_{1}} p (x_{2} | x_{1}, t) p (x_{1} | x_{1 m}) d x_{1}

(A33)

Equation A33 is Bayes’ formula for the tap 2 position,x₂. It states that the marginal posterior density overx₂ is proportional to the product of the tap 2 likelihood, $p (x_{2 m} | x_{2})$ , and the tap 2predicted prior density,

p (x_{2} | x_{1 m}, t) = \int_{x_{1}} p (x_{2} | x_{1}, t) p (x_{1} | x_{1 m}) d x_{1}

(A34)

The predicted prior projects belief forwards in time. It reflects the observer’s beliefs about tap 2, given the tap 1 measurement and the elapsed time. Based onx_1m, the observer can generate a posterior over tap 1,p(x₁|x_1m). The predicted prior over a particular tap 2 position is then calculated by integrating across every possible tap 1 the product of this tap 1 posterior with the probability that the particular tap 2 will follow.

Postdicting tap 1 upon observing tap 2

Replacing the integrand in upper Eq. A27 with the expression from Eq. A1, we have:

p (x_{1} | x_{1 m}, x_{2 m}, t) \propto \int_{x_{2}} p (x_{1 m}, x_{2 m} | x_{1}, x_{2}, t) p (x_{1}, x_{2} | t) d x_{2}

(A35)

Further expanding the integrand, we have:

p (x_{1} | x_{1 m}, x_{2 m}, t) \propto \int_{x_{2}} p (x_{1 m} | x_{1}) p (x_{2 m} | x_{2}) p (x_{1} | x_{2}, t) p (x_{2}) d x_{2}

(A36)

Becausep(x_1m|x₁) does not depend onx₂, we move it outside the integral. Thus, we have:

p (x_{1} | x_{1 m}, x_{2 m}, t) \propto p (x_{1 m} | x_{1}) \int_{x_{2}} p (x_{2 m} | x_{2}) p (x_{1} | x_{2}, t) p (x_{2}) d x_{2}

(A37)

Now we note that, according to Bayes’ formula:

p (x_{2 m} | x_{2}) p (x_{2}) \propto p (x_{2} | x_{2 m})

(A38)

Substituting Eq. A38 into Eq. A37 yields:

p (x_{1} | x_{1 m}, x_{2 m}, t) \propto p (x_{1 m} | x_{1}) \int_{x_{2}} p (x_{1} | x_{2}, t) p (x_{2} | x_{2 m}) d x_{2}

(A39)

Equation A39 is Bayes’ formula for the tap 1 position,x₁. It states that the marginal posterior density overx₁ is proportional to the product of the tap 1 likelihood,p(x_1m|x₁), and the tap 1postdicted prior density,

p (x_{1} | x_{2 m}, t) = \int_{x_{2}} p (x_{1} | x_{2}, t) p (x_{2} | x_{2 m}) d x_{2}

(A40)

The postdicted prior projects belief backwards in time. It reflects the observer’s beliefs about tap 1, given the tap 2 measurement and the elapsed time. Based onx_2m, the observer can generate a posterior over tap 2,p(x₂|x_2m). The postdicted prior over a particular tap 1 position is then calculated by integrating across every possible tap 2 the product of this tap 2 posterior with the probability that the particular tap 1 preceded.

Formulas for the predicted and postdicted prior densities

We now solve the predicted and postdicted prior integrals (Eqs A34 and A40). To find the predicted prior, we substitute from Eqs A8 and A17 left, into Eq. A34:

\begin{array}{l} p (x_{2} | x_{1 m}, t) & = \int_{x_{1}} \frac{1}{\sqrt{2 π} σ_{v} t} exp (- \frac{{(x_{2} - x_{1})}^{2}}{2 {(σ_{v} t)}^{2}}) \frac{1}{\sqrt{2 π} σ_{s 1}} exp (- \frac{{(x_{1 m} - x_{1})}^{2}}{2 σ_{s 1}^{2}}) d x_{1} \\ = \frac{1}{2 π σ_{v} t σ_{s 1}} \int_{x_{1}} exp [- (\frac{{(x_{2} - x_{1})}^{2}}{2 {(σ_{v} t)}^{2}} + \frac{{(x_{1 m} - x_{1})}^{2}}{2 σ_{s 1}^{2}})] d x_{1} \end{array}

(A41)

We note that, upon much rearrangement:

\frac{{(x_{2} - x_{1})}^{2}}{2 {(σ_{v} t)}^{2}} + \frac{{(x_{1 m} - x_{1})}^{2}}{2 σ_{s 1}^{2}} = \frac{{(σ_{v} t)}^{2} + σ_{s 1}^{2}}{2 σ_{s 1}^{2} {(σ_{v} t)}^{2}} {(x_{1} - \frac{x_{2} σ_{s 1}^{2} + x_{1 m} {(σ_{v} t)}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2}})}^{2} + \frac{1}{2} (\frac{{(x_{2} - x_{1 m})}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2}})

(A42)

Thus, Eq. A41 becomes,

p (x_{2} | x_{1 m}, t) = \frac{1}{2 π σ_{v} t σ_{s 1}} exp (- \frac{{(x_{2} - x_{1 m})}^{2}}{2 ({(σ_{v} t)}^{2} + σ_{s 1}^{2})}) \int_{x_{1}} exp (- \frac{{(x_{1} - \frac{x_{2} σ_{s 1}^{2} + x_{1 m} {(σ_{v} t)}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2}})}^{2}}{\frac{2 σ_{s 1}^{2} {(σ_{v} t)}^{2}}{{(σ_{v} t)}^{2} + σ_{s 1}^{2}}}) d x_{1}

(A43)

The integrand is a Gaussian function with standard deviation

\frac{σ_{s 1} σ_{v} t}{\sqrt{{(σ_{v} t)}^{2} + σ_{s 1}^{2}}} .

Because the integral of an un-normalized Gaussian function of standard deviation σ is $\sqrt{2 π} σ$ , Eq. A43 simplifies to:

p (x_{2} | x_{1 m}, t) = \frac{1}{2 π σ_{v} t σ_{s 1}} exp (- \frac{{(x_{2} - x_{1 m})}^{2}}{2 ({(σ_{v} t)}^{2} + σ_{s 1}^{2})}) \frac{\sqrt{2 π} σ_{s 1} σ_{v} t}{\sqrt{{(σ_{v} t)}^{2} + σ_{s 1}^{2}}}

(A44)

Therefore, the predicted prior density overx₂ is

p (x_{2} | x_{1 m}, t) = \frac{1}{\sqrt{2 π ({(σ_{v} t)}^{2} + σ_{s 1}^{2})}} exp (- \frac{{(x_{2} - x_{1 m})}^{2}}{2 ({(σ_{v} t)}^{2} + σ_{s 1}^{2})})

(A45)

That is, the predicted prior is a Gaussian with mean and variance

μ_{pre} = x_{1 m} σ_{pre}^{2} = {(σ_{v} t)}^{2} + σ_{s 1}^{2}

(A46)

A similar derivation reveals that the postdicted prior density overx₁ is

p (x_{1} | x_{2 m}, t) = \frac{1}{\sqrt{2 π ({(σ_{v} t)}^{2} + σ_{s 2}^{2})}} exp (- \frac{{(x_{1} - x_{2 m})}^{2}}{2 ({(σ_{v} t)}^{2} + σ_{s 2}^{2})})

(A47)

That is, the postdicted prior is a Gaussian with mean and variance

μ_{post} = x_{2 m} σ_{post}^{2} = {(σ_{v} t)}^{2} + σ_{s 2}^{2}

(A48)

Multi-Tap Perception

So far, we have considered trajectories composed of just two taps. An interesting question arises in modeling the perception of multi-tap stimuli: is the observer’s generative model (a) a direct extension of the one we have considered here, such that a zero-mean low-speed prior applies independently to each pair of consecutive taps, or (b) does the observer expect velocity to be consistent across the multi-tap trajectory, such that the prior applied to each tap pair might be a Gaussian centered on the velocity of the preceding pair (a zero-mean low-acceleration prior)?

Considering trajectories with an arbitrary number of taps,n, and permitting inhomogeneous spatial acuity, possibilities (a) and (b) result in the following generalizations of Eq. A18:

(a)

p (\{x_{i}\} | \{x_{i m}\}, \{t_{i}\}) \propto exp (- (\sum_{i = 1}^{n} \frac{{(x_{i m} - x_{i})}^{2}}{2 σ_{s i}^{2}} + \sum_{i = 1}^{n - 1} \frac{{(x_{i + 1} - x_{i})}^{2}}{2 {(σ_{v} t_{i})}^{2}}))

(A49)

(b)

p (\{x_{i}\} | \{x_{i m}\}, \{t_{i}\}) \propto exp (- (\sum_{i = 1}^{n} \frac{{(x_{i m} - x_{i})}^{2}}{2 σ_{s i}^{2}} + \frac{{(x_{2} - x_{1})}^{2}}{2 {(σ_{v} t_{1})}^{2}} + \sum_{i = 2}^{n - 1} \frac{{(\frac{x_{i + 1} - x_{i}}{t_{i}} - \frac{x_{i} - x_{i - 1}}{t_{i - 1}})}^{2}}{2 σ_{v}^{2}}))

(A50)

Here {x_i} refers to the set of tap positions,x₁,x₂, …x_n; {x_im} to the corresponding set of measurements; {t_i} to the set of times elapsed between each tapi and tapi + 1; and σ_si to the spatial uncertainty associated with tapi.

The observer’s percept ${x_{i}^{*}}$ in case (a) or (b) can be found by taking partial derivatives of Eq. A49 or Eq. A50 with respect to each of the {x_i}, setting these to zero, and solving the simultaneous equations. We used this method to find the percepts depicted in Figures10 and11 [case (a)] and Figure12 [case (b)].

Alternatively, the identical percept can be found through Kalman smoothing (Haykin, 2001), a recursive extension of the predictive-postdictive formulation described above. The Kalman smoother consists of an iterative forward (predictive) pass through the stimulus sequence, followed by a backward (postdictive) pass. For model (a), the algorithm for the forward pass (the Kalmanfilter) is:

\begin{array}{l} K_{i} & = \frac{σ_{i - 1 | i - 1}^{2} + {(σ_{v} t)}^{2}}{σ_{i - 1 | i - 1}^{2} + {(σ_{v} t)}^{2} + σ_{s}^{2}} \\ {\hat{x}}_{i | i} & = {\hat{x}}_{i - 1 | i - 1} + K_{i} (x_{i m} - {\hat{x}}_{i - 1 | i - 1}) \\ σ_{i | i}^{2} & = (1 - K_{i}) (σ_{i - 1 | i - 1}^{2} + {(σ_{v} t)}^{2}) \end{array}

(A51)

Here,K_i is theKalman gain at timei; the notation ${\hat{x}}_{i | j}$ refers to the estimated position of tapi based on all taps up to and including tapj; and $σ_{i | j}^{2}$ is the variance of that estimate. The filter is initialized at the first tap, with ${\hat{x}}_{1 | 1} = x_{1 m}, σ_{1 | 1}^{2} = σ_{s}^{2}$ , and runs forward until tapn is reached. The Rauch-Tung-Striebel algorithm for the subsequent backward pass is:

\begin{array}{l} C_{i} & = \frac{σ_{i | i}^{2}}{σ_{i | i}^{2} + {(σ_{v} t)}^{2}} \\ {\hat{x}}_{i | n} & = {\hat{x}}_{i | i} + C_{i} ({\hat{x}}_{i + 1 | n} - {\hat{x}}_{i | i}) \\ σ_{i | n}^{2} & = σ_{i | i}^{2} + C_{i}^{2} (σ_{i + 1 | n}^{2} - σ_{i | i}^{2} - {(σ_{v} t)}^{2}) \end{array}

(A52)

We verified that Eqs A51 and A52 yielded the same percepts plotted in Figures10 and11.

Extensions

Although skin is a two-dimensional surface, we have so far considered only a single position axis,x, along which stimuli occur. In essence, we have assumed that the orthogonal,y coordinate, of the taps is a known constant. We have also assumed that the time,t, is known. Each of these restrictions can be removed.

Two-dimensional movement

A more realistic generative model would allow stimuli to move in any direction along a two-dimensional skin surface. To accomplish this, we can adopt an (x,y) Cartesian coordinate system in which the orthogonal components of the velocity vector are independently specified by low-speed priors:

\begin{matrix} p (v_{x}) = \frac{1}{\sqrt{2 π} σ_{v}} exp (- \frac{v_{x}^{2}}{2 σ_{v}^{2}}) = \frac{1}{\sqrt{2 π} σ_{v}} exp (- \frac{{((x_{2} - x_{1}) / t)}^{2}}{2 σ_{v}^{2}}) \\ p (v_{y}) = \frac{1}{\sqrt{2 π} σ_{v}} exp (- \frac{v_{y}^{2}}{2 σ_{v}^{2}}) = \frac{1}{\sqrt{2 π} σ_{v}} exp (- \frac{{((y_{2} - y_{1}) / t)}^{2}}{2 σ_{v}^{2}}) \end{matrix}

(A53)

The tap 1 and 2 likelihood functions generalize to:

\begin{matrix} p (x_{1 m}, y_{1 m} | x_{1}, y_{1}) = \frac{1}{\sqrt{2 π} σ_{s 1}} exp (- \frac{{(x_{1 m} - x_{1})}^{2} + {(y_{1 m} - y_{1})}^{2}}{2 σ_{s 1}^{2}}) \\ p (x_{2 m}, y_{2 m} | x_{2}, y_{2}) = \frac{1}{\sqrt{2 π} σ_{s 2}} exp (- \frac{{(x_{2 m} - x_{2})}^{2} + {(y_{2 m} - y_{2})}^{2}}{2 σ_{s 2}^{2}}) \end{matrix}

(A54)

The posterior over trajectories then takes the form:

\begin{array}{l} p (x_{1}, y_{1}, x_{2}, y_{2} | x_{1 m}, y_{1 m}, x_{2 m}, y_{2 m}, t) \\ \propto exp (- (\frac{{(x_{1 m} - x_{1})}^{2} + {(y_{1 m} - y_{1})}^{2}}{2 σ_{s 1}^{2}} + \frac{{(x_{2 m} - x_{2})}^{2} + {(y_{2 m} - y_{2})}^{2}}{2 σ_{s 2}^{2}} + \frac{{({(x_{2} - x_{1})}^{2} + {(y_{2} - y_{1})}^{2})}^{2}}{2 {(σ_{v} t)}^{2}})) \end{array}

(A55)

It is straightforward to show that the length contraction formula resulting from Eq. A55 is identical to Eq. A20. Indeed, if we define thex-axis as the axis along which the tap measurements lie, then marginalization of Eq. A55 overy₁ andy₂ recovers the posterior density Eq. A18.

Temporal uncertainty

Our model has assumed that the time between stimuli,t, is perceived veridically. This assumption can be removed.Goldreich (2007) showed that the Bayesian observer with temporal uncertainty tends to overestimatet in addition to underestimatingl. Thus, the Bayesian observer can model time dilation as well as length contraction illusions.

Fitting to Human Perceptual Data

We found the value of tau that minimized the mean-squared error (MSE) between human and model performance. This was done separately for the perceptual data fromMarks et al. (1982),Lechelt and Borchert (1977), andKilgard and Merzenich (1995), shown in Figures1A–C, and for the data fromHelson and King (1931), shown in Figure10.

The data ofHelson and King (1931) required some processing prior to the fitting procedure. We fit the data reported in Tables 2–6 ofHelson and King (1931). In those experiments, on each trial the participant reported whether the second spatial interval was perceived to be shorter than, equal to, or longer than the first interval (which was fixed at 3 cm). To fit these data, we first transformed them into an equivalent two-alternative forced-choice format by distributing each participant’s “equal” responses evenly to the “shorter” and “longer” response categories. We then fit each participant’s transformed data (proportion “l₂ is longer” responses) at eacht₂ setting with a Weibull psychometric function:

Ψ_{a, b, γ, δ} (l_{2}) = (1 - δ) [γ + (1 - γ) (1 - 2^{- {(\frac{l_{2} - 3 c m}{a})}^{b}})] + \frac{δ}{2}

Here δ is a lapse rate, γ is the probability that the concentrating participant would answer “l₂ is longer” when in factl₂ =l₁ (i.e., 3 cm),a is a position parameter, andb is a slope parameter. We found the maximum likelihood parameter settings, and from them read off the point of subjective equality (PSE:l₂ that the participant judged longer thanl₁ with 50% probability). We fit the Bayesian observer’s tau to minimize the MSE between its performance and the average PSE of the six human participants across the fivet₂ values tested byHelson and King (1931). Before doing these fits, we discarded the data from one of the six participants on one of the fivet₂ points: “Observer B” ofHelson and King (1931) did not have a valid PSE att₂ = 0.25 s because that participant’s transformed “l₂ is longer” response proportion was greater than 50% at alll₂ values.

Keywords: probabilistic inference, sensory saltation, motion illusions, tactile spatial attention, optimal percepts, Kalman smoothing, somatosensory spatiotemporal perception, sensory uncertainty

Citation: Goldreich D and Tong J (2013) Prediction, postdiction, and perceptual length contraction: a Bayesian low-speed prior captures the cutaneous rabbit and related illusions.Front. Psychol.4:221. doi: 10.3389/fpsyg.2013.00221

Received: 20 March 2013;Accepted: 11 April 2013;
Published online: 10 May 2013.

Edited by:

Yuki Yamada, Yamaguchi University, Japan

Reviewed by:

Iris M. D. Vilares, Northwestern University and Rehabilitation Institute of Chicago, USA
Robert Van Beers, VU University Amsterdam, Netherlands

Copyright: © 2013 Goldreich and Tong. This is an open-access article distributed under the terms of theCreative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.

*Correspondence: Daniel Goldreich, Department of Psychology, Neuroscience & Behaviour, McMaster University, 1280 Main Street West, Hamilton, ON L8S 4K1, Canada. e-mail:Z29sZHJkQG1jbWFzdGVyLmNh

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Movatterモバイル変換

Prediction, postdiction, and perceptual length contraction: a Bayesian low-speed prior captures the cutaneous rabbit and related illusions

Introduction

The Fundamentals of the Bayesian Observer

The Perceptual Length Contraction Formula

Bayesian Perception is Optimal because It is Beneficially Biased

Selective Spatial Attention Shifts the Perceived Trajectory

The Predictive-Postdictive Formulation

The Perception of Multi-Tap Sequences

Discussion

Perceptual Length Contraction as Bayesian Inference

The Wide Applicability of the Low-Speed-Prior Observer

The Percept as a Combined Pre- and Post-Dictive Inference

Speculations Regarding Neural Implementation

Testable Predictions

Conflict of Interest Statement

Acknowledgments

Footnotes

References

Appendix

The Bayesian Model

Bayes’ formula

Prior probability density

Likelihood function

Posterior probability density

Generalization to Inhomogeneous Spatial Uncertainty

One-Dimensional Reductions

Posterior density over trajectory length

Marginal posterior densities overx₁ andx₂

The Prediction-Postdiction Formulation

Predicting tap 2 upon observing tap 1

Postdicting tap 1 upon observing tap 2

Formulas for the predicted and postdicted prior densities

Multi-Tap Perception

Extensions

Two-dimensional movement

Temporal uncertainty

Fitting to Human Perceptual Data

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good

Movatterモバイル変換

Prediction, postdiction, and perceptual length contraction: a Bayesian low-speed prior captures the cutaneous rabbit and related illusions

Introduction

The Fundamentals of the Bayesian Observer

The Perceptual Length Contraction Formula

Bayesian Perception is Optimal because It is Beneficially Biased

Selective Spatial Attention Shifts the Perceived Trajectory

The Predictive-Postdictive Formulation

The Perception of Multi-Tap Sequences

Discussion

Perceptual Length Contraction as Bayesian Inference

The Wide Applicability of the Low-Speed-Prior Observer

The Percept as a Combined Pre- and Post-Dictive Inference

Speculations Regarding Neural Implementation

Testable Predictions

Conflict of Interest Statement

Acknowledgments

Footnotes

References

Appendix

The Bayesian Model

Bayes’ formula

Prior probability density

Likelihood function

Posterior probability density

Generalization to Inhomogeneous Spatial Uncertainty

One-Dimensional Reductions

Posterior density over trajectory length

Marginal posterior densities overx1 andx2

The Prediction-Postdiction Formulation

Predicting tap 2 upon observing tap 1

Postdicting tap 1 upon observing tap 2

Formulas for the predicted and postdicted prior densities

Multi-Tap Perception

Extensions

Two-dimensional movement

Temporal uncertainty

Fitting to Human Perceptual Data

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good

Marginal posterior densities overx₁ andx₂