Classifying Factor Velocity with Swarm Intelligence: Market Pricing of Fast- and Slow-Moving Factors

Chen, Ren-Raw; Huang, Mengjie; Tang, Yi

doi:10.3390/a18110682

Open AccessArticle

Classifying Factor Velocity with Swarm Intelligence: Market Pricing of Fast- and Slow-Moving Factors

by

Ren-Raw Chen

^1,*,

Mengjie Huang

²

and

Yi Tang

¹

Gabelli School of Business, Fordham University, 45 Columbus Avenue, New York, NY 10019, USA

²

School of Business and Computer Science, Caldwell University, 120 Bloomfield Avenue, Caldwell, NJ 07006, USA

^*

Author to whom correspondence should be addressed.

Algorithms 2025, 18(11), 682; https://doi.org/10.3390/a18110682

Submission received: 9 September 2025 / Revised: 17 October 2025 / Accepted: 22 October 2025 / Published: 25 October 2025

(This article belongs to the Section Evolutionary Algorithms and Machine Learning)

Download

Browse Figures

Versions Notes

Abstract

Utilizing a dataset of 190 risk factors spanning over three decades, we apply a swarm-based classification model to estimate factor velocity and analyze its implications for asset pricing. Our results show that slower-moving factors generate higher abnormal returns than their faster-moving counterparts, underscoring the critical role of price adjustment speed in market dynamics. Furthermore, our results suggest that trading frictions impede the rapid assimilation of information, contributing to the observed return patterns. This research offers new insights into return predictability and demonstrates the potential of swarm intelligence as a powerful tool for financial modeling.

Keywords:

swarm intelligence; pricing factors; Fama–French factors

1. Introduction

Momentum investments have been the centerpiece of investments in stocks for the last two decades. Momentum investing refers to buying stocks that have had high returns in the past and selling those that have had poor returns over the same period. Researchers have identified persistent momentum trends in stock markets as far back as the Victorian era (c. 1830s to 1900). Richard Driehaus is sometimes considered the father of momentum investing. He once said, “far more money is made buying high and selling at even higher prices.” This strategy reportedly delivered compound annual returns of 30% for Driehaus Capital Management in the 12 years after it was set up in the 1980s. This is taken from Richard Driehaus and Momentum Investing. For the original source, please see [1,2,3].

Momentum supporters believe that there is a fundamental reason for the momentum phenomenon to persistently exist, although momentum in stocks totally lacks economic theory. Pioneering academic research on momentum investing can be traced back to [4]. They documented how strategies of buying recent stock winners and selling recent losers generated significantly higher near-term returns than the U.S. market overall from 1965 to 1989. They established the basic time frame for momentum investing success as a 3- to 12-month window on either side. Since then, it has been booming as one of the largest research areas in finance. Recently, they [4] wrote an excellent review piece of the last 30 years of momentum research. Besides momentum, we believe that all of those technical indicators, although not supported by any economic theory, are supported by psychological behaviors of market participants—herding.

While herding itself cannot be easily observed due to the fact that individual trading data are not publicly (or even legally) available, the consequences are. In other words, although we cannot measure how investors herd in buying/selling stocks, we can observe how stock prices (returns) move as a result of herding. When herding exists, certain stocks will be chased (bought) or avoided (sold). These result in winner stocks and loser stocks. As herding behaviors change, winners can become losers and vice versa.

Past studies demonstrate the diverse applications of swarm intelligence algorithms in financial contexts, including portfolio optimization and stock price prediction (see, e.g., [5,6,7,8,9,10]). These studies leverage swarm intelligence in decision making processes by simulating collective agent behaviors observed in nature, such as those of birds, fish, and insects. However, to our knowledge, no existing research has specifically examined how stock market prices influence or interact with the velocity of agents in a swarm system. This study aims to bridge that gap by exploring the market pricing of velocity in financial swarm dynamics.

Velocity is a crucial component of system intelligence in swarm-based models. In natural and artificial swarm systems, velocity reflects the rate of change in an agent’s position, capturing both speed and directional movement. It serves as a fundamental mechanism by which information is propagated and decisions are adjusted dynamically within the swarm. In financial markets, velocity can be viewed as a proxy for the responsiveness of stocks to new information, as well as a key determinant of trend formation and reversals. Stocks that exhibit higher velocity within the swarm framework may represent “leaders,” entities that drive market sentiment, while lower-velocity stocks may act as “followers,” adjusting their movements based on lagged signals from faster-moving assets.

Incorporating velocity into swarm intelligence models of financial markets enhances our understanding of price formation, momentum effects, and return predictability. A system that lacks velocity awareness may fail to capture the hierarchical influence of stocks, where certain assets dictate the movement patterns of others. By analyzing how velocity distributes across financial agents, we can gain deeper insights into price discovery mechanisms and the differentiation between trend-driving and trend-reactive assets.

While the concept of velocity originates from swarm intelligence and captures the rate of directional movement in a multi-agent system, we argue that it also has a meaningful financial interpretation grounded in economic theory. In our setting, factor velocity serves as a proxy for the speed at which return-predictive information is incorporated into asset prices. This notion aligns with the literature on information diffusion and price discovery, where some factors or assets lead in assimilating value-relevant news, while others adjust more slowly [11,12]. High-velocity factors may reflect conditions of greater liquidity or heightened investor attention, leading to rapid but potentially noisy price movements [13,14]. On the other hand, low-velocity factors may be slower to adjust due to frictions, inattention, or underreaction, conditions consistent with behavioral theories that emphasize delayed price discovery and persistent mispricing [15,16]. Thus, velocity within our swarm-based model is not merely a technical artifact but a behaviorally and economically motivated measure of how promptly different factors respond to new information, with implications for trend persistence, herding, and return predictability.

This study therefore introduces a velocity-driven perspective on swarm intelligence in financial markets. By modeling returns of risk factors (also sometimes called factor portfolios in that these factors are weighted averages of stocks and hence analogous to portfolios) formed based on stylized return predictors through a swarm-based system that accounts for velocity differences, we aim to uncover new dimensions of market behavior and pricing dynamics that have previously been overlooked in the literature.

Crucially, we define two related concepts. Swarm velocity (SV) is the empirically estimated parameter derived from our model, representing the factor’s mathematical rate of change in the swarm space. This SV serves as a proxy for the economic concept of factor adjustment speed or information incorporation speed—the rate at which value-relevant information is assimilated into asset prices.

The primary contributions of this research, which bridges the fields of swarm intelligence and asset pricing, are fourfold. First, we introduce and successfully implement a non-parametric swarm intelligence model (derived from particle swarm optimization principles) to estimate a quantitative measure of factor velocity (SV), or the speed at which 190 risk factors adjust their returns relative to a market leader. This represents the first application of this methodology to systematically classify and analyze asset pricing factors.

Second, we find that factors classified as slow-moving (Low SV) generate significantly higher one-month-ahead abnormal returns (alphas) than their faster-moving (High SV) counterparts, even after controlling for established asset pricing models (e.g., FF5, FF6, Q5). The monthly alpha differential is both economically and statistically significant, ranging from 17 to 22 basis points.

Third, our results suggest that the cross-sectional return pattern documented in this study is likely driven by trading frictions and delayed information assimilation. This is evidenced by (1) the significant outperformance of the slow-mover portfolios and (2) the finding that friction-related factors are disproportionately overrepresented in the slow-mover category (more than twice their unconditional share).

Lastly, we demonstrate that the swarm velocity classification is highly persistent, with factors remaining in the same velocity quintiles for up to five years, confirming that SV represents a stable characteristic of the factor rather than temporary noise.

The remainder of the paper is organized as follows. Section 2 introduces the theoretical background of swarm intelligence (SI) and derives our non-optimizing, leader-following model for empirical factor estimation. Section 3 details the empirical methodology, outlining the three-stage pipeline for transforming monthly factor returns into the swarm velocity (SV) metric. Section 4 provides the literature review, contextualizing our SI approach within the broader fields of modern financial AI, herding, and lead–lag effects. Section 5 presents the empirical findings, including the experimental setup, descriptive statistics, return predictability results (alphas), and a detailed economic analysis of factor persistence and composition. Section 6 concludes the paper.

2. Swarm Intelligence

Swarm intelligence is a powerful artificial intelligence tool used to model animal behaviors and solve high-dimensional optimization problems. The basic idea of swarm intelligence is derived from those animals (such as birds, ants, bees, and fish) that rely on group effort to achieve their basic survival needs—seek food and avoid prey. The intelligence behind this collective behavior is how they communicate among one another.

There are two broadly classified categories in swarm intelligence—boids and particle swarm optimization (PSO). While both share the same fundamental theory, they vary in the following manner. Boids is mainly for behaviors of the swarm, and PSO can be used for optimization. Roughly speaking, PSO is used when there is a loss function (i.e., a landscape) and boids without. We should note that although they have been recognized as two separate models, they can be easily combined if needed. For example, although our estimation seems to follow particle swarm optimization, there is no optimization. If needed, we can estimate alignment and cohesion (which reflect another version of swarm speed) along with leader following. We briefly sketch each model in this section.

2.1. Boids (Without a Leader)

Ref. [17] was the first to “artificialize” such natural intelligence and create a computer algorithm named boids (for bird-oid object). Reynold’s algorithm is amazingly simple. For any given bird, Reynold devises a set of combined linear equations (vectors) that determine how the bird should fly to its next destination. Reynold created boids in 1986: “Boids is an artificial life program, developed by Craig Reynolds in 1986, which simulates the flocking behaviour of birds.”

The factors that determine how various vectors are combined are separation, alignment, and cohesion. As their names suggest, “separation” is to avoid collision with other birds, “alignment” decides how a particular bird should fly in a direction by referencing its fellow birds, and “cohesion” decides how fast (speed) a particular bird should fly to the center of its fellow birds.

There are countless versions of boids. For example, see Google Scholar: https://scholar.google.com/scholar?q=boids+flocking+algorithm&hl=en&as_sdt=0&as_vis=1&oi=scholart (accessed on 7 March 2025). One can add obstacles. One can add an objective destination (swim to target). One can place boids in a maze. The basic boids, as shown in Figure 1, can be described by the following algorithm.

In the swarm model, let there be

m

fish (birds)

F = {f^{(1)}, \dots, f^{(m)}}

that move about in an

n

-dimensional space. We then represent their positions at any given time

t

as

X_{t} = {{\overset{⇀}{x}}_{t}^{(1)}, \dots, {\overset{⇀}{x}}_{t}^{(m)}}

, in which each vector

{\overset{⇀}{x}}_{t}^{(i)}

(

i = 1, \dots, m

) has

n

elements

x_{j, t}^{(i)}

(

j = 1, \dots, n

). In other words,

X_{t}

is an

n \times m

matrix. In our empirical work, each fish is a risk factor (a total of 190 factors), and each dimension is a lagged month (total of 12 lagged months). See the online appendix for the full list of the 190 factors, along with their notations and descriptions.

The velocity of each fish is defined as a weighted average of various forces.

{\overset{⇀}{v}}_{t}^{(i)} = w_{A} {\overset{⇀}{v}}_{A, t}^{(i)} + w_{C} {\overset{⇀}{v}}_{C, t}^{(i)} + w_{S} {\overset{⇀}{v}}_{S, t}^{(i)}

(1)

where the weights,

w_{A}

for alignment,

w_{C}

for cohesion, and

w_{S}

for separation, must sum up to one, and each corresponding velocity is defined as follows:

\begin{array}{l} {\overset{⇀}{v}}_{A, t}^{(i)} = avg ({\overset{⇀}{v}}_{t - 1}^{(j \neq i)} | f^{(j)} \in F) - {\overset{⇀}{v}}_{t - 1}^{(i)} \\ {\overset{⇀}{v}}_{C, t}^{(i)} = avg ({\overset{⇀}{x}}_{t - 1}^{(j \neq i)} | f^{(j)} \in F) - {\overset{⇀}{x}}_{t - 1}^{(i)} \\ {\overset{⇀}{v}}_{S, t}^{(i)} = \max \{|{\overset{⇀}{x}}_{t}^{(i)} - {\overset{⇀}{x}}_{t}^{(j \neq i)}|, ε\} \end{array}

(2)

representing alignment, cohesion, and separation, respectively. As easily seen, alignment for each fish involves seeking the average velocity of its fellow fish (excluding itself), cohesion for each fish involves moving toward the center of its fellow fish (excluding itself), and separation is undertaken in order to avoid colliding with its fellow fish.

The update of position is

{\overset{⇀}{x}}_{t}^{(i)} = {\overset{⇀}{x}}_{t - 1}^{(i)} + {\overset{⇀}{v}}_{t}^{(i)}

(3)

In our empirical work, we shall estimate the weights in Equation (1) using data. Our data contain each of 190 risk factors’ lagged returns from

t - 1

to

t - 12

. Formally,

x_{j, t}^{(i)}

, defined earlier, can be rewritten as the following matrix (190 fish or risk factors and 12 lagged months or dimensions):

[\begin{matrix} x_{t - 1, t}^{(1)} & x_{t - 2, t}^{(1)} & \dots & x_{t - 12, t}^{(1)} \\ x_{t - 1, t}^{(2)} & x_{t - 2, t}^{(2)} & \dots & x_{t - 12, t}^{(2)} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ x_{t - 1, t}^{(190)} & x_{t - 2, t}^{(190)} & \dots & x_{t - 12, t}^{(190)} \end{matrix}]

This is a snapshot of the locations of all fish at a given time

t

. From one snapshot to another, it represents a migration (i.e., velocity). Then, this empirically observed velocity is matched (by minimizing the sum of squared errors) with Equation (1) to solve for the weights.

2.2. Particle Swarm Optimization (With a Leader)

Particle swarm optimization (PSO), from its name, is an optimization tool using swarm. See [18,19]. Particles here are the same as fish or birds in the previous sub-section. The change in terms is mainly due to different researchers in the past. To be consistent, we shall use fish for the remainder of this paper.

In a standard PSO, an objective function (aka a fitness function) is given so that all fish can reach the optimal location. The communication mechanism among the fish is the same as among boids, and yet how they move is different.

One can think of PSO as a swarm with a landscape (i.e., the fitness function). Fish will now try to reach either the peak or the bottom (i.e., the global optimum) of the landscape. In this case, they will stop moving once their objective is met. A simple demonstration of a two-dimensional PSO is given in the Appendix A.

In order to achieve convergence in an optimization problem, the velocity of a standard PSO is given as

{\overset{⇀}{v}}_{t}^{(i)} = w_{t} {\overset{⇀}{v}}_{t - 1}^{(i)} + c_{1} r_{1} ({\overset{⇀}{p}}_{t - 1}^{(i)} - {\overset{⇀}{x}}_{t - 1}^{(i)}) + c_{2} r_{2} ({\overset{⇀}{g}}_{t - 1} - {\overset{⇀}{x}}_{t - 1}^{(i)})

(4)

where

w_{t} < 1

is a decaying weight (e.g., we can let

w_{t} = δ^{t}

and

δ < 1

),

c_{1}

and

c_{2}

are two constants,

r_{1}

and

r_{2}

are two random variables,

{\overset{⇀}{p}}_{t}^{(i)} = \{{\overset{⇀}{x}}_{τ \leq t}^{(i)} | \max_{τ} ϕ ({\overset{⇀}{x}}_{τ}^{(i)})\}

(5)

is the personal best position, and

{\overset{⇀}{g}}_{t} = \max_{i} {\overset{⇀}{p}}_{t}^{(i)}

(6)

is the global best position, which is determined by maximizing (or minimizing) the fitness function,

ϕ (\cdot)

.

The “personal best” (Equation (5)) represents the best position (i.e., the highest value of the fitness function, as demonstrated in Figure A2) a fish has experienced in its past (hence maximizing over all

τ \leq t

of the past). Given that

{\overset{⇀}{p}}_{t}^{(i)}

is the personal best of fish

i

, the “global best” (Equation (6)) is simply the best of the bests, as it finds the fish that has highest fitness value among all fish.

In Equation (4),

c_{1}

and

c_{2}

are learning rates. These are standard parameters in artificial intelligence to attain the most effective learning. First of all, fish tend to move toward the current leader, who is the best among all fish,

{\overset{⇀}{g}}_{t - 1} - {\overset{⇀}{x}}_{t - 1}^{(i)}

, and also tend to move toward its own historical best,

{\overset{⇀}{p}}_{t - 1}^{(i)} - {\overset{⇀}{x}}_{t - 1}^{(i)}

This is known as “exploitation” in artificial intelligence, as they would like to learn from/exploit given information. Yet more importantly, fish also move randomly based on the two random variables

r_{1}

and

r_{2}

. These random movements are known as “exploration,” meaning that the fish do not only learn from exploitation. This is analogous to mutations in a genetic algorithm. This is key to artificial intelligence to avoid local optima. Note that without such random movements, fish can only move within the initial positions that are arbitrarily assigned.

The update of the position is the same as Equation (3) and repeated here:

{\overset{⇀}{x}}_{t}^{(i)} = {\overset{⇀}{x}}_{t - 1}^{(i)} + {\overset{⇀}{v}}_{t}^{(i)}

(7)

2.3. Combining Boids and PSO

Although boids and PSO are based on the same intelligence (i.e., peer-to-peer communication), their purposes are different. The former is a behavioral model, and yet the latter is an optimization tool. As we can see, the parameters in boids, alignment, cohesion, and separation, describe how fish move around, and every fish is identical (i.e., there is no leader). Ref. [17] argues that with only three parameters followed identically by every fish, amazing group intelligence can be achieved to avoid prey (or barriers) or hunt food. On the other hand, PSO has a different use of intelligence. PSO identifies a leader (i.e., whoever is the global best), and other fish follow. The behavior of each fish is not determined by the three parameters but by following the leader (and its own personal best position), and, in doing so, the global optimum can be reached.

We can combine the two versions of the swarm easily. By combining Equations (1) and (4), the velocity equation can be defined as

{\overset{⇀}{v}}_{t}^{(i)} = w_{A} {\overset{⇀}{v}}_{A, t}^{(i)} + w_{C} {\overset{⇀}{v}}_{C, t}^{(i)} + w_{S} {\overset{⇀}{v}}_{S, t}^{(i)} + α_{P, t}^{(i)} ({\overset{⇀}{p}}_{t - 1}^{(i)} - {\overset{⇀}{x}}_{t - 1}^{(i)}) + α_{L, t}^{(i)} ({\overset{⇀}{g}}_{t - 1} - {\overset{⇀}{x}}_{t - 1}^{(i)}),

(8)

where the random coefficients

c_{1} r_{1}

and

c_{2} r_{2}

in Equation (4) are replaced by constants

α_{P, t}^{(i)}

and

α_{L, t}^{(i)}

, respectively. Here, PSO is used for a behavioral model and no longer used for optimization, as we have no fitness function. In PSO, the leader is the one who has the highest fitness value. In other words, there must exist a fitness function upon which each fish can be evaluated. This fitness function is then maximized (minimized in some cases) once the algorithm converges. In our empirical work, there is no such fitness function. The leader is determined by whoever is being followed the most. This way, the new swarm will possess both behaviors of boids and leader following.

It is apparent that one can modify (by adding, dropping, or editing) the above velocity for any specific need. For example, one can eliminate the personal best if one is only interested in group behavior. Or, one can drop separation, as it is not relevant to evaluating financial assets.

In our empirical work, because we only focus on identifying the leader, we simplify Equation (8) as

{\overset{⇀}{v}}_{t}^{(i)} = α_{L, t}^{(i)} ({\overset{⇀}{g}}^{t - 1} - {\overset{⇀}{x}}_{t - 1}^{(i)})

(9)

We drop all of the other terms, as we would like to focus on just one parameter,

α_{L, t}^{(i)}

, which represents the speed of lead/lag (which is in turn used in the empirical work of portfolio sorting).

Note that

α_{L, t}^{(i)}

is a “follow-the-leader” parameter that is different for every fish. That is, each fish can choose its own tendency to follow the leader. We could alternatively replace

α_{L}^{(i)}

by

α_{L}

(without the superscript). In doing so, every fish must adopt the same speed.

Clearly, the particular version of swarm we adopt in this paper is restricted. A general swarm should contain all of the parameters. By estimating only

α_{L, t}^{(i)}

, the effects of other parameters, such as alignment and cohesion, are embedded in

α_{L, t}^{(i)}

. Implicitly, we are assuming that these omitted parameters have the same effect for every fish, implying that these effects will not alter the direction of

α_{L, t}^{(i)}

, although the magnitudes will change.

3. Empirical Methodology

Our empirical design follows a rigorous three-stage pipeline to establish the relationship between swarm-based dynamics and asset pricing. First, we transform the time-series of monthly factor returns into a sequence of instantaneous positions (

{\overset{⇀}{x}}_{t}^{(i)}

) in a 12-dimensional swarm space. The change in position from t to t + 1 provides the empirically observed velocity (

{\overset{⇀}{v}}_{t}^{(i)}

). Second, using the leader-following model (a variant of PSO), we estimate the factor-specific parameter

α_{L, t}^{(i)}

by minimizing the squared error between the model-predicted velocity and the observed empirical velocity. This factor-specific

α_{L, t}^{(i)}

is defined as the swarm velocity (SV) metric. Third, we sort the 190 factors into quintiles each month based on their calculated SV. The core test of predictability involves analyzing the subsequent month’s return differential and risk-adjusted alphas between the slowest- and fastest-moving factor quintiles.

This section provides a detailed explanation of the estimation of the factor-specific swarm velocity metric. We can estimate the parameters of swarm (Equation (1)) using data or factor returns in our study. Specifically, empirical positions (i.e., factor returns) can be labeled as

{\overset{⇀}{ξ}}_{t}^{(i)}

and velocity as

{\overset{⇀}{ν}}_{t}^{(i)}

(to substitute for

{\overset{⇀}{x}}_{t}^{(i)}

and

{\overset{⇀}{v}}_{t}^{(i)}

in Equation (3)). Hence, similar to Equation (3),

{\overset{⇀}{ν}}_{t}^{(i)} = {\overset{⇀}{ξ}}_{t}^{(i)} - {\overset{⇀}{ξ}}_{t - 1}^{(i)}

(10)

Our objective function is to minimize the sum of squared errors between

{\overset{⇀}{ν}}_{t}^{(i)}

and

{\overset{⇀}{v}}_{t}^{(i)}

:

\min_{θ} ({\overset{⇀}{v}}_{t}^{(i)} - {\overset{⇀}{ν}}_{t}^{(i)})^{'} ({\overset{⇀}{v}}_{t}^{(i)} - {\overset{⇀}{ν}}_{t}^{(i)}) = \min_{θ} \sum_{j = 1}^{n} {(v_{j, t}^{(i)} - ν_{j, t}^{(i)})}^{2}

(11)

By taking the partial derivative and setting it to 0,

\sum (v_{j, t}^{(i)} - ν_{j, t}^{(i)}) \frac{\partial v_{j, t}^{(i)}}{\partial α_{L, t}^{(i)}} = 0

(12)

In this paper, we only focus on leaders and laggards, and hence there is only parameter,

α_{L, t}^{(i)}

.

Clearly, Equation (12) in general has no closed-form solution and needs to be solved numerically. However, if we focus on one parameter at a time (i.e., holding other parameters constant), then there is a closed-form solution, which is what we implement in the empirical section.

In our empirical work, we are only interested in leader following. Hence,

{\overset{⇀}{v}}_{t}^{(i)} = α_{L, t}^{(i)} ({\overset{⇀}{g}}_{t - 1} - {\overset{⇀}{x}}_{t - 1}^{(i)})

(13)

The solution (see the Appendix B) is

α_{L, t}^{(i)} = \frac{\sum_{j = 1}^{n} ν_{j, t}^{(i)} (g_{j, t} - x_{j, t}^{(i)})}{\sum_{j = 1}^{n} {(g_{j, t} - x_{j, t}^{(i)})}^{2}}

(14)

While Equation (14) minimizes the empirical sum of squared errors described by Equation (11), it has an interesting interpretation. First of all, we should note that every term in Equation (14) comes from data.

ν_{j, t}^{(i)}

is the empirical velocity of the

i

-th risk factor in the

j

-th dimension at time

t

. This quantity is obtained by taking a difference of returns of the

i

-th risk factor in two consecutive months.

x_{j, t}^{(i)}

is the return of the

i

-th risk factor in the

j

-th dimension at time

t

. Later, as we explain our empirical methodology, it will be clear that

x_{j, t}^{(i)}

is the return of the

i

-th risk factor in the

j

-th lagged month, given that we are using past returns to predict future returns (and hence lagged returns are features/dimensions in the swarm model). Hence,

x_{j, t}^{(i)}

is defined as

t - j

return of the risk factor

i

. We use the last 12 returns (

n = 12

in Equation (14)) in our empirical work, and hence there are 12 lagged returns in Equation (14). That is,

x_{1, t}^{(i)}

is the return of the last month (

t - 1

),

x_{2, t}^{(i)}

is the return of two months ago (

t - 2

), …, until the last one

x_{12, t}^{(i)}

which is the return of the twelfth lagged month.

Finally,

g_{j, t}

is the “leader,” however it is chosen. As mentioned earlier, when there is a landscape (i.e., fitness function), the leader can be whoever possesses the best fitness value. Or, it can simply be chosen subjectively. In our empirical work, we are seeking to find such a leader. As a result, we loop through each risk factor as the leader and compute the speeds of its followers. If a particular risk factor is not a leader, then the other risk factors will not follow it, and the speeds are therefore small. If this risk factor is indeed the leader, then the other risk factors will follow it closely, and the speeds will be high.

In other words, each risk factor’s position (i.e., the last 12 months of returns) when it is assumed to be the leader is

g_{j, t} = x_{j, t}^{(i)}

(15)

and, as a result,

α_{L, t}^{(i)} = 0

.

The average across individual risk factors is

\begin{matrix} {\bar{α}}_{L, t} & = \frac{1}{m - 1} \sum_{i = 1}^{m - 1} α_{L, t}^{(i)} \\ = \frac{1}{m - 1} \sum_{i = 1}^{m - 1} \frac{\sum_{j = 1}^{n} ν_{j, t}^{(i)} (g_{j, t} - x_{j, t}^{(i)})}{\sum_{j = 1}^{n} {(g_{j, t} - x_{j, t}^{(i)})}^{2}} \end{matrix}

(16)

It is noteworthy that the method of looping through each risk factor as a potential leader is a core feature that distinguishes our approach from simpler, purely financial lead–lag models. Alternative leader selection mechanisms, such as designating the largest factor by market capitalization or the factor with the highest trading volume, rely on exogenous financial proxies that may not accurately reflect the psychological mechanism of herding. Herding behavior is driven by which stock (or factor) is currently perceived as the “winner” by market participants, which is not necessarily the largest or most liquid factor. Our data-driven approach is superior because the optimal leader is identified endogenously—it is the factor whose past position best explains the collective subsequent movements of the entire swarm, as measured by minimizing the sum of squared errors (SSE) in Equation (11). This method directly captures the empirical leader–follower dynamic that aligns best with the observed market behavior. Consequently, the chosen leader is the one that historically best dictated the direction of factor movements, providing a more direct proxy for herding than simple size or liquidity metrics.

In the next empirical section, we provide a visualization of how estimation is performed. A more generalized optimization can be found in [20,21].

As we can see, in this paper, we apply data to swarm and “back-out” the parameters of the swarm, and we even try to find out who the leader is if the data indeed comply with our assumption that there is swarm in risk factors. This is analogous to computing the implied volatility of the Black–Schole option pricing model in that the model is not used for pricing but for “backing out” the parameters of the model.

4. The Swarm-Related Literature in Finance

Our research intersects two distinct, yet complementary, streams in the financial literature: (1) the broader application of advanced artificial intelligence and machine learning techniques in financial modeling, and (2) the classical behavioral finance topics of herding and lead–lag effects, which provide the economic interpretation for our velocity measure.

4.1. Contextualizing Swarm Intelligence in Modern Financial AI

Our research utilizes swarm intelligence (SI), a metaheuristic rooted in collective agent behavior, positioning it uniquely within the rapidly evolving landscape of quantitative financial modeling. In contemporary finance, the literature has increasingly moved beyond linear factor models to adopt highly flexible, non-linear methodologies, such as deep learning (DL) and reinforcement learning (RL), which are primarily aimed at superior prediction and dynamic strategy execution [22,23]. This evolution recognizes that complex financial systems are best modeled as complex adaptive systems (CAS), where traditional linear relationships often fail to capture market non-linearities and emergent properties [24].

However, while DL and RL prioritize predictive accuracy, they often operate as black box models, making it difficult to extract economic intuition about why a prediction was made. Swarm intelligence and related evolutionary algorithms offer an important middle ground. They were historically a key component of early artificial intelligence in finance, demonstrating strong capabilities in portfolio optimization and parameter estimation in high-dimensional, rugged landscapes where parametric methods fail [10,25].

The SI framework we employ offers a unique advantage over pure prediction models as it provides a powerful, physics-based, and interpretable lens through which to measure a specific behavioral dynamic, the speed and direction of collective factor movement. By focusing on estimating the factor velocity parameter and linking it directly to asset pricing theory (information diffusion and trading frictions), our contribution is positioned squarely in the realm of advanced behavioral and psychological modeling, offering economic insight alongside a modern computational technique.

4.2. Herding

Swarm, although meant for describing low-intelligent animals like ants and bees, has actually been observed in human behaviors. When gathered, humans tend to act collectively and hastily without a thorough decision making process. This is known as herding.

Ref. [26] is believed to be the first author who studied herding in the financial market. He describes investors influenced by their peers in making investment decisions as a social activity. He asserts that when making investment decisions, humans tend to gossip about others’ successes or failures in investing as opposed to making intelligent decisions of their own.

In fact, humans herd in many other activities, as well. When voting for someone politically, people often go with their close friends and relatives as opposed to thinking independently. Nutrition decisions are another example where people tend to listen to their friends and relatives as opposed to actively conducting research. The main reason is that such research usually takes time and people are “lazy.” Another possible reason is that these decisions (voting, picking supplements, and investing) are often gossip topics when people social.

Herding is a significant research topic because researchers find that herding does not generate better returns. Ref. [27] argues that herding results in market disturbances and destabilization, which can move the market away from its fundamentals. As a result, it can be expected that herding is not persistent. It happens sometimes, ceases to happen sometimes, and furthermore contradicts itself sometimes.

Herding, as a result, can be regarded as a reason why technical analyses have been persistent. Financial economists have long frowned on technical analyses due to their lack of economic support. Although they may not have economic support, they have plenty of psychological support. For this reason, Ref. [28] study the determinants of herding and find that when volatility is high (e.g., a crisis), the size of the firm is small, interest rates are high, or the market is opaque, herding is likely to be observed. It is obvious that such situations are not sustainable situations. They happen only temporarily and are naturally not persistent.

Various authors, given data availability and research focus, use different proxies for herding (see, for example, [29,30]). Yet these proxies suffer from various measurement errors, and, so far, there has been no consensus.

The literature also associates herding with momentum (see [31] for the original momentum research). Ref. [32] define herding as how a group of investors moves in and out of the market simultaneously, which is then connected to momentum trading. Recently, Ref. [33] studied the influence of industry herding on momentum returns. Ref. [34] also studied the impact of herding behavior on momentum. Ref. [35] found that uninformed country-level herding is highly related to momentum. Ref. [28] discovered strong evidence of international herding. Ref. [36] estimated swarm parameters using a proprietary broker/dealer dataset and found mild herding among brokers and dealers. Ref. [37] provides an overview of the recent theoretical and empirical research on herd behavior in financial markets.

4.3. Lead–Lag

Asset returns that move faster, referred to as leaders in our study, could be driven by several factors. One possibility is that they exhibit higher liquidity, allowing investors to incorporate value-relevant information more quickly [14,38]. Another potential explanation is that these assets attract heightened investor attention, often due to media coverage or prevailing market sentiment [13,39]. When assets gain excessive attention, their prices may deviate from fundamentals, leading to a temporary overvaluation that eventually corrects over time. If investor-driven overreaction propels fast-moving assets, we would expect them to subsequently underperform, as measured by alphas relative to standard asset pricing models, such as the Fama–French factor models [40,41] or the Hou–Xue–Zhang factor models [42]. In other words, the overvaluation fueled by heightened investor attention should lead to lower subsequent alphas as prices revert to their intrinsic values. However, if the primary driver of rapid price movement is superior liquidity conditions, then these assets would likely incorporate value-relevant information efficiently, leaving no room for abnormal performance and ensuring fair pricing.

Conversely, assets that move slower, referred to as laggards, may do so for several reasons. One possibility is that they operate in more mature sectors where the information flow is relatively stable, leading to fewer price-adjusting events [40]. As a result, the absence of significant informational updates causes slow price movements. Another potential explanation is that these assets face higher trading frictions, such as lower liquidity, wider bid–ask spreads, and higher transaction costs [43]. In this case, market participants take longer to process and incorporate new information, leading to a slower diffusion of price-relevant signals. If slow movement results from operating in a stable industry with predictable fundamentals, there may be no systematic impact on future returns. However, if frictions and impediments to trading hinder the incorporation of information, the direction of future abnormal returns depends on whether the market predominantly underreacts to good news or bad news. If slow-moving assets experience an underreaction to positive information, they could generate positive alphas, whereas an underreaction to negative information could lead to negative alphas. This remains an empirical question, and we rely on data-driven analysis to uncover the relationship between asset velocity and return predictability.

5. Empirical Findings

In this section, we briefly introduce the data we use and how a swarm model is applied in order to estimate the speed of following the leader, and we investigate the market pricing of slow- and fast-moving risk factors.

5.1. Data

The test assets are monthly Hou–Xue–Zhang 190 risk factors over 23 years obtained from the Q-factor data library, https://global-q.org/index.html (accessed on 7 March 2025). It contains portfolios in six categories with different periods:

Frictions (February 1990~December 2022), 10 factors;
Intangibles (January 1990~December 2022), 31 factors;
Investments (January 1973~December 2022), 32 factors;
Momentum (July 1979~December 2022), 42 factors;
Profitability (January 1985~December 2022), 44 factors;
Value vs. growth (January 1985~December 2022), 31 factors.

In order to run our swarm model, we adopt the common period from February 1990 to December 2022, a total of 395 months.

We use lag for 12 months as features. That is, the universe of the swarm is a 12-dimensional space. In other words, on each month, a risk factor (i.e., a fish) is positioned in a 12-dimensional space according to its last 12 months of returns. As each month passes by, the risk factor moves to the next 12 months of returns.

However, to help visualize how we fit a swarm model via data, we only use lag for 2 months as a demonstration in Figure 2. Figure 2 plots frictions’ 10 risk factors for 4 consecutive months, April through July of 1990. Each panel contains positions (i.e., returns) of the last two months. For example, Panel (A) plots 10 risk factors’ February returns on the x-axis and March returns on the y-axis.

We pick two risk factors as an example; beta_1 (the market beta estimated using daily returns within a month) is marked in red, and tv_1 (total volatility computed from daily returns over the same period) is marked in yellow. From Figure 2, we can clearly observe how they move from month to month. As explained earlier, on each month, the position of a risk factor is its last two months of returns. The returns of the two risk factors are given below.

In Table 1, Panel (A), beta_1’s position is (3.5051, 7.4907), and tv_1’s position is (1.4045, 1.6897). In Panel (B), beta_1’s position is (7.4907, 0.3085), and tv_1’s position is (1.6897, −1.5450). From month to month, we can now see the migration of the two risk factors. We then use these positions to estimate the swarm speed. Then, Panel C illustrates how the velocity is calculated, which is measured as the difference between two consecutive positions.

These numerical results are then fed into Equation (16) to calculate the speed parameter. In our empirical work, 12 lags are used. We also tried different numbers of lags, and the results are qualitatively similar. They are available upon request. In terms of the number of risk factors, we follow [42] and use 190 risk factors. Hence, we use

m = 190

and

n = 12

in our swarm model.

To visualize the movements of the risk factors (i.e., fish), we use beta_1 and tv_1 as an example and plot their movements in Figure 3.

The movement of a risk factor is represented by the velocity. As described in Equation (10), the actual velocity from data can be calculated by taking the difference of the positions of two consecutive months. Panel (A) of Figure 2 plots the positions of chosen risk factors in April 1990, and Panel (B) plots the positions of chosen risk factors in May 1990. Taking beta_1 as an example, Figure 3 plots its positions in these two consecutive months. Its velocity is calculated (as the difference of positions) as (3.98, –7.18), which is the difference in two consecutive positions, (3.5051, 7.4907) and (7.4907, 0.3085), in April and May of 1990, respectively. Similarly, the velocity for tv_1 (orange) is (0.28, –3.23). The result is plotted in Panel (A) of Figure 3. Similarly, the transitions from May to June and June to July of 1990 are plotted in Panels (B) and (C).

In estimating the parameters, such as

α_{L, t}^{(i)}

in Equation (14), we simply iterate the parameter value until the model’s velocity matches the data’s velocity as closely as possible (i.e., minimized sum of squared errors).

5.2. Experimental Design and Procedure

This section clarifies the experimental environment, the fixed parameter settings of our adapted swarm model, and the step-by-step procedure used to conduct the monthly asset pricing tests.

The core empirical methodology is a time-series analysis built on monthly cross-sectional sorting. The fixed parameters of our estimation environment are determined based on the input data and the structural design of the swarm space. The agents, or “particles,” in the swarm are the 190 cross-sectional risk factors sourced from the Q-factor data library. The dimension of the swarm space is fixed at D = 12, representing the preceding 12 months of returns for each factor. This D = 12 choice serves as the input features that define the factor’s position vector (

{\overset{⇀}{x}}_{t}^{(i)}

) in the swarm space. The core estimation period spans T = 395 months, from February 1990 to December 2022.

The experiment is executed via a monthly rolling procedure, which consists of three distinct phases. The first phase involves the observation and modeling of factor dynamics. We first compute the position vector (

{\overset{⇀}{x}}_{t}^{(i)}

) for all 190 factors at the end of each month t using their returns from t − 11 to t. The empirically observed velocity (

{\overset{⇀}{v}}_{t}^{(i)}

) is then calculated as the change in position from the previous month:

{\overset{⇀}{v}}_{t}^{(i)} = {\overset{⇀}{x}}_{t}^{(i)} - {\overset{⇀}{x}}_{t}^{(i)}

. Subsequently, the leader position (

{\overset{⇀}{x}}_{t}^{(leader)}

) is identified as the factor exhibiting the highest cross-sectional return over the preceding 12-month period, t − 11 to t. Finally, using the time-series of

v_{t}^{(i)}

and the position difference (

{\overset{⇀}{x}}_{t}^{(leader)} - {\overset{⇀}{x}}_{t}^{(i)}

), we estimate the factor-specific swarm velocity (SV),

α_{L, t}^{(i)}

, for all 190 factors by minimizing the error via the closed-form solution (Equation (14)).

The second phase establishes the link between dynamics and returns. Specifically, at the end of month t, we sort the 190 factors into five equal-weight quintile portfolios based on their estimated SV. The Low SV portfolio comprises the slow-moving factors, and the High SV portfolio comprises the fast-moving factors. We then calculate the equal-weighted return for each of the five portfolios over the subsequent month, t + 1. This step establishes the core link between factor dynamics (measured at t) and subsequent return predictability (observed at t + 1).

The third phase involves the formal statistical evaluation. We calculate the time-series average of the return differential between the High SV and Low SV portfolios (High−Low). To test whether the predictability constitutes an unmodeled risk premium, the time-series returns of all five portfolios are regressed against various established factor models (FF5, FF6, Q, Q5). The intercept terms (α) are the primary metric for assessing abnormal performance. Finally, we examine the long-term return predictability by analyzing portfolio returns when the portfolios formed at month t are held and tested over horizons extending up to 12 months ahead (as summarized in Table 3). This comprehensive procedure is repeated for every month within the sample period, generating a time-series of monthly portfolio returns used for all subsequent statistical inference.

5.3. Speed Result

The speed results are plotted in Figure 4 and Figure 5. Figure 4 presents a distribution of average speeds across risk factors. Each observation in this distribution is an average speed of a risk factor over the entire sample period. It can be seen that (1) all speeds are positive, demonstrating a definite swarm behavior (somewhat surprising!), and (2) the mode of the distribution is slightly less than 0.5, with the next highest peak slightly higher than 0.5.

Figure 5 presents the time-series of average speed across risk factors. As we can see, the variability of speed over time is not noteworthy, and the mean and the median are very close to each other, demonstrating no significant skewness. Figure 5 also plots the 25th and 75th percentiles, and they indicate a roughly symmetrical distribution across risk factors.

5.4. Market Pricing of Factor Velocity

To investigate how the market prices factors with different velocity measures derived from swarm intelligence, we conduct a portfolio-level analysis. Each month, we sort 190 factors from the Q-factor data library into quintiles based on their swarm velocity (SV). We then compute the equal-weighted returns for each quintile and compare the return differences between the top and bottom quintiles. Our portfolio-level analysis is conducted in an out-of-sample fashion in the time-series sense. Specifically, for each month, factor velocities are computed using only return data from the prior 12 months, and these estimates are then used to sort factors and evaluate one-month-ahead returns. This rolling structure ensures that no future information is used in forming the velocity measure, thereby preserving the out-of-sample integrity of the return predictions. To further address concerns about predictive validity across time, we perform a subsample analysis by splitting the sample into two equal subperiods and examining the return predictability of factor velocity separately within each. The results (available upon request) remain directionally consistent with the full-sample findings, with slow-moving factors continuing to earn higher average returns than fast-moving ones. These findings reinforce the robustness and generalizability of swarm velocity as a predictive signal.

The primary metrics used to evaluate the performance of portfolios sorted by swarm velocity (SV) are the modified Jensen’s Alpha (α), serving as the gold standard for measuring risk-adjusted abnormal returns in empirical asset pricing (e.g., [40,41,44,45]); among many others). Specifically, α is calculated as the intercept term in a time-series regression of the monthly returns of each factor portfolio (

R_{t}^{p}

) on established market risk factors (

F_{t}^{k}

) using the general K-factor model,

R_{t}^{p} = α^{p} + \sum_{k = 1}^{K} {β^{k} F}_{t}^{k} + ϵ_{t}^{p}

. A positive alpha (α > 0) indicates superior performance, returns exceeding those predicted by the model given its risk, suggesting an abnormally high return after accounting for portfolio p’s exposure to market risk factors. Conversely, a negative alpha indicates underperformance. The statistical significance of alpha is determined by its t-statistic (Newey–West adjusted to account for serial correlation), where an absolute value typically exceeding 1.96 confirms that the abnormal return is statistically robust across the alternative factor models tested. Following prior studies, we estimate alphas for each factor portfolio and the return difference between the top and the bottom SV quintiles relative to the following established models: the Fama–French Five-Factor Model (FF5, including excess market return (MKT−RF), size factor (SMB), book-to-market factor (HML), investment growth factor (CMA), and operating profitability factor (RMW)); the Fama–French Six-Factor Model (FF6, extending FF5 with a momentum factor (UMD)); the augmented FF6 model (FF6PS, including the Pastor–Stambaugh illiquidity factor (PS)); the Hou, Xue, and Zhang Four-Factor Model (Q, including excess market return (R_MKT), size factor (R_ME), investment growth factor (R_IA), and operating profitability factor (R_ROE)); and the extended Q-Factor Model (Q5, including the growth-related factor (R_EG)).

Table 2 presents the results. Column 1 reports the average characteristics of the sorting variable, Column 2 provides the mean equal-weighted returns within each quintile, and Columns 3 to 8 display alphas relative to various factor models. These models include (1) the Fama–French Five-Factor Model (FF5), which includes market (MKT), size (SMB), book-to-market (HML), investment (CMA), and profitability (RMW) factors [40], with the resulting intercept term labeled FF5 alpha; (2) the Fama–French Six-Factor Model (FF6), which adds the momentum (UMD) factor to FF5, producing FF6 alpha [40,44]; (3) the Fama–French–Carhart Six-Factor Model augmented with the Pastor–Stambaugh liquidity factor [14], generating FF6PS alpha; (4) the Hou–Xue–Zhang Four-Factor Model (Q-factor model), which includes R_MKT, R_ME, investment (R_IA), and profitability (R_ROE) [42], with the corresponding Q alpha; and (5) the Hou–Mo–Xue–Zhang Five-Factor Model [46], which extends the Q-factor model with the growth (R_EG) factor, producing Q5 alpha. The last row of Table 2 reports the differences in mean returns and alphas between the top and bottom quintiles, with Newey–West-adjusted t-statistics) accounting for serial correlation. Our sample period spans March 1991 to December 2022 to ensure all 190 factors are available each month.

Table 2 reveals that factors in the lowest SV quintile earn significantly higher abnormal returns than those in the highest SV quintile, with an alpha difference ranging from 17 to 22 basis points per month. Importantly, this return differential is primarily driven by the superior performance of slower-moving factors. The time period robustness test shows that the predictive power of the SV factor holds across both the pre-Global Financial Crisis or GFC (March 1993–December 2008) and post-GFC (January 2009–December 2022) periods, with slow-moving factors outperforming fast-moving factors at the one-month-ahead horizon, suggesting that its predictive strength is not confined to a specific economic cycle or market regime.

We further examine whether swarm speed exhibits return predictability beyond a one-month horizon. To test this, we relate the swarm velocity measure to future returns of the previously formed quintile factor values over horizons extending from 2 months to 12 months ahead.

Table 3 presents the Hou–Mo–Xue–Zhang Five-Factor alphas (Q5) for these horizons. The results show that the return spread between the top and bottom SV quintile factor values remains negative across all horizons, ranging from 5 to 17 basis points per month. However, only the two-month-ahead alpha spread is statistically significant, driven by the continued outperformance of slow-moving factors.

Next, we explore the characteristics of fast-moving versus slow-moving factors. Panel A of Table 4 lists the top 10 slowest-moving factors (left panel) and the top 10 fastest-moving factors (right panel). For comparison, values in the table are demeaned. The slowest-moving factors exhibit an average velocity below the mean of all 190 factors, with a probability exceeding 50% of falling into the slow-moving quintile and a 20% lower likelihood of appearing in the fast-moving quintile. Conversely, the fastest-moving factors exhibit significantly higher velocity scores, with probabilities more than 50% above the mean for appearing in the fast-mover quintile and approximately 20% lower for falling into the slow-mover quintile.

To improve interpretability and identify underlying drivers of return differentials, we follow [42,46] and categorize factor-level results by their respective themes, including frictions, intangibles, investments, momentum, profitability, and value vs. growth. Panel B of Table 4 shows that friction-related factors are disproportionately overrepresented in the slow-mover risk factor, appearing at more than twice their unconditional share. In contrast, investment-related factors are significantly underrepresented, comprising less than 50% of their unconditional share. Within the fast-mover risk factor, momentum-related factors dominate, accounting for 36.20% of the risk factor versus their unconditional share of 25.37%, while intangibles and value vs. growth factors are notably underrepresented. The non-random factor categorization bias in Table 4, Panel B, is formally confirmed using a Chi-Squared (

χ^{2}

) goodness-of-fit test. The test compares the observed distribution of factor categories within the extreme quintiles against the unconditional distribution and finds the bias statistically significant at the 1% level or better for both slow- and faster-mover portfolios, validating that the factor bias is non-random.

The concentration of specific factor categories in the extreme velocity quintiles strongly supports our behavioral interpretation. The heavy overrepresentation of friction factors (e.g., beta1, tv1, ivff1) within the slow-mover portfolio directly confirms the hypothesis that the low swarm velocity is driven by trading impediments and delayed information diffusion. These factors, which are often proxies for low liquidity or high idiosyncratic risk, inherently lead to slow price adjustments, as institutional investors may be slow to trade or incorporate new information due to high transaction costs. Conversely, the significant domination of the fast-mover portfolio by momentum factors (e.g., cm12, sim12, abr12) is consistent with the literature on investor attention and herding. Momentum factors are highly sensitive to recent price action and news, leading to rapid collective movements. These factors act as the trend-setters or “leaders” within the swarm system, and their high swarm velocity confirms their role as the fastest responders to market sentiment. This empirical factor alignment validates that the estimated swarm velocity is a direct, data-driven measure of a factor’s speed of price adjustment.

Next, we examine the persistence of swarm velocity (SV) using a portfolio transition matrix, which reports the probability that a factor in the

i

-th SV quintile at month t remains in the

j

-th SV quintile at month

t + n

, where n ranges from 1 to 60 months (i.e., from 1 month ahead to 5 years ahead). Under the null hypothesis of random portfolio assignment, the probability of a factor remaining in the same quintile over time would be 20%. Panel A of Table 5 shows that for the one-month-ahead transition matrix, 79% (81%) of factors in the highest (lowest) SV quintile at month t remain in the same quintile at month

t + 1

. Moreover, for all SV quintiles, the probability of remaining in the same quintile at

t + 1

exceeds the random benchmark of 20%, indicating meaningful short-term persistence. Panels B through H of Table 5 present the SV transition matrices at 3-, 6-, 12-, 24-, 36-, 48-, and 60-month horizons. The results reveal strong and sustained persistence in SV. For example, 70%, 60%, 45%, 42%, 40%, and 41% of factors in the high-SV portfolio remain in the high-SV quintile after 3, 6, 12, 24, 36, 48, and 60 months, respectively. Similarly, the corresponding persistence rates for factors in the low-SV portfolio are 71%, 62%, 50%, 49%, 48%, and 46%. These probabilities are substantially higher than the 20% expected under random assignment and provide compelling evidence that swarm velocity is highly persistent over time. The persistence of swarm velocity (SV) is formally confirmed by testing the factor retention rates against the 20% random benchmark (null hypothesis). The retention rates in the extreme portfolios (Low SV and High SV) are statistically significant at the 1% level across all horizons tested (up to 60 months).

These findings, combined with our risk factor return results, support the hypothesis that trading frictions impede the swift incorporation of value-relevant information, leading to subsequent outperformance of slow-moving factors. On the other hand, the statistically insignificant alphas of future returns of fast movers are inconsistent with the idea that heightened investor attention results in temporary overvaluation of fast movers, leading to subsequent underperformance.

While the persistence of the negative alpha differential across all major asset pricing models (FF5, FF6, FF6PS, Q, Q5) strongly suggests that the abnormal return is not compensation for known, priced risk, we acknowledge that it could theoretically be attributed to an unpriced macroeconomic risk premium. However, we find the behavioral interpretation to be more compelling and coherent; the high concentration of friction factors in the slow-moving quintile (Table 4, Panel B) provides empirical support for the mechanism of delayed price discovery. In this view, the alpha is not a risk premium but rather a premium earned for exploiting market inefficiency caused by trading impediments and slow information assimilation, which is the direct economic counterpart to a factor’s low swarm velocity.

6. Conclusions

This study presents a novel application of swarm intelligence to classify factor velocity and investigate its role in asset pricing. Swarm is an ideal methodology to model the herding behavior of people trading in the investment world. Given that individuals’ trading data are not available, it is common to follow risk factors (or portfolios) as a proxy of investors’ herding.

By leveraging a swarm-based approach, we systematically identify risk factors that act as market leaders and followers based on their return movement speed. Our results indicate that slow-moving factors generate higher one-month-ahead returns than fast-moving ones, a finding that holds across multiple asset pricing models. This return differential is likely due to trading frictions or delayed information processing by market participants.

These findings open new avenues for understanding return predictability and market efficiency, emphasizing how velocity classification can be integrated into existing asset pricing frameworks. Future research could enhance these models by incorporating alternative swarm intelligence techniques and exploring different test assets. Our study underscores the potential of artificial intelligence in finance, offering a fresh perspective on how collective behaviors shape stock returns.

The velocity-based classification of risk factors provides a compelling economic proof of concept regarding the role of price adjustment speed in market behavior. The finding that factors in the slowest quintile earn significantly higher risk-adjusted returns strongly suggests that these returns are compensation for exploiting an information anomaly (delayed incorporation or underreaction) rather than a known risk factor. However, this study’s primary contribution is focused on factor discovery and economic interpretation. Because our analysis relies on aggregate monthly factor portfolio returns and does not incorporate high-frequency data for modeling transaction costs, liquidity impact, or portfolio turnover, we must caution against drawing immediate conclusions for real-world trading strategies. The observed alpha serves as a robust academic measure of predictive power; however, future research is essential for establishing the full empirical robustness of the factor. This includes extending the framework to individual stocks or other asset classes and using granular trading data to determine exploitability after accounting for implementation costs.

Author Contributions

The authors contributed equally to all aspects of this work, including conceptualization, methodology, formal analysis, and writing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The code used to estimate the swarm velocity is provided in Appendix B. The data used in the study are publicly available.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. A Brief Sketch of PSO

A typical PSO is demonstrated in Figure A1.

There are only two dimensions in Figure A1 (x and y axes). The red solid circle in Figure A1 represents the position of a particular fish at

t - 1

. It is labeled

x_{j, t - 1}^{(i)}

, where

j

is either the x or y axis. Its movement to the next position at time

t

, i.e.,

x_{j, t}^{(i)}

, is determined by three forces: its previous velocity

v_{j, t - 1}^{(i)}

(solid black arrow), its own best history

p_{j, t}^{(i)}

(solid green arrow), and the global best

g_{j, t}

(solid blue arrow).

To decide its velocity and move to the next position, this fish takes into account, as Equation (4) describes, three dotted arrows representing three different forces (velocities) that pull the red solid circle. The dotted black arrow is inertia; the dotted green arrow is personal best; and the blue dotted arrow is global best. The final velocity is a combination of the three different velocities, which is the three solid arrows (black + green + blue). Hence, finally, the new position is the black solid circle.

Figure A1. Particle swarm optimization. Note: three dotted arrows represent three different forces (velocities) that pull the red solid circle. The dotted black arrow is inertia; the dotted green arrow is personal best; and the blue dotted arrow is global best. The final velocity is a combination of the three different velocities, which is the three solid arrows (black + green + blue). Hence, finally, the new position is the black solid circle.

Note that at any given time, a fish’s position

x_{j, t}^{(i)}

, defined earlier, is evaluated via a fitness function. The fitness function can be viewed as a landscape on top of the

n

-dimensional space. For example, in Figure A2, we illustrate a rugged fitness function on a two-dimensional space. Given only two dimensions, say, x and y, we specify a fitness function

ϕ (x, y)

, as in Figure A2, which has multiple local optima. In such a case, it is difficult to reach the global optimum using any parametric search method.

Figure A2. An illustrative fitness function of particle swarm optimization. The global optimum of this fitness function is at (0, 0), and yet there are multiple local optima.

Appendix B. VBA Code of Swarm

′---------------------------------------------------------------

′In this Macro, we loop each fish as leader

′Then calculate alphas to that fish

′Then rank the leader by average alphas

′This way, we have a rank of all the fish - from leader to loser

′---------------------------------------------------------------

Sub sort_leader()

′lag 2: t, t-1, t-2, ..., t-12

lag = 12

Dim fish(190, 12), velo(190, 12)

Dim alfa(190)

Dim port(190)

For icol = 2 To 191

‘retrieving data from sheet “data”

port(icol - 1) = Sheets("data").Cells(1, icol)

nrow = "B" & (190 * (idate - 2) + 2)

For icol = 2 To 191 'total of 190 fish (factors)

ifsh = icol - 1

idim = 0

For irow = idate To idate + lag 'total of 12 lags as dimensions

idim = idim + 1

′getting coordinates of features (which are lagged returns)

If irow < (idate + lag) Then

fish(ifsh, idim) = Sheets("m_all6_common").Cells(irow, icol)

x1 = Sheets("m_all6_common").Cells(irow + 1, icol)

x0 = Sheets("m_all6_common").Cells(irow, icol)

velo(ifsh, idim) = x1 - x0

End If

′getting the current return

If irow = (idate + lag) Then fish(ifsh, 0) = Sheets("data").Cells(irow, icol)

For ildr = 1 To 190

cum_alf = 0

For ifsh = 1 To 190

nume = 0

deno = 0

If ifsh = ildr Then alfa(ifsh) = 0

If ifsh <> ildr Then

For idim = 1 To 12

nume = nume + velo(ifsh, idim) * (fish(ildr, idim) - fish(ifsh, idim))

deno = deno + (fish(ildr, idim) - fish(ifsh, idim)) ^ 2

irow = ildr + 1

icol = ifsh + 1

cum_alf = cum_alf + alfa(ifsh)

End If

End Sub

Appendix C. List of Variables

This appendix provides the complete list of the 190 factors, including notations and category-specific descriptions.

Category 1: Momentum

1. Abr1 (“abr_1”), cumulative abnormal returns around earnings announcement dates, 1-month holding period;

2. Abr6 (“abr_6”), cumulative abnormal returns around earnings announcement dates, 6-month holding period;

3. Abr12 (“abr_12”), cumulative abnormal returns around earnings announcement dates, 12-month holding period;

4. Cim1 (“cim_1”), customer industries momentum, 1-month holding period;

5. Cim6 (“cim_6”), customer industries momentum, 6-month holding period;

6. Cim12 (“cim_12”), customer industries momentum, 12-month holding period;

7. Cm1 (“cm_1”), customer momentum, 1-month holding period;

8. Cm6 (“cm_6”), customer momentum, 6-month holding period;

9. Cm12 (“cm_12”), customer momentum, 12-month holding period;

10. dEf1 (“def_1”), changes in analyst earnings forecasts, 1-month holding period;

11. dEf6 (“def_6”), changes in analyst earnings forecasts, 6-month holding period;

12. dEf12 (“def_12”), changes in analyst earnings forecasts, 12-month holding period;

13. Ile1 (“ile_1”), industry lead-lag effect in earnings surprises, 1-month holding period;

14. Ilr1 (“ilr_1”), industry lead-lag effect in prior returns, 1-month holding period;

15. Ilr6 (“ilr_6”), industry lead-lag effect in prior returns, 6-month holding period;

16. Ilr12 (“ilr_12”), industry lead-lag effect in prior returns, 12-month holding period;

17. Im1 (“im_1”), industry momentum, 1-month holding period;

18. Im6 (“im_6”), industry momentum, 6-month holding period;

19. Im12 (“im_12”), industry momentum, 12-month holding period;

20. Nei1 (“nei_1”), the number of quarters with consecutive earnings increase, 1-month holding period;

21. 52w6 (“p52w_6”), 52-week high, 6-month holding period;

22. 52w12 (“p52w_12”), 52-week high, 12-month holding period;

23. R6_1 (“r6_1”), prior 6-month returns, 1-month holding period;

24. R6_6 (“r6_6”), prior 6-month returns, 6-month holding period;

25. R6_12 (“r6_12”), prior 6-month returns, 12-month holding period;

26. R11_1 (“r11_1”), prior 11-month returns, 1-month holding period;

27. R11_6 (“r11_6”), prior 11-month returns, 6-month holding period;

28. R11_12 (“r11_12”), prior 11-month returns, 12-month holding period;

29. Re1 (“re_1”), revisions in analyst earnings forecasts, 1-month holding period;

30. Re6 (“re_6”), revisions in analyst earnings forecasts, 6-month holding period;

31. Resid6_6 (“resid6_6”), 6-month residual momentum, 6-month holding period;

32. Resid6_12 (“resid6_12”), 6-month residual momentum, 12-month holding period;

33. Resid11_1 (“resid11_1”), 11-month residual momentum, 1-month holding period;

34. Resid11_6 (“resid11_6”), 11-month residual momentum, 6-month holding period;

35. Resid11_12 (“resid11_12”), 11-month residual momentum, 12-month holding period;

36. Rs1 (“rs_1”), revenue surprises, 1-month holding period;

37. Sim1 (“sim_1”), supplier industries momentum, 1-month holding period;

38. Sim12 (“sim_12”), supplier industries momentum, 12-month holding period;

39. Sm1 (“sm_1”), segment momentum, 1-month holding period;

40. Sm12 (“sm_12”), segment momentum, 12-month holding period;

41. Sue1 (“sue_1”), standard unexpected earnings, 1-month holding period;

42. Sue6 (“sue_6”), standard unexpected earnings, 6-month holding period.

Category 2: Value versus growth

1. Bm (“bm”), book-to-market equity;

2. Bmj (“bmj”), book-to-June-end market equity;

3. Bmq12 (“bmq_12”), quarterly book-to-market equity, 12-month holding period;

4. Cp (“cp”), cash flow-to-price;

5. Cpq1 (“cpq_1”), quarterly cash flow-to-price, 1-month holding period;

6. Cpq6 (“cpq_6”), quarterly cash flow-to-price, 6-month holding period;

7. Cpq12 (“cpq_12”), quarterly cash flow-to-price, 12-month holding period;

8. Dp (“dp”), dividend yield;

9. Dur (“dur”), equity duration;

10. Ebp (“ebp”), enterprise book-to-price;

11. Em (“em”), enterprise multiple;

12. Emq1 (“emq_1”), quarterly enterprise multiple, 1-month holding period;

13. Emq6 (“emq_6”), quarterly enterprise multiple, 6-month holding period;

14. Emq12 (“emq_12”), quarterly enterprise multiple, 12-month holding period;

15. Ep (“ep”), earnings-to-price;

16. Epq1 (“epq_1”), quarterly earnings-to-price, 1-month holding period;

17. Epq6 (“epq_6”), quarterly earnings-to-price, 6-month holding period;

18. Epq12 (“epq_12”), quarterly earnings-to-price, 12-month holding period;

19. Ir (“ir”), intangible return;

20. Nop (“nop”), net payout yield;

21. Ocp (“ocp”), operating cash flow-to-price;

22. Ocpq1 (“ocpq_1”), quarterly operating cash flow-to-price, 1-month holding period;

23. Op (“op”), payout yield;

24. Rev1 (“rev_1”), long-term reversal, 1-month holding period;

25. Rev6 (“rev_6”), long-term reversal, 6-month holding period;

26. Rev12 (“rev_12”), long-term reversal, 12-month holding period;

27. Sp (“sp”), sales-to-price;

28. Spq1 (“spq_1”), quarterly sales-to-price, 1-month holding period;

29. Spq6 (“spq_6”), quarterly sales-to-price, 6-month holding period;

30. Spq12 (“spq_12”), quarterly sales-to-price, 12-month holding period;

31. Vfp (“vfp”), analyst-based intrinsic value-to-market;

32. Vhp (“vhp”), Roe-based intrinsic value-to-market.

Category 3: Investment

1. Aci (“aci”), abnormal corporate investment;

2. Cei (“cei”), composite equity issuance;

3. Dac (“dac”), discretionary accruals;

4. dBe (“dbe”), changes in book equity;

5. dCoa (“dcoa”), changes in current operating assets;

6. dFin (“dfin”), changes in net financial assets;

7. dFnl (“dfnl”), changes in financial liabilities;

8. dIi (“dii”), percent changes in investment relative to industry;

9. dLno (“dlno”), changes in long-term net operating assets;

10. dLti (“dlti”), changes in long-term investments;

11. dNca (“dnca”), changes in non-current operating assets;

12. dNco (“dnco”), changes in net non-current operating assets;

13. dNoa (“dnoa”), changes in net operating assets;

14. dPia (“dpia”), changes in PPE and inventory scaled by lagged assets;

15. dWc (“dwc”), changes in net non-cash working capital;

16. I/A (“ia”), investment-to-assets (asset growth);

17. Iaq1 (“iaq_1”), quarterly investment-to-assets (asset growth), 1-month holding period;

18. Iaq6 (“iaq_6”), quarterly investment-to-assets (asset growth), 6-month holding period;

19. Iaq12 (“iaq_12”), quarterly investment-to-assets (asset growth), 12-month holding period;

20. Ig (“ig”), investment growth;

21. 2Ig (“ig2”), 2-year investment growth;

22. Ivc (“ivc”), inventory changes;

23. Ivg (“ivg”), inventory growth;

24. Ndf (“ndf”), net external debt financing;

25. Nxf (“nxf”), net external equity financing;

26. Noa (“noa”), net operating assets;

27. Nsi (“nsi”), net stock issues;

28. Oa (“oa”), operating accruals;

29. Pda (“pda”), percent discretionary accruals;

30. Poa (“poa”), percent operating accruals;

31. Pta (“pta”), percent total accruals;

32. Ta (“ta”), total accruals.

Category 4: Profitability

1. Ato (“ato”), assets turnover;

2. Atoq1 (“atoq_1”), quarterly assets turnover, 1-month holding period;

3. Atoq6 (“atoq_6”), quarterly assets turnover, 6-month holding period;

4. Atoq12 (“atoq_12”), quarterly assets turnover, 12-month holding period;

5. Cla (“cla”), cash-based operating profits-to-lagged assets;

6. Claq1 (“claq_1”), quarterly cash-based operating profits-to-lagged assets, 1-month holding period;

7. Claq6 (“claq_6”), quarterly cash-based operating profits-to-lagged assets, 6-month holding period;

8. Claq12 (“claq_12”), quarterly cash-based operating profits-to-lagged assets, 12-month holding period;

9. Cop (“cop”), operating cash flow-to-assets;

10. Cto (“cto”), capital turnover;

11. Ctoq1 (“ctoq_1”), quarterly capital turnover, 1-month holding period;

12. Ctoq6 (“ctoq_6”), quarterly capital turnover, 6-month holding period;

13. Ctoq12 (“ctoq_12”), quarterly capital turnover, 12-month holding period;

14. dRoa1 (“droa_1”), 4-quarter changes in return on assets, 1-month holding period;

15. dRoa6 (“droa_6”), 4-quarter changes in return on assets, 6-month holding period;

16. dRoe1 (“droe_1”), 4-quarter changes in return on equity, 1-month holding period;

17. dRoe6 (“droe_6”), 4-quarter changes in return on equity, 6-month holding period;

18. dRoe12 (“droe_12”), 4-quarter changes in return on equity, 12-month holding period;

19. Eg1 (“eg_1”), expected growth, 1-month holding period;

20. Eg6 (“eg_6”), expected growth, 6-month holding period;

21. Eg12 (“eg_12”), expected growth, 12-month holding period;

22. Fp6 (“fp_6”), failure probability, 6-month holding period;

23. Fq1 (“fq_1”), quarterly fundamental score, 1-month holding period;

24. Fq6 (“fq_6”), quarterly fundamental score, 6-month holding period;

25. Fq12 (“fq_12”), quarterly fundamental score, 12-month holding period;

26. Gla (“gla”), gross profits-to-lagged assets;

27. Glaq1 (“glaq_1”), quarterly gross profits-to-lagged assets, 1-month holding period;

28. Glaq6 (“glaq_6”), quarterly gross profits-to-lagged assets, 6-month holding period;

29. Glaq12 (“glaq_12”), quarterly gross profits-to-lagged assets, 12-month holding period;

30. Gpa (“gpa”), gross profits-to-assets;

31. Olaq1 (“olaq_1”), quarterly operating profits-to-lagged assets, 1-month holding period;

32. Olaq6 (“olaq_6”), quarterly operating profits-to-lagged assets, 6-month holding period;

33. Olaq12 (“olaq_12”), quarterly operating profits-to-lagged assets, 12-month holding period;

34. Oleq1 (“oleq_1”), quarterly operating profits-to-lagged book equity, 1-month holding period;

35. Oleq6 (“oleq_6”), quarterly operating profits-to-lagged book equity, 6-month holding period;

36. Oleq12 (“oleq_12”), quarterly operating profits-to-lagged book equity, 12-month holding period;

37. Opa (“opa”), operating profits-to-assets;

38. Ope (“ope”), operating profits-to-book equity;

39. Oq1 (“oq_1”), quarterly O-score, 1-month holding period;

40. Pmq1 (“pmq_1”), quarterly profit margin, 1-month holding period;

41. Rnaq1 (“rnaq_1”), quarterly return on net operating assets, 1-month holding period;

42. Rnaq6 (“rnaq_6”), quarterly return on net operating assets, 6-month holding period;

43. Rnaq12 (“rnaq_12”), quarterly return on net operating assets, 12-month holding period;

44. Roa1 (“roa_1”), return on assets, 1-month holding period;

45. Roa6 (“roa_6”), return on assets, 6-month holding period;

46. Roe1 (“roe_1”), return on equity, 1-month holding period;

47. Roe6 (“roe_6”), return on equity, 6-month holding period;

48. Sgq1 (“sgq_1”), quarterly sales growth, 1-month holding period;

49. Tbiq6 (“tbiq_6”), quarterly tax income-to-book income, 6-month holding period;

50. Tbiq12 (“tbiq_12”), quarterly tax income-to-book income, 12-month holding period.

Category 5: Intangibles

1. Adm (“adm”), advertising expense-to-market;

2. Alaq1 (“alaq_1”), quarterly asset liquidity scaled by 1-quarter-lagged total assets, 1-month holding period;

3. Almq1 (“almq_1”), quarterly asset liquidity scaled by 1-quarter-lagged market value of assets, 1-month holding period;

4. Almq6 (“almq_6”), quarterly asset liquidity scaled by 1-quarter-lagged market value of assets, 6-month holding period;

5. Almq12 (“almq_12”), quarterly asset liquidity scaled by 1-quarter-lagged market value of assets, 12-month holding period;

6. Dls1 (“dls_1”), disparity between long- and short-term earnings growth forecasts, 1-month holding period;

7. Eprd (“eprd”), earnings predictability;

8. Etl (“etl”), earnings timeliness;

9. Etr (“etr”), effective tax rate;

10. Hs (“hs”), industry concentration in sales;

11. Ioca (“ioca”), industry-adjusted organizational capital-to-assets;

12. Oca (“oca”), organizational capital-to-assets;

13. Ol (“ol”), operating leverage;

14. Olq1 (“olq_1”), quarterly operating leverage, 1-month holding period;

15. Olq6 (“olq_6”), quarterly operating leverage, 6-month holding period;

16. Olq12 (“olq_12”), quarterly operating leverage, 12-month holding period;

17. R1a (“r1a”), seasonality, return in month t-12;

18. R1n (“r1n”), seasonality, average return from month t-11 to t-1;

19. R [2,5]a (“r5a”), seasonality, average return across months t-24, t-36, t-48, and t-60;

20. R [2,5]n (“r5n”), seasonality, average return from month t-60 to t-13 except for months t-24, t-36, t-48, and t-60;

21. R [6,10]a (“r10a”), seasonality, average return across months t-72, t-84, t-96, t-108, and t-120;

22. R [6,10]n (“r10n”), seasonality, average return from month t-120 to t-61 except for months t-72, t-84, t-96, t-108, and t-120;

23. R [11,15]a (“r15a”), seasonality, average return across months t-132, t-144, t-156, t-168, and t-180;

24. R [16,20]a (“r20a”), seasonality, average return across months t-192, t-204, t-216, t-228, and t-240;

25. Rca (“rca”), R&D capital-to-assets;

26. Rdm (“rdm”), R&D expense-to-market;

27. Rdmq1 (“rdmq_1”), quarterly R&D expense-to-market, 1-month holding period;

28. Rdmq6 (“rdmq_6”), quarterly R&D expense-to-market, 6-month holding period;

29. Rdmq12 (“rdmq_12”), quarterly R&D expense-to-market, 12-month holding period;

30. Rdsq6 (“rdsq_6”), quarterly R&D expense-to-sales, 6-month holding period;

31. Rdsq12 (“rdsq_12”), quarterly R&D expense-to-sales, 12-month holding period;

32. Rer (“rer”), industry-adjusted real estate ratio;

33. Vcf1 (“vcf_1”), cash flow volatility, 1-month holding period.

Category 6: Frictions

1. beta1 (“beta_1”), market beta, 1-month holding period;

2. Dtv12 (“dtv_12”), dollar trading volume, 12-month holding period;

3. Isff1 (“isff_1”), idiosyncratic skewness estimated from the Fama-French 3-factor model, 1-month holding period;

4. Isq1 (“isq_1”), idiosyncratic skewness estimated from the q-factor model, 1-month holding period;

5. Ivff1 (“ivff_1”), idiosyncratic volatility estimated from the Fama-French 3-factor model, 1-month holding period;

6. Ivq1 (“ivq_1”), idiosyncratic volatility estimated from the q-factor model, 1-month holding period;

7. Me (“me”), the market equity;

8. Srev (“srev”), short-term reversal;

9. Sv1 (“sv_1”), systematic volatility, 1-month holding period;

10. Tv1 (“tv_1”), total volatility, 1-month holding period.

References

Andrew, B. Riding the Momentum Investing Wave. 25 July 2007. Available online: https://www.forbes.com (accessed on 4 September 2017).
Marek, L. Richard Driehaus Launches Private-Equity Fund. Crain’s Chicago Business, 13 May 2015. Available online: https://www.chicagobusiness.com/article/20150513/NEWS01/150519906/chicago-s-richard-driehaus-launches-private-equity-fund (accessed on 21 September 2015).
Schwager, J.D. The New Market Wizards: Conversations with America’s Top Traders; John Wiley and Sons: Hoboken, NJ, USA, 1992; p. 224. ISBN 0-471-13236-5. [Google Scholar]
Jegadeesh, N.; Titman, S. Momentum: Evidence and insights 30 years later. Pac.-Basin Financ. J. 2023, 82, 102202. [Google Scholar] [CrossRef]
Chen, R.-R.; Huang, W.; Huang, J.; Yu, R. An Artificial Intelligence Approach to the Valuation of American-Style Derivatives: A Use of Particle Swarm Optimization. J. Risk Financ. Manag. 2021, 14, 57. [Google Scholar] [CrossRef]
Hegazy, O.; Soliman, O.S.; Salam, M.A. LSSVM-ABC Algorithm for Stock Price Prediction. arXiv 2014, arXiv:1402.6366. [Google Scholar] [CrossRef]
Huang, Z.; Zhang, Z.; Hua, C.; Liao, B.; Li, S. Leveraging Enhanced Egret Swarm Optimization Algorithm and Artificial Intelligence-Driven Prompt Strategies for Portfolio Selection. Sci Rep. 2024, 14, 26681. [Google Scholar] [CrossRef] [PubMed]
Kaucic, M.; Piccotto, F.; Sbaiz, G.; Valentinuz, G. A hybrid level-based learning swarm algorithm with mutation operator for solving large-scale cardinality-constrained portfolio optimization problems. Inf. Sci. 2023, 634, 321–339. [Google Scholar] [CrossRef]
Liu, J.; Wei, Y.; Xu, H. Financial Sequence Prediction Based on Swarm Intelligence Algorithms of Internet of Things. Comput. Econ. 2022, 59, 1465–1480. [Google Scholar] [CrossRef]
Zhu, H.; Chen, Y.; Wang, K. Swarm Intelligence Algorithms for Portfolio Optimization. In Advances in Swarm Intelligence; Tan, Y., Shi, Y., Tan, K.C., Eds.; ICSI 2010; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2010; Volume 6145. [Google Scholar]
Hong, H.; Stein, J.C. A Unified Theory of Underreaction, Momentum Trading, and Overreaction in Asset Markets. J. Financ. 1999, 54, 2143–2184. [Google Scholar] [CrossRef]
Hou, K.; Mo, H.; Moskowitz, T.J. Market Frictions, Price Delay, and the Cross-Section of Expected Returns. Rev. Financ. 2005, 18, 981–1020. [Google Scholar] [CrossRef]
Barber, B.; Odean, T. All That Glitters: The Effect of Attention on the Buying Behavior of Individual and Institutional Investors. J. Financ. Stud. 2008, 21, 785–818. [Google Scholar] [CrossRef]
Pástor, L.; Stambaugh, R. Liquidity Risk and Expected Stock Returns. J. Polit. Econ. 2003, 111, 642–685. [Google Scholar] [CrossRef]
Barberis, N.; Shleifer, A.; Vishny, R. A Model of Investor Sentiment. J. Financ. Econ. 1998, 49, 307–343. [Google Scholar] [CrossRef]
Daniel, K.; Hirshleifer, D.; Subrahmanyam, A. Investor Psychology and Security Market under- and Overreactions. J. Financ. 1998, 53, 1839–1885. [Google Scholar] [CrossRef]
Reynolds, C. Flocks, herds and schools: A distributed behavioral model. In Proceedings of the 14th Annual Conference on Computer Graphics and Interactive Techniques, Association for Computing Machinery, Anaheim, CA, USA, 27–31 July 1987; pp. 25–34. [Google Scholar]
Eberhart, R.C.; Kennedy, J. A New Optimizer Using Particle Swarm Theory. In Proceedings of the Sixth International Sympo-sium on Micro Machine and Human Science, Nagoya, Japan, 4–6 December 1995. [Google Scholar]
Shi, Y.; Eberhart, R.C. A Modified Particle Swarm Optimizer. In Proceedings of the IEEE International Conference on Evolutionary Computation, Anchorage, AK, USA, 4–9 May 1998; pp. 69–73. [Google Scholar]
Chen, R.-R.; Miller, C.; Toh, P.K. Search on a NK Landscape with Swarm Intelligence: Limitations and Future Research Op-portunities. Algorithms 2023, 16, 527. [Google Scholar] [CrossRef]
Chen, R.-R.; Miller, C.D.; Toh, P.K. Modeling Firm Search and Innovation Trajectory Using Swarm Intelligence. Algorithms 2023, 16, 72. [Google Scholar] [CrossRef]
Shih, K.H.; Wang, Y.H.; Kao, I.C.; Lai, F.M. Forecasting ETF Performance: A Comparative Study of Deep Learning Models and the Fama-French Three-Factor Model. Mathematics 2024, 12, 3158. [Google Scholar] [CrossRef]
Zhang, A.; Pan, M.; Zhang, X. The pricing ability of factor model based on machine learning: Evidence from high-frequency data in China. Int. Rev. Econ. Financ. 2025, 101, 104153. [Google Scholar] [CrossRef]
Tatarczak, A. Mapping the landscape of artificial intelligence in supply chain management: A bibliometric analysis. Mod. Manag. Rev. 2024, 29, 43–57. [Google Scholar] [CrossRef]
Chen, R.-R. Index Tracking: A Stock Selection Model Using Particle Swarm Optimization. J. Invest. 2023, 32, 53–73. [Google Scholar] [CrossRef]
Shiller, R. Stock Prices and Social Dynamics. Brook. Pap. Econ. Act. 1984, 15, 457–510. [Google Scholar] [CrossRef]
Mavruk, T. Analysis of Herding Behavior in Individual Investor Portfolios using Machine Learning Algorithms. Res. Int. Bus. Financ. 2022, 62, 101740. [Google Scholar] [CrossRef]
Rahayu, A.D.; Putra, A.; Oktaverina, C.; Ningtyas, R.A. Herding Behavior in The Stock Market: A Literature Review. Int. J. Soc. Sci. Rev. 2020, 1, 8–25. [Google Scholar]
Lakonishok, J.; Shleifer, A.; Vishny, R.W. The Impact of Institutional Trading on Stock Prices. J. Financ. Econ. 1992, 32, 23–43. [Google Scholar] [CrossRef]
Ukpong, I.; Tan, H.; Yarovaya, L. Determinants of industry herding in the US stock market. Financ. Res. Lett. 2021, 43, 101953. [Google Scholar] [CrossRef]
Jegadeesh, N.; Titman, S. Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency. J. Financ. 1993, 48, 65–91. [Google Scholar] [CrossRef]
Grinblatt, M.; Titman, S.; Wermers, R. Momentum investment strategies, portfolio performance, and herding: A study of mutual fund behavior. Am. Econ. Rev. 1995, 85, 1088–1105. [Google Scholar]
Demirer, R.; Lien, D.; Zhang, H. Industry herding and momentum strategies. Pac.-Basin Financ. J. 2015, 32, 95–110. [Google Scholar] [CrossRef]
Lin, Z.; Wu, W.; Zhang, H. Momentum, Information, and Herding. J. Behav. Financ. 2023, 24, 219–237. [Google Scholar] [CrossRef]
Chen, T. Does Country Matter to Investor Herding? Evidence from an Intraday Analysis. J. Behav. Financ. 2020, 22, 56–64. [Google Scholar] [CrossRef]
Chen, R.-R. Is the Taiwan Stock Market (Swarm) Intelligent. Information 2024, 15, 707. [Google Scholar] [CrossRef]
Bikhchandani, S.; Sunil, S. Herd Behavior in Financial Markets. IMF Staff Pap. 2001, 47, 279–310. [Google Scholar] [CrossRef]
Amihud, Y.; Mendelson, H. Asset Pricing and the Bid-Ask Spread. J. Financ. Econ. 1986, 17, 223–249. [Google Scholar] [CrossRef]
Tetlock, P. Giving Content to Investor Sentiment: The Role of Media in the Stock Market. J. Financ. 2007, 62, 1139–1168. [Google Scholar] [CrossRef]
Fama, E.F.; French, K.R. A Five-Factor Asset Pricing Model. J. Financ. Econ. 2015, 116, 1–22. [Google Scholar] [CrossRef]
Fama, E.F.; French, K.R. Common Risk Factors in the Returns of Stocks and Bonds. J. Financ. Econ. 1993, 33, 3–56. [Google Scholar] [CrossRef]
Hou, K.; Xue, C.; Zhang, L. Digesting Anomalies: An Investment Approach. Rev. Financ. Stud. 2015, 28, 650–705. [Google Scholar] [CrossRef]
Easley, D.; O’hara, H. Information and the Cost of Capital. J. Financ. 2004, 59, 1555–1583. [Google Scholar] [CrossRef]
Carhart, M.M. On Persistence in Mutual Fund Performance. J. Financ. 1997, 52, 57–82. [Google Scholar] [CrossRef]
Fama, E.F.; French, K.R. Size and Book-to-Market Factors in Earnings and Returns. J. Financ. 1995, 50, 131–155. [Google Scholar]
Hou, K.; Mo, H.; Xue, C.; Zhang, L. An Augmented q-Factor Model with Expected Growth. Rev. Financ. 2021, 25, 1–41. [Google Scholar] [CrossRef]

Figure 1. Swarm intelligence boids. Source: https://people.engr.tamu.edu/sueda/courses/CSCE450/2023F/projects/Frank_Martinez/index.html (accessed on 7 March 2025).

Figure 2. Positions

x_{j, t}^{(i)}

of frictions (10 risk factors and lag for 2 months). There are 10 risk factors (i.e., 10 fish) in the diagrams. The red dot is beta_1 (i.e., fish 1), and the yellow dot is tv_1 (i.e., fish 2). The x-axis is 1-month lagged return and the y-axis is 2-month lagged return. In terms of

x_{j, t}^{(i)}

, the two coordinates of beta_1 in Panel (A), because

t

is April 1990, are

x_{1, 4 / 1990}^{(1)}

, which is the return of beta_1 in March 1990 (1-month lagged), and

x_{2, 4 / 1990}^{(1)}

(2-month lagged), which is the return of beta_1 in February 1990. Similarly,

x_{1, 4 / 1990}^{(2)}

and

x_{2, 4 / 1990}^{(2)}

are the returns of tv_1’s (i.e., fish 2) two lagged returns. Panel (B) shows lagged returns when

t

is May 1990. Hence, the two lagged returns are April 1999 (x-axis) and March 1990 (y-axis). Hence, the y-axis of Panel (A) is not the x-axis of Panel (B). Taking beta_1 as an example, its y-axis value is 7.4907 in Panel (A), and now is the value of its x-axis.

Figure 2. Positions

x_{j, t}^{(i)}

of frictions (10 risk factors and lag for 2 months). There are 10 risk factors (i.e., 10 fish) in the diagrams. The red dot is beta_1 (i.e., fish 1), and the yellow dot is tv_1 (i.e., fish 2). The x-axis is 1-month lagged return and the y-axis is 2-month lagged return. In terms of

x_{j, t}^{(i)}

, the two coordinates of beta_1 in Panel (A), because

t

is April 1990, are

x_{1, 4 / 1990}^{(1)}

, which is the return of beta_1 in March 1990 (1-month lagged), and

x_{2, 4 / 1990}^{(1)}

(2-month lagged), which is the return of beta_1 in February 1990. Similarly,

x_{1, 4 / 1990}^{(2)}

and

x_{2, 4 / 1990}^{(2)}

are the returns of tv_1’s (i.e., fish 2) two lagged returns. Panel (B) shows lagged returns when

t

is May 1990. Hence, the two lagged returns are April 1999 (x-axis) and March 1990 (y-axis). Hence, the y-axis of Panel (A) is not the x-axis of Panel (B). Taking beta_1 as an example, its y-axis value is 7.4907 in Panel (A), and now is the value of its x-axis.

Figure 3. Velocities

v_{j, t}^{(i)}

of frictions (beta_1 and tv_1) from April 1990 to May, May to June, and June to July are plotted, in panels A to C, respectively. These velocities are simply differences of positions in Figure 2. The velocity for beta_1 is (3.98, –7.18), (–7.18, 8.19), and (8.19, –9.40), respectively, and for tv_1 it is (0.28, –3.23), (–3.23, 3.88), and (3.88, –3.10), respectively. These velocity values are given in the short table above.

Figure 3. Velocities

v_{j, t}^{(i)}

of frictions (beta_1 and tv_1) from April 1990 to May, May to June, and June to July are plotted, in panels A to C, respectively. These velocities are simply differences of positions in Figure 2. The velocity for beta_1 is (3.98, –7.18), (–7.18, 8.19), and (8.19, –9.40), respectively, and for tv_1 it is (0.28, –3.23), (–3.23, 3.88), and (3.88, –3.10), respectively. These velocity values are given in the short table above.

Figure 4. Average speed distribution.

Figure 5. Average speed over time.

Table 1. This table presents the monthly returns of the beta_1 and tv_1 factors from February 1990 to June 1990 (Panel A), their corresponding mappings to position space (Panel B), and the computation of their swarm-based velocities (Panel C).

Panel A. Monthly return
Data		beta_1				tv_1
2/1990		3.5051				1.4045
3/1990		7.4907				1.6897
4/1990		0.3085				−1.5450
5/1990		8.5011				2.3310
6/1990		−0.9011				−0.7728
Panel B. Position space
Position	beta_1				tv_1
	X		y		X		Y
4/1990	3.5051		7.4907		1.4045		1.6897
5/1990	7.4907		0.3085		1.6897		−1.5450
6/1990	0.3085		8.5011		−1.5450		2.3310
7/1990	8.5011		−0.9011		2.3310		−0.7728
Panel C. Monthly velocity
Velocity	beta_1				tv_1
	X			y	X		Y
5/1990	3.9856			−7.1822	0.2852		−3.2347
6/1990	−7.1822			8.1926	−3.2347		3.8760
7/1990	8.1926			−9.4022	3.8760		−3.1038

Table 2. Univariate portfolio sorts on swarm velocity. Each month, we sort the 190 factors into quintiles based on their swarm velocity (SV). Column 1 presents the average characteristics of swarm velocity (SV), the sorting variable. Column 2 reports the mean equal-weighted returns (Mean) across factors within each quintile. Columns 3 to 8 present the alphas for each quintile portfolio relative to various factor models, including the Fama–French Five-Factor Model (FF5), the Fama–French–Carhart Six-Factor Model (FF6), the Fama–French–Carhart Six-Factor Model augmented with the Pastor–Stambaugh liquidity factor (FF6PS), the Hou–Xue–Zhang Four-Factor Model (Q), and the Hou–Mo–Xue–Zhang Five-Factor Model (Q5). Rows 1 to 5 present the results for each SV-based quintile, from the lowest (Low) to the highest (High). The last row (High–Low) reports the differences in mean returns and alphas between the top and bottom quintiles. Newey–West-adjusted t-statistics with six lags are reported in parentheses. The sample period spans from March 1991 to December 2022. Numbers in boldface indicate statistical significance at the 5% level or better.

Quintile	SV	Mean	FF5	FF6	FF6PS	Q	Q5
Low	0.21	0.43	0.41	0.25	0.25	0.31	0.24
		(4.85)	(3.94)	(3.81)	(3.71)	(3.52)	(2.57)
2	0.37	0.23	0.16	0.13	0.14	0.12	0.06
		(4.43)	(3.18)	(2.52)	(2.52)	(2.36)	(1.20)
3	0.47	0.07	0.06	0.06	0.07	0.04	0.00
		(1.48)	(1.27)	(1.15)	(1.33)	(0.79)	(−0.04)
4	0.58	0.20	0.17	0.10	0.10	0.11	0.07
		(3.46)	(3.27)	(2.40)	(2.38)	(2.37)	(1.55)
High	0.73	0.17	0.16	0.07	0.07	0.09	0.01
		(2.99)	(3.26)	(1.58)	(1.60)	(1.76)	(0.11)
High–Low	0.52	−0.26	−0.25	−0.18	−0.17	−0.22	−0.21
		(−3.39)	(−2.86)	(−2.52)	(−2.42)	(−2.61)	(−2.63)

Table 3. Long-term return predictability. This table presents the Hou–Mo–Xue–Zhang Five-Factor alphas for the top and bottom quintile portfolios sorted by swarm velocity (SV), along with their alpha differentials across horizons ranging from 2 (t + 2) to 12 months (t + 12) ahead. Rows 1 and 2 report the results for the lowest (Low) and highest (High) SV-based quintiles. The last row (High–Low) reports the differences in mean returns and alphas between the top and bottom quintiles. Newey–West-adjusted t-statistics with six lags are reported in parentheses. The sample period spans from March 1991 to December 2022.

Quintile	t + 2	t + 3	t + 4	t + 5	t + 6	t + 7	t + 8	t + 9	t + 10	t + 11	t + 12
Low	0.18	0.17	0.18	0.16	0.21	0.21	0.18	0.17	0.18	0.16	0.16
	(2.05)	(1.88)	(1.90)	(1.53)	(2.27)	(2.28)	(1.82)	(1.78)	(1.99)	(1.81)	(1.60)
High	0.01	0.06	0.05	0.04	0.05	0.03	0.03	0.05	0.01	0.04	0.08
	(0.20)	(1.10)	(0.87)	(0.80)	(0.87)	(0.46)	(0.51)	(0.91)	(0.21)	(0.75)	(1.51)
High–Low	−0.17	−0.11	−0.10	−0.11	−0.11	−0.13	−0.11	−0.09	−0.13	−0.12	−0.05
	(−2.18)	(−1.38)	(−1.29)	(−1.08)	(−1.32)	(−1.64)	(−1.25)	(−1.15)	(−1.58)	(−1.41)	(−0.67)

Table 4. Characteristics of slow-mover and fast-mover portfolios. Panel A presents the top 10 slow-moving factors (left four columns) and the top 10 fast-moving factors (right four columns), based on swarm velocity (SV). For each factor, the table reports the average SV (column labeled “SV”) and the probabilities of inclusion in the slow-mover portfolio (bottom SV quintile, column labeled “Slow mover”) and the fast-mover portfolio (top SV quintile, column labeled “Fast mover”). All values are mean-adjusted to facilitate comparison. Panel B summarizes factor representation by category. The “Factor presence” column indicates the percentage of factors in each of the six categories relative to the total universe of 190 factors. The “Slow mover” and “Fast mover” columns report the proportion of each category represented in the slow- and fast-mover portfolios, respectively. The complete list of factors, including notations and category-specific descriptions, is provided in the online Appendix C. Numbers in boldface indicate statistical significance at the 5% level or better.

Panel A. Top slow-mover and fast-mover factors.
Top 10 Slow-Movers						Top 10 Fast-Movers
Characteristics	SV		Slow Mover	Fast Mover		Characteristics	SV		Slow Mover	Fast Mover
beta_1	−0.28		0.69	−0.20		cm_12	0.37		−0.20	0.78
tv_1	−0.29		0.68	−0.20		sim_12	0.33		−0.20	0.77
r11_1	−0.31		0.67	−0.20		abr_12	0.33		−0.20	0.76
ivff_1	−0.24		0.61	−0.20		ilr_12	0.32		−0.20	0.73
r1n	−0.29		0.57	−0.20		sm_12	0.34		−0.20	0.72
r11_6	−0.25		0.57	−0.20		droe_12	0.28		−0.20	0.70
p52w_6	−0.26		0.56	−0.20		cm_6	0.27		−0.19	0.67
r6_1	−0.26		0.54	−0.19		resid6_12	0.25		−0.20	0.64
ivq_1	−0.22		0.50	−0.20		droe_6	0.23		−0.20	0.57
fp_6	−0.20		0.46	−0.20		tbiq_12	0.25		−0.20	0.55
Panel B. Presence in slow-mover and fast-mover portfolios by category.
Category		Factor Presence			Slow Mover			Fast Mover
Frictions		5.26			11.55			4.64
Intangibles		16.32			17.48			7.71
Investments		16.84			8.31			20.32
Momentum		22.11			25.37			36.2
Profitability		23.16			17.03			24.36
Value vs. Growth		16.32			20.26			6.77
Sum (%)		100			100			100

Table 5. Transition matrix for swarm-velocity-sorted portfolios. This table presents the transition matrix for quintile portfolios sorted by swarm velocity (SV). At the end of each month t, all factors are sorted into ascending SV quintiles. For each t-month SV quintile, the table reports the time-series average of the percentage of factors that transition into each SV quintile at future horizons of t + 1, t + 3, t + 6, t + 12, t + 24, t + 36, t + 48, and t + 60 months, respectively. Numbers in boldface indicate statistical significance at the 5% level or better.

Panel A, month t + 1
	Quintile in Month t
Quintile in Month t + 1	Low	2	3	4	High
Low	79	11	4	4	2
2	10	68	14	4	3
3	4	14	64	14	3
4	4	4	14	68	11
High	2	3	3	11	81
Sum (%)	100	100	100	100	100
Panel B, month t + 3
	Quintile in Month t
Quintile in Month t + 3	Low	2	3	4	High
Low	70	18	6	4	3
2	17	50	22	7	3
3	6	22	45	22	5
4	4	6	22	50	18
High	3	3	5	18	71
Sum (%)	100	100	100	100	100
Panel C, month t + 6
	Quintile in Month t
Quintile in Month t + 6	Low	2	3	4	High
Low	60	23	10	6	3
2	21	37	24	11	5
3	10	24	33	24	8
4	6	11	24	37	22
High	3	5	8	22	62
Sum (%)	100	100	100	100	100
Panel D, month t + 12
	Quintile in Month t
Quintile in Month t + 12	Low	2	3	4	High
Low	45	25	16	10	5
2	24	26	23	17	9
3	16	23	24	23	14
4	10	17	22	27	23
High	5	8	14	24	50
Sum (%)	100	100	100	100	100
Panel E, month t + 24
	Quintile in Month t
Quintile in Month t + 24	Low	2	3	4	High
Low	42	24	18	11	5
2	24	27	22	18	9
3	17	22	24	22	14
4	12	18	22	25	23
High	6	9	14	24	49
Sum (%)	100	100	100	100	100
Panel F, month t + 36
	Quintile in Month t
Quintile in Month t + 36	Low	2	3	4	High
Low	40	24	19	13	5
2	24	26	22	19	9
3	18	23	23	21	15
4	12	18	22	24	23
High	6	9	15	23	48
Sum (%)	100	100	100	100	100
Panel G, month t + 48
	Quintile in Month t
Quintile in Month t + 48	Low	2	3	4	High
Low	40	25	18	12	6
2	24	25	22	19	9
3	17	23	23	21	14
4	13	18	22	25	22
High	6	10	15	23	48
Sum (%)	100	100	100	100	100
Panel H, month t + 60
	Quintile in Month t
Quintile in Month t + 60	Low	2	3	4	High
Low	41	25	18	12	5
2	24	26	23	18	10
3	17	22	23	23	15
4	11	17	22	25	24
High	7	11	15	22	46
Sum (%)	100	100	100	100	100

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, R.-R.; Huang, M.; Tang, Y. Classifying Factor Velocity with Swarm Intelligence: Market Pricing of Fast- and Slow-Moving Factors. Algorithms 2025, 18, 682. https://doi.org/10.3390/a18110682

AMA Style

Chen R-R, Huang M, Tang Y. Classifying Factor Velocity with Swarm Intelligence: Market Pricing of Fast- and Slow-Moving Factors. Algorithms. 2025; 18(11):682. https://doi.org/10.3390/a18110682

Chicago/Turabian Style

Chen, Ren-Raw, Mengjie Huang, and Yi Tang. 2025. "Classifying Factor Velocity with Swarm Intelligence: Market Pricing of Fast- and Slow-Moving Factors" Algorithms 18, no. 11: 682. https://doi.org/10.3390/a18110682

APA Style

Chen, R.-R., Huang, M., & Tang, Y. (2025). Classifying Factor Velocity with Swarm Intelligence: Market Pricing of Fast- and Slow-Moving Factors. Algorithms, 18(11), 682. https://doi.org/10.3390/a18110682

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Classifying Factor Velocity with Swarm Intelligence: Market Pricing of Fast- and Slow-Moving Factors

Abstract

1. Introduction

2. Swarm Intelligence

2.1. Boids (Without a Leader)

2.2. Particle Swarm Optimization (With a Leader)

2.3. Combining Boids and PSO

3. Empirical Methodology

4. The Swarm-Related Literature in Finance

4.1. Contextualizing Swarm Intelligence in Modern Financial AI

4.2. Herding

4.3. Lead–Lag

5. Empirical Findings

5.1. Data

5.2. Experimental Design and Procedure

5.3. Speed Result

5.4. Market Pricing of Factor Velocity

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. A Brief Sketch of PSO

Appendix B. VBA Code of Swarm

Appendix C. List of Variables

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI