Next Article in Journal
Predictors of Digital Fraud: Evidence from Thailand
Previous Article in Journal
Does Litigation Risk Affect Meeting-or-Beating Earnings Expectations? Evidence from Quasi-Natural Experiment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

PCA-Based Investor Attention Index and Its Impact on the KSE-100 Excess Returns

1
Department of Maritime Studies, Faculty of Maritime and Industrial Studies, University of Piraeus, 185-33 Piraeus, Greece
2
Department of Management Sciences, DHA Suffa University, Karachi 74001, Pakistan
3
Faculty of Management Sciences, SZABIST University, Karachi 74001, Pakistan
4
Department of Management Sciences, SZABIST University, Islamabad 44000, Pakistan
5
Department of Management Sciences, HITEC University, Taxila 47080, Pakistan
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2025, 18(12), 670; https://doi.org/10.3390/jrfm18120670
Submission received: 21 August 2025 / Revised: 11 November 2025 / Accepted: 15 November 2025 / Published: 25 November 2025

Abstract

The study employs principal component analysis (PCA) to construct an investor attention index derived from seven key variables: abnormal trading volume, extreme returns, past returns, nearness to the 52-week high, nearness to the historical high, Google search volume, and mutual fund inflows. Subsequently, the research examines the impact of the investor attention index on the KSE-100 index excess returns. The analysis covers monthly data from January 2004 to December 2024. The PCA identified four components and constructed attention indices: A P C A 1 has highest weights of nearness to the 52-week high, abnormal trading volume, past returns, and mutual funds inflows; A P C A 2 has major weights of abnormal trading volume, extreme returns, past returns, and Google search volume; A P C A 3 has nearness to the 52-week high, nearness to the historical high, extreme returns, and mutual funds inflows; and A P C A 4 has nearness to the historical high, extreme returns, Google search volume, and mutual funds inflows. The A P C A 1 and A P C A 4 have a positive and significant impact on the excess returns of the KSE-100 index. This suggests that when investors are more motivated to invest, herding behavior increases, leading to improved index performance and higher returns. Subsequently, A P C A 3 has a negative but significant impact on index returns, indicating that a lack of investor interest leads to reduced trading activity and weaker index performance. The findings of this study have important implications for policymakers, investors, and mutual fund managers to understand the patterns of investor attention, creating policies and procedures to make the financial markets more transparent and protect the investor’s rights.

1. Introduction

Investor attention has long been a subject of interest in financial research, particularly as markets increasingly exhibit patterns that deviate from traditional rational models. One such deviation arises from the concept of investor attention, a limited cognitive resource that shapes how individuals process information and make investment decisions. Contrary to the Efficient Market Hypothesis (Fama, 1970), which assumes that all available information is fully and instantly reflected in asset prices, behavioral finance suggests that investors often rely on selective attention, shaped by information overload, media signals, and cognitive limitations (Kahneman, 1973; Barber & Odean, 2008; Peng & Xiong, 2006). This limited attention can lead to disproportionate focus on certain stocks or market-wide events, resulting in price fluctuations that are not entirely justified by fundamentals. While a growing body of literature supports the link between investor attention and predictability of stock return in developed markets (Da et al., 2011; J. Li & Yu, 2012), evidence from emerging economies remains fragmented. This study contributes to the discourse by examining the impact of investor attention on Pakistan’s KSE-100 index performance in a context where structural reforms, evolving investor attention, and heightened volatility create a compelling environment for such analysis.
In Pakistan, initially, there were three stock exchanges: the Karachi Stock Exchange (KSE), the Lahore Stock Exchange (LSE), and the Islamabad Stock Exchange (ISE) (Jamal & Ashraf, 2024; Sharif, 2018). The existence of multiple stock exchanges created conflicts of interest and frustration for investors, as none of the exchanges coordinated with one another and operated independently regarding trading interfaces, indexes, and listing criteria (Ali & Sharif, 2022). Hence, the government of Pakistan introduced the Stock Exchanges Act in 2012, integrated all three stock exchanges into one operation, and formed the “Pakistan Stock Exchange” (PSX) on 11 January 2016 (Ali & Sharif, 2022). It became operational at the beginning with 559 listed companies across 35 sectors, including domestic and foreign companies (Pakistan Stock Exchange, 2024). The history of PSX has witnessed a significant improvement over time. It started with just five listed companies with a total paid-up capital of Rs 37 million (Pakistan Stock Exchange, 2024). Then, in the 1960s, 81 companies were added with a free float capitalization of Rs 1.8 billion (Pakistan Stock Exchange, 2024). The PSX witnessed an aggregate market capitalization of Rs 10.37 trillion as of 30 June 2024 (Pakistan Stock Exchange Annual Report, 2024).
Among the indices operating under the PSX, the performance of the KSE-100 index is reflected as a barometer to evaluate the country’s economic progress. In a report PSX on KSE 100 Index, it is mentioned that the KSE-100 index was brought into effect in November 1991, and the base value was set at 1000 points. As the number in the index suggests, it comprises 100 blue-chip companies. The criteria for selecting companies for the KSE 100 index are based on two main components: Sectorial Representation and companies with the highest free float market capitalization (Pakistan Stock Exchange, 2025). Initially, 35 companies are selected from 36 available sectors (one from each sector) based on the highest free float capitalization (the Open-End Mutual Fund Sector is excluded). The remaining 65 companies are chosen based on the highest free float capitalization, irrespective of their sector. Hence, the KSE-100 index comprises about 80% of the total free float market capitalization of all listed companies on the Exchange (Pakistan Stock Exchange, 2025). The rationale for selecting the KSE 100 Index includes the broad market coverage, sectoral diversity, and economic importance. In addition to reflecting the collective movements of major publicly traded firms in Pakistan, it serves as a barometer of national economic performance and a critical tool for investors, analysts, and policymakers. With its comprehensive composition, researchers can draw insightful and meaningful inferences about macroeconomic trends, investor behavior, and the movement of the financial market in the Pakistani context.
Figure 1 shows the KSE-100 price index data from January 2004 through December 2024. The data was extracted from LSEG (Formerly Refinitiv and Thomson Reuters) DataStream. The figure shows that the Index experienced a significant financial downturn in 2008, following a minor crash in 2005, during which the market stopped functioning for four months. The military rule ended in 2008, and power was transferred to a civilian democratic government. Following the 2008 financial crisis, numerous discussions were held to implement changes in the stock market. Finally, the Corporatization, Demutualization, and Integration Stock Exchanges Act was enacted in 2012. Reflecting this milestone, Bloomberg ranked the Pakistani index among the five best stock performers in 2016 (Mangi, 2016). The FTSE also considers it an emerging secondary market. In 2017, it was again classified by MSCL (Morgan Stanley Capital International) and NBSP as an emerging market. The economy experienced a boom, with the PSX performing best due to high Chinese Investment in CPEC projects from 2013 to 2018. However, in May 2017, the market showed a bearish trend, and the Index dropped by 28% after an association of China purchased 40% of the shares for $85 million.
Interestingly, in 2020, among the 61 stock exchanges worldwide, the PSX ranked fourth best, and fourth in Asia, despite the COVID-19 outbreak. In addition, PSX is recognized as one of the premier emerging markets globally, characterized by a high growth rate and a highly liquid environment. However, this market is also culturally sensitive and unstable due to macroeconomic differences and political instability (Jaffri et al., 2025; Zahid & Saleem, 2025; Ghani et al., 2022; Joyo & Lefen, 2019). Factors such as political instability, economic crisis, and fluctuating policy significantly impacted the KSE 100 index, which remained volatile from 2021 to 2024. Moreover, the change in government in 2022, ongoing inflation, and the climate-induced floods of 2022 led to a decline in investor confidence. Recovery began in FY 2024, as macroeconomic indicators improved following the successful completion of the IMF program, which led the KSE 100 to achieve a remarkable return of 89.24%.
The current structure of Pakistan’s financial market offers opportunities for conducting a wide range of interesting research, but it also poses several problems. Although significant strides have been made to enhance the depth and stability of the market, further research is needed to compare the PSX with developed markets. This gap provides the necessary background to analyze the role of behavioral factors, such as investor attention, in the historically volatile market surrounding the stocks. Research shows that Pakistan’s stock market is highly sensitive to macroeconomic shocks. Therefore, it is anticipated that greater effort is needed to understand investors’ behavior in this regard.
There is also evidence of heightened investor attention to stock returns due to mirror-neighbors, decision heuristics, and self-organizing map information available in Pakistan’s capital market. Mirror-neighbors refers to the imitation of peers’ trading behavior with geographical and social proximity, leading to effects known as localized herding and information clustering (Banerjee, 1992; Bikhchandani et al., 1992). While decision heuristics—mental shortcuts used in judgment—help make investment decisions in uncertain environments, they sometimes introduce the risk of systematic biases. The use of such heuristics in decision-making, as proposed by Tversky and Kahneman (1973, 1974), corroborates the idea that individuals assess probabilities based on how readily instances come to mind. In other words, recent events and the emotions associated with them in an investor’s life significantly influence their decision making. It challenges objectivity and rationality in decision making. The use of decision heuristics can, therefore, represent a distorted perception, especially in retail-dominated markets like Pakistan (Abideen et al., 2023; Shah et al., 2018). Furthermore, Self-Organizing Map (SOM), a neural network–based unsupervised learning method, is increasingly used in financial analysis to detect hidden patterns and cluster investor behaviors, offering a data-driven lens on market segmentation and volatility (Resta, 2016). Generally, extant finance theories rely on assumptions about self-interest and wealth maximization. However, as obtained from the behavioral finance theory, investors act in ways that can be deemed irrational due to psychological factors and the inability to control their emotions, such as overconfidence, fear, and greed (Gill & Bajwa, 2018). In Pakistan’s case, these behavioral factors, such as heuristics, the herding effect, and market variables, have a meaningful influence over the investment decision.
The main motivation behind this study is to shed light on the growing and significant impact of investor attention in the Pakistani stock market by systematically analyzing attention proxies that affect future stock returns. In Pakistan, a few studies on investors’ attention are available. This study adds valuable insights to the existing body of knowledge by addressing the investor’s attention constructed from the context of the KSE-100 index and its effects on subsequent returns. It builds upon prior research in the following ways: The PCA methodological technique is used to construct the attention index. It will help policymakers understand the patterns of investor attention, creating policies and procedures to make the financial markets more transparent and protect the investors’ rights. From this study, market participants could identify key strategic directions that could assist them in understanding market conditions. In addition, appropriate investment decisions could be of considerable benefit and a competitive advantage in the market. It would help academic scholars in behavioral finance and open a new perspective. Besides this, the researcher would examine the signals that affect the performance of the KSE-100 index, thereby effectively managing their portfolios and risk.
The remaining study is structured as follows. Section 2 covers the existing literature. Section 3 discusses the methodology. Section 4 presents the empirical analysis and discussion, and Section 5 concludes the study.

2. Literature Review

Information must first attract investor attention before it can be processed and incorporated into asset prices through trading. Therefore, understanding how investor attention influences asset price formation is of significant importance (Z. Li et al., 2024, 2025). As attention is a finite cognitive resource that an individual possesses, researchers and industry experts have a keen interest in studying the theoretical and empirical significance of investors’ attention on the prices of stocks. Two ways are identified in the theories through which prices are affected. Informed traders are the first path through which the market learns (Hirshleifer & Teoh, 2003; Huang & Liu, 2007; Peng & Xiong, 2006). The second path is when attention fascinates uninformed traders (Barber & Odean, 2008). The authors documented that an individual investor requires more attention towards stock prices than institutional investors. A buy decision requires an average investor to gauge the worthiness of thousands of instruments before making a final decision, whereas a selling decision need a smaller evaluation.
Da et al. (2011) identified that one of the major limitations in empirically testing attention theories is the unavailability of a direct measure of investor attention. Hence, the researchers must take indirect proxies such as extreme returns, trading volume, news and headlines, advertising expenses, and price limits. Building on these foundations, the literature of this study is organized around three central themes. The first theme presents the theoretical discussion, and the second theme presents studies that elaborate on the proxies or methodologies used to construct the attention index. The third part of the same studies examines the impact of investor attention on stock returns.

2.1. Theoretical Discussion

Economist Herbert A. Simon, in 1955, proposed the Bounded Rationality Theory, which posits that decision-makers have cognitive and informational limitations (Simon, 1955). They “satisfice”—therefore making decisions that are adequate in the light of available cognitive abilities, time, and information. Merton (1987) emphasized that individual investors save resources by seeking only portions of the financial information. On this premise, Kahneman (1973) and Pashler et al. (2001) posited that, given their low cognitive abilities, investors were able to apprehend a simple take on attention. Hence, attention paid by investors is a link between bounded cognition and securities’ price determination. Information that targets this short span of attention from investors can skew their decision-making, thus producing consequences in the market that can include short-run changes in prices, volatility, and returns. Thus, this theory of bounded rationality can be used to explain why investor attention is not distributed evenly and how the process of selective attention contributes to nonrandom patterns of stock returns, especially in the less-developed emerging markets.
Miller’s (1977) Heterogeneous Beliefs Theory states that the correlation between heterogeneity of beliefs and the risk of stock is direct. He states that “in practice, uncertainty, disagreement over a security’s return, and risk correlate.” Therefore, ‘high-risk, low volume stocks are also high information uncertainty stocks; hence, the market clearing price for a high-risk stock will be higher than that for a low-risk stock. Miller also opines that the overvaluation of high-risk stocks is attributed to short sale constraints among the heterogeneous investors.
Prospect Theory (Kahneman & Tversky, 1979), the foundational theory in behavioral finance, helps to explain the impact that attention has on stock returns in emerging markets due to investor psychology. The theory deviates from the rational expectations model by stating that investors compare economic outcomes to a reference level and are characterized by systematic decision-making biases about risk. There are three types: (1) Loss aversion speaks of why investors respond more aggressively to negative signals: when attention is given to the potential of a loss, the suffering felt is always more than that resulting from an equivalent amount of profit, which we sell off more aggressively, a factor that occurs in emerging markets, due to increased volatility and information asymmetry (Barberis & Huang, 2001). (2) Probability weighting explains why investors overestimate the stock that provides ‘lottery-like’ investments, a finding supported by theoretical (Tversky & Kahneman, 1992) and empirical research (Da et al., 2011). (3) The editing phase of Prospect Theory discusses how investors employ attention as an information filter and base their choices on easily recognizable but potentially irrelevant cues in situations of uncertainty (Peng & Xiong, 2006). This dilemmatic-heuristic behavior results in short-run mispricing, with prices rising in conditions of increased focus and then experiencing a backlash. Theoretical frameworks predict that such shocks equate to short-term outperformance and subsequent underperformance for the stocks receiving the attention boost (Hong & Stein, 1999). Such patterns are particularly pronounced in markets where institutions are immature and where more retail investors trade, such as the emerging markets suggested by Seasholes and Wu (2007). The combination of Prospect Theory and investor attention explains these return anomalies, hence shedding light on how behavioral prejudice plays out in structural imperfections in these markets. Limited Attention Theory by Peng and Xiong (2006) studies theoretically the outcome of inadequate investor attention on asset–price dynamics. However, this study is based on Prospect Theory.

2.2. The Measurement of Investor Attention

Barber and Odean (2008) studied the New York Stock Exchange equity market and tested attention-driven buying by sorting stocks based on events that capture investor attention. These include abnormal trading volume, extreme one-day returns (positive or negative), and news coverage. Aboody et al. (2010) have taken top percentile stocks, ranked based on prior 12-month price performance, as a proxy measure for observing investor attention from 1971 to 2005. J. Li and Yu (2012), motivated by the psychological evidence on limited investor attention, considered two proxies to test whether attention plays its role in the U.S. stock market. The two proxies were nearness to the Dow 52-week high and nearness to the Dow historical high. W. Zhang et al. (2013) advocate the search frequency of stock names in Baidu Index (the largest search engine in China) as a novel and direct proxy for investor attention. Aouadi et al. (2013) studied the French equity market and considered abnormal trading volume as a proxy measure of investor attention. Hirshleifer et al. (2013) used Analyst Coverage as a substitute measure to gauge investor attention. Lou (2014) also took advertising spending as a proxy measure of investor attention.
Choi and Choi (2019) discovered the influence of trading volume on investor attention in the Korean market. For measuring investor attention, the top 30 stocks were considered, and they were discussed online frequently on the message board from 2010 to 2014 using Daum (a widely used internet portal in Korea). Chemmanur and Yan (2019) studied the US stock market from 1996 to 2005. In the first hypothesis, they tested the relationship between advertising and stock returns. Then, in the second hypothesis, their research investigated whether advertising predicts future stock returns. Another study by Düz Tan and Taş (2019) examined whether firm-specific investor attention predicts future stock returns for the Borsa Istanbul All Shares Index, Turkey. The researchers developed the Abnormal Google Search Volume index (ASVI) as a direct measure of investor attention, which quantifies investor interest based on abnormal search volumes. Their data spanned from 2013 to 2017. Moreover, Subramaniam and Chakraborty (2020) explored whether investor attention influences the prices of cryptocurrencies. They used the daily Google Search Volume index as a proxy for attention. In their work, they analyze Bitcoin, Ethereum, Ripple, and Litecoin from January 2013 to March 2018 and utilize a quantile causality approach to consider effects through various market phases. Ibikunle et al. (2020) investigated whether high investors’ attention disturbs the price discovery process of Bitcoin. For this, they used two data sources: First, 30.5 million trades from Bitstamp between 13 September 2011 and 10 April 2019, including time, price, and volume. The second measure of investor attention is calculated using Google Trends data for the keyword “Bitcoin”, which is the most popular search term for potential investors.
The COVID-19 pandemic represents an unparalleled disruption in the modern era, having an impact on global markets and institutions, including the financial sector. Unlike past pandemics, it leads to unprecedented volatility, liquidity shocks, and market instability (S. R. Baker et al., 2020). The epidemic triggered widespread sell-offs, increased systemic risks, and pressurized central banks to use aggressive monetary and fiscal policies to stabilize economies (D. Zhang et al., 2020). COVID-19 intensifies the profit gaps between banks and non-financial markets, undermining the development of the real economy (Guan et al., 2021). The pandemic set the finest example of analyzing market response to variations in investors’ attention. According to Smales’ (2021) research, GSV for the keyword “coronavirus” rose sharply from late February, peaked in mid-March, and then declined significantly. Therefore, he attempted to explain the changing dynamics of market response by considering Google Search Volume (GSV) as a proxy measure for investor attention. Similarly, recent research by Yahya et al. (2021) examined the impact of various factors, including the outbreak of novel COVID-19, investor attention as assessed by the Google search volume index, social isolation, and investor sentiments, on stock returns in the German Stock Market. The study investigated whether social isolation influences the investor’s attentiveness to the stock market amid fluctuating investor sentiments.

2.3. The Impact of Investor Attention on Stock Markets

Barber and Odean (2008) suggest that individual investors exhibit attention-driven buying behavior. They tend to be net buyers on high-volume days, reacting to both extremely negative and positive one-day returns, and when stocks receive media coverage. The results of Aboody et al. (2010) showed that strong past returns attract individual investors’ attention, particularly before earnings announcements. Another study by J. Li and Yu (2012) found that, due to scarce attention, future stock returns are positively predicted by nearness to the 52-week high, whereas this is the opposite for nearness to the historical high. According to W. Zhang et al. (2013), attention could be the new barometer for information in financial markets due to the heterogeneous behavior of investors. Consequently, investor attention is the major factor in predicting abnormal stock returns and exhibits contemporary relationships with it. Aouadi et al. (2013) identified a positive and significant relationship between investor attention and abnormal trading volume at the market level. Hirshleifer et al. (2013) found that stock returns tend to be high for those firms that are efficient in research and development. Furthermore, the authors identified it because such firms’ earnings per share (EPS) tend to increase in the future. Lou (2014) concluded that if a firm increases advertising spending, the stock price will grow simultaneously, followed by a complete return reversal.
Choi and Choi (2019) concluded that when investors paid high attention to the stocks, the trading volume of such stocks increased suddenly and unexpectedly on that day. Thus, due to short-sell constraints, high investor attention results in low future returns through active trading. Chemmanur and Yan (2019), in the first hypothesis, found that more investor attention is drawn to firms if firms invest heavily in advertising. Then, in the second hypothesis, the results showed that advertising significantly increases investors’ attention to the firms’ stocks and negatively affects their future returns. The study of Düz Tan and Taş (2019) found that firms attracting high ASVI tend to generate higher returns. The study emphasized that ASVI outperforms indirect proxies of investor attention, such as abnormal turnover, excess returns, and news volume, in predicting stock returns, as it is more closely linked to actions or decisions made by investors. These findings demonstrate the significant impact of investor attention (constructed by ASVI) on stock prices in Turkey, particularly in the context of the All-Shares Index. Moreover, Subramaniam and Chakraborty (2020) concluded that retail/individual investors pay high attention to highly ranked cryptocurrencies such as Bitcoin and Ethereum. Therefore, the returns of that cryptocurrency increase. However, the prices of new cryptocurrencies such as Ripple are influenced by investor attention, but only at the time of their superior performance. Ibikunle et al. (2020) concluded that irrational, uninformed trading activity increases when attention is given to the bitcoin market. However, attention is unaffected by fully rational and informed traders. It implies that only uninformed traders paid high attention, leading them to participate more in the bitcoin market, which is unrelated to any increase in informed trading (Ibikunle et al., 2020).
The study findings of Smales (2021) suggest that as news events or other information drew investors’ attention, the prices responded by incorporating the same news or information. So, it was proved that stock returns are marginally low and that price volatility grew strong as the news or certain information drew investors’ attention. Moreover, Yahya et al. (2021) showed a significant impact of the coronavirus on the Frankfurt Stock Exchange after considering all the anomalies, such as weather disruptions, calendar variations, and oil price volatility. Furthermore, the higher stock returns during the social isolation period resulted from investors’ attention to buying underpriced stock. Thus, a temporary lockdown helped increase investors’ precision in decision making, leading them to make better investment decisions.
Principal component analysis (PCA) is a widely used technique for simplifying complex multivariate systems with high dimensionality. PCA reduces the large number of variables to a concise representation without compromising the characteristics of the original data. Thus, brevity is achieved through the PCA model, but not at the cost of clarity in the data. PCA is used in numerous prominent studies on investors’ attention and stock return predictability (Ludvigson & Ng, 2007; M. Baker & Wurgler, 2006; Neely et al., 2014; B. Wang et al., 2018; He et al., 2021; J. Chen et al., 2022; Chu et al., 2022; Song et al., 2023), among many others. Resultantly, this study has been finalized to use the PCA prediction technique. Through PCA, market-level measures were aggregated into a single index. By doing so, this study first aggregates firm-level measures into measures at the market level whenever needed. Thus, an aggregated attention index would employ all information on the proxies and stock market returns.

3. Methodology

3.1. Data Sources and Variables

This study uses monthly time-series data from January 2004 to December 2024 to construct an investor aggregate attention index. The time period is selected based on the data availability for all the constituents of the KSE-100 index, thereby eliminating survivorship bias. Also, this time period captures significant market cycles and regulatory changes in the Pakistan stock market, impacting investors’ sentiments. The data is collected from Refinitiv Eikon and DataStream. We collect mutual funds’ data from the Mutual Funds Association of Pakistan (MUFAP) website, and Google search volume from Google Trends. The stock’s closing price is used for the calculations of company-specific returns. We follow the literature and use principal component analysis (PCA) to construct the investor attention index (Ludvigson & Ng, 2007; M. Baker & Wurgler, 2007; Neely et al., 2014; B. Wang et al., 2018; He et al., 2021; J. Chen et al., 2022; Chu et al., 2022; Song et al., 2023). Based on the behavioral finance theories and empirical evidences of attention proxies, this study has taken seven major individual attention proxies, namely A A T V i t —abnormal trading volume and A E R i t —extreme returns (Barber & Odean, 2008), A P R i t —past returns (Aboody et al., 2010), A N K 52 W H t —nearness to the 52-week high and A N K I H H t —nearness to the historical high (J. Li & Yu, 2012), A G S V t —google search volume (Da et al., 2011; Aouadi et al., 2013; Ding & Hou, 2015; Yao et al., 2017), and A N A V E M F t —mutual funds inflow (J. Chen et al., 2022). These seven proxies were first used to construct individual investor indexes at the firm level; then, this research aggregated them to the market level by following the J. Chen et al. (2022) methodology. At first, the variables are computed using the formulas provided in Table 1. Hence, Table 1 exhibits the variables, their definitions, and operationalization.

3.2. Aggregate Attention Index and Principal Component Analysis (PCA)

The aggregated attention index is created using all seven proxies together. A factor structure is developed in Equation (5).
A i t = η i , 0 + η i , 1 A t * + η i , 2 E t + e i t
In Equation (7), A t * is the true but unobservable investor attention (J. Chen et al., 2022), A i t is the proxy of attention sensitivity to A t * which is the true attention, and is summarized by factor loadings, that is η i , 1 , the common error term of all the proxies is E t , while e i t denotes the idiosyncratic noise solely associated with i . The estimation of A t * helps in identifying investor attention impact in the stock market, mainly to impose factor structure and reduce E t (the common approximation error) and e i t (estimation idiosyncratic noise).
According to J. Chen et al. (2022), PCA is the simplest but popular method that helps in extracting the principal component A i t , the aggregate attention index measure. It represents the maximum total variation in the seven proxies used in this study. Furthermore, this approach apprehends the information in an index and separates A t * from e i t . Researchers have widely used PCA in the literature, specifically in forecasting stock returns (M. Baker & Wurgler, 2006; Ludvigson & Ng, 2007; Neely et al., 2014; J. Chen et al., 2022; and others). The main rationale for using PCA for constructing an index is that it reduces dimensionality, transforming high-dimensional data into a low dimension while retaining total variance or information. Furthermore, it creates components that reduce noise and explain the most variance, helps in removing multicollinearity, and constructs orthogonal components. Also, it compresses the dataset and retains the components with the most variance. In short, to ensure that no a priori weighting is imposed, the PCA allows for an endogenous data-driven scheme based on the variance–covariance among the standardized variables. The PCA approach follows established practice in financial economics for aggregating indicators into latent constructs (M. Baker & Wurgler, 2006; Ben-Rephael et al., 2017). The main steps in PCA methodology and in constructing an index are as follows: It computes the covariance matrix, mainly to understand how factors vary with each other. Then it computes and sorts eigenvalues and eigenvectors to define the direction of principal components.
Similarly, the rationale for this methodological choice and its significance are supported by the literature. The classical PCA technique is effective for high-dimensional datasets, particularly correlated variables, and successfully reduced the feature space while retaining the maximum variance, enabling clear interpretation of the underlying patterns without the overfitting risks associated with more complex models. Jolliffe and Cadima (2016) emphasized the importance of PCA, which remains a cornerstone in exploratory data analysis mainly due to its simplicity, interpretability, and robustness across diverse sets of applications. The results of this study demonstrate that PCA adequately captures the essential structure of the data, aligning with the study’s aim to construct an investors’ attention index derived from seven key proxies. Hair (2009) highlighted that PCA is particularly valuable when the goal is to identify key components driving variability in a dataset without unnecessary complexity. However, hybrid or novel models may introduce assumptions that could obscure insights or limit generalizability, especially given our dataset’s characteristics in developing an index.
Abdi and Williams (2010) stated that PCA often outperforms more complex hybrid models in scenarios where linear relationships dominate, as observed in our dataset. Moreover, in many contexts, the primary drivers of financial phenomena can be well-approximated by linear structures, making PCA a robust and parsimonious tool for initial dimension reduction and factor identification. For instance, PCA has been successfully applied to analyze yield curve dynamics, market risk factors, and portfolio construction, where the lending principal components often correspond to well-understood market movements such as level, slope, and curvature (X. Wang, 2006; Oprea, 2022). Wold et al. (1987) specified that PCA’s value lies in its ability to uncover meaningful patterns, which this study achieves to minimize risk and promote efficient financial management.
In summary, the findings of this study are grounded in PCA, providing actionable insights directly applicable to researchers, portfolio managers, and financial analysts in gauging investors’ herding behavior, specifically in the KSE-100 benchmark index. Based on this, it constructs the matrix and predicts the data. J. Chen et al. (2022) used an equally weighted (EW) aggregation approach and normalized all the proxies; however, according to the authors, the EW fails to eliminate the common observation errors and only captures the maximum common variations in predictors. Hence, we have not used this technique.
The STATA software version 17.0 is employed for index construction and empirical testing. We standardize and normalize all the variables using z-score ( z = X μ σ ). Please note that the descriptive statistics and correlation matrix of standardized variables are provided in Appendix A and Appendix B. Additionally, to examine whether all the standardized series are stationary, the Augmented Dickey–Fuller (ADF) test was run (See Appendix C). All the variables are significant except for A N K I H H t ; hence, by taking the first difference, it is converted into stationary ( A D N K I H H ) and is used in the index. The line graphs of each variable in Appendix D give the graphical presentation of the index variables. Through PCA, four indices are constructed; mainly, these indices have eigenvalues approximately equal to 1 or greater than 1. Please see screeplot of eigenvalues in Appendix E. The scoreplot and loadings of components are shown in Appendix F. The loadings are estimated, and based on the loadings, four indices are created, mainly because all the components explain 12% to 26% and cumulatively 73% of the variations. Then, Kaiser–Meyer–Olkin (KMO) is run to examine the sampling adequacy. The line graphs and ADF results of all the four attention indices are provided in Appendix G. The authors examine the linearity assumption of OLS regression through a scatter plot (see Appendix H). The OLS regression and its four assumptions (heteroscedasticity, autocorrelation, multicollinearity, and normality) empirical test results are shown in Appendix I. As some of the assumptions are not fulfilled, we have to use GLS regression. Based on the attention index, the forecasting regression Equation (8) mentioned below is considered for the current research.
K S E 100 E M R t + 1 = α i + β i A i t + ε t + 1
where K S E 100 E M R t + 1 is KSE-100 index excess return, β i denotes the slope of regression on the i t h attention proxy A i t at time t , and ε t + 1 is the noise term. The primary aim of this study is to examine the impact of investor attention index on KSE 100 stock returns while keeping all other factors constant; therefore, the univariate forecast model is empirically tested. To estimate the model above, the GLS regression is used in line with the literature to examine the impact of investor attention on excess stock market returns, also known as market risk premium.

4. Results and Discussion

4.1. Descriptive and Correlation Analysis

This study uses monthly data comprising 252 observations (months) as shown in Table 2.
The mean of Nearness to KSE-100 Index 52-Week High ( A N K 52 W H ) is 0.861 with SD of 0.115, a minimum value of 0.395, maximum of 1, p1 of 0.552, and p99 of 1; skewness is −0.974, showing negatively skewed data, kurtosis is 3.628, which is greater than 3, meaning the data is leptokurtic. Nearness to KSE-100 Index Historical High ( A N K I H H ) has a mean of 0.241, SD of 0.164, minimum and maximum are 0.041 and 1, p1 of 0.044, p99 of 0.772, positively skewed with 1.061, and 4.872 is kurtosis, showing leptokurtic data. The mean of Abnormal Trading Volume ( A A T V ) is 0.996, SD is 0.365, the minimum value is 0.004, the maximum value is 2.361, p1 is 0.212, and p99 is 2.013. The variable has positively skewed data with skewness of 0.58, and kurtosis of 3.707, which is greater than 3, representing leptokurtic data.
The mean value of Google Search Volume ( A G S V ) is 10.442, with an SD of 2.862, a minimum search of 1.373, and a maximum of 19.529. It has p1 of 2.41 and p99 of 18.049, along with the skewness of −0.22, and kurtosis of 3.967, indicating the data is leptokurtic. Extreme Returns ( A E R ) represents extreme returns with a 0.080 mean value and 0.021 SD, minimum extreme returns of 0.001 and 0.233 with maximum extreme returns, p1 of 0.041, p99 of 0.157, skewness of 2.585, and kurtosis of 18.498, showing leptokurtic data series. The mean value of Past Returns ( A P R ) is 0.238 with SD of 0.346, −0.56 of minimum past returns and 1.397 maximum returns, with p1 of −0.529 and p99 of 1.19, skewness of 0.269, and kurtosis of 2.965, showing data is platykurtic. Equity Mutual Funds Inflow ( A N A V E M F ) mean value is 2,404.89, SD of 659.002, a minimum of 1258.84, a maximum of 4666.71, p1 of 1306.07, and p99 of 4163.15, with skewness of 1.257, and kurtosis of 4.088. K S E 100 E M R descriptive statistics show that the index has produced average −0.088 excess returns along with a SD of 0.081. The index produced a minimum excess return of −0.483, and a maximum of 0.167, during the period of study. The lowest percentile p1 is −0.297, and the largest percentile p99 is 0.107, with −0.504 skewness, and kurtosis of 5.309 showing leptokurtic behavior.

4.2. Attention Index Construction Through PCA and Its Impact on Stock Market Returns of KSE-100 Index

For the construction of an attention index, it is important to have relationship among the variables. Table 3 showed a negative and weak to moderate correlation of A G S V , and A E R of −0.288, and −0.176 with the dependent variable K S E 100 M R . The variables A N K 52 W H , A D N K I H H , A A T V , A P R , and A N A V E M F have positive but weak to moderate correlations of 0.387, 0.566, 0.112, 0.271, and 0.163 with K S E 100 E M R . The results showed that the variables are significant and do not have the problem of multicollinearity.
Table 4 shows eigenvalues for each component. As per the rules of PCA, if eigenvalues are greater than and/or equal to one (eigenvalues >= 1), such components should be included for further analysis. The eigenvalue suggested by (Cattell, 1966) stated that four components should be retained for factor analysis of this research. The eigenvalue for Comp1 is 1.831, Comp2 is 1.334, Comp3 is 1.030, and Comp4 is 0.924. The eigenvalues show the total variance each factor accounts for (Mooi et al., 2018). The proportion of Comp1 explains 26.2%, Comp2 explains 19.1%, Comp3 explains 14.7%, and Comp4 explains 13.2% of the overall variance. Based on the PCA results, four attention indices are predicted; Comp1 is the first attention index denoted by A P C A 1 , Comp2 is the second index A P C A 2 , Comp 3 is the third index A P C A 3 , and Comp 4 is the fourth index A P C A 4 . Please note that for the construction of indices standardized values are used.
The eigenvectors of each variable of each component of PCA are mentioned in Table 5. According to Mooi et al. (2018) PCA eigenvectors comprise factor weights that help in obtaining the maximum variance until a significant part of the variables’ variance is explained. The unexplained column in Table 5 shows the unexplained variance of four components.
The aggregate attention index is constructed using a combination of factor weights of the individual attention proxies available in Table 5:
A P C A 1 = 0.479 A N K 52 W H + 0.161 A D N K I H H + 0.484 A A T V 0.170 A E R + 0.510 A P R 0.150 A G S V + 0.447 A N A V E M F
A P C A 2 = 0.111 A N K 52 W H + 0.291 A D N K I H H + 0.417 A A T V + 0.411 A E R 0.309 A P R + 0.679 A G S V + 0.062 A N A V E M F
A P C A 3 = 0.363 A N K 52 W H 0.582 A D N K I H H + 0.133 A A T V + 0.559 A E R + 0.261 A P R 0.025 A G S V + 0.362 A N A V E M F
A P C A 4 = 0.427 A N K 52 W H + 0.287 A D N K I H H + 0.320 A A T V + 0.073 A E R + 0.271 A P R + 0.164 A G S V + 0.341 A N A V E M F
The coefficients of eigenvalues of each variable in each component are provided in Table 6. In order to identify which factors load the most in the respective index, the coefficients below 0.3 are removed for graphical representation of Table 6.
According to the results of Table 6, the following proxies contribute higher to the indices A P C A 1 , A P C A 2 , A P C A 3 , and A P C A 4 . Hence, the major contributions are of these factors:
A P C A 1 = 0.4792 A N K 52 W H + 0.4836 A A T V + 0.5095 A P R + 0.4465 A N A V E M F
A P C A 2 = 0.4168 A A T V + 0.4108 A E R 3087 A P R + 0.6793 A G S V
A P C A 3 = 0.3634 N K 52 W H 0.5816 A D N I K H H + 0.5585 A E R + 0.3624 A N A V E M F
A P C A 4 = 0.4700 A D N K I H H + 0.5958 A E R 0.4397 A G S V 0.4084 A N V E M P
The attention indices loadings are estimated, and the indices are constructed based on the loadings. The Kaiser–Meyer–Olkin (KMO) results are used to analyze the adequacy of samples because the correlations and partial correlations between variables are compared by KMO value in Table 7.
The KMO value for this study is 0.5248, which indicates a middling sample and highlights that correlations are high compared to partial correlations (Field, 2024). We examine the data pattern and stationarity of A P C A 1 , A P C A 2 , A P C A 3 , and A P C A 4 through line graphs and ADF tests (Appendix E). All the indices are stationary. The descriptive statistics of the aforementioned variables are shown in Table 8. The mean value of all the four indices is equal to zero and standard deviation (SD) is in the range of 0.961 and 1.353, indicating higher volatility in A P C A 1 , with the main index explaining 26.2% of variations (see Table 4). A P C A 1 descriptive statistics show the minimum value is −5.408, maximum is 3.188, p1 is −3.393, p99 is 2.859, skewness is −0.388, and kurtosis is 3.797, showing non-normal and leptokurtic pattern. The second index is A P C A 2 which explains the 19.1% variations (see Table 4) has SD of 1.155, minimum is −3.866, maximum is 3.736, p1 is −2.487, p99 is 3.454, skewness is 0.336, and kurtosis is 4.042. These statistics show that it has relatively fewer variations as compared to the first index. The third index A P C A 3 explains 14.7% of variations (see Table 4), has a comparatively lower SD of 1.015, minimum of −3.505, and maximum of 4.62 along with p1 of −2.102, p99 of 3.904, skewness of 0.919, and kurtosis of 6.316, showing leptokurtic curve. The fourth and the last index is A P C A 4 , which explains 13.2% of total variance (see Table 4), has lowest SD of 0.961, minimum of −3.232, maximum of 4.544, p1 of −2.201, p99 of 3.267, skewness of 0.683, and kurtosis of 6.118 showing positively skewed and leptokurtic curve. All these four indices cumulatively explain the total variance of 73.1% (see Table 4).
Table 9 shows the correlation of A P C A 1 , A P C A 2 , A P C A 3 , and A P C A 4 with K S E 100 E M R . The results show A P C A 1 and A P C A 4 have a positive moderate and significant relation with K S E 100 E M R . A P C A 2 has a negative and insignificant relation, and A P C A 3 shows negative but significant correlation.
Once the components were analyzed, we regressed each of the attention indices with the K S E 100 M R t + 1 to investigate whether the investors’ attention index developed using PCA influences index returns or not. The GLS regression results are presented in Table 10:
Table 10 presents the empirical results examining the impact of each attention index on the KSE 100 Index excess returns while holding all other factors constant. The probability values of F-test of A P C A 1 , A P C A 3 , and A P C A 4 models are significant at 1% significance level except A P C A 2 which is insignificant. The R 2 of model 1 is 0.167, of model 3 is 0.430, and of model 4 is 0.080, explaining 16.7%, 43%, and 8% of the variations in the K S E 100 E M R t + 1 returns. The low R 2 supports results of Ben-Rephael et al. (2017) and J. Chen et al. (2022). In model 1 A P C A 1 is regressed on K S E 100 E M R t + 1 , with β 1 of 0.361 and p-value of 0.000 showing positive and significant impact on K S E 100 E M R t + 1 . Among all the four models, model 3 ( A P C A 3 regressed on K S E 100 E M R t + 1 ) has the better explanatory power along with negative but significant β 1 of −0.632 and p-value of 0.000 and 1% significance level. In model 4, the impact of A P C A 4 is examined, and the results showed β 1 of 0.320 is positive and significant at 1% significance level, meaning an increase in A P C A 4 would increase K S E 100 E M R t + 1 market returns. Concisely, models 1 and 4 showed that the more investors’ attention, the better the KSE 100 index performs, along with higher returns. However, model 3 showed the negative impact on the index returns. Furthermore, the results support Prospect Theory (Kahneman & Tversky, 1979), which helps to explain the impact of attention on stock returns. Investors make investment decisions and pay attention to stocks that significantly impact stock returns based on their psychology and available information.
From all seven proxies, the results of A P C A 3 showed four variables that significantly contribute to the attention index along with an R 2 of 43%. It comprises A N K 52 W H , A D N K I H H , A E R , and A N A V E M F proxies, as shown in Table 6. Thus, other factors that affect the attention index can explain the remaining 57% of the variability in K S E 100 E M R t + 1 returns. As J. Chen et al. (2022) mentioned, the sign of beta coefficients can be positive or negative or inconclusive. This study’s results also support their proposition, as the beta coefficient of A P C A 3 is negative and of A P C A 1 and A P C A 4 are positive, showing both negative and positive significant impacts of attention indices on excess stock returns. Furthermore, the results of A P C A 1 and A P C A 4 supports Barber and Odean’s (2008) and Peng and Xiong’s (2006) findings that individual investors are more attentive to attention grabbing stocks and are the net buyers, which simultaneously leads to positive pressure on stock prices but lower future stock returns (J. Chen et al., 2022). In addition, the authors created an index using an orthogonal rotation; the results are provided in Appendix J. Figure 2 displays the KSE 100 market returns and the investors’ attention indices. The results show that the 2008 financial crisis had a significant impact in Pakistan, particularly in 2008 and 2009. All four indices, specifically A P C A 1 , showed a sharp decline, indicating lower investors’ interest. Another downturn was observed during COVID-19, from 2019 to 2020. In 2023, a decline in investors’ interest was observed mainly due to a liquidity crunch in the economy. On the other hand, the highest investors’ interest was observed during 2004, 2006, 2008, 2014, 2016, and 2017 as per A P C A 1 . A P C A 3 shows upper turning points during 2005, 2006, 2007, 2008, 2016, 2017, 2020, and 2024. A P C A 4 illustrates highest attention in 2004, 2016, 2017, 2020, and 2024.

5. Discussion

The current findings support the studies by Peng and Xiong (2006), Barber and Odean (2008), and J. Chen et al. (2022) which demonstrate that investor attention is a crucial factor in the dynamics of stock markets. The same is true in the Pakistan stock market. The aggregate attention index used in this study performs well, as it incorporates relevant and recent information by using proxies that help measure investors’ true attention to the Pakistan stock market. The findings show that greater investor attention leads to more active trading in the KSE-100 index and higher returns. However, the negative attention index means that the lower investors’ attention leads to poor index performance. Other factors, including expansionary and contractionary monetary and fiscal policies and different phases of the business cycle, may influence the changes in investors’ attention. The empirical findings also support previous literature and underpinned theories.
Some previous studies used different proxies or factors, such as Hirshleifer et al. (2013), who studied Analyst Coverage, and found that stock returns tend to be high for firms that are efficient in research and development and that have increasing expected earnings per share. Lou (2014) used Advertising Spending as a proxy, and the results showed that an increase in advertising spending is associated with a simultaneous increase in stock prices, followed by a complete return reversal. Chemmanur and Yan (2019) found that advertising in the US stock market attracts investors’ attention to the firm’s stocks and affects future returns. Media coverage was also considered an attention measure (Hillert & Ungeheuer, 2015; Zou et al., 2019). Various researchers have already studied the factors above, but in Pakistan, the major limitation was the lack of data on them. In addition, macroeconomic factors and monetary policy also affect investors’ attention. Therefore, Choi and Choi (2019) examined the US equity market and found a negative, significant relationship between investor attention and abnormal trading volume at the market level. Their study concluded that when investors paid close attention to the stocks, the trading volume of those stocks increased suddenly and unexpectedly that day. Thus, due to short-sell constraints, high investor attention leads to lower future returns through active trading. Aboody et al. (2010) identified the top-performing stocks ranked by prior 12-month price performance for the 1971 through 2005 period and showed that strong past returns tend to attract individual investors’ attention, particularly before earnings announcements.
In this study, attention index proxies affect excess returns. The proximity to the 52- historical high in A P C A 3 has a negative influence on excess returns. The results support J. Li and Yu (2012), who found that heightened attention to this factor adversely affects future stock returns. Chang et al. (2018) investigated the impact of a historically high ratio on stock index returns of G7 countries. The authors found an inverse correlation between the historical high and the stock returns, attributing this to the significant gap. Our findings also support the results as historical high has a negative relationship with KSE 100 index excess returns. Google Search Volume index (GSVI) was used as a proxy for investor attention, as the authors considered it a reliable indicator (Aouadi et al., 2013; Ding & Hou, 2015; Yao et al., 2017). In this study, the Google Search Volume index shows a positive relationship: the more investors that are interested in a stock, the more attention they give it and search for the company’s technical and fundamental analysis on Google.
The findings are consistent with the literature. J. Chen et al. (2022) also concluded that the index had a negative but significant impact on future stock returns of the NASDAQ market. The negative coefficient is consistent with the literature review on the investors’ attention index. Yuan (2015) found that market-wide events that captured investors’ attention led to a decrease in future stock returns.
From the study, market participants could identify key strategic directions to help them understand market conditions. In addition, appropriate investment decisions could be highly beneficial and confer a competitive advantage in the market. It would help academic scholars in behavioral finance and open a new perspective. Besides this, the researcher would examine the signals that affect the performance of the KSE-100 index, enabling effective portfolio management and risk mitigation.

6. Conclusions, Limitations, Implications, and Recommendations

This paper aimed to illuminate the critical and growing phenomenon of investor attention in the Pakistani stock market by systematically analyzing attention proxies affecting stock market returns. For the first time, A N K 52 W H , A N K I H H , A A T V , A E R , A P R , A G S V , and A N A V E M F . These proxies were used to construct the investor attention index using the PCA approach, which is renowned for its predictive capabilities. The findings reveal that A P C A 1 and A P C A 4 have a positive and significant impact on KSE-100 index excess returns. However, A P C A 3 has a negative but significant impact on the index excess returns. The results are consistent with J. Chen et al. (2022) and T. Chen (2017).
This study highlights the importance of investor attention in Pakistan’s equity market and its substantial role in behavioral finance. Two primary economic sources were identified: First, lower investor attention leads to greater future uncertainty, prompting investors to demand higher returns to compensate for increased risk, thereby driving higher stock returns. Second, heightened investor attention can drive a surge in trading volume, particularly through active trading and short-sell constraints, ultimately leading to lower future returns. Furthermore, this research underscores two significant points: investor attention substantially influences the aggregate stock market, both statistically and economically, and individual attention measures understate the true predictive power of investor attention in Pakistan’s stock market when analyzed using PCA. In conclusion, this study has provided valuable insights into investor attention in Pakistan’s stock market, paving the way for future research to build on these findings and explore new dimensions of behavioral finance.
However, this research encountered several limitations, including fluctuations in advertising expenditures, media coverage, analyst coverage, and other economic and firm-specific variables, which were excluded due to data unavailability in Pakistan. The study period was limited from 1 January 2004 to 31 December 2024, due to the availability of Google Search Volume data from 2004. J. Chen et al. (2022) also used the dataset from January 2004. The analysis was also confined to the KSE-100 index, which was assumed to represent the Pakistani stock market. Additionally, due to time constraints and the COVID-19 pandemic, the research was limited to market-based variables and in-sample analysis, with data analysis conducted solely using STATA. Furthermore, in this study, 30% to 40% of the variation is unexplained, and the KMO of 0.5248 highlights limitations and opportunities for future research. We have examined the impact of Investor’s Attention Index only and used bivariate model. The variance remains unexplained, mainly due to other factors influencing investor sentiment, such as political instability, the application of capital gains taxes, terrorist attacks, the US–Afghanistan war, COVID-19, Sino–US trade war, natural disasters such as earthquakes and floods, and other socio-political issues. We have not accounted for exogenous factors that directly affect investors’ sentiment.
In addition, we have considered bivariate regression, resulting in lower R 2 . To improve explanatory power, multivariate regression analysis can be conducted, including macroeconomic variables and other proxies, depending on data availability. Furthermore, it would be valuable to analyze the impact on other indices and conduct comparative analyses.
The major implications of this study are the need for greater financial literacy and a better understanding of investment strategies to earn higher returns. Policymakers can develop and implement transparent policies and procedures to increase effective trading in the stock market. Furthermore, they can offer incentives to attract more investors to equity trading, thereby increasing market liquidity, making it more transparent, and motivating both national and international investors to participate. Policymakers can also plan a capital gains tax, which would increase their revenues and not hurt investor sentiment, a crucial factor for sustainable growth. Furthermore, understanding investors’ attention patterns would help reduce the risk of a bubble that could lead to a financial crisis in the future and eliminate speculation. Individual investors can also increase returns by understanding the market sentiments and information. Mutual fund managers and practitioners can design portfolios based on individual investors’ behavior and risk appetite to maximize future returns. Scholars and academicians can conduct further research studies to understand investors’ attention behavior in different market scenarios and contribute to strengthening knowledge of behavioral finance.
In this study, a few recommendations are given to researchers and financial institutions who are highly inspired to conduct research, especially in this domain. The authors can consider economic variables for constructing an investor attention index and investigate their impact on returns. The authors can use firm-specific variables such as dividend-to-price ratio, earnings-to-price ratio, dividend payout ratio, book-to-market ratio, and the like. One of the limitations of this study is that PCA does not capture nonlinear relationships; it is still often preferred by researchers because of its greater transparency and ease of application. However, in future studies, the extension of PCA is Kernel PCA could provide additional information on potential nonlinear dependencies. Furthermore, it is recommended to perform comparative analysis on emerging or frontier markets like Turkey, China, India, and alike, and can explain how Pakistan’s unique institutional context adds new insights. Also, researchers can predict in-sample and out-of-sample stock returns using different forecast models including ARIMA and SARIMA forecasts. The researchers may consider specific industries in which investor attention is analyzed, including health, manufacturing, energy, telecommunications, etc. Moreover, future studies may compare this study with other studies that used different methodologies.

Author Contributions

Conceptualization, S.P. and H.Z.; methodology, R.M. and S.A.; software, H.Z. and E.T.; validation, S.P. and R.M.; formal analysis, S.A.; investigation, E.T.; resources, S.A.; data curation, H.Z.; writing—original draft preparation, S.P.; writing—review and editing, S.A.; visualization, E.T.; supervision, H.Z.; project administration, R.M.; funding acquisition, E.T. All authors have read and agreed to the published version of the manuscript.

Funding

The study did not receive any external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Descriptive Statistics of Standardized Variables

VariablesObsMeanStd. Dev.MinMaxp1p99Skew.Kurt.
K S E 100 E M R 25201−4.8563.132−2.5662.394−0.5045.309
A N K 52 W H 25201−4.0461.203−2.6871.203−0.9743.628
A N K I H H 25201−1.2114.616−1.1983.2281.0614.872
A A T V 25201−2.7183.743−2.1492.7880.583.707
A G S V 25201−3.1693.175−2.7922.658−0.223.967
A E R 25201−3.6887.115−1.813.5742.58518.498
A P R 25201−2.313.351−2.222.7520.2692.965
A N A V E M F 25201−1.7393.432−1.6672.6681.2574.088

Appendix B. Matrix of Correlations of Standardized Variables

Variables K S E 100 E M R A N K 52 W H A N K I H H A A T V A G S V A E R A P R A N A V E M F
K S E 100 E M R 1.000
A N K 52 W H 0.3871.000
A N K I H H −0.0770.0071.000
A A T V 0.1120.3820.0941.000
A G S V −0.288−0.0510.0070.1171.000
A E R −0.176−0.1600.1570.0940.1431.000
A P R 0.2710.187−0.0630.229−0.335−0.0951.000
A N A V E M F 0.1630.131−0.2550.2640.043−0.0890.3271.000

Appendix C. Augmented Dickey–Fuller Test Results of Standardized Index Variables

VariableADF p-Value
A N K 52 W H 0.0000
A N K I H H 1.0000
A D N K I H H 0.0000
A A T V 0.0000
A G S V 0.0207
A E R 0.0000
A P R 0.0140
A N A V E M F 0.0138
K S E 100 E M R 0.0000

Appendix D. Graphical Analysis

Jrfm 18 00670 i001

Appendix E. Screeplot

Jrfm 18 00670 i002

Appendix F. Score and Loading Plots

Jrfm 18 00670 i003

Appendix G. Indices Line Graphs and Augmented Dickey–Fuller Test Results of Indices

Jrfm 18 00670 i004
VariableADF p-Value
A P C A 1 0.0001
A P C A 2 0.0000
A P C A 3 0.0000
A P C A 4 0.0000

Appendix H. Scatterplots-Linearity

Jrfm 18 00670 i005

Appendix I. OLS Regression Estimation and Assumptions Results

OLS Regression Estimation and Assumptions Results
RegressDependent
Variable
Investor
Attention Index
Beta
Coefficients
p-ValueAlpha
Coefficients
p-ValueF-StatsF-Test
p-Value
Adj R2Durbin-WatsonJarque–BeraHettestVIF
1 K S E 100 E M R A P C A 1 0.3320.000−0.008−0.89063.750.0000.2001.4240.0000.0001.000
2 K S E 100 E M R A P C A 2 −0.0520.335−0.0080.9020.930.3346−0.0001.3130.0000.9121.000
3 K S E 100 E M R A P C A 3 −0.4060.000−0.0080.89251.720.0000.1690.9160.0000.0001.000
4 K S E 100 E M R A P C A 4 0.2980.000−0.0080.89822.530.0000.0791.290.0000.2071.000

Appendix J. Orthogonalized Rotation Results

Rotation: Orthogonal varimax (Kaiser off)
ComponentVarianceDifferenceProportionCumulative
Comp11.6750.4010.2390.239
Comp21.2740.1800.1820.421
Comp31.0940.0780.1560.577
Comp41.076.0.1540.731
Rotated components
VariableComp1Comp2Comp3Comp4Unexplained
A N K 52 W H 0.346 0.4189 0.427
A D N K I H H 0.8159 0.287
A A T V 0.577 0.320
A G S V 0.810 0.164
A E R 0.9300.073
A P R 0.390−0.57 0.271
A N A V E M F 0.609 −0.339 0.341
Component rotation matrix
Comp1Comp2Comp3Comp4
Comp10.883−0.3320.264−0.200
Comp20.2850.7790.3540.432
Comp30.308−0.122−0.7210.609
Comp4−0.210−0.5200.5340.635

References

  1. Abdi, H., & Williams, L. J. (2010). Principal component analysis. Wiley Interdisciplinary Reviews: Computational Statistics, 2(4), 433–459. [Google Scholar] [CrossRef]
  2. Abideen, Z. U., Ahmed, Z., Qiu, H., & Zhao, Y. (2023). Do behavioral biases affect investors’ investment decision making? Evidence from the Pakistani equity market. Risks, 11(6), 109. [Google Scholar] [CrossRef]
  3. Aboody, D., Lehavy, R., & Trueman, B. (2010). Limited attention and the earnings announcement returns of past stock market winners. Review of Accounting Studies, 15(2), 317–344. [Google Scholar] [CrossRef]
  4. Ali, A., & Sharif, S. (2022). Stock market efficiency: The Pakistan stock exchange merger. Afro-Asian Journal of Finance and Accounting, 12(4), 455–478. [Google Scholar] [CrossRef]
  5. Aouadi, A., Arouri, M., & Teulon, F. (2013). Investor attention and stock market activity: Evidence from France. Economic Modelling, 35, 674–681. [Google Scholar] [CrossRef]
  6. Baker, M., & Wurgler, J. (2006). Investor sentiment and the cross-section of stock returns. The Journal of Finance, 61(4), 1645–1680. [Google Scholar] [CrossRef]
  7. Baker, M., & Wurgler, J. (2007). Investor sentiment in the stock market. Journal of Economic Perspectives, 21(2), 129–151. [Google Scholar] [CrossRef]
  8. Baker, S. R., Bloom, N., Davis, S. J., & Terry, S. J. (2020). COVID-induced economic uncertainty (No. w26983). National Bureau of Economic Research. Available online: https://www.nber.org/system/files/working_papers/w26983/w26983.pdf (accessed on 20 May 2024).
  9. Banerjee, A. V. (1992). A simple model of herd behavior. The Quarterly Journal of Economics, 107(3), 797–817. [Google Scholar] [CrossRef]
  10. Barber, B. M., & Odean, T. (2008). All that glitters: The effect of attention and news on the buying behavior of individual and institutional investors. Review of Financial Studies, 21(2), 785–818. [Google Scholar] [CrossRef]
  11. Barberis, N., & Huang, M. (2001). Mental accounting, loss aversion, and individual stock returns. The Journal of Finance, 56(4), 1247–1292. [Google Scholar] [CrossRef]
  12. Ben-Rephael, A., Da, Z., & Israelsen, R. D. (2017). It depends on where you search: Institutional investor attention and underreaction to news. The Review of Financial Studies, 30(9), 3009–3047. [Google Scholar] [CrossRef]
  13. Bikhchandani, S., Hirshleifer, D., & Welch, I. (1992). A theory of fads, fashion, custom, and cultural change as informational cascades. Journal of Political Economy, 100(5), 992–1026. [Google Scholar] [CrossRef]
  14. Cattell, R. B. (1966). The scree test for the number of factors. Multivariate Behavioral Research, 1(2), 245–276. [Google Scholar] [CrossRef]
  15. Chang, S. L., Chien, C. Y., Lee, H. C., & Lin, C. (2018). Historical high and stock index returns: Application of the regression kink model. Journal of International Financial Markets, Institutions and Money, 52, 48–63. [Google Scholar] [CrossRef]
  16. Chemmanur, T. J., & Yan, A. (2019). Advertising, attention, and stock returns. Quarterly Journal of Finance, 9(3). [Google Scholar] [CrossRef]
  17. Chen, J., Tang, G., Yao, J., & Zhou, G. (2022). Investor attention and stock returns. Journal of Financial and Quantitative Analysis, 57(2), 455–484. [Google Scholar] [CrossRef]
  18. Chen, T. (2017). Investor attention and global stock returns. Journal of Behavioral Finance, 18(3), 358–372. [Google Scholar] [CrossRef]
  19. Choi, S., & Choi, W. Y. (2019). Effects of limited attention on investors’ trading behavior: Evidence from online ranking data. Pacific Basin Finance Journal, 56, 273–289. [Google Scholar] [CrossRef]
  20. Chu, G., Goodell, J. W., Shen, D., & Zhang, Y. (2022). Machine learning to establish proxies for investor attention: Evidence of improved stock-return prediction. Annals of Operations Research, 318(1), 103–128. [Google Scholar] [CrossRef]
  21. Da, Z., Engelberg, J., & Gao, P. (2011). In search of attention. The Journal of Finance, 66(5), 1461–1499. [Google Scholar] [CrossRef]
  22. Ding, R., & Hou, W. (2015). Retail investor attention and stock liquidity. Journal of International Financial Markets, Institutions and Money, 37, 12–26. [Google Scholar] [CrossRef]
  23. Düz Tan, S., & Taş, O. (2019). Investor attention and stock returns: Evidence from Borsa Istanbul. Borsa Istanbul Review, 19(2), 106–116. [Google Scholar] [CrossRef]
  24. Fama, E. F. (1970). Efficient capital markets. Journal of Finance, 25(2), 383–417. [Google Scholar] [CrossRef]
  25. Field, A. (2024). Discovering statistics using IBM SPSS statistics. Sage publications limited. [Google Scholar]
  26. Ghani, M., Guo, Q., Ma, F., & Li, T. (2022). Forecasting Pakistan stock market volatility: Evidence from economic variables and the uncertainty index. International Review of Economics & Finance, 80, 1180–1189. [Google Scholar] [CrossRef]
  27. Gill, R. K., & Bajwa, R. (2018). Study on behavioral finance, behavioral biases, and investment decisions. International Journal of Accounting and Financial Management Research, 8(3), 1–14. [Google Scholar] [CrossRef]
  28. Guan, C., Yu, B., & Bi, S. (2021). A novel profit cutting mechanism for Chinese banks: Theory and multi-dimensional evidence. The North American Journal of Economics and Finance, 58, 101541. [Google Scholar] [CrossRef]
  29. Hair, J. F. (2009). Multivariate data analysis. Pearson. [Google Scholar]
  30. He, M., Zhang, Y., Wen, D., & Wang, Y. (2021). Forecasting crude oil prices: A scaled PCA approach. Energy Economics, 97, 105189. [Google Scholar] [CrossRef]
  31. Hillert, A., & Ungeheuer, M. (2015). Ninety years of media coverage and the cross-section of stock returns. SSRN Electronic Journal, March. 1–2. [Google Scholar] [CrossRef]
  32. Hirshleifer, D., Hsu, P. H., & Li, D. (2013). Innovative efficiency and stock returns. Journal of Financial Economics, 107(3), 632–654. [Google Scholar] [CrossRef]
  33. Hirshleifer, D., & Teoh, S. H. (2003). Limited attention, information disclosure, and financial reporting. Journal of Accounting and Economics, 36, 337–386. [Google Scholar] [CrossRef]
  34. Hong, H., & Stein, J. C. (1999). A unified theory of underreaction, momentum trading, and overreaction in asset markets. The Journal of Finance, 54(6), 2143–2184. [Google Scholar] [CrossRef]
  35. Huang, L., & Liu, H. (2007). Rational inattention and portfolio selection. The Journal of Finance, 62(4), 1999–2040. [Google Scholar] [CrossRef]
  36. Ibikunle, G., McGroarty, F., & Rzayev, K. (2020). More heat than light: Investor attention and bitcoin price discovery. International Review of Financial Analysis, 69, 101459. [Google Scholar] [CrossRef]
  37. Jaffri, A., Shirvani, A., Jha, A., Rachev, S. T., & Fabozzi, F. J. (2025). Optimizing portfolios with Pakistan-exposed exchange-traded funds: Risk and performance insight. Journal of Risk and Financial Management, 18(3), 158. [Google Scholar] [CrossRef]
  38. Jamal, A. H., & Ashraf, A. (2024). How demutualization of Pakistan stock exchange affected determinants of foreign portfolio investment in Pakistan? International Journal of Management Research and Emerging Sciences, 14(2), 87–101. [Google Scholar] [CrossRef]
  39. Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: A review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. [Google Scholar] [CrossRef]
  40. Joyo, A. S., & Lefen, L. (2019). Stock market integration of Pakistan with its trading partners: A multivariate DCC-GARCH model approach. Sustainability, 11(2), 303. [Google Scholar] [CrossRef]
  41. Kahneman, D. (1973). Attention and effort (Vol. 1063, pp. 218–226). Prentice-Hall. [Google Scholar]
  42. Kahneman, D., & Tversky, A. (1979). Prospect theory: An analysis of decision under risk. Econometrica, 47(2), 263–292. [Google Scholar] [CrossRef]
  43. Li, J., & Yu, J. (2012). Investor attention, psychological anchors, and stock return predictability. Journal of Financial Economics, 104(2), 401–419. [Google Scholar] [CrossRef]
  44. Li, Z., Liu, J., Liu, J., Liu, X., & Wu, C. (2025). Investor attention and stock price manipulation: Evidence from daily quasi-natural experiments. Journal of Banking & Finance, 179, 107528. [Google Scholar] [CrossRef]
  45. Li, Z., Liu, J., Liu, X., & Wu, C. (2024). Investor attention and stock price efficiency: Evidence from quasi-natural experiments in China. Financial Management, 53(1), 175–225. [Google Scholar] [CrossRef]
  46. Lou, D. (2014). Attracting investor attention through advertising. Review of Financial Studies, 27(6), 1797–1829. [Google Scholar] [CrossRef]
  47. Ludvigson, S. C., & Ng, S. (2007). The empirical risk–return relation: A factor analysis approach. Journal of Financial Economics, 83(1), 171–222. [Google Scholar] [CrossRef]
  48. Mangi, F. (2016). Pakistan stocks jump most in a year as MSCI seen luring Cash, bloomberg. Available online: https://www.bloomberg.com/news/articles/2016-06-15/pakistani-stocks-jump-most-this-year-after-upgrade-from-msci (accessed on 10 March 2025).
  49. Merton, R. C. (1987). A simple model of capital market equilibrium with incomplete information. The Journal of Finance, 42, 483–510. [Google Scholar] [CrossRef]
  50. Miller, E. M. (1977). Risk, uncertainty, and divergence of opinion. The Journal of Finance, 32(4), 1151–1168. [Google Scholar] [CrossRef]
  51. Mooi, E., Sarstedt, M., & Mooi-Reci, I. (2018). Principal component and factor analysis. In Market research (Springer Texts in Business and Economics). Springer. [Google Scholar] [CrossRef]
  52. Neely, C. J., Rapach, D. E., Tu, J., & Zhou, G. (2014). Forecasting the equity risk premium: The role of technical indicators. Management Science, 60(7), 1772–1791. [Google Scholar] [CrossRef]
  53. Oprea, A. (2022). The use of principal component analysis (PCA) in building yield curve scenarios and identifying relative-value trading opportunities on the Romanian government bond market. Journal of Risk and Financial Management, 15(6), 247. [Google Scholar] [CrossRef]
  54. Pakistan Stock Exchange. (2024, April 1). PSX—A historical perspective. Available online: https://www.psx.com.pk/psx/psx-blog-articles/psx-a-historical-perspective (accessed on 9 August 2025).
  55. Pakistan Stock Exchange. (2025, June). KSE100 index. Available online: https://www.psx.com.pk/psx/themes/psx/uploads/KSE-100-and-KSE100-PR-Brochure-Jun-2025-Updated.pdf (accessed on 9 August 2025).
  56. Pakistan Stock Exchange Annual Report. (2024). Available online: https://www.psx.com.pk/psx/themes/psx/uploads/Annual-Report-2024.pdf (accessed on 9 August 2025).
  57. Pashler, H., Johnston, J. C., & Ruthruff, E. (2001). Attention and performance. Annual Review of Psychology, 52(1), 629–651. [Google Scholar] [CrossRef]
  58. Peng, L., & Xiong, W. (2006). Investor attention, overconfidence and category learning. Journal of Financial Economics, 80(3), 563–602. [Google Scholar] [CrossRef]
  59. Resta, M. (2016). Enhancing self-organizing map capabilities with graph clustering: An application to financial markets. Intelligent Systems in Accounting, Finance and Management, 23(1–2), 21–46. [Google Scholar] [CrossRef]
  60. Seasholes, M. S., & Wu, G. (2007). Predictable behavior, profits, and attention. Journal of Empirical Finance, 14(5), 590–610. [Google Scholar] [CrossRef]
  61. Shah, S. Z. A., Ahmad, M., & Mahmood, F. (2018). Heuristic biases in investment decision-making and perceived market efficiency: A survey at the Pakistan stock exchange. Qualitative Research in Financial Markets, 10(1), 85–110. [Google Scholar] [CrossRef]
  62. Sharif, S. (2018). From mutualized exchange to investor-owned demutualized entity: The case of Pakistan stock exchange. International Journal of Experiential Learning & Case Studies, 3(2), 1–8. [Google Scholar] [CrossRef]
  63. Simon, H. A. (1955). A behavioral model of rational choice. The Quarterly Journal of Economics, 69(1), 99–118. [Google Scholar] [CrossRef]
  64. Smales, L. A. (2021). Investor attention and global market returns during the COVID-19 crisis. International Review of Financial Analysis, 73, 101616. [Google Scholar] [CrossRef]
  65. Song, Z., Gong, X., Zhang, C., & Yu, C. (2023). Investor sentiment based on scaled PCA method: A powerful predictor of realized volatility in the Chinese stock market. International Review of Economics & Finance, 83, 528–545. [Google Scholar] [CrossRef]
  66. Subramaniam, S., & Chakraborty, M. (2020). Investor attention and cryptocurrency returns: Evidence from quantile causality approach. Journal of Behavioral Finance, 21(1), 103–115. [Google Scholar] [CrossRef]
  67. Tversky, A., & Kahneman, D. (1973). Availability: A heuristic for judging frequency and probability. Cognitive Psychology, 5(2), 207–232. [Google Scholar] [CrossRef]
  68. Tversky, A., & Kahneman, D. (1974). Judgment under uncertainty: Heuristics and biases: Biases in judgments reveal some heuristics of thinking under uncertainty. Science, 185(4157), 1124–1131. [Google Scholar] [CrossRef]
  69. Tversky, A., & Kahneman, D. (1992). Advances in prospect theory: Cumulative representation of uncertainty. Journal of Risk and Uncertainty, 5, 297–323. [Google Scholar] [CrossRef]
  70. Wang, B., Long, W., & Wei, X. (2018). Investor attention, market liquidity and stock return: A new perspective. Asian Economic and Financial Review, 8(3), 341. [Google Scholar] [CrossRef]
  71. Wang, X. (2006). On the effects of dimension reduction techniques on some high-dimensional problems in finance. Operations Research, 54(6), 1063–1078. [Google Scholar] [CrossRef]
  72. Wold, S., Esbensen, K., & Geladi, P. (1987). Principal component analysis. Chemometrics and Intelligent Laboratory Systems, 2(1–3), 37–52. [Google Scholar] [CrossRef]
  73. Yahya, F., Shaohua, Z., Abbas, U., & Waqas, M. (2021). COVID-19-induced returns, attention, sentiments and social isolation: Evidence from dynamic panel model. Global Business Review. [Google Scholar] [CrossRef]
  74. Yao, T., Zhang, Y. J., & Ma, C. Q. (2017). How does investor attention affect international crude oil prices? Applied Energy, 205, 336–344. [Google Scholar] [CrossRef]
  75. Yuan, Y. (2015). Market-wide attention, trading, and stock returns. Journal of Financial Economics, 116(3), 548–564. [Google Scholar] [CrossRef]
  76. Zahid, S., & Saleem, H. M. N. (2025). A comparison of machine learning algorithms to predict stock price during COVID-19 outbreaks: An empirical investigation of Pakistan stock exchange. Global Business Review, 09721509251316121. [Google Scholar] [CrossRef]
  77. Zhang, D., Hu, M., & Ji, Q. (2020). Financial markets under the global pandemic of COVID-19. Finance Research Letters, 36, 101528. [Google Scholar] [CrossRef] [PubMed]
  78. Zhang, W., Shen, D., Zhang, Y., & Xiong, X. (2013). Open source information, investor attention, and asset pricing. Economic Modelling, 33, 613–619. [Google Scholar] [CrossRef]
  79. Zou, L., Cao, K. D., & Wang, Y. (2019). Media coverage and the cross-section of stock returns: The Chinese evidence. International Review of Finance, 19(4), 707–729. [Google Scholar] [CrossRef]
Figure 1. LSEG data for the KSE-100 index.
Figure 1. LSEG data for the KSE-100 index.
Jrfm 18 00670 g001
Figure 2. KSE 100 market returns and investors’ attention indices.
Figure 2. KSE 100 market returns and investors’ attention indices.
Jrfm 18 00670 g002
Table 1. Definition and Operationalization of Variables.
Table 1. Definition and Operationalization of Variables.
VariablesSymbol and OperationalizationDefinition
Abnormal Trading Volume A A T V i t = T V i t T V i t ¯ (1)In Equation (1), A A T V i t is the equal-weighted abnormal trading volume ( A T V i t ) at time t across all stocks. It is calculated as a ratio of the stock’s trading volume ( T V i t ) at time t to its average trading volume ( T V i t ¯ ) over the previous year. According to Barber and Odean (2008), when abnormally heavy trading volumes are observed in the stock market, investors are more attracted to these shares than usual. Hence, it is a measure of the market-level attention index.
Extreme Returns A E R i t = E R i t E R i t 1 (2)According to Barber and Odean (2008), investors notice extreme one-day returns, positive or negative, mostly when news about the company occurs. Hence, extreme returns play a significant role in measuring investors’ attention. In Equation (2), A E R i t is ratio of extreme returns ( E R i t ) at the end of each month to the average of extreme returns over the previous one year for each stock ( E R i t 1 ). Hence, it A E R i t is the equal-weighted extreme returns at time t across all stocks.
Past Returns A P R i t = s = t 12 t 1 ( 1 + r i , s ) 1 Aboody et al. (2010) highlighted that stocks with sharp past returns attract investors’ attention. Also, the company’s earnings announcements grab attention; hence, there is a need to include a past returns proxy in the investors’ attention index. A P R i t is the cumulative monthly returns for the past twelve months for each stock i at time t is calculated which is the equally weighted past return across all stocks (J. Chen et al., 2022).
Nearness to the KSE-100 Index 52-Week High A N K 52 W H t = p t p 52 W H t (3)To empirically test psychological anchoring and limited investor attention, J. Li and Yu (2012) used Nearness to the Index 52-week high ( A N K 52 W H t ) and historical high proxies. Furthermore, the authors stated that A N K 52 W H t helps in measuring the extent of underreaction and in predicting the aggregate market returns. In Equation (3), A N K 52 W H t is the ratio of the current level of KSE-100 index ( p t ) at the end of month t to its 52-week high ( p 52 W H t ) at month t .
Nearness to the KSE-100 Index Historical High A N K I H H t = p t p max t (4)J. Li and Yu (2012) used Nearness to the KSE-100 index historical high ( A N K I H H t ) specifically to examine when the current price is far from the historical high price, which is the time when investors react to the bad news. The authors stated that investors overreact because there has been a series of bad news in the past, and prolonged news has a negative impact on future market returns.
In Equation (4), A N K I H H t is the ratio of the current KSE-100 index ( p t ) at the end of the month t to its historical high ( p max t ) at the end of month t . The p max t is the maximum index value of KSE-100 price index dataset from January 2004 to December 2024.
Google Search Volume A G S V t Da et al. (2011) proposed Google aggregate search frequency as a direct measure of investors’ attention. A G S V t is the monthly aggregate search frequency of companies listed on KSE-100 index from Google Trends using their stock ticker. The data is available from January 2004 onwards.
Equity Mutual Funds Inflow A N A V E M F t A N A V E M F t denotes the net asset value of mutual fund inflow for each equity mutual fund i at the end of the month t . Like J. Chen et al. (2022), we computed the equal-weighted mutual funds inflow as a proxy for the market level.
KSE 100 Index Returns K S E 100 M R t = ln ( I P t I P t 1 ) (5) K S E 100 M R t is the index returns, I P t is the KSE-100 index price at time t and I P t 1 is the index price at time t 1 .
KSE 100 Index Excess Returns K S E 100 E M R t = K S E 100 M R t R F t (6) K S E 100 E M R t is the KSE-100 index excess market returns. It is the difference of K S E 100 M R t and R F t (three-month treasury bill rate)
Table 2. Descriptive statistics of investors’ attention proxies and KSE-100 index returns.
Table 2. Descriptive statistics of investors’ attention proxies and KSE-100 index returns.
VariablesObsMeanSDMinMaxP1P99SkewKurt
A N K 52 W H 2520.8610.1150.39510.5521−0.9743.628
A N K I H H 2520.2410.1640.04110.0440.7721.0614.872
A A T V 2520.9960.3650.0042.3610.2122.0130.583.707
A G S V 25210.4422.8621.37319.5292.45118.049−0.223.967
A E R 2520.080.0210.0010.2330.0410.1572.58518.498
A P R 2520.2380.346−0.561.397−0.5291.190.2692.965
A N A V E M F 2522404.885659.0021258.8384666.7111306.074163.1471.2574.088
K S E 100 E M R 252−0.0880.081−0.4830.167−0.2970.107−0.5045.309
Note. Obs represents the number of observations, Mean is the average of investors’ index proxies, SD is the standard deviation, Min is the minimum value, Max is the maximum value in the data series, P1 is the first percentile, P99 is the ninety-ninth percentile, Skew represents skewness, and Kurt is kurtosis. The Table exhibits the descriptive statistics of key variables: A N K 52 W H (Nearness to KSE-100 Index 52-Week High), A N K I H H (Nearness to KSE-100 Index Historical High), A A T V (Abnormal Trading Volume), A G S V (Google Search Volume), A E R (Extreme Returns), A N A V E M F (Equity Mutual Funds Inflow), A P R (Past Returns), and K S E 100 E M R is the KSE 100 index excess returns. The data frequency is monthly, and the period is from January 2004 to December 2024.
Table 3. Pairwise correlation matrix of investors’ attention proxies.
Table 3. Pairwise correlation matrix of investors’ attention proxies.
Variables K S E 100 E M R A N K 52 W H A D N K I H H A A T V A G S V A E R A P R A N A V E M F
K S E 100 E M R 1.000
A N K 52 W H 0.387 ***1.000
(0.000)
A D N K I H H 0.566 ***0.123 *1.000
(0.000)(0.052)
A A T V 0.112 *0.382 ***0.107 *1.000
(0.077)(0.000)(0.091)
A G S V −0.288 ***−0.0510.0830.117 *1.000
(0.000)(0.420)(0.188)(0.065)
A E R −0.176 ***−0.160 **−0.0100.0940.143 **1.000
(0.005)(0.011)(0.877)(0.137)(0.023)
A P R 0.271 ***0.187 ***0.0490.229 ***−0.335 ***−0.0951.000
(0.000)(0.003)(0.442)(0.000)(0.000)(0.131)
A N A V E M F 0.163 ***0.131 **0.0210.264 ***0.043−0.0890.327 ***1.000
(0.009)(0.038)(0.737)(0.000)(0.497)(0.160)(0.000)
Note. Table 3 illustrates the relationship between K S E 100 E M R and seven attention index proxies. *** p < 0.01, ** p < 0.05, * p < 0.1.
Table 4. Principal components eigenvalue shows total variance explained by variables/factors.
Table 4. Principal components eigenvalue shows total variance explained by variables/factors.
ComponentEigenvalueDifferenceProportionCumulative
Comp11.8310.4980.2620.262
Comp21.3340.3030.1910.452
Comp31.0300.1070.1470.599
Comp40.9240.0400.1320.731
Comp50.8830.3630.1260.857
Comp60.5200.0420.0740.932
Comp70.4780.0000.0681.000
Table 5. Principal components (eigenvectors).
Table 5. Principal components (eigenvectors).
VariableComp1Comp2Comp3Comp4Unexplained
A N K 52 W H 0.4790.111−0.363−0.0170.427
A D N K I H H 0.1610.291−0.5820.4700.287
A A T V 0.4840.4170.1330.0490.320
A E R −0.1700.4110.5590.5960.073
A P R 0.510−0.3090.2610.2480.271
A G S V −0.1500.679−0.025−0.4400.164
A N A V E M F 0.4470.0620.362−0.4080.341
Table 6. The coefficients of eigenvalues results.
Table 6. The coefficients of eigenvalues results.
VariableComp 1
(PCA1)
Comp 2
(PCA2)
Comp 3
(PCA3)
Comp 4
(PCA4)
Unexplained
A N K 52 W H 0.4792 −0.3634 0.4267
A D N K I H H −0.58160.47000.287
A A T V 0.48360.4168 0.3195
A E R 0.41080.55850.59580.0728
A P R 0.5095−0.3087 0.2707
A G S V 0.6793 −0.43970.1642
A N A V E M F 0.4465 0.3621−0.40840.3407
Table 7. KMO measure of sampling adequacy.
Table 7. KMO measure of sampling adequacy.
VariableKMO
A N K 52 W H 0.5595
A D N K I H H 0.6101
A A T V 0.5368
A E R 0.4403
A P R 0.5400
A G S V 0.4170
A N A V E M F 0.5693
Overall0.5248
Table 8. Descriptive statistics.
Table 8. Descriptive statistics.
VariablesObsMeanStd. Dev.MinMaxp1p99Skew.Kurt.
A P C A 1 25101.353−5.4083.188−3.3932.859−0.3883.797
A P C A 2 25101.155−3.8663.736−2.4873.4540.3364.042
A P C A 3 25101.015−3.5054.62−2.1023.9040.9196.316
A P C A 4 25100.961−3.2324.544−2.2013.2670.6836.118
Table 9. Pairwise correlations.
Table 9. Pairwise correlations.
Variables K S E 100 E M R A P C A 1 A P C A 2 A P C A 3 A P C A 4
K S E 100 E M R 1.000
A P C A 1 0.451 ***1.000
(0.000)
A P C A 2 −0.0610.0001.000
(0.335)(1.000)
A P C A 3 −0.415 ***0.0000.0001.000
(0.000)(1.000)(1.000)
A P C A 4 0.288 ***0.0000.0000.0001.000
(0.000)(1.000)(1.000)(1.000)
*** p < 0.01.
Table 10. Generalized least square regression results of each attention index.
Table 10. Generalized least square regression results of each attention index.
VariablesCoef.St. Er.t-Valuep-ValueR2F-TestProb > FTransformed DW
Model 1:  K S E 100 E M R t + 1 = α 0 + β 1 A P C A 1 t + ε t + 1
A P C A 1 0.3610.0517.060.000 ***0.16749.850.000 ***2.01
Constant−0.0090.076−0.120.904
Model 2:  K S E 100 E M R t + 1 = α 0 + β 1 A P C A 2 t + ε t + 1
A P C A 2 0.0890.0661.360.1760.0071.850.1762.10
Constant−0.0120.096−0.120.904
Model 3:  K S E 100 E M R t + 1 = α 0 + β 1 A P C A 3 t + ε t + 1
A P C A 3 −0.6320.046−13.690.000 ***0.430187.330.000 ***2.22
Constant−0.0310.120−0.260.796
Model 4:  K S E 100 E M R t + 1 = α 0 + β 1 A P C A 4 t + ε t + 1
A P C A 4 0.3200.0694.650.000 ***0.08021.670.000 ***2.05
Constant−0.0080.088−12.380.928
Table 10 presents the results of four GLS regressions examining the impact of A P C A 1 , A D P C A 2 , A P C A 3 , and A P C A 4 attention indices on K S E 100 E M R t + 1 . *** shows significance at the 1% level, respectively. DW denotes Durbin–Watson.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Thalassinos, E.; Parveen, S.; Mughal, R.; Zada, H.; Ahmed, S. PCA-Based Investor Attention Index and Its Impact on the KSE-100 Excess Returns. J. Risk Financial Manag. 2025, 18, 670. https://doi.org/10.3390/jrfm18120670

AMA Style

Thalassinos E, Parveen S, Mughal R, Zada H, Ahmed S. PCA-Based Investor Attention Index and Its Impact on the KSE-100 Excess Returns. Journal of Risk and Financial Management. 2025; 18(12):670. https://doi.org/10.3390/jrfm18120670

Chicago/Turabian Style

Thalassinos, Eleftherios, Samina Parveen, Riffat Mughal, Hassan Zada, and Shakeel Ahmed. 2025. "PCA-Based Investor Attention Index and Its Impact on the KSE-100 Excess Returns" Journal of Risk and Financial Management 18, no. 12: 670. https://doi.org/10.3390/jrfm18120670

APA Style

Thalassinos, E., Parveen, S., Mughal, R., Zada, H., & Ahmed, S. (2025). PCA-Based Investor Attention Index and Its Impact on the KSE-100 Excess Returns. Journal of Risk and Financial Management, 18(12), 670. https://doi.org/10.3390/jrfm18120670

Article Metrics

Back to TopTop