4.1. Descriptive Statistics and Panel Diagnostics
The following
Table 3 reports summary statistics for the model-ready panel of observations. The dependent variable, daily net flow, has a mean of USD 10.30 million with a standard deviation of USD 99.78 million, reflecting enormous heterogeneity in capital allocation across funds and over time. The median flow is exactly zero, indicating that on a typical day, most ETFs experience negligible net creation or redemption activity, while the extremes range from USD
642.5 million to USD 1119.9 million. The distribution is positively skewed (
) and strongly leptokurtic (excess kurtosis
); the Jarque–Bera test statistic
decisively rejects the null of normality (
), confirming that the flow distribution exhibits fat tails, motivating the use of robust inference procedures throughout.
BrandScore has a mean of 1.91 and a standard deviation of 4.845 but a median of zero, confirming that the majority of ETF-day observations involve no issuer-specific mentions. This concentration is a defining feature of the empirical setting: on any given day, most issuers are “invisible” in community discourse, while a small number (predominantly IBIT and FBTC) dominate the conversation. The right tail is substantial (maximum = 60.98, skewness = 4.03), driven by episodic spikes in discussion around specific issuers during salient market events. The interaction variable inherits this concentration, with a median of zero, a mean of 0.27, and extreme kurtosis (47.99), indicating that the joint occurrence of non-zero brand visibility and strongly directional sentiment is a relatively rare but potentially impactful event.
Table 4 and
Figure 1 present the pairwise correlation structure. Net flow is most strongly correlated with BrandScore (
,
) and the sentiment–brand interaction (
,
), providing preliminary evidence that issuer visibility is associated with capital allocation even before conditioning on other variables. By contrast, the correlation between flow and lagged sentiment is negligible (
,
), foreshadowing the regression result that sentiment alone has no predictive power for fund-level flows. Importantly, sentiment and attention are negatively correlated (
,
), indicating that high-volume discussion days are not necessarily positive in tone, a finding consistent with attention spikes during negative market events such as large drawdowns or regulatory uncertainty. The correlation between BrandScore and the interaction term is high (
) by construction, since the interaction is the product of sentiment and BrandScore; however, VIF diagnostics reported below confirm that this mechanical collinearity does not compromise coefficient precision.
In addition, we conduct a battery of panel diagnostic tests prior to estimation.
Table 5 reports the results of unit root tests, which confirm that all panel variables are stationary in levels. For market-wide variables (which are common across all entities), the ADF test rejects the unit root null at the 5% level or better, with ADF statistics ranging from
(Attention) to
(BTC Return).
For entity-varying variables, the IPS panel unit root test produces strongly negative statistics: for Net Flow () and for BrandScore (). These results confirm panel stationarity and validate the use of the fixed-effects estimator in levels without differencing.
VIF values for the Model 2 regressors are all below 2.2 (the highest being 2.18 for the interaction term), well below the conventional threshold of 5, confirming that multicollinearity does not compromise the precision of the coefficient estimates despite the mechanical correlation between BrandScore and the interaction term.
The Hausman specification test yields a negative test statistic (
), a finite-sample phenomenon that arises when the estimated difference in variance matrices
is not positive semi-definite, a known issue in panels with few clusters [
50]. Although the negative statistic precludes a standard chi-squared rejection, the theoretical justification for fixed effects is compelling: the 10 ETFs differ in persistent, unobserved ways (fee schedules, custody arrangements, distribution networks, institutional reputation) that are almost certainly correlated with BrandScore and flow levels, violating the orthogonality condition required for random-effects consistency.
The Pesaran test for cross-sectional dependence yields
(
), with a mean absolute pairwise residual correlation of
. This substantial cross-entity residual co-movement is expected in a panel of ETFs tracking the same underlying asset: common Bitcoin price shocks and market-wide sentiment shifts generate correlated residuals even after controlling for observed market variables. The significant cross-sectional dependence motivates the Driscoll–Kraay robustness check reported in
Section 4.3.
4.2. Panel Regression Results
Table 6 reports the panel regression results. Column (1) reports the baseline model estimated on the observations. The coefficient on lagged sentiment is positive (
) but statistically insignificant (
,
), indicating that aggregate community mood does not reliably predict next-day ETF-level flows. Similarly, lagged attention carries a positive but insignificant coefficient (
,
). Among the market controls, lagged Bitcoin return is positive and economically large (
) but falls short of conventional significance (
). The within-
of 0.030 indicates that the baseline specification explains approximately 3% of the within-ETF temporal variation in daily flows, a figure that is consistent with the well-documented difficulty of explaining daily fund flow variation in the broader ETF literature. The key takeaway from Model 1 is that generic Reddit sentiment and attention (without any issuer-specific conditioning) have no statistically detectable effect on Bitcoin ETF flows. This null result is itself informative: it suggests that aggregate community mood is too diffuse a signal to predict product-level capital allocation in a market where 10 near-identical products compete for the same investor base.
Column (2) reports the brand-augmented model, which yields the central empirical finding of this paper. The BrandScore level coefficient is not statistically significant (, , ), indicating that issuer mention density alone does not predict next-day fund-level flows. This non-result reflects the countervailing influence of GBTC (Grayscale): Grayscale’s product has high brand visibility (mean BrandScore 2.0, third after IBIT at 9.9 and FBTC at 3.6) but experienced persistent large outflows as investors rotated from GBTC’s 1.50% expense ratio to lower-cost alternatives, creating a negative BrandScore–flow association that offsets the positive association observed for other issuers. The Sentiment BrandScore interaction is positive and significant (, , ), confirming that the effect of sentiment on flows is amplified when the issuer is more visible in community discourse. The standalone sentiment coefficient shrinks from 12.68 in Model 1 to 7.63 in Model 2 and remains insignificant (), consistent with the hypothesis that generic sentiment is not independently meaningful for ETF-level capital allocation; rather, sentiment operates through a brand-mediated channel. The within- changes marginally from 0.030 to 0.031.
The incremental -test for the joint restriction yields (), indicating that the brand variables do not jointly achieve significance at conventional levels in terms of incremental improvement. This reflects the fact that the direct BrandScore coefficient is insignificant; the brand channel operates through the interaction with sentiment rather than through a direct level effect. The interaction coefficient alone is significant (), and the economic magnitude of the interaction is substantial.
Under Model 2, the marginal effect of sentiment on flows is
The threshold BrandScore at which the marginal effect would equal zero is . Since BrandScore is non-negative by construction, the marginal effect is always positive across the observed data range (but its magnitude and statistical significance vary dramatically with issuer visibility). At zero visibility, the marginal effect is USD 7.63 per unit of sentiment but is statistically indistinguishable from zero. As BrandScore increases, the marginal effect grows linearly and gains statistical power. This pattern (where the mechanism ‘turns on’ only at sufficient brand salience) is the empirical signature of the brand-filtered sentiment transmission hypothesis.
Table 7 quantifies these marginal effects at representative BrandScore values, and
Figure 2 visualizes the relationship. For IBIT (BlackRock), with a mean BrandScore of 9.92, the marginal effect is 36.70 million per unit of sentiment (
, 95% CI
). This means that a one-standard-deviation increase in community sentiment (
) is associated with an additional 5.23 million in net inflows to IBIT the following day. For an issuer at the sample mean BrandScore (1.91), the same one-standard-deviation sentiment shock translates into only 1.89 million, that is, a nearly threefold reduction arising entirely from the brand channel. At zero visibility, the effect is USD 1.09 million and is statistically indistinguishable from zero. The progressive ‘activation’ of sentiment as BrandScore rises, visualized in
Figure 2, provides compelling evidence that issuer brand visibility is the mechanism through which diffuse community sentiment becomes translated into concrete capital allocation decisions.
To contextualize the economic magnitude of the brand-sentiment interaction, we compute standardized effect sizes for the Model 2 coefficients. The BrandScore direct effect is million (not statistically significant), while the joint interaction effect (representing a simultaneous one-standard-deviation increase in both Sentiment and BrandScore) is million, or 27.2% of the sample mean daily flow (). The VIX effect is million, confirming that market fear exerts a meaningful dampening effect on ETF inflows. Among the significant regressors, BTC Return has the largest standardized effect (, or 143% of the mean daily flow), consistent with the dominant role of underlying asset momentum. The brand–sentiment interaction, while smaller in absolute magnitude, captures the novel product-selection mechanism that is this paper’s primary contribution.
Columns (3) and (4) report the baseline model estimated separately on TradFi and crypto-native subsamples (with the corrected classification: IBIT, FBTC, ARKB, EZBC, and BTCW as TradFi; BITB, HODL, BRRR, BTCO, and GBTC as crypto-native). For TradFi ETFs, the VIX coefficient is negative and significant (, ), consistent with the interpretation that flows into institutionally backed ETFs are more sensitive to aggregate risk conditions; this is a pattern that resonates with the risk management frameworks employed by the financial advisors and wealth managers who disproportionately channel capital through TradFi products. The within- of 0.066 is the highest among all specifications. For crypto-native ETFs, lagged Bitcoin return is highly significant (, ), suggesting that investors in specialist products are more responsive to the cryptocurrency’s own price momentum, while the within- of 0.009 indicates that the included variables explain very little of the within-ETF flow variation for smaller issuers, which is consistent with flows to niche products being driven by idiosyncratic factors such as promotional campaigns, fee reductions, or platform-specific distribution arrangements that our model does not capture.
4.3. Robustness and Sensitivity Analysis
We subject the core findings to a comprehensive battery of robustness checks spanning alternative inference procedures, lag specifications, and distributional modeling. The objective is to determine which features of the baseline results survive the most stringent methodological stress tests and which are sensitive to inferential assumptions.
To address the concern that common daily shocks not spanned by the three market controls may confound the entity-FE estimates, we re-estimate the brand-augmented specification with two-way (entity + date) fixed effects. The date fixed effects absorb all market-wide daily variation, including Sentiment, Attention, and the three market controls, so that identification relies exclusively on within-day cross-sectional differences in BrandScore and the interaction term. In this demanding specification, BrandScore level is not independently significant (, ), consistent with the entity-FE result and reflecting the offsetting role of GBTC (high visibility, persistent outflows). Crucially, however, the Sentiment BrandScore interaction remains significant (, ), confirming that the brand-filtered sentiment transmission mechanism survives even when all common daily shocks are absorbed. This is the paper’s most demanding identification test: it asks whether, on a given day, the ETF whose brand visibility is high relative to its peers receives disproportionately more flows when community sentiment is positive, controlling for everything common across ETFs on that day.
We note that a market-share specification, in which the dependent variable is each ETF’s share of total daily category net flow, does not yield significant results for either BrandScore or the interaction term. This null finding likely reflects noise introduced by the ratio normalization: on days when total category net flow is near zero or negative (approximately 12% of the sample), the market-share variable becomes extremely volatile or undefined, degrading power. The two-way fixed-effects specification, which absorbs all common daily variation through date fixed effects, achieves the same conceptual objective of isolating within-day cross-sectional allocation differences while avoiding the denominator-volatility problem inherent in ratio-based dependent variables. We therefore interpret the two-way FE result () as the more reliable test of the product-allocation mechanism.
Given the significant cross-sectional dependence documented in
Section 4.1 (
,
), we re-estimate Model 2 using Driscoll–Kraay kernel-based HAC standard errors with a Bartlett kernel and bandwidth
. Under Driscoll–Kraay SE, the BrandScore coefficient remains insignificant (
), and the interaction coefficient also loses significance (
). Conversely, Attention becomes significant (
), BTC Return becomes highly significant (
), and VIX achieves marginal significance (
). This divergence reflects a fundamental tension in panel inference with a small number of cross-sectional units. Entity-clustered standard errors treat the
ETFs as the source of sampling uncertainty, which is appropriate when the research question concerns the within-entity temporal relationship between discourse and flows. Driscoll–Kraay standard errors additionally account for cross-entity contemporaneous correlation but assume large-
asymptotics, and they penalize regressors whose cross-sectional variation is itself correlated with common market factors. Because BrandScore is mechanically zero for most ETF-day observations and concentrated in the few dominant issuers (IBIT, GBTC), the cross-sectional dispersion in BrandScore-driven residuals is inherently correlated across entities during high-visibility episodes, precisely the variation that identifies the brand effect. We interpret the Driscoll–Kraay results as a useful stress test that reveals the limits of identification in a panel of products tracking the same underlying asset, rather than as a refutation of the primary finding.
Table 8 reports the bootstrap results. The bootstrap 95% CI for BrandScore is
, which includes zero, consistent with the finding that the direct brand visibility effect is not statistically significant. The bootstrap mean (
) is close to the OLS estimate (
), with a moderate downward bias of
, indicating that the non-significance of BrandScore is confirmed under finite-sample resampling. The bootstrap SE for BrandScore (
) is comparable to the entity-clustered SE (
). For the interaction term, the bootstrap 95% CI is
, which is wide and includes zero, tempering the strength of the interaction result under the most conservative finite-sample assessment. The bootstrap mean for the interaction (
) is below the OLS estimate (
), suggesting some attenuation that arises from the concentration of the identifying variation in a few high-BrandScore entities. These bootstrap results delineate the boundary of what the data can firmly establish: the interaction effect (significant under clustered inference at
) requires interpretation with appropriate caution given the small cross-sectional dimension (
), while the direct BrandScore level effect is clearly not robust.
We further verify that the results are not sensitive to the choice of lag length by re-estimating Model 2 using contemporaneous (lag 0) and two-day lagged (lag 2) discourse variables. The BrandScore coefficient is not significant at any lag specification ( in all cases), with point estimates of (lag 0), (lag 1), and (lag 2). The interaction coefficient is significant at lag 0 (, ) and lag 1 (, ), but not at lag 2 (, ). The interaction coefficient is highest at lag 0 and declines at longer horizons, consistent with sentiment having the most immediate impact when it is contemporaneous with the flow decision. As a further robustness check against the influence of extreme flow observations, we winsorize the dependent variable at the 1st and 99th percentiles and re-estimate Model 2. The Sentiment BrandScore interaction coefficient remains positive and highly significant (, ), in fact strengthening relative to the unwinsorized specification (). This confirms that the headline interaction result is not driven by a handful of extreme inflow or outflow events; if anything, the brand-filtered sentiment mechanism is more precisely estimated once extreme flow days are trimmed.
We also verify sensitivity to the engagement weighting specification. Replacing the original weight (Equation (10)) with the logarithmic alternative , a great suggestion by an anonymous reviewer, compresses the right tail and provides better differentiation among low-engagement posts. The logarithmic transformation does meaningfully alter the daily Sentiment time series ( between the original and alternative series), primarily because the original weighting allows a small number of highly upvoted posts (maximum weight 40,693) to drive the daily aggregate, whereas the logarithmic compression reduces the maximum weight to approximately 11.0. Nevertheless, re-estimating Model 2 with the alternative weighting produces qualitatively reinforcing results: the Sentiment BrandScore interaction coefficient remains positive and is in fact highly significant (, ), substantially stronger than under the original weighting (, ). The larger magnitude reflects the reduced variance of the alternative Sentiment series (standard deviation of 0.047 versus 0.141), which compresses the Sentiment scale and inflates the interaction coefficient to compensate. The key qualitative finding, that the brand-filtered sentiment mechanism is significant regardless of the weighting scheme, is confirmed.
The quantile regression results in
Table 9 reveal substantial heterogeneity in the relationship between discourse variables and flows across the conditional flow distribution, i.e., heterogeneity that the conditional-mean model necessarily obscures. The BrandScore coefficient displays a striking sign-reversal pattern across quantiles: strongly negative in the lower tail (
at
;
at
), near zero at the median (
), and strongly positive in the upper tail (
at
;
at
). This pattern reflects the role of GBTC: at lower quantiles (large outflow days), GBTC’s high brand visibility is associated with disproportionately negative flows, pulling the BrandScore coefficient negative. At upper quantiles (large inflow days), the positive association between brand visibility and flows for IBIT and other net-inflow ETFs dominates. The conditional-mean OLS estimate (
, insignificant) averages over these offsetting effects. The interaction coefficient follows a complex distributional pattern, being strongly positive at
(
), declining through the interior quantiles, and re-emerging at
(
,
).
Figure 3 visualizes these quantile coefficient estimates with their 95% confidence intervals.
The VIX coefficient displays the expected distributional pattern, being most strongly negative in the lower quantiles ( at ) and weakening toward the upper tail ( at , insignificant). This asymmetry suggests that market fear disproportionately amplifies outflow days rather than symmetrically dampening all flow quantiles, a result with implications for risk management and market stability. The BTC Return coefficient is positive and significant across all quantiles, being largest at the extremes ( at ; at ) and smallest near the median (), consistent with cryptocurrency momentum driving the tails of the flow distribution.
4.4. Dynamic Analysis
Table 10 reports the Granger non-causality test results, which address a key identification concern: whether the observed associations reflect sentiment influencing flows, or flows influencing sentiment through a feedback loop.
The Sentiment Flow tests yield -statistics well below 1.0 at all lag orders ( for , respectively), confirming that lagged sentiment does not Granger-cause flows independently, which is consistent with the regression evidence that generic sentiment has no standalone predictive content and that the brand channel is the operative mechanism. The reverse direction, Flow Sentiment, produces a marginally significant result at (, ) but insignificant results at longer lags ( and for and ). This weak, short-lived feedback (where large flows on day may slightly shift community sentiment on day ) does not generate a persistent endogeneity problem for our lagged specification, since the effect dissipates within a single day and does not accumulate over the lag horizons relevant to our model. The Attention Flow tests are insignificant at all lag orders, confirming that discussion volume per se does not predict fund flows.
To examine the temporal stability of the brand-mediated sentiment mechanism, we estimate Model 2 over rolling 90-trading-day windows, yielding 422 subsample estimates.
Figure 4 displays the rolling coefficient estimates with their 95% confidence bands. The rolling window analysis reveals that the brand-mediated sentiment mechanism is not uniformly persistent but episodic in nature (a finding with important implications for both theory and practice). The BrandScore coefficient exhibits substantial time variation, achieving statistical significance at the 5% level in 62.3% of windows. The interaction coefficient shows a similar pattern, with significance in 62.8% of windows. Both coefficients tend to spike during bull-market phases and periods of elevated market attention (such as the Bitcoin halving in April 2024 and the post-election rally in late 2024) when positive sentiment and high brand visibility jointly attract capital. Conversely, during risk-off periods and market drawdowns, the coefficients shrink toward zero or become negative, consistent with the interpretation that the brand channel is most potent when investors are actively seeking exposure rather than defensively reducing positions.
Figure 5 plots the rolling within-
over time. The rolling within-
averages 0.055 across all windows (somewhat higher than the full-sample
of 0.031) with a range of [0.027, 0.080]. The explanatory power of the model varies meaningfully over time, peaking during periods of elevated market activity and declining during quieter intervals. This temporal variation in model fit is consistent with the view that social discourse becomes a more important driver of ETF flows during periods of heightened market salience, when retail attention is high and the information content of community discussion is most relevant to investment decisions.
The episodic nature of the brand effect connects to the theoretical framework of salience-driven demand [
47]. During high-salience regimes, when cryptocurrency markets are in the news, prices are moving sharply, and Reddit discussion is intense, the brand channel is “active” because investors are paying attention to issuer-specific discourse and making product-selection decisions based on brand recognition. During low-salience regimes, flows are driven predominantly by common market factors. To identify which market conditions correspond to the episodic activation and deactivation of brand-sentiment channel, we classify each rolling window into three market regimes based on within-window financial conditions: (i) a bull/low-stress regime, defined as windows in which the mean daily Bitcoin return is positive and the mean VIX level is below the sample-wide median VIX (16.31); (ii) a bear/high-stress regime, defined as windows in which the mean daily Bitcoin return is negative or the mean VIX exceeds the 75th percentile of the full-sample VIX distribution (18.59); and (iii) a neutral regime for the remaining windows. Of the 422 rolling windows, 45 are classified as bull/low-stress, 228 as bear/high-stress, and 149 as neutral. The Sentiment
BrandScore interaction achieves significance (
) in 73.3% of bull/low-stress windows, 77.9% of neutral windows, but only 50.9% of bear/high-stress windows. This pattern is consistent with the theoretical mechanism: during favorable or stable market conditions, investors are actively selecting among competing ETFs, and the combination of issuer visibility with positive community sentiment matters for product-level capital allocation. During bear/high-stress periods, flows are predominantly driven by aggregate risk-off behavior, which pushes capital symmetrically out of the category and renders the brand–sentiment interaction less consequential. The asymmetry between bull/neutral and bear regimes complements the qualitative narrative in
Section 4.4 by providing a formal link between market conditions and the activation of the brand-mediated mechanism.
4.5. Discussion
The results of this study yield three principal findings that, taken together, provide convergent evidence consistent with a brand-filtered sentiment transmission mechanism in digital asset markets. First, aggregate Reddit sentiment and community attention do not independently predict daily net flows into individual Bitcoin ETFs. This null result holds across all four model specifications and all three lag structures tested, and is confirmed by the Granger non-causality analysis (
Table 10). Second, the direct BrandScore level effect is not statistically significant (
), reflecting the countervailing role of GBTC (high visibility, persistent outflows); however, the Sentiment
BrandScore interaction is positive and significant (
), survives two-way fixed effects (
) and winsorization (
), and achieves significance in approximately 63% of rolling 90-day windows. Third, the effect of sentiment on flows is moderated by brand visibility, indicating that positive community mood translates into capital allocation only when it is directed at or associated with a recognizable issuer. The interaction finding is significant under standard clustered inference and economically meaningful, but its sensitivity to conservative finite-sample inference (entity-bootstrap and Driscoll–Kraay standard errors) calls for appropriate caution.
The brand-filtered sentiment transmission mechanism documented here represents a novel contribution to the intersection of intelligent financial systems, behavioral finance, information economics, and digital asset markets. Unlike existing studies that examine the aggregate relationship between cryptocurrency sentiment and market returns [
14,
51], our analysis operates at the product level in a setting where all 10 ETFs track the same underlying asset. This identical-exposure design allows us to isolate the role of issuer-specific discourse in directing capital allocation, a mechanism that is invisible in aggregate return studies. Moreover, unlike attention-based studies that rely on Google Trends or Wikipedia page views to measure aggregate interest, BrandScore captures whom investors are discussing, not merely how much they are searching. This issuer-level granularity is what enables the interaction specification to identify the brand-moderation channel.
We emphasize that the interaction specification captures whether aggregate community mood translates into differential capital allocation across ETFs as a function of issuer visibility. It does not isolate sentiment expressed specifically about a given issuer. The mechanism we identify is therefore best characterized as a brand-salience-moderated demand channel: positive community mood becomes actionable for fund-level allocation decisions only when a specific issuer is sufficiently prominent in the discourse to serve as a focal point for investor attention. This interpretation is weaker than a pure issuer-directed sentiment channel but remains economically meaningful and empirically supported by the within-day cross-sectional identification provided by the two-way fixed-effects specification.
The insignificance of generic sentiment resonates with models of limited attention and rational inattention. In a market with 10 near-identical products, investors face a product-selection problem that cannot be resolved by aggregate community mood alone. Positive sentiment about Bitcoin in general does not indicate which ETF to buy. BrandScore resolves this ambiguity by providing issuer-specific salience: when a particular ETF dominates community discourse, investors who are already positively disposed toward Bitcoin have a focal point for directing their capital. This interpretation aligns with the salience theory of Bordalo et al. [
47], which predicts that disproportionate attention to particular attributes (in our setting, particular issuers) can shift demand independently of fundamentals. The quantile regression evidence reinforces this connection: the BrandScore coefficient displays a striking sign-reversal across quantiles, shifting from −6.86 at the 25th percentile to +10.73 at the 90th percentile. This pattern reflects the offsetting role of GBTC at lower quantiles and the dominance of IBIT at upper quantiles, indicating that the brand-salience mechanism is most potent during large inflow episodes when marginal investors are actively choosing among competing products.
The interaction effect connects to trust-based models of financial intermediation. In the cryptocurrency space, where counterparty risk, custody security, and regulatory uncertainty are salient concerns, brand recognition serves as a heuristic for credibility. Investors may be willing to act on positive sentiment signals only when those signals are associated with a trusted issuer, which is formally captured by the interaction term . The fact that the standalone sentiment coefficient is insignificant while the interaction is significant is the precise empirical signature predicted by this trust-mediation hypothesis. One can also frame BrandScore through the lens of information economics: investors who actively discuss a specific issuer have already incurred the cognitive cost of learning about that product, making them more likely to act on sentiment signals. BrandScore thus proxies not only for visibility but for the depth of investor engagement with a particular issuer, which reduces the information asymmetry barrier to capital commitment.
The quantile regression results merit further theoretical reflection. The finding that the BrandScore effect is strongest in the upper tail of the flow distribution (on days of large net inflows) is consistent with positive feedback models of fund flows [
52]. During bullish momentum episodes, initial inflows driven by brand visibility may attract additional capital from momentum-chasing investors, creating an amplification loop that concentrates flows in the most visible products. This mechanism has implications for market structure: if brand-mediated sentiment transmission amplifies flow concentration during bull markets, it may contribute to the winner-take-all dynamics observed in the Bitcoin ETF market, where IBIT has captured a disproportionate share of cumulative inflows despite offering an economically identical product to its competitors. While the monotonically increasing BrandScore coefficient across quantiles is consistent with the behavioral amplification interpretation, alternative explanations deserve consideration. The strong upper-quantile effect could also reflect a mechanical concentration effect: because a small number of ETFs (primarily IBIT and GBTC) account for the vast majority of both BrandScore variation and large-flow days, the upper-quantile coefficient may be identifying a relationship specific to these dominant products rather than a general feature of brand-mediated sentiment transmission. To the extent that BlackRock and Grayscale have unique institutional characteristics, such as superior distribution networks, broader brokerage platform integration, and pre-existing investor familiarity, the upper-tail brand effect may partially capture these confounders rather than pure discourse-driven demand. Disentangling these channels would require a larger cross-section of financial products and more granular data on distribution arrangements.
The robustness analysis delineates a clear boundary between what the data firmly establish and what remains suggestive. The Sentiment BrandScore interaction, which operationalizes the paper’s central hypothesis of brand-filtered sentiment transmission, emerges as the more robust finding. The interaction is significant under entity-clustered inference in the baseline specification (, ), survives the two-way (entity + date) fixed-effects specification that absorbs all common daily variation (, ), and strengthens under winsorization of the dependent variable (, ). The interaction achieves significance in 62.8% of rolling 90-day windows, with activation rates higher during bull/low-stress (73.3%) and neutral (77.9%) market conditions than during bear/high-stress periods (50.9%). The direct BrandScore level effect, by contrast, is not statistically significant in the corrected specification (), reflecting the countervailing influence of GBTC: Grayscale’s product has high brand visibility but experienced persistent large outflows as investors rotated from GBTC’s 1.50% expense ratio to lower-cost alternatives. This pattern is substantively clarifying rather than problematic: it demonstrates that brand visibility alone does not mechanically attract capital. Rather, visibility operates through its interaction with community sentiment, precisely as the brand-filtered transmission hypothesis predicts. When sentiment is positive and an issuer is highly visible, flows increase; when sentiment is negative or neutral, high visibility does not by itself drive inflows. We acknowledge that the interaction estimate is sensitive to conservative finite-sample inference procedures: the entity-bootstrap 95% confidence interval includes zero, and the Driscoll–Kraay -value exceeds conventional thresholds, consistent with the small cross-sectional dimension (). We therefore interpret the interaction as a well-supported but not definitively established mechanism that warrants replication with a larger cross-section of financial products.
For ETF issuers, brand visibility on social media platforms is not merely a marketing metric but a material driver of capital inflows: the finding that the Sentiment BrandScore interaction produces an estimated USD 2.80 million daily flow effect for a joint one-standard-deviation increase in both sentiment and brand visibility suggests that community engagement strategies have direct economic consequences for asset gathering, particularly when combined with positive community discourse. For regulators, the brand-mediated sentiment channel raises questions about whether social media visibility creates herding effects that concentrate flows in a small number of dominant products, potentially increasing systemic risk through excessive assets under management (AUM) concentration. For researchers, the interaction specification and quantile regression evidence demonstrate the value of non-linear and distributional modeling techniques in capturing mechanisms that conditional-mean models miss, highlighting the contribution of mathematical methods to uncovering economically meaningful heterogeneity in financial data. The finding that the brand channel is significant in approximately 63% of rolling windows raises a natural question: why does it deactivate in the remaining 37%? Two non-exclusive explanations merit consideration. First, during risk-off periods, investor behavior may shift from product selection (which ETF to buy) to market timing (whether to hold crypto exposure at all), rendering brand differentiation irrelevant. Second, the 37% of insignificant windows may reflect periods of low statistical power rather than genuine mechanism deactivation: when BrandScore variation is compressed (because no issuer dominates discourse), the interaction effect is difficult to detect even if the underlying mechanism is present. Distinguishing between these explanations, whether mechanism deactivation or power attenuation, would benefit from a formal regime-switching model that allows the coefficient itself to follow a latent Markov process, an approach we identify as a promising direction for future research.