1. Introduction
Color in urban public spaces extends far beyond aesthetic enhancement to function as a critical system variable influencing psychological and physiological responses, attention and behavior, spatial legibility and safety, place identity, and thermal environment and energy consumption. In this review, we use the term “colour as a system variable” in a pragmatic systems-engineering sense, referring to aspects of color that can be described by measurable metrics, adjusted through design decisions, and embedded in feedback loops with human perception and performance. The central research question of this review is: How can existing urban color research be translated into actionable, measurable guidelines for urban planning and design? However, prior reviews have largely remained aesthetic- or single-domain-oriented and rarely provide performance-based thresholds, cross-context comparability, or audit-ready procedures that connect color choices to verifiable outcomes in public space. Over the past three decades, research paradigms have evolved from controlled laboratory experiments to immersive virtual reality (VR) and wearable physiological monitoring, integrated with street-view machine learning and architectural/urban-scale physical simulations, providing interdisciplinary evidence and operational frameworks for understanding how color transforms urban experience and system performance [
1,
2,
3,
4,
5]. Building on this foundation, this review systematically examines why color should be considered a fundamental variable in public space design, identifies which effects demonstrate cross-contextual robustness versus high context-dependency, and explores how empirical evidence can be translated into engineering thresholds, processes, and metrics [
1,
2,
3,
6,
7,
8].
Foundational architectural-color scholarship provides the intellectual ground for this review. Caivano traces the historical development of color research in architecture and environmental design and advances a semiotic program that links perceptual variables with cultural meaning and design decisions [
9,
10,
11]. Earlier functional-color writing by Birren systematically related hue, lightness and chroma to emotional, cognitive and behavioral responses in work, therapeutic and institutional settings, establishing a practical paradigm for “color functionality” that anticipated later empirical research [
12,
13,
14]. Complementing this lineage, Mahnke’s interdisciplinary corpus consolidates evidence on how color affects human response in built environments and sets out functional planning principles that go beyond decoration to health, comfort, and performance [
15,
16,
17,
18]. In parallel, Serra Lluch and collaborators analyze color composition and preference structures in architecture using systematized attributes that can be operationalized in design workflows [
19,
20], while landscape-related work examines chromatic organization with respect to setting and viewpoint [
21]. These contributions frame urban color as both a measurable system and a cultural practice, which this review synthesizes into operational guidance.
Responsibility for urban color schemes is shared but led by urban planners and architects as conveners of spatial and cultural objectives; signage and lighting engineers co-own legibility and night-time optics; and facility/maintenance teams own life-cycle control. This division of labor aligns with the framework adopted in this review, where objectives are set by planning and design, strategies are co-developed across disciplines, metrics are instrumentable for audit, and validation includes handover and in-use monitoring [
22].
At the individual mechanistic level, color influences emotional valence and arousal through hue, brightness, saturation, and contrast, subsequently affecting attention allocation, risk assessment, and task performance. Classical experimental psychology has established robust associations between brightness and pleasantness, and between saturation and arousal [
1]. However, hue effects are context-dependent rather than universally directional: red cues may induce avoidance motivation and impair performance in examination contexts while potentially enhancing detail-oriented tasks compared to blue, whereas blue facilitates divergent and creative thinking [
2,
3]. Immersive and neurophysiological evidence demonstrates that even blue, typically categorized as a “cool” color, significantly elevates electrodermal responses and alters frontal EEG indicators relative to achromatic/white baselines, highlighting the relativity and baseline sensitivity of color effects [
4]. These findings align with cross-cultural landscape research revealing differential emotional and physiological responses to warm/cool color schemes across populations, supporting a “universal principles plus local calibration” design approach [
5].
In the empirical corpus, this context-dependence of hue manifests in several ways. Studies that equate luminance contrast while varying hue report mixed and sometimes contradictory preference or arousal effects, with outcomes moderated by task semantics (warning vs. calming settings), background complexity, cultural associations and adaptation state. For this reason, the quantitative guidance in this review is deliberately framed in terms of luminance contrast, ΔE00 and related psychophysical metrics, while hue is treated as a locally calibrated design degree of freedom that must be adjusted to project-specific cultural, climatic and functional contexts rather than prescribed universally.
Environmental-visual research demonstrates that in complex outdoor settings, enhancing target-background salience (quantified through Δ
E00 color difference and luminance contrast) improves detection rates and legibility more effectively than pursuing color harmony alone, with this directionality remaining stable across varying background complexity and illuminance conditions [
6]. Virtual wayfinding experiments similarly reveal that color-based zone coding significantly reduces immediate wayfinding time and path length, though its contribution to long-term cognitive mapping remains limited [
7]. For accessibility and inclusivity, color-only information encoding proves unreliable for individuals with color vision deficiency (CVD), necessitating redundant encoding through color, shape, and text/icon combinations with high luminance contrast as a baseline, while adhering to conservative font size-viewing distance and stroke width ratios [
8,
23].
Urban color palettes emerge from the coupling of climate, materials, and culture. Cross-city preference studies (Tokyo/Taipei/Tianjin) reveal commonalities amid differences, with universal high preference for white and structural variations elucidated through multidimensional scaling, providing psychological support for dual-layer “universal-local” strategies [
24,
25]. Field research emphasizes the foundational role of materials and craftsmanship in shaping regional colors (e.g., Mediterranean white/blue, temperate ochre/terracotta), informing anchor colors and component-level area ratios [
26]. Computer vision advances have enabled parametric translation from empirical observation to regulation: tri-layer façade parsing and rule-based structure recognition bind primary/secondary/accent colors to semantic components like windows, cornices, and railings, establishing auditable area thresholds and Δ
E00 corridors [
27]. At the urban scale, street-view analysis combined with crowdsourcing and deep learning validates positive correlations between similarity/balance and perceived safety, vitality, and aesthetic ratings [
28,
29].
Color parameters, particularly brightness/albedo and nocturnal light color, bridge aesthetic strategies with engineering performance. Increasing roof albedo from 0.08 to 0.30–0.50 significantly reduces cooling loads and peak demand while lowering surface temperatures, with maximum benefits in hot, high-insolation climates [
30]. Urban-scale net albedo increases of 0.10–0.20 for roofs and pavements yield concurrent negative radiative forcing and air conditioning cost reductions, achieving both mitigation and adaptation benefits [
31]. Recent urban and regional studies on pavement albedo enhancement confirm significant daytime surface cooling and modest near-surface air temperature reductions, while revealing context-sensitive increases in pedestrian mean radiant temperature (MRT) under certain conditions, necessitating coordination with shade provision and ventilation corridors alongside lifecycle glare and maintenance assessments [
32,
33]. For nocturnal lighting, correlated color temperature (CCT), blue light proportion, shielding/spillage control, and color rendering quality (e.g., TM-30) require integrated optimization within a multi-objective framework balancing visibility, accuracy, and ecological impact [
34,
35].
This review synthesizes interdisciplinary advances and methodological progress to establish color as a measurable, verifiable, and manageable urban system variable. By integrating salience thresholds (Δ
E00, luminance contrast, font size-viewing distance), cultural parameters (area ratios, Δ
E00 corridors), and thermal/optical metrics (albedo, CCT, TM-30, glare) as constraints, and implementing evidence pipelines from VR/eye-tracking/physiological monitoring through pilot installations to operational causal evaluation, we ensure traceability and external validity from laboratory evidence to public space implementation [
4,
6,
8,
28,
29,
30,
32,
33,
34]. The proposed “objective-strategy-metric-validation” framework aims to achieve multi-objective optimization across psychological health, wayfinding safety, place identity, and climate resilience.
As shown in
Table 1, the sample spans 2012–2025 and multiple regions (Europe, Asia, and Latin America), with citation impact ranging from highly cited foundational studies to recent contributions. The topics cover environmental color and legibility, use and health benefits of small urban green spaces, lighting and women’s safety, brightness/contrast evaluation, thermal perception, and algorithmic palette design. This snapshot complements the detailed search and eligibility protocol in the Methods and helps readers gauge both the breadth and maturity of the literature synthesized here.
This review advances the field in two ways. First, it integrates dispersed findings from perception and cognition, cultural identity, wayfinding engineering, and environmental performance to treat urban color as a measurable system variable rather than a solely aesthetic choice [
45]. Second, it proposes a unifying OSMV workflow—Objective → Strategy → Metric → Validation—that connects project goals to color strategies, to instrumentable metrics (e.g., luminance contrast and Δ
E, letter-height–distance coupling, surface reflectance/albedo, and nighttime color-quality indices), and to verification protocols (laboratory/VR/field pilots; before–after or quasi-experimental designs). In
Figure 1, the four letters of OSMV correspond to the central boxes labeled “Objective”, “Strategy”, “Metric”, and “Validation”, which are preceded by an “Inputs” block that summarizes contextual evidence and regulatory constraints. Guided by this workflow, our synthesis yields actionable insights and audit-ready criteria for planning, design review, and post-occupancy evaluation. Throughout, we distinguish (i) robust “red-line” requirements that generalize across contexts from (ii) context-calibrated parameters that depend on background complexity, illumination, and cultural semantics. Psychosocial and cultural responses are therefore handled through locally negotiated palette typologies and semantic assessments rather than through universal numerical optimization.
The synthesis indicates that legibility is governed primarily by luminance contrast, with Δ
E00 ≈ 18–25 and Weber/Michelson ≥ 0.4–0.6 as conservative bands for identification and wayfinding, while hue benefits remain context-dependent. Cultural identity is operationalized via primary/secondary/accent façade area ratios (accents ~10–15%) and Δ
E00 corridors that preserve recognizability while allowing local calibration. Environmental performance is advanced by albedo increases of ~0.10–0.20 for roofs and pavements and by night-time lighting configured with CCT ≤ 3000 K, appropriate TM-30 color-quality ranges, and glare/spill limits. Equity and accessibility are secured through redundant encoding (color + shape + text/icons + luminance contrast). In this framing, color is treated as a measurable system variable;
Figure 1 framework specifies how objectives map to strategies, normalized metrics, and tiered validation, and how non-negotiable red-line requirements are distinguished from context-calibrated parameters linked to background complexity, illumination, and cultural semantics, thereby implementing a “universal principles plus local calibration” strategy for culturally diverse and climatically varied settings. These parameter bands are obtained through the multi-step synthesis protocol described in
Section 2, in which convergent breakpoints from higher-quality studies are aggregated and then benchmarked against existing signage, accessibility and cool-material standards to ensure that the resulting thresholds are conservative, auditable and technically feasible.
Despite extensive scholarship, prior reviews and practice notes remain fragmented or primarily aesthetic-led. They seldom translate color evidence into measurable thresholds and verification procedures for outdoor public space. Baseline conditions and key moderators—such as background complexity, illumination regime and cultural semantics—are reported inconsistently. Equity provisions for people with color-vision deficiency or low vision are often omitted, and links from laboratory or VR studies to field pilots, operational evaluation and life-cycle maintenance are uneven.
The present review addresses these gaps by synthesizing and operationalising cross-domain evidence for outdoor urban public spaces, by day and by night. Robust findings are consolidated into audit-ready parameters and acceptance bands and organized through the Objective–Strategy–Metric–Verification (OSMV) pathway illustrated in
Figure 1. In this framework, Objectives specify the task class and user groups (e.g., wayfinding and egress, placemaking, daytime thermal comfort, night-time visibility), together with deployment context, hypothesized moderators and equity requirements for users with CVD or low vision. Strategies assign roles and constraints to color and light within that context, such as the structure of primary, secondary and accent palettes or rules that prioritize conspicuity over harmony. Metrics translate these strategies into instrumentable variables and conservative bands, including color-difference and luminance-contrast measures, typography geometry, surface reflectance and key night-time lighting indicators. Verification then escalates evidence from controlled laboratory or VR screening, through site pilots with luminance/illuminance and accessibility audits, to quasi-experimental field evaluations linked to explicit decision rules, inspection intervals and maintenance triggers. Inputs in the form of evidence, context and risk flow from left to right towards outputs such as specification sheets, metric registers and validation plans, closing the loop through life-cycle monitoring.
Within this framing, the synthesis indicates several cross-cutting trends. Legibility in public spaces is governed primarily by luminance contrast, while hue effects are strongly context-dependent. Cultural identity can be operationalised via structured façade palettes and area-ratio bands that preserve recognisability yet allow local calibration. Environmental performance is improved by carefully increasing surface reflectance for roofs and pavements and by configuring night-time lighting with warm spectra, adequate color quality and strict limits on glare and spill. Equity and accessibility are secured through redundant encoding, in which color is consistently combined with shape, text or icons and sufficient luminance contrast. Taken together, these elements treat color as a measurable system variable and clarify how
Figure 1 maps objectives to strategies, normalized metrics and tiered validation, while distinguishing non-negotiable baseline requirements from context-calibrated parameters linked to background complexity, illumination and cultural semantics.
2. Methods: Review Protocol and Synthesis Approach
2.1. Search Strategy and Identification of Evidence
The review followed a structured protocol designed to identify empirical and practice-oriented studies on color in outdoor urban public spaces. Major bibliographic and technical databases were searched, including Web of Science Core Collection, Scopus, ScienceDirect, Google Scholar and IEEE Xplore. The search window covered publications from 1990 to early 2025, reflecting the period in which laboratory color research, urban design practice and environmental simulation began to converge.
Boolean keyword strings combined four groups of terms: (i) setting terms (“urban”, “city”, “street*”, “square”, “public space”, “facade/façade”, “signage”); (ii) color and appearance terms (“colour/color”, “paint”, “finish”, “albedo”, “light*”); (iii) outcome terms (“perception”, “emotion”, “physiolog*”, “wayfinding”, “safety”, “thermal”, “comfort”, “identity”); and (iv) methodological terms (“experiment*”, “field study”, “mock-up”, “simulation”, “evaluation”). Searches were limited to English-language, peer-reviewed journal articles and conference papers, supplemented by a small number of standards and design guidelines where they reported empirical evidence. All records were imported into a reference manager and de-duplicated before screening.
2.2. Eligibility, Screening and Coding
Screening proceeded in three stages—title, abstract and full text—against pre-specified inclusion and exclusion criteria. Studies were **included** when they met all of the following conditions:
- (1)
The setting was an outdoor urban public space or a building-related exterior space (streets, squares, parks, plazas, transport nodes, building facades and signage) or an indoor/VR environment explicitly designed to inform such spaces;
- (2)
The study reported human perception, behavior, or environmental performance outcomes relevant to public-space design (e.g., legibility, safety, preference, thermal or lighting performance);
- (3)
Color, surface finish or lighting were manipulated or measured as independent variables; and
- (4)
Outcomes were reported quantitatively or through systematically coded qualitative data.
Studies were **excluded** if they were indoor-only with no discussion of outdoor transferability, purely theoretical or descriptive essays, or rendering-only studies without photometric or colourimetric calibration. When reports were ambiguous, at least two authors independently screened the full text and reached consensus.
For each included study we coded: (i) bibliographic information; (ii) setting (indoor, outdoor, mixed) and task class (e.g., wayfinding/egress, placemaking, thermal comfort); (iii) method type (laboratory/VR experiment, field mock-up or pilot, quasi-experimental field evaluation, observational study); (iv) participant characteristics where available (including aging and vision diversity); and (v) manipulated or observed variables and outcome measures. Rather than recording every parameter value in
Section 2, variables were grouped into a small set of operational categories used throughout the review: luminance contrast, color difference (Δ
E, preferably Δ
E00), typography geometry (letter-height–distance ratios, stroke-width and spacing rules), surface reflectance/albedo and daytime glare measures, and night-time lighting metrics (illuminance, correlated color temperature, color-quality indices, glare indices and light-spill limits). Potential moderators—including background complexity, illumination regime (day/overcast/night), and cultural or semantic context—were also coded. This coding enabled later synthesis by outcome family and by level (psychological/physiological, cultural identity, functional zoning and wayfinding, environmental adaptation).
2.3. Quality Appraisal, Synthesis and Derivation of Thresholds
Because the corpus spans different disciplines and methods, evidence quality was appraised with criteria tailored to study type. Laboratory and VR experiments were assessed for randomisation or counter-balancing, calibration of stimuli, clarity or preregistration of protocols, and reporting of effect sizes with confidence intervals. Field mock-ups and pilots were assessed for instrument reliability, worst-case testing conditions (e.g., overcast daylight and night-time), and checks for accessibility using color-vision-deficiency filters or aging-vision simulations. Quasi-experimental field evaluations were assessed for design strength (e.g., interrupted time series or difference-in-differences), control of seasonality and secular trends, treatment fidelity and possible spillover effects. Each study was assigned a four-level quality rating (high, moderate, low, unclear), and these ratings are reported in the evidence tables.
Synthesis then proceeded by outcome family rather than by study, using a structured narrative approach. Where measures were commensurable, we did not attempt formal meta-analysis; instead, we identified ranges where multiple high- and moderate-quality studies showed convergent performance breakpoints (for example, rapid drops in legibility or comfort, onset of glare, or plateauing of cooling benefits). From these convergent regions, we derived **conservative threshold bands** for key metrics such as luminance contrast, ΔE00 and albedo, and for lighting parameters relevant to circadian and ecological effects. These bands were subsequently **benchmarked against existing signage, lighting and cool-materials standards**, and, where necessary, adjusted to be at least as conservative as current practice. Sensitivity checks repeated the synthesis after excluding lower-quality or indoor-only studies and considered evidence on aging and maintenance drift (e.g., soiling and UV abrasion) to judge whether deterioration could push performance outside the recommended bands.
In the remainder of the paper, these threshold bands and acceptance ranges are translated, via the OSMV (Objective–Strategy–Metric–Validation) framework, into audit-ready guidance: objectives and strategies are linked to metrics, and these relationships are defined by the derived threshold bands and validation procedures.
4. Cultural Expression & Place Identity
In the cultural and identity domain, architectural color functions as a compositional and communicative resource rather than a purely ornamental choice. Building on Caivano’s semiotic perspective [
9,
10,
11] and empirical analyses by Serra Lluch and colleagues [
19,
20], effective schemes articulate explicit relationships among value/lightness, chroma, and contextual cues so that legibility is preserved while identity is expressed. Landscape-oriented studies further indicate that chromatic organization should be tuned to setting and viewpoint, reinforcing coherence at street and district scales [
21].
Urban color serves as the core visual encoding carrier of “placeness,” bearing the function of translating material properties, climatic conditions, and socio-cultural semantics into recognizable symbolic systems for the public. This system exhibits “dual structure” characteristics: on one hand, it manifests universal cross-cultural preference patterns. Classical studies examining Tokyo, Taipei, and Tianjin revealed that despite regional differences in hue-tone preference curves across the three cities, high preference for white demonstrates cross-cultural consistency, with dual-scaling analysis successfully converting this phenomenon into a comparable statistical coordinate system [
24]; on the other hand, it embodies region-specific cultural expressions. Lenclos’s pioneering “color geography” theory, through systematic field investigations, constructed a theoretical coupling model of “color genealogy = cultural background × climatic conditions × material properties,” which has been validated across different climatic zones: white-blue color combinations in Mediterranean regions counter intense solar radiation environments, ochre color systems in temperate towns respond to fired material characteristics, while high-saturation accent colors in oceanic climates compensate for visual needs in low-illumination environments [
26]. This empirical framework established the concept of “anchoring colors,” whereby a few dominant colors maintain visual continuity in neighborhoods while low-proportion decorative colors satisfy contemporary expressive needs.
Advances in computer-vision techniques have begun to turn traditional façade color rules into parametric analysis workflows. Façade semantic segmentation can now automatically identify components such as windows, doors, string courses and wall fields, extract dominant and accent colors together with their area proportions, and encode these patterns using color-difference metrics (e.g., Δ
E00 in CIELAB space). Three-layer parsing frameworks that distinguish component bands, inter-storey moldings and background wall surfaces, combined with template-matching methods for historic districts, have been shown to robustly recover façade topology and support precise links between color parameters and component semantics [
44,
45,
46,
47,
48].
In parallel, studies based on street-view imagery and crowdsourced perception have modeled the coupling between façade color structure and public evaluations. These analyses indicate that façade color similarity and visual balance are positively associated with ratings of safety and prosperity [
28,
29], whereas excessive chromatic diversity reduces the perceived sense of order. At the same time, overly uniform palettes do not yield continuous gains in perceived quality; the most favorable responses tend to occur when a stable dominant palette is combined with a limited set of accent colors [
49,
50,
51,
52,
53,
54].
Table 3 systematically organizes key research evidence in the field of cultural identity and place expression, demonstrating the methodological evolutionary trajectory from traditional field surveys to computer vision analysis. This table encompasses six representative studies spanning from 1994 to 2017, with research scope expanding from single-city cross-cultural comparisons to global big data analysis. From a methodological perspective, early research primarily relied on questionnaire surveys and field observations to establish theoretical frameworks, while recent studies extensively adopt emerging technologies such as deep learning, crowdsourcing perception, and large-scale image analysis. In terms of research outcomes, these studies collectively established the fundamental understanding of “coexistence of cultural universality and regional specificity,” providing a solid theoretical foundation and technical pathway for subsequent parametric design and intelligent governance.
Based on the evidence chain demonstrated in
Table 3, the design and governance of “cultural expression—place identity” can be operationalized through three interlocking steps. The anchoring phase integrates historical imagery, documentary records, and field sampling data, determining the site’s primary color domains and their allowable lightness/saturation variation ranges according to Saito’s cross-cultural preference model and Lenclos’s geographical atlas framework, employing CIELAB coordinate systems and Δ
E00 thresholds to quantitatively define “color identity” boundaries, distinguishing this approach from traditional perceptual descriptions. The calibration phase applies validated three-layer facade parsing and regular structure template methods to implement component-level semantic segmentation and color clustering of existing streetscapes, calculating area ratios of dominant/auxiliary/accent colors, establishing verifiable compliance boundaries for historic districts, and setting color–material coupling technical constraints in combination with material craft characteristics. The validation phase utilizes street view crowdsourcing data and image learning perception models to conduct external validity assessments of candidate color schemes, quantifying trends in safety, orderliness, and visual quality. In specific scenarios, small-scale pilots can be conducted to verify nighttime color rendering and light pollution boundaries, employing interrupted time series or difference-in-differences methods to identify net effects. Through this closed-loop mechanism, anchoring colors are transformed into “public contracts” that possess both cultural semantics and engineering executability, maintaining neighborhood memory and recognizability while providing measurable flexibility for functional updates. These anchoring–calibration–validation steps map to the OSMV chain and produce audit-ready acceptance bands (Δ
E00 and area ratios) for design review and post-occupancy checks.
Addressing “traditional–contemporary” tensions, this framework provides negotiable mediating pathways. To avoid cultural tokenism and selection bias, pilots should include local stakeholders and diverse vision profiles (aging and CVD), and report night-time rendering and glare audits alongside daytime checks. Based on the material-climate-culture triangular model, when cities need to increase light color proportions for energy efficiency and thermal environment objectives, this can be achieved through “fine-tuning” strategies of lightness enhancement and saturation reduction within established color domains, thereby satisfying albedo and glare engineering constraints while maintaining semantic continuity. Conversely, when local culture has rigid demands for high-saturation symbolic colors, these can be limited to component or temporary levels, with visual impact constrained through area limits and street-facing proportions, while statistically maintaining the overall “stable dominant colors + moderate diversity” structure. As demonstrated in Seresinhe et al.’s national study shown in
Table 1, crowdsourcing perception and deep learning research indicate that consistent dominant color tones + bilateral similarity + appropriate accent colors can maximize the coexistence interval of “orderliness and recognizability,” while extreme uniformity or extreme mixing lack sustained statistical advantages [
28,
29]. Therefore, the “authenticity” of cultural expression is not a static replication of historical color palettes, but rather a dynamic calibration process between local statistical characteristics and public perception.
Illustrative example.
Figure 2b illustrates a historic commercial street where a restricted dominant palette, controlled accent colors and Δ
E00 corridors are applied to façades, shopfronts and street furniture to maintain a recognizable place identity while avoiding both monotonous uniformity and visually noisy patchworks.
6. Sustainability and Environmental Adaptation
This section reframes sustainability and environmental adaptation as two sets of objectives that can be verified in practice: (i) daytime thermal and visual performance, and (ii) night-time visibility, health/ecology and light-spill control. In line with the OSMV workflow, we state objectives, select layered strategies, bind them to instrumentable metrics, and indicate how they are validated in pilots and field operations.
6.1. From Buildings to Cities: Multi-Scale Evidence of Albedo, Energy Consumption and Heat Islands
Urban public space color within the sustainability framework not only affects visual aesthetics and readability but also exerts systematic impacts on building energy consumption, urban heat island intensity, and nighttime ecosystems through surface solar reflectance and spectral characteristics. Research indicates that rooftops and paved surfaces account for approximately 60% of urban exposed surfaces (rooftops ~20–25%, pavements ~40%), making “roof brightening + pavement brightening” through enhanced surface albedo one of the highest unit-area benefits, rapidly deployable passive cooling strategies [
20]. At the building scale, field measurements and simulation evidence demonstrate that increasing roof albedo can substantially reduce building air conditioning loads: elevating roof albedo from typical values of 0.08 to 0.3–0.5 can reduce air conditioning cooling energy consumption by approximately 20–90%, decrease peak cooling demand by about 11–27%, and significantly lower roof surface temperatures [
19]. These energy reduction and cooling effects are most pronounced in hot, high-insolation regions. Urban-scale research further confirms the collective effects of high-albedo strategies: broadly increasing average urban roof and pavement albedo by approximately 0.1 (through promoting high-reflective materials, increasing roof and pavement albedo by about 0.25 and 0.15, respectively) results in significant summer reductions in near-surface urban air temperature, with corresponding decreases in total air conditioning demand and costs. Simultaneously, due to reduced solar radiation absorption by the surface-troposphere system, negative radiative forcing effects are produced, equivalent to offsetting billions of tons of CO
2 greenhouse gas emissions [
20]. This “urban brightening” pathway provides dual benefits of climate change mitigation and extreme heat adaptation: locally alleviating urban heat stress, reducing peak air conditioning loads and improving air quality, while macroscopically providing some offset to global warming.
This strategy is technically and economically feasible, primarily involving changes in color coatings and material selection rather than expensive system upgrades (such as adopting cool roofs/cool pavements with high-reflective coatings, light-colored exterior wall coatings, or high-reflective roofing membranes). It should be noted that the specific benefits of increased albedo vary with climate zones, building envelope characteristics, roof construction, and shading conditions; design should incorporate local typical meteorological year data for energy consumption simulation and sensitivity analysis. The Phoenix “cool pavement” pilot study provides empirical support: roads coated with high-reflective coatings showed average daytime surface temperature reductions of approximately 6 °C compared to aged black asphalt, with limited impact on near-surface air temperature [
59]. Subsequent monitoring revealed that new-generation high-reflective coatings maintained stable road surface cooling effects across multiple communities, but coating albedo declined with dust accumulation and wear, with absolute albedo values dropping an average of 5–12% within seven months, necessitating regular cleaning and recoating schedules to maintain performance [
32]. Synthesizing multi-scale building and urban evidence, an actionable heat mitigation strategy emerges: high-temperature, high-insolation cities should prioritize roof brightening, while neighborhood renewal should adopt high-reflective pavements combined with tree shade coverage and ventilation corridors to synergistically reduce pedestrian mean radiant temperature (MRT) and thermal stress during extreme heat periods.
Daytime performance is characterized by surface reflectance/albedo (for radiant load and legibility background), disability/discomfort glare checks at critical viewpoints, and luminance contrast for safety/wayfinding tasks placed in sunlit contexts. These variables are recorded with calibrated instruments and tied to acceptance bands defined at project start; baseline colorimetry plates are archived to monitor drift (ΔE00) due to soiling and UV exposure. Validation proceeds from laboratory/VR screening of contrast and glare scenes to site mock-ups tested under worst-case overcast and clear-sky conditions, before city-scale deployment. Because the magnitude of these cooling and energy benefits depends strongly on local climate, sky conditions and urban morphology, the suggested albedo ranges are framed as climate-sensitive starting bands that should be verified by local simulation or monitoring before large-scale deployment.
6.2. Nighttime Light Color, Color Rendering and Ecology: From “Being Visible” to “Being Seen Correctly”
Night-time sustainability in public spaces pursues three concurrent objectives: adequate visibility for safety and recognition, ecological and human-health protection, and the prevention of sky glow and light trespass. Consistent with the OSMV workflow, strategies couple finishes with optics rather than treating lighting in isolation. We therefore specify full cut-off or well-shielded luminaires, task-focused distributions, and finishes whose night appearance preserves figure–ground contrast without excessive luminance. These strategies are audited with instrumentable metrics rather than narrative principles.
Metrics and acceptance bands are defined per task: (i) horizontal and vertical illuminance at pedestrian and sign planes; (ii) correlated color temperature and color quality (TM-30 Rf/Rg) appropriate to recognition without over-cool appearance; (iii) glare control at observer positions using UGR or TI; and (iv) property-line spill limits to protect neighbors and ecology. For legibility tasks, luminance contrast (Weber or Michelson) remains primary, with hue assisting conditionally; baseline colorimetry plates are archived to track ΔE00 drift of finishes. Worst-case test points are pre-declared and sampled during pilots and at handover so that results are auditable and comparable across sites.
Validation escalates with risk and cost. Tier 1 screens candidate palettes and optics in laboratory/VR tasks (search, recognition) with preregistered protocols and effect sizes; Tier 2 verifies mock-ups on site under worst-case overcast and night conditions using luminance cameras or photometers and accessibility checks (aging-vision and color-vision-deficiency filters); Tier 3 evaluates operational impact with interrupted time-series or difference-in-differences designs and links outcomes with maintenance logs. Decision rules are tied to acceptance bands, including inspection intervals and retouch triggers when ΔE00 or luminance-contrast drift exceeds thresholds. This layered approach keeps the legibility/safety layer conspicuous while allowing placemaking palettes to vary without eroding readability.
Sustainable management of urban environmental color also encompasses optimization of nighttime artificial light environments: unlike daytime reliance on high albedo for thermal environment improvement, nighttime primarily involves multi-objective trade-offs between lighting correlated color temperature (CCT) and blue light content, luminaire shielding and light control (full cutoff), and light source color rendering quality. The American Medical Association (AMA) 2016 policy report recommends that outdoor road lighting should prioritize warm color temperature light sources ≤ 3000 K, and through good shielding design and nighttime time-division dimming, reduce glare impacts on driving safety and human circadian rhythm disruption; under the premise of meeting visual task requirements, minimize exposure to cool white light with high blue light content [
60]. The International Commission on Illumination (CIE) 2019 position statement clarifies that “blue light hazard” (BLH) should be limited to describing photochemical damage risk to the retina from high-intensity light exposure [
61]. Under normal lighting use conditions, general LED white light sources do not cause substantial harm to human retina, but when exposure levels approach safety thresholds for extended periods, vigilance is still required along with adherence to good lighting design principles [
34].
In practice, nighttime lighting optimization does not simply involve comprehensively reducing color temperature or brightness, but rather comprehensively evaluating human visual needs, health impacts, and ecological environment within the same framework. Specific strategies include: adopting lower correlated color temperature light sources (typically ≤ 3000 K) to reduce blue light proportion in the spectrum, selecting full cutoff luminaires to prevent light pollution spillage, providing appropriate illumination levels meeting visual task requirements combined with graduated dimming during late night periods, while utilizing advanced color rendering evaluation methods such as ANSI/IES TM-30 to ensure good visibility and accurate color presentation of key targets including road surfaces, traffic signs, and pedestrian skin tones [
35]. Recent industry standards tend toward quantified implementation of these principles, such as the North American DLC consortium’s LUNA technical specification emphasizing “low light pollution and high visibility” indicators for outdoor lighting products, strictly limiting upward light and glare while considering impacts on insects and other wildlife, aiming to achieve “visible, accurate, low-impact” lighting effects. Compliance is checked at declared test points for horizontal and vertical illuminance, CCT and TM-30 Rf/Rg, glare at observer positions (UGR/TI), property-line spill, and figure–ground contrast; results are recorded at handover for life-cycle drift monitoring.
Truly sustainable “urban nightscape” does not involve unilaterally reducing brightness or suppressing color temperature, but rather achieving the lighting objective of “seeing what should be seen at the minimum necessary brightness, at the right time and place” through scientific light distribution design, light shielding and control, intelligent dimming, and color rendering quality management. From daytime high-albedo cooling to nighttime low color temperature, strict light shielding, and high color rendering quality control, a complete urban color-light environment strategy can achieve multi-objective optimization balance among energy conservation, thermal comfort, human health, and ecological protection.
Aging and maintenance are explicitly integrated. Pollution and abrasion shift reflectance and chroma, which can degrade luminance contrast and legibility over time. Projects, therefore, specify inspection intervals and retouch triggers tied to contrast and ΔE00 drift, and link field performance with maintenance logs to close the life-cycle loop.
Takeaway. Sustainability and adaptation are operational only when objectives, strategies, and metrics are stated in verifiable terms. Daytime outcomes are driven by reflectance/albedo and glare management; night-time outcomes by illuminance, CCT and color quality, glare control, and spill limits. These domains are audited with the same OSMV chain used elsewhere in the paper and fed back into acceptance bands and maintenance plans.
Compliance is checked at declared test points for horizontal/vertical illuminance, CCT, TM-30 Rf/Rg, glare (UGR/TI), property-line spill, and figure–ground contrast; results are recorded at handover for life-cycle drift monitoring.
Synthesizing existing evidence, legibility is predominantly governed by luminance contrast rather than hue; for signage and wayfinding tasks, conservative ranges of ΔE00 ≈ 18–25 and Michelson/Weber ≥ 0.4–0.6 are recommended. Regarding the thermal environment, an albedo increase of approx. 0.10–0.20 typically yields significant surface cooling and energy consumption benefits. These conclusions show high consistency across studies and across a range of cultural and climatic contexts, but hue effects are highly sensitive to task semantics, lighting regimes, and background complexity; therefore, local calibration within these thresholds is necessary. The main limitations involve differences in baseline control and ΔE00/contrast drift caused by material aging; to address this, re-measurement and recalibration will be conducted on a semi-annual to annual basis during the operational phase.
Figure 2d depicts a public square that combines high-albedo paving, shaded seating zones, vegetated edges and layered warm-CCT lighting. The vignette visualizes how the proposed daytime albedo and nighttime lighting bands can be implemented to reduce heat stress, maintain visual comfort and limit ecological disturbance.
8. Conclusions
This review set out to address a practical gap in the literature on color in urban public spaces. Although many studies have examined color preferences, perception and environmental performance, designers and regulators still lack integrated, audit-ready guidance that links color decisions to verifiable outcomes in real outdoor settings. By treating color as a measurable system variable rather than a purely aesthetic attribute, the paper consolidates scattered evidence into a coherent basis for performance-based design and governance.
The first contribution is a cross-domain synthesis of how color operates in four key application areas: psychological and physiological effects, cultural expression and place identity, functional zoning and wayfinding, and sustainability and environmental adaptation. Across these domains, the review distinguishes robust patterns—such as the central role of luminance and saturation in shaping affect, attention and legibility—from findings that are strongly conditioned by context, baseline choices and cultural meaning. This organization helps clarify which results are transferable across projects and which require local calibration.
The second contribution is the formulation of the Objective–Strategy–Metric–Validation (OSMV) framework, which connects high-level design intent to concrete implementation and evaluation steps. Objectives articulate the performance goals and user groups for a project. Strategies translate these goals into coordinated palette, contrast and material–lighting decisions. Metrics provide a compact register of measurable levers that can be monitored over the life cycle of a space. Validation links laboratory and VR experiments, site pilots and operational monitoring into a tiered pathway for evidence building. Together, these elements provide a common language for planners, designers, engineers and regulators to negotiate and audit color-related decisions.
A third contribution lies in the translation of research findings into parameterised prescriptions that can be applied without relying on copyrighted images. Instead of proposing a single prescriptive standard, the review offers indicative bands and relationships that can be adapted to different climatic, cultural and regulatory contexts while maintaining equity for users with color-vision deficiency or low vision. This approach is intended to support both everyday design practice and the development of more explicit color-related clauses in guidelines, standards and audits.
Future work should extend and stress-test this framework. Methodologically, there is a need for more studies that link controlled experiments, immersive simulations and longitudinal field observations using common metrics, including under-represented regions and user groups. Substantively, promising directions include integrating color decisions more tightly with health and well-being agendas, biodiversity and dark-sky protection, and data-driven planning tools such as street-view analytics and digital twins. Advancing along these lines would allow color in urban public spaces to be managed as an ongoing performance variable, supporting comfort, safety, identity and environmental adaptation in a transparent and accountable way.