1. Introduction
As cities get denser and larger, the urban atmosphere changes significantly. The wind flow in urban areas plays an important role in determining the urban heat island intensity, pollution dispersion characteristics, ventilation characteristics, pedestrian wind and thermal comfort. It is also used in building energy consumption studies and mesoscale numerical weather prediction (NWP) models. More specifically, in NWP models, the surface layer modeling schemes require characterized momentum and scalar fields within the roughness sublayer. The wind flow in the streets near the buildings highly influence the convective heat flux at the walls [
1] and plays an important role in the determination of convective heat transfer coefficient in urban canopy models (UCM). Therefore, the knowledge of horizontally averaged wind flow profiles within the roughness sublayer are essential for numerous applications. Despite the immense need, currently available analytical models lack the flexibility and accuracy for realistic and diverse urban neighborhoods. In this study, we propose a machine learning approach and demonstrate its superiority in comparison to the existing methods.
Several studies in the past have tried to determine the mean wind profiles in an urban or vegetative environment based on wind-tunnel experiments, theoretical formulations or CFD simulations. A mathematical model for airflow in a vegetation canopy was developed by Cionco [
2] and an exponential wind profile of the form
was suggested. The attenuation coefficient
a could vary between 0.3 and 3 depending on the nature of the vegetative canopy [
3]. Within homogeneous plant canopies, the dominant eddies through the depth of a vegetation canopy are produced from the mixing-layer instability of the shear layer at the top of the canopy [
4]. This mixing is observed to lead to an exponential mean wind profile in vegetation canopies according to Finnigan [
5] and in canopies of cubical urbanlike roughness elements according to Macdonald [
6]. Macdonald modified the simple model for vegetation canopy to 3D arrays of obstacles with a constant mixing length assumption using empirical methods and proposed the use of the same exponential profile inside sparse urban canopies. The averaging was performed on a few points assumed to represent the entire space. The exponential coefficient
a was estimated to have a linear relationship with the frontal aspect ratio
of the cubes (
). There were two main limitations to this model. First, there was an upper limit on the validity of the model (
) where skimming flow occurred. Second, it was only valid for neutral atmospheric scenarios. Similarly, Coceal and Belcher [
7] developed a model for spatially averaged mean winds within and above urban areas assuming neutral atmospheric conditions using a simple mixing-length scheme to represent the dynamical effects of urban areas. The attenuation coefficient of the exponential function was then defined in terms of the mixing-length and the canopy morphology parameters. On the other hand, Harman and Finnigan [
8] provided analytical functions of spatially averaged mean RSL wind profiles that were analogical to the standard Monin–Obhukhov similarity theory (MOST) for a wide range of atmospheric stabilities with the help of a mixing-layer flow analogy at the canopy top. An exponential equation of the form
was suggested for in-canopy wind flow profiles where
is the atmospheric stability parameter as mentioned in
Table 1 which can be determined iteratively and
is the mixing length which is also dependent on
. Despite being a good starting point for the analytical solutions of stability-dependent wind flow profiles, the empirical relations and stability function calculations that were used in the
solution procedure made its application limited. However, Kono et al. [
9] performed a series of large-eddy simulations (LES) on cube canopies with different plan area densities
ranging from 0.05 to 0.33 in neutral conditions and argued that the exponential approximation of the average velocity profile inside the urban canopy was not always valid. The discrepancy between the spatial average of wind velocity and Macdonald’s five-point average was found to be higher for dense canopies. It was also found that the mixing length varied significantly inside the canopy with a height which was contrary to Macdonald’s constant-mixing-length assumption. On the other hand, Yang et al. [
10] found that for a wide range of urban morphology values
from 0.03 to 0.25, an exponential profile was a very good representation of the mean velocity profile within the RSL with a series of LES studies in neutral conditions. It was found that in the upper 70–80% of the RSL, a generic exponential velocity profile with respect to the wall normal distance provided a very good description of the mean velocity. An iterative procedure was proposed to determine the attenuation coefficient with the help of a geometric sheltering model. Castro [
11], through many direct numerical simulation (DNS) and LES studies of staggered and aligned cubic array canopies for a range of
from 0.04 to 0.25 in neutral conditions, was not able to reasonably fit an exponential curve to the horizontally averaged mean velocity profiles. It was also found that the mixing length was not constant with height, and the author concluded that the urban canopy profiles might not be exponential in nature over a significant height range. Theeuwes et al. [
12] evaluated the performance of MOST based models with roughness correction for modeling the momentum and scalar exchange within the urban RSL using measurement towers’ data from the Basel UrBan Boundary Layer Experiment (BUBBLE). The horizontally averaged velocity and temperature profiles of these models were compared and analyzed for various stability conditions occurring in a real urban environment. Inside the urban canopy, it was observed that the Harman and Finnigan parameterization performed better than the Macdonald’s model when compared against the measurement data. Awol et al. [
13] recently developed a new 1D analytical model for in-canopy wind flow in neutral atmospheric conditions using a different drag parameterization relating mean velocity and turbulent stress from first principles assuming a horizontal homogeneity and vertically uniform canopy. The model’s wind profile resulted in a combination of logarithmic and exponential functions making it applicable for various packing densities. At very low
, the wind profile tended to be logarithmic and at very high
, it tended to be exponential in nature. These legacy models are advantageous in a sense that they are simple in nature and usually easy to implement. However, in practice, their accuracy and flexibility are limited due to underlying assumptions regarding atmospheric stability and the exponential nature of the flow profile.
There are many parameters that influence the horizontally averaged mean flow profiles in an urban canopy such as the urban morphology, the urban layout, external atmospheric scenarios, the presence of trees and cars, etc. Many previous urban wind flow studies through CFD simulations point out the most significant parameters. However, only a few of them have used nonisothermal CFD simulations to incorporate temperature boundary condition like in a realistic atmosphere in order to show the effect of surface heating and buoyancy on the horizontally averaged mean wind speed. Kim and Baik [
14] investigated flow patterns in an urban street canyon between two buildings with aspect ratios ranging from 0.6 to 3.6 for every 0.2 interval and nine different thermal forcing scenarios. Later, Kim and Baik [
15] studied the effects of ground and roof heating in a 3D urban canyon consisting of a 2 × 3 array of cuboid buildings (9.56 m tall) with periodic boundary conditions at the lateral boundaries. Both studies led to a conclusion that the 3D vortex structure varied significantly for different heating scenarios. Dimitrova et al. [
16] performed 3D steady-state RANS simulations for a building configuration consisting of two buildings with
using
k-
turbulence models with Boussinesq’s approximation and validated their model against wind-tunnel measurements. They concluded that the solar heating of building walls influenced the flow behavior significantly compared to the isothermal case. Santiago et al. [
17] studied the effect of buoyant forcing scenarios in the resulting flow fields in a street cavity formed by two buildings with
. Steady-state RANS simulations were performed with a standard
k-
turbulence model along with a radiation model to obtain temperatures at different walls in order to capture the heating scenarios at various solar positions. For realistic wall-heating scenarios, the flow field showed variation from the neutral case and the highest variation from neutral case was observed when the sun was directly above the walls and ground when the radiation effects were maximum. It was therefore concluded that the horizontally averaged mean flow properties were more dependent on the ratio of buoyant-to-inertial forces and less dependent on the solar position. These studies showed the significance of buoyant effects on the urban wind flow and thereby led to the inclusion of different surface-heating scenarios in the present parametric study.
Another important parameter which also plays a major role in mean wind profile estimation is the canyon aspect ratio
, where
H is the building height and
W is the canyon width. Few studies have investigated the effect of the canyon aspect ratio on the mean wind speed inside the canyon. Kovar-Panskus et al. [
18] performed 2D steady-state CFD simulations on a single street cavity formed by two buildings using a standard
k-
turbulence model and observed that the flow regimes, vortex structures and vortex center positions varied for different canyon aspect ratios. Xie et al. [
19] studied wind flow and pollutant transfer for the same 2D canyon for different wall-heating scenarios and a series of aspect ratios (between 0.1 and 2) covering various flow regimes. It was concluded that the mean flow behavior inside the canyon was highly dependent on the aspect ratio and stability condition. Memon et al. [
20] simulated diurnal heating scenarios using a 2D RANS simulation with a renormalization group (RNG)
k-
turbulence model on a row of building cavities. A linear regression model was used to obtain the relationship between air temperature and aspect ratio. To complement these studies, the canyon aspect ratio was also included in the present parametric study.
Even urban layouts are important as suggested by Allegrini et al. [
21]. In their study, the influence of urban morphology on the urban microclimate was investigated using 3D steady-state RANS simulations with Boussinesq’s approximation and standard
k-
turbulence model. These simulations were performed for six different urban morphologies to mainly study the building surface temperatures and outdoor air temperatures. Buildings of dimensions H = 10 m, B = 10 m and L = 10 to 70 m were simulated for two reference wind speeds at a 10 m height (1 m/s and 5.5 m/s) and using a building energy simulation (BES) model to determine the surface temperatures of building walls and ground. The temperatures were found to be significantly lower for an aligned configuration compared to a staggered configuration. Moreover, in general, zones with high temperatures often coincided with low winds which could lead to decreased thermal comfort.
Most previous studies have not considered the effect of the ambient wind direction on the mean wind flow. Kim [
22] used 3D CFD simulations with an array of 4 × 4 buildings to study the effect of wind direction on neutral urban flow and dispersion characteristics. Simulations with ten different wind angles ranging from 0
to 45
in intervals of 5
were performed to investigate the vortex structure variations with respect to the wind angle. The flow patterns were classified into three regimes based on incoming air flow direction and it was concluded that the ambient wind angle significantly affected the mean flow behavior inside an urban canyon. Allegrini et al. [
21] also suggested to perform more simulations with oblique flow to study the urban microclimate in detail. While all other studies considered wind angles that were perpendicular to the buildings (0
) and ignored oblique ones, the present parametric study included the investigation of oblique winds on mean wind profiles.
Some studies have also used LES or vortex dynamics model in order to improve the accuracy of nonisothermal simulation results. Wang and Cui [
23] simulated the Uehara’s wind-tunnel experiment conditions using an LES model with an idealized building array geometry mainly to compare the results against the previous RANS simulation studies and suggest improvements. The model geometry consisted of a 10 × 6 array of regular cubes similar to the wind-tunnel model described in Uehara et al. [
24] with periodic boundary conditions at the lateral sides to simulate an extension of building rows in the lateral direction. It was concluded that the momentum effects due to buoyancy were overestimated by RANS simulations and therefore LES could be a reasonable choice for accurate simulations of unsteady flows. Chen et al. [
25], on the other hand, compared LES and RANS models for nonisothermal applications and argued that the horizontally averaged flow properties were similar. The urban air flow subjected to diurnal variations of atmospheric stability located at five different climatic zones in China was studied for an idealized building array geometry of 6 × 6 buildings with
,
m,
m and
. Steady-state RANS simulations with a standard
k-
model, standard wall functions and Boussinesq’s approximation were used to obtain the mean flow in stratified atmospheric conditions with good accuracy. An alternate approach based on vortex dynamics for neutral atmospheric simulations was given by Furtak-Cole and Ngan [
26] which are computationally much cheaper than RANS or LES simulations. As an extension to that study, Wang et al. [
27] also used the same approach to evaluate the model’s capability to incorporate geometric and thermal effects. Few canyon types and stratification scenarios were simulated, and reasonably accurate mean profiles were obtained. Although faster than conventional CFD models, their method was based on the numerical resolution of the Poisson partial differential equation and used a very simple canyon configuration and flow properties.
In the last decade, the use of artificial intelligence and machine learning techniques in wind-engineering applications have greatly increased. The study by García-Gutiérrez et al. [
28] used an ANN trained with lidar (light detection and ranging) wind profile measurements at low levels (30 m) to estimate the vertical atmospheric boundary layer (ABL) wind profile. It was observed that the ANN model prediction results were better than simple traditional analytical models such as log-law and power law. In a similar study by Türkan et al. [
29], machine learning models trained with annual wind-speed measurement data in Kutahya (Turkey) at 10 m were used to predict wind speeds at 30 m. Seven machine learning models were compared and the results showed that the support vector machine (SVM) and multilayer perceptron (MLP) were superior for wind-speed predictions compared to the rest. On the other hand, some studies have successfully used CFD simulation data rather than measurement data for training the models. Ding and Lam [
30] successfully developed machine learning models such as gradient boosted nonlinear and multivariate linear regression models that were trained using isothermal CFD simulation data to determine cross-ventilation potential in an urban environment. These studies exhibited the potential of machine learning algorithms and the ability of surrogate models to replace CFD simulations in wind-engineering studies.
The current study aims to develop and validate a comprehensive machine learning scheme to estimate the horizontally averaged velocity and temperature profiles in an idealized urban canopy for a wide range of urban morphologies and thermal forcing scenarios. Multiple steady-state RANS simulations with a standard k- model, standard wall functions and Boussinesq’s approximation for various atmospheric and geometric scenarios were performed. The simulation results were used for training machine learning models which were shown to have great potential to replace the legacy empirical and semiempirical models. Our main focus was to build a state-of-the-art surrogate model that provided an accurate solution for mean profiles inside the urban canopy. The novelty in this research lies in the development of a model that takes into account the combined effect of the most important parameters affecting the average wind flow for specific regions in the urban canopy.
3. Validation and Simulation Setup
Many wind-tunnel studies have measured wind flow over an urban-like environment considering nonisothermal atmospheric conditions in the past [
24,
42,
43]. In this study, Uehara’s wind-tunnel data were used for the validation since the simulation geometry and conditions were similar. For nonisothermal flows, the flow similarity was established by Reynolds number independence (
) and Richardson number similarity criteria according to Snyder [
44]. The bulk Richardson number (
) formulation was similar to that in Kim and Baik [
14], as shown here:
where
g is the gravitational constant, and
is the reference height. The reference height is the height at which the known values of flow quantities are present.
is the potential temperature of the air at the reference height,
is the potential temperature of the surface, and
is the mean potential temperature within street canyon. The simulation geometry used for the validation consisted of a regular array of 10 × 6 cubes similar to Uehara’s wind-tunnel model scaled by 100:1. Building dimensions were
and the street width was
such that the canyon aspect ratio was
. The atmospheric stability scenario for
was simulated, which corresponded to
,
,
,
,
and
. Since styrofoam was used to represent cubic elements in the wind tunnel, a zero gradient temperature boundary condition was applied at the buildings walls and roof in the CFD simulation. This was in accordance with Wang and Cui [
23], who used LES to compare the results with Uehara’s wind-tunnel measurements. The thermal instability was generated with the help of a fixed temperature
at the ground. The air temperature
was fixed at the top boundary. The inlet temperature was also maintained at the reference air temperature. For the simulations, the Prandtl number
, the turbulent Prandtl number
, the kinematic viscosity
and the volumetric thermal expansion coefficient of air
were used. Relaxation factors can determine the stability and the rate of convergence of a simulation. They artificially introduce a diagonal dominance to the matrices and introduce a stability to the solution of coupled equations. Hence, they were adjusted to find the best combinations and the optimal values were found to be 0.7 for pressure (
) and 0.3, 0.5, 0.7 and 0.7 for
U,
T,
k and
, respectively. The effect of wall functions, turbulence closure schemes and layer addition near walls were investigated and the best simulation configuration suitable for multiple simulations was obtained, which is explained in detail in [
45]. Moreover, Xie et al. [
46] performed 2D steady-state RANS simulations for a finite row of nine identical street canyons with standard, RNG and realizable
k-
turbulence models to compare the performance against Uehara’s wind-tunnel results. It was concluded that all three turbulence models performed almost the same for unstable atmospheric simulations. Therefore, a standard
k-
model with standard wall functions was chosen for the final simulations in this study.
The simulation results plotted against Uehara’s wind-tunnel measurements are shown in
Figure 2. The RMSE for the velocity was 0.104 m/s and
was 0.935. For the temperature, the RMSE was 0.185
and
was 0.622. The quantitative results showed that the velocity was modeled reasonably well using simulations but the temperature modeling was not very close to the measurements. This could be due to three reasons. First, the validation simulation was performed in real scale by geometric scaling considering dynamic similarity conditions to ensure its applicability in realistic urban conditions. As a result, the temperature difference between walls and the atmosphere reduced to 9 K in real scale from 40 K in wind-tunnel scale. Second, the trade-off between computational cost and accuracy restricted the use of a very fine mesh for such a large domain where 10 rows of roughness elements were present. Third, the discrepancy could also be explained to some extent by the adiabatic wall constraint of the model, which may be slightly different from the actual wind-tunnel conditions. Moreover, to increase the ease of applicability of wind angles to the simulations, periodic boundary conditions were used. Steady-state results were verified for a 4 × 4 building geometry for wind angles up to 45° thereby enabling the use of periodic boundary conditions at the lateral walls, which is also mentioned in detail in [
45].
4. Case Study of an Idealized City Model
Real urban canopies are complex due to random building heights and due to the presence of obstacles such as cars, trees, etc. In every locality, the configuration of urban elements differ.
Therefore, an idealized city model consisting of a regular array of 4 × 4 cubes as shown in
Figure 3 was chosen for the parametric study. The choice of simulation geometry is a trade-off between model complexity and detailed averaging region. The model should not be too complex and at the same time, a good region for averaging the flow quantities is required. A 4 × 4 cubic arrangement of buildings also benefited us with a central crossroad area between four corners of the inner-most buildings for this study. Allegrini et al. [
1] analyzed the velocity and turbulent kinetic energies inside a scaled model of an urban canyon to study the influence of buoyancy due to heated surfaces on the flow field using wind-tunnel measurements and observed distinct changes in the flow fields for different heating scenarios (isothermal, all surfaces heated, windward wall heated, leeward wall heated and ground heated). The maximum turbulent kinetic energy was found with the all-surface-heating scenario. Santiago et al. [
17] concluded that the horizontally averaged mean flow properties were less dependent on the solar position (i.e., differential-wall-heating scenarios). Therefore, in this study, all-surface heating was adopted to create the atmospheric stratification. The choice of mesh for this simulation affected the accuracy of the results. Therefore, the mesh independence was verified with the grid convergence index (GCI) procedure as explained by Roache et al. [
47] for three different grid spacing values inside the city boundary (
and
). The GCI results showed an average deviation of 8.33% within the canopy and 0.4% above the canopy for horizontally averaged wind profiles. The mesh independent study is explained in detail in [
45]. As a result, a
grid spacing was chosen as an optimal trade-off between accuracy and simulation time. The highest
values among all the cases in this parametric study were also verified to be within 10,000. At the building walls, the
value was 6412 and at the ground, the
value was 5395.
4.1. Basel Annual Climate Data
The atmospheric flow variables are directly affected by microclimatic conditions and diurnal temperature variations. There exists a range of stability conditions within a day and within a year in an urban neighborhood. As a part of BUBBLE conducted in Basel (Switzerland), the urban boundary layer was investigated in detail by Rotach et al. [
48]. One of the experimental sites which was located at Basel Sperrstrasse (city center) was used in that study. A tall tower equipped with anemometers and thermometers at six height levels was raised at the site to measure wind velocities, wind direction and temperatures.
That experimental site consisted of an urban canyon similar to the geometry of interest. The location at which measurements were taken went up to the top of the tower (around 30 m) which was within the inertial sublayer (ISL). The raw data consisted of measurements for every 10 min for which hourly averaging was performed. The period of analysis was chosen as January 2002 to mid July 2002. This period had a good quality data and was spread among different seasons. The velocity magnitude at a height of 31.7 m was used as the reference velocity. The temperature at a height of 31.2 m was used as the reference temperature. However, surface temperature values were not available with BUBBLE data. Hence, the wall surface temperature results were obtained from the model explained in Afshari and Ramirez [
49], which was simulated for Basel conditions.
The atmospheric stability was defined by the bulk Richardson number (
) and it was calculated for the given period based on the flow variables at every hour using Equation (
7). Its frequency distribution is shown in
Figure 4, which indicates that extreme bulk Richardson number events occur rarely in a realistic urban environment. The critical value of
was taken as zero in this study. If
, the atmosphere was considered stable; if
, it was considered unstable; and if
, it was neutral. In this parametric study, we aimed to simulate atmospheric scenarios that fell within a selected range of
so that the simulations were more realistic and probabilistic. However,
could not be directly given as input to the simulation. It can be seen from Equation (
7) that
depends mainly on the reference velocity (
) and the potential temperature difference between the air at the reference height and the wall (
). Therefore, a realistic range of
and
was needed for which BASEL’s Gaussian kernel density estimation (kde), as shown in
Figure 5, was used. The central 90th-percentile data were chosen as an appropriate range and the 5th-percentile data at both extremes were ignored since their occurrence was minimal.
The selected range of the quantities for
and
are shown in
Table 2; some of them fell slightly outside the 90th percentile range. Values chosen within this range were expected to be more realistic and probabilistic since the data were obtained from a real urban environment in Basel.
4.2. Choice of Simulation Parameters
From a study of the literature, it was concluded that the canyon aspect ratio (), stability condition characterized by the bulk Richardson number () and wind angle () were the significant factors that affected the mean wind profile and the effect of these parameters needed to be investigated in the study.
Wind angle (): Due to symmetries present in the simulation geometry, considering wind angles between 0 and 45 was sufficient to simulate all the wind angles. Longitudinal and lateral symmetry enabled the reproduction of any wind angle above 90° by an equivalent wind angle below 90. Further, the diagonal symmetry enabled the reproduction of any wind angle above 45 by an equivalent wind angle below 45. Therefore, the 4 wind angles 0, 15, 30 and 45 were chosen in this study.
Canyon aspect ratio (
): The urban canyon can be generally classified into three categories based on the canyon aspect ratio
. A regular canyon (
1) as shown in
Figure 6a, a narrow canyon (
2) as shown in
Figure 6b and a wide canyon (
0.5) as shown in
Figure 6c.
Santiago and Martilli [
50] used a range of values for
and
(0.11 to 0.44) for their study. Flow regimes were classified for symmetric building configurations based on the canyon aspect ratio (
) such as skimming flow for
= 1 or 2 and wake interference flow for
= 0.5 [
19,
51].
affects
and similarly,
affects
. If
A for a 0
wind angle, then
for wind angle
. The values of
and
for the 12 combinations of wind angles and canyon aspect ratios are shown in
Table 3. It can be clearly seen that this covers a maximum range of
and
. Therefore, the
values of 0.5, 1 and 2 were chosen in this parametric study.
Bulk Richardson number (
): The bulk Richardson number is a function of flow parameters such as
and
. The term
which is the potential temperature difference between the wall and the ambient air at a reference height, is termed as
hereafter. A small variation in
does not affect the bulk Richardson number significantly and therefore it was kept constant (
K) for this study. Nakamura and Oke [
52] measured wind and temperature in a real urban canyon and pointed out that the maximum temperature difference between air and walls lay between 12
C to 14
C. This also served as a starting point for the temperature difference between surface and the ambient air. The wind velocity and temperature difference were chosen in a systematic way from the selected range in
Table 2. Allegrini et al. [
34] simulated a flow involving a heat transfer in a 2D simulation domain with a single street cavity (
) for global Richardson numbers ranging from 0.14 to 13.7 in an unstable regime, which was used as a reference range in this study. Basel’s annual distribution of flow shows that
and
occur in pairs and therefore cannot be chosen independently. Hence, a multivariate probability distribution estimated using a kernel density estimation (kde) as shown in
Figure 7 was used to choose the pair of
and
. These contour levels are the isocurves corresponding to the mass probability. The white region in the plot represents the bottom 5% of the probability mass. The remaining 95% of the probability mass is represented by the colored isolevels. The contours represent the mass probability within 5%, 25%, 50%, 75%, 90% and 100 % of the total mass. Twenty one combinations of
and
were sampled within the selected range in such a way that they were well distributed and covered all kinds of scenarios. This was also limited, keeping the computational cost in mind. It comprised ten unstable atmospheric scenarios, ten stable atmospheric scenarios and one neutral atmospheric scenario. The combinations and the probability of occurrence of these combinations within a year are tabulated in
Table 4 and are also plotted in
Figure 7. A full factorial sampling method was used to generate the design of experiments which resulted in a total of 252 simulations (
).
4.3. Simulation Workflow and Postprocessing Automation
In order to avoid human errors while performing repetitive simulations, an automation approach was used. The whole simulation workflow was automated using the pyfoam utility to make the process simple, efficient and error-free. For each simulation, the flow fields were spatially averaged to obtain a horizontally averaged wind profile along the ground normal direction. Traditionally, OpenFOAM simulation postprocessing is done using Paraview software. However, executing a repetitive postprocessing action for 252 simulations can be exhausting. Hence, a python code was developed to automate the Paraview postprocessing operations in an efficient and error-free manner.
4.4. Spatial Averaging
The spatial averaging methodology and the different regions used for averaging in this study are discussed here. The velocity glyphs for one simulation case with a 45° inlet wind angle are shown in
Figure 8. It can be seen that the flow near the first row of buildings was slightly different from that of the next rows. To avoid boundary effects that could strongly affect the average wind profile, the middle region of the urban layout was primarily used for averaging. The characteristic wind profiles were obtained with respect to the following regions shown in
Figure 9 for this study. The urban in-canopy region was divided into volumes of 1 m from the ground up to a height of 25 m, and the flow fields of the corresponding volumes were averaged. The horizontally averaged velocity was obtained from mean velocity components in the x and y directions given as
.
4.5. Curve Fitting of Horizontally Averaged Profiles
As we recall from the literature review, the exponential nature of the in-canopy wind profile has been suggested by many researchers [
6,
7,
8,
10,
13]. The main disadvantage of most of these models (except Harman and Finnigan’s model) is that they do not take the atmospheric stability into account when calculating the attenuation coefficient (
a). We intended to verify these models with the horizontally averaged wind profiles obtained from CFD simulations in this study. For Awol’s model, an additional parameter (
) was specified which was defined as the aerodynamic roughness length scale in the absence of specific roughness elements. In addition to these models, analytical functions such as exponential, linear polynomial, quadratic polynomial and cubic polynomial functions were used to fit the mean wind profiles using python’s (
) module to determine which analytical function could be used to accurately characterize the nature of mean wind profiles. It was observed that a linear or a quadratic function could not adequately fit the complex velocity data. This was mainly due to the wide range of atmospheric and geometric simulation conditions used in this study. However, third-order polynomials were found to fit the data reasonably well. For one of the cases (case 13) with inputs
and
°, the exponential fit and cubic polynomial fit characterized by Equations (
8) and (
9) along with existing legacy schemes for the crossroad and spanwise regions are shown in
Figure 10.
It should be noted that the mean wind profiles obtained from the CFD simulations were very different for different averaging regions. Legacy schemes lacked the capability to determine wind profiles for these specific regions, which was achieved in this study. The nature of wind profiles in specific regions of an urban canopy was characterized using surrogate models which was also a novelty in this study. The quantitative comparison for all four regions is tabulated in
Table 5. It can be clearly seen that the cubic polynomials fit the flow profiles better than the exponential fit and the analytical models. Therefore, it was concluded that cubic polynomial functions were better approximations for the velocity and temperature profiles compared to exponential functions for specific regions in the urban canopy, a wide range of stability conditions and oblique wind angles. The main idea was to use the polynomial coefficients obtained using curve fitting to train the machine learning model for the given set of inputs. Like velocity, cubic polynomials were also used for temperature.
7. Conclusions
The main focus of this study was to characterize the horizontally averaged wind and temperature profiles within the urban canopy using a data-driven approach. The fundamental idea was to create a surrogate model using machine learning techniques utilizing data from CFD simulation results.
A steady-state RANS model in OpenFOAM was used for the CFD simulations. The model was validated using Uehara’s wind-tunnel data, which helped to select the best simulation configuration. A standard k- model with standard wall functions was used in this case study. Previous literature studies and experiments showed that wind flow inside an urban canopy was mostly affected by the street canyon’s aspect ratio, wind angle and atmospheric stability. Thus, a case study was conducted for various simulation conditions using an idealized simulation geometry that consisted of a regular array of 4 × 4 cubes. The ranges of velocity and temperature input values for the simulation were obtained from Basel’s BUBBLE experiment data. These data provided the realistic atmospheric conditions of a European city for a period of about half a year. Overall, 252 simulations were carried out for different realistic urban atmospheric conditions. The velocity and temperature field data from the simulations were averaged in the horizontal direction to obtain a horizontally averaged wind profile along the vertical direction. CFD simulations and postprocessing operations were automated in order to increase the efficiency of the process.
The horizontally averaged vertical wind and temperature profiles inside the urban canopy were normalized to make them nondimensional. Several studies have suggested that the wind profile can be approximated by an exponential function. Therefore, fitting the simulation data with an exponential curve was tried at first. The exponential fit was not accurate enough. Hence, polynomial fits were tried, and third-order polynomials were found to be optimal. The wind and temperature profiles were fitted using cubic polynomials and the coefficients were saved for each simulation case. Discrete data points were also saved for each simulation case.
Each simulation had its own set of inputs for the reference wind speed, temperature difference between surface and air, wind angle and aspect ratio. Both cubic polynomial and discrete-point profiles were predicted using machine learning algorithms. Outputs were the three polynomial coefficients for the cubic profiles and the simulation output points for the discrete-point profiles. First, a linear regression model was used. The deviation between the actual and the predicted coefficients were around 9.39% for the velocity and 9.49% for the temperature on an average. Then, artificial neural network models were used with model architectures obtained using a hyperparameter optimization method. The deviation came down to 5.70% for the velocity and 8.43% for the temperature on average. It was observed that for the velocity, the ANN performed better than the linear regression. The complex nonlinear relationships due to the nonlinear activation functions in the ANN helped to build a better surrogate model compared to the linear regression. However, for the temperature data set, the linear regression model performance was comparable to that of the ANN. This might be due to the fact that data samples for training were not enough for the temperature. If more simulations results had been available, a much more accurate model could have been built with the help of the ANN. Then, ANN models trained directly using the simulation output data were used to predict the discrete data points. In this way, the error occurring due to the curve fitting of the profiles was eliminated. The test error for the velocity was 4.75% on average for the four regions and 6.09% for the temperature compared to the actual simulation results. For all the simulation cases, the velocity profiles obtained from the existing models were compared against the ANN models from the current study. It was observed that the ANN models (mainly the discrete-point prediction model) outperformed the legacy schemes. However, the downside was that the profile could be discontinuous, resulting in a nondifferentiable function.
The main conclusions are as follows:
The horizontally averaged wind and temperature profiles inside the canopy are mainly affected by input parameters such as the street canyon’s aspect ratio, wind angle and atmospheric stability.
The exponential approximation of the mean velocity profiles given by the legacy schemes provide reasonable results for large averaging regions (such as midcity regions), are based on simplifying assumptions and are mostly applicable for neutral atmospheric conditions except the Harman and Finnigan model [
8]. However, cubic polynomials are more versatile and flexible to represent wind profiles at specific regions (midcity, crossroad, spanwise and streamwise regions) in the urban canopy for a wide range of atmospheric stability conditions and oblique wind angles.
Machine learning models such as ANN and LR can be used for the prediction of horizontally averaged wind and temperature profiles inside the canopy. An ANN proved to be superior to an LR for this dataset and application.
Cubic polynomial and discrete-point velocity and temperature profiles inside the urban canopy can be predicted for a specified input scenario using surrogate ANN models.
The discrete-point prediction ANN model of this study outperforms the current state-of-the-art models in the prediction of horizontally averaged mean wind profile inside an urban canopy.
The novelty in this research lies in the study of the combined effect of the most important parameters using multiple CFD simulations and the development of surrogate models that cater the need of velocity and temperature profiles requirements for specific regions in an urban canopy.
This study can be extended further to obtain a data-driven model with a better accuracy. LES can be used instead of a RANS model, but the simulation time has to be taken into consideration. The influence of other factors such as the roughness length of the incoming wind profile, custom temperature inlet profile, different urban layouts (staggered arrangements) and variable building heights (instead of idealized cubes) can be included. Above-canopy profiles (within the RSL) can also be predicted in a similar manner. The effect of evapotranspiration due to the presence of vegetation in an urban canopy can be included in the parametric study to derive humidity profiles. Anthropogenic heat emissions from automobiles, indoor space air conditioning and industries, which also impact the urban microclimate, can be included. As part of future work, the models discussed in this study will be validated against real measurement data and an enhanced version shall be made available to the scientific community.