The Design of a Metro Network Using a Genetic Algorithm

Featured Application: A quick method that allows one to obtain a draft of a metro network. Abstract: The process of planning the metro system is very complex. This is due to the large amount of required input data originating from many domains. Additionally, the required extraordinary budget makes the design phase difficult and forces some trade-offs to meet most expectations. Designing a metro system requires often a multi-variant analysis to examine possible future ways of urban agglomeration development. In such circumstances there is a need to apply an efficient method, which allows for obtaining a draft design of a metro system. This paper presents an application of a genetic algorithm for two stages of a metro system design. The first stage determines the layout of metro stations, which optimally meets the transportations demands of the inhabitants. The second stage generates the optimal topology of lines connecting the stations, taking into account the minimization of construction costs and the total travel time of all passengers. The parameters, which control the operation of the optimization procedures, were tuned with the use of the Taguchi method.


Introduction
Metro systems is the best way to improve the quality of public transport in large cities due to their large capacity, rapidity, time reliability, and almost total separation from the urban space [1]. The construction of a metro network entails extraordinary costs; therefore, the design phase is of great importance. Since a metro network should serve the inhabitants living in a given area, their transportation demands should be the main criterion while designing such a system [2]. Obviously, there are always some budget constraints that could significantly limit the range of the investment. However, a feedback relationship also exists-if a new metro station is built in a relatively underdeveloped place, it could stimulate the development of such an area [3][4][5][6]. Furthermore, the cost of such a metro infrastructure location could be significantly lower. In such a situation, there are two approaches when considering the planned location of a new infrastructure: the first oriented towards the current transportation demands and the second oriented toward future land development [7]. After the above-mentioned conceptual choice has been decided, a detailed study on station location can take place. The main issue of this stage is accessibility, which is twofold: • Attraction accessibility, which means the ease of reaching the station using any mode of transport; • Radiation accessibility, which means the ease of reaching other stations from the given station [8].
Accessibility is the main criterion of assessing the degree of satisfaction of passengers' transportation demands [9][10][11]. In general, when considering the optimization of the structure of the The design process is divided into two stages: in the first stage, the optimal station location is determined, and in the second stage the optimal layout of lines connecting the stations is found. Such an approach comes from quite different and independent objectives of both sub-issues. Therefore, the problem can be split since decreasing the number of the degrees of freedom significantly reduces the computational complexity. Additionally, if the task is not split, it might happen that much computational time will be spent on optimizing the lines topology to the as yet not optimal station layout.
The total number of lines and number of stations are assumed in advance. These numbers can be estimated after the review of the structures of some metro networks existing in the world.

Criteria of Station Locations
A station is regarded as an object whose location must assure meeting the transportation demands of inhabitants. It was assumed that the transportation demands are proportional to the number of people living in an area. Therefore, the main input here is a map of population density covering the considered region. In the rough approach, which involves neglecting the street layout and existing possibilities of inhabitants' mobility, the main criterion of passenger satisfaction is the Euclidean distance to the nearest metro station-e.g., circular shape of the catchment areas was assumed [21]. The present work expresses the number of inhabitants being satisfied with the location of i-th metro station as [22,23] where r i -distance to i-th station; ρ-local population density; σ-standard deviation, determining the distance regarded as satisfactory; and A-considered area. The model also takes into account the existence of point generators of transportation demands like stadiums, multi-arenas, railway stations, airports, or big factories. In this case, the number of people related to j-th generator whose transportation demands are satisfied by i-th station can be expressed as where r ij -distance between j-th generator and i-th station, and g j -number of people related to j-th generator. The important issue is to establish the proper value of σ parameter. This depends on the way the passengers reach the station. Schlossberg and Brown considered walking accessibility and adopted the values in range 400-800 m [24]. Brinckerhoff used questionnaire research to determine the acceptable distance to the nearest metro station: about a half of respondents claimed it to be less than a 1500 m, and one fifth of respondents claimed more than 3000 m [25]. The distance up to 1000 is commonly accepted as comfortable [26]. The accessibility of a metro station can be significantly enhanced by the introduction of Park & Ride systems [27]; it concerns especially areas of a low population density.
In the optimization process, the locations of all stations are varied in such a way as to maximize the objective function, which is simply the number of people whose transportation demands are satisfied, as follows: After optimization is complete, the best locations of all stations are known and the number of potentially serviced people is known for each station as well. Let us denote this value for i-th station as s i and then it can be expressed as

Criteria of Lines Layout
A metro line is a series of stations. There are no preliminary assumptions determining the network topology, except for one requirement: the network must be coherent. In other words, it must be possible to travel from any origin station to any destination station. The aim of the optimization is to find such a layout of metro lines that the total cost of the network is minimized and the total travel time of all the passengers is minimized too. The model does not take into account the time needed for transfers; what is more, the travel time is assumed to be dependent only on journey length, neglecting the contribution of transfers entirely. Such an assumption could be partially justified when the frequency of trains is relatively high. The above criteria are contradictory: if the travel time is expected to be minimized, the topology of the network has to be somewhat like a grid. This, in turn, means that some connections are duplicated, which causes the increase of the costs. Generally, some basic patterns of Appl. Sci. 2019, 9,433 4 of 17 metro network topology can be listed: a triangle, a star, and a cartwheel [28], but having in mind the irregular shape of the populated areas and the irregular distribution of the population density in the considered case, no such assumption was made.
The construction costs of a line segment depend on its length; additionally, it is possible to take into account the cost dependency on some local conditions like rivers, lakes, built up areas, or industry zones. For this purpose, a map describing the cost distribution is introduced into the model, and the building cost of i-th segment of j-th line between two stations can be expressed as where k-local unit building cost and L-segment length. The second criterion taken into account is the overall network efficiency. There are many different measures developed to asses it [29][30][31], but from the passengers' point of view the expected travel time is the most important. The total travel time of all potential passengers calculated below is only an estimation, but it reflects the passengers' expectations. In the optimization process, the shortest distance between each pair of stations is calculated (d ij ); it could include transfers, but the transfer time is not taken into account. Then, the total travel time between a pair of origin and destination stations for all their potential passengers can be estimated in the following way: If the total travel time is taken into account, the constructed objective should result in a highly interconnected network topology. It will lead to the construction's cost increase, but the passengers' demands will be satisfied in a higher degree [32]. This criterion will remain valid if a lesser number of passengers is taken into account, as long as the population distribution is the same. Just the proportional factor will change. Of course, considering the explicit costs, the investment will appear less beneficial because the costs per a single passenger will increase.
The applied optimization method requires an increasing objective function; thus, both components of the objective function are expressed as inverts Finally, the objective function is a product of both factors Such formulation of the objective function allows for simultaneous optimization in relation to both criteria without the use of arbitrarily adopted weights.

Optimization Method
The idea of genetic algorithms is based on the evolution processes in the world of living nature [33]. It was first described by Holland [34]. Due to their advantages, genetic algorithms have quickly become an efficient tool in science and technology [35]. The optimization process is based on the following assumptions: • Individuals representing different versions of solutions compete with each other; • The structure of each individual is coded by an entity called a genotype; • A genotype is randomly varied by mutations; • Across-over operation causes random pairs of individuals to partially exchange their genotypes; • The quality of each individual can be evaluated as a single value called fit function; • Fit function determines the probability of transition to the next generation, which corresponds to the selective pressure; • The combination of random mutation and cross-over with aimed selective pressure leads the whole population towards the optimal solution.
The main advantage of the genetic algorithms is they are able to find a solution close to the optimal one investigating only a scant part of the solution's space. Additionally, they are able to avoid getting stuck in the local optimum. A specific cross-over operation gives the generally faster convergence in comparison with other methods based on the solutions space exploration.
The selective pressure means that individuals with a higher value of the fit function are preferred to pass to the next generation. In the presented work, the method of the roulette wheel was used, in which the probability of passing to the next generation is directly proportional to the value of the fit function. Additionally, a few of the best individuals pass unconditionally (elite selection). The size of the elite selection should be relevant to avoid ousting from the population individuals that are poor now but could be improved.
Additionally, the current leader of each generation is compared with the stored general leader. The better of both becomes a new general leader. This stored solution does not take part in the optimization process, but it contains the result.
The described issue involved two optimization problems-optimization of stations locations and optimization of lines layout.

Optimization of Stations Locations
The genotype for the first sub-problem is a simple table containing the following coordinates of stations (x i and y i denote coordinates of i-th station): {(x 1 , y 1 ), . . . , (x i , y i ), . . . (x n , y n )}.
A mutation is a shift of a randomly selected station. There are random numbers added to its coordinates. The distribution of these shifts is normal with the mean equal to zero, and the standard deviation is a program parameter.
The number of the lines is the same for all individuals, and a station can appear at most once in a line (no loops). However, there are no other limitations apart the above: Any station can belong to many lines, so lines can go parallel and can cross each other.
Because of the specific structure of this genotype, the special mutation operators were designed. There are six types of mutations, which allow for completely free changing of a genotype.

1.
Exchange of the position of two randomly selected station for a randomly selected line, as follows: 2.
The order reversion of the randomly selected sub-series of stations for a randomly selected line, as follows: 3.
Exchange of a pair of randomly selected stations between randomly selected lines, as follows:

5.
Removing of a randomly selected station from a randomly selected line, as follows: 6.
Addition of a randomly selected station to a randomly selected line, as follows: The cross-over operation involves an exchange of a random single metro line between a pair of individuals, as follows: After each operation of mutation or cross-over the network structure is checked, special attention is paid to its coherency. If the test fails, the operation is canceled.
The shortest distance between a pair of stations (6) is calculated using fast Dijkstra's algorithm with a priority queue implemented by the heap.

Results-A Case Study
The presented method was applied to obtain a preliminary design of a metro system for Upper Silesia conurbation in Southern Poland.

Case Study Description
Commonly, the development of a metro network is a process lasting many years. The considered case differs here, because the center of Upper Silesia region is an agglomeration of a dozen of cities, which practically form a uniform urban structure, but due to the lack of funding there was no attempt to build a common metro communication system in the past. There are lots of bus and trams lines, which serve over 1 million passengers daily. There is also a network of railways, but the train frequency is much too low to serve as typical urban trains. Very dense road network including two motorways and an urban motorway crossing the centers of most of cities cause many citizens to use their own cars; this results in congestions, city centers overcrowding and the environment pollution. Nowadays, in such situation an idea has arisen to consider a possibility to build a metro network. Figure 1 presents a map of considered area, and Table 1 lists the biggest cities forming the center of Upper Silesia region. The region extends from E18.6150 to E19.3030 and from N50.1118 to N50.4575 (65 km × 41 km).  Mysłowice 75,000

Determination of the Main Metro System Parameters
The first issue to be discussed is the number of metro lines and stations. As there are no strict guidelines, research on a few of the world's metro systems was done. Some of this research is gathered in Table 2. The data are commonly available, but sometimes it is difficult to interpret them because metro networks often cross the city boundaries and the size of serviced population cannot be determined exactly.

Determination of the Main Metro System Parameters
The first issue to be discussed is the number of metro lines and stations. As there are no strict guidelines, research on a few of the world's metro systems was done. Some of this research is gathered in Table 2. The data are commonly available, but sometimes it is difficult to interpret them Appl. Sci. 2019, 9,433 8 of 17 because metro networks often cross the city boundaries and the size of serviced population cannot be determined exactly. The dependence between the population and the number of metro lines is not strict, but some regularity can be observed ( Figure 2). The dependence between the population and the number of metro lines is not strict, but some regularity can be observed ( Figure 2). As is can be noted except the case of New York (off-standing red square), all other points are more or less aligned along the regression line. The conclusion is as follows: one metro line should be for about 800,000 citizens. In the same way, the overall number of stations can be determined ( Figure  3). In contrast to the previous issue, there is no exception here-the regression line fits the data points very well, and the average value of 19 stations per line can be stated as a conclusion.
There is a population of about 2,700,000 inhabitants living in considered region; therefore, three metro lines were assumed with, respectively, 54 stations. Additionally, some calculations were done As is can be noted except the case of New York (off-standing red square), all other points are more or less aligned along the regression line. The conclusion is as follows: one metro line should be for about 800,000 citizens. In the same way, the overall number of stations can be determined (Figure 3). The dependence between the population and the number of metro lines is not strict, but some regularity can be observed ( Figure 2). As is can be noted except the case of New York (off-standing red square), all other points are more or less aligned along the regression line. The conclusion is as follows: one metro line should be for about 800,000 citizens. In the same way, the overall number of stations can be determined ( Figure  3). In contrast to the previous issue, there is no exception here-the regression line fits the data points very well, and the average value of 19 stations per line can be stated as a conclusion.
There is a population of about 2,700,000 inhabitants living in considered region; therefore, three metro lines were assumed with, respectively, 54 stations. Additionally, some calculations were done for four lines, keeping the same number of stations. In contrast to the previous issue, there is no exception here-the regression line fits the data points very well, and the average value of 19 stations per line can be stated as a conclusion.
There is a population of about 2,700,000 inhabitants living in considered region; therefore, three metro lines were assumed with, respectively, 54 stations. Additionally, some calculations were done for four lines, keeping the same number of stations.

Selection of the Relevant Values of Optimization Parameters
An optimization process is controlled by a number of parameters. In the case of genetic algorithm, the most important are the population size, the number of generations, the probability of mutation, and the probability of cross-over. Some additional parameters are possible depending on the details of the problem. The increase of the population size leads to better convergence and allows for avoiding local optima, but causes the calculation time to grow up. The size of population was preliminary determined by trial and error; finally, it was equal to 250 individuals at the first stage (stations locations) and 1000 at the second stage (lines layout). To select the relevant number of generations, the optimization process was monitored, and then it was stopped when the progress of increasing the objective function value significantly slowed down.
The most important of the other parameters for the first stage are the probability of mutation (PM), the probability of cross-over (PX), the elite size (ES), and the standard deviation for the station shift (ND, expressed as a fraction of the region diameter). The parameters taken into account in second stage are PM, PX, and ES. A genetic algorithm is a non-deterministic procedure, so there is a need to perform a series of algorithm runs for each combination of parameters. If each parameter took only two values (from a typical range), the full analysis of all possible parameters combinations would require 16 series at the first stage and 8 series at the second stage. Therefore, the values of the parameters were determined by the use of Taguchi method [36]. The L8 and L4 orthogonal tables were used to design these preliminary tests. Taguchi proposed a procedure of determining the relative importance of the examined parameters. This idea involves an application of a quantity called signal to noise ratio (S/N). This quantity is taken from signal theory, and it expresses the relation between the signal-a desired feature and the noise-undesired disturbing factors. In the considered case, as high value of the objective function as possible is desired, so according to Taguchi the S/N value should be expressed as follows: Tables 3 and 4 show the selected parameters values and the optimization process outputs (scaled values of the objective function and S/N) for both stages. The input data described a simplified metro system with 30 stations and three lines to make these preliminary calculations quick. Four procedure runs were carried out in each series. All the values of the objective function were referred to the highest one (normalized to unity).  In the next step, the average values of S/N for each parameter and for each value level must be calculated, and then charts showing the relative importance of each parameter can be constructed. Figure 4a,b show this and then allow for determining the optimal parameters for both stages of thegenetic optimization. In the next step, the average values of S/N for each parameter and for each value level must be calculated, and then charts showing the relative importance of each parameter can be constructed. Figures 4a,b show this and then allow for determining the optimal parameters for both stages of thegenetic optimization. Such adjusted parameters were further applied for the final optimization. The described above procedure allowed for determining the best conditions for the optimization process. What is worth noting, parameters PM and NE were adjusted in a different way for both optimization stages (lower values of PM and NE are preferred in the first stage; meanwhile, higher are more appropriate for the second).  Such adjusted parameters were further applied for the final optimization. The described above procedure allowed for determining the best conditions for the optimization process. What is worth noting, parameters PM and NE were adjusted in a different way for both optimization stages (lower values of PM and NE are preferred in the first stage; meanwhile, higher are more appropriate for the second). Figure 5 presents the adopted map of population density [37]. Cities are marked according to Table 1. Due to avoiding the uncertainty resulting from non-deterministic nature of genetic algorithm, a series of 10 runs of optimization procedure was carried out. The obtained results were similar but not the same. Figure 6 shows as examples two of the obtained stations layouts. The shaded circles mark the degree of satisfying the transportation demands, and bold black circles mark the additional generators of the transportation demands. These generators and relevant number of people demanding the transport have been added after analyzing the list of such facilities at the examined area. As it can be seen, most of the stations are located mainly on highly populated areas, which was expected. However, the detailed locations of stations differ significantly. Some of the transportation demands generators are not served as well (however, it concerns those of lesser significance). The main reason is probably that the number of stations was underestimated, and it was too low to sate all the highly populated areas. Additionally, as was stated, the objective function did not take into account the details of local topography.

Obtained Structures of the Metro Network
values of PM and NE are preferred in the first stage; meanwhile, higher are more appropriate for the second). Figure 5 presents the adopted map of population density [37]. Cities are marked according to Table 1.  Due to avoiding the uncertainty resulting from non-deterministic nature of genetic algorithm, a series of 10 runs of optimization procedure was carried out. The obtained results were similar but not the same. Figure 6 shows as examples two of the obtained stations layouts. The shaded circles mark the degree of satisfying the transportation demands, and bold black circles mark the additional generators of the transportation demands. These generators and relevant number of people demanding the transport have been added after analyzing the list of such facilities at the examined area. As it can be seen, most of the stations are located mainly on highly populated areas, which was expected. However, the detailed locations of stations differ significantly. Some of the transportation demands generators are not served as well (however, it concerns those of lesser significance). The main reason is probably that the number of stations was underestimated, and it was too low to sate all the highly populated areas. Additionally, as was stated, the objective function did not take into account the details of local topography. In the light of such variability of the obtained results, a conclusion arose that the found stations layouts should be regarded just as the preliminary. The determination of the exact location of each station would require further work and far more detailed data. Anyway, such a matter could be expected due to simplicity and rawness of the model. On the other side, the values of fit function were quite close to each other. This is shown in Figure 7, which presents the progress of the optimization process for five runs. The value of fit function was normalized to unity.  In the light of such variability of the obtained results, a conclusion arose that the found stations layouts should be regarded just as the preliminary. The determination of the exact location of each station would require further work and far more detailed data. Anyway, such a matter could be expected due to simplicity and rawness of the model. On the other side, the values of fit function were quite close to each other. This is shown in Figure 7, which presents the progress of the optimization process for five runs. The value of fit function was normalized to unity.

Obtained Structures of the Metro Network
As it can be noted, all the solutions were very close to the final value of fit function about 2000th generation. All of them converged in the same manner: the optimization progress was very quick during the first thousand generations; next, it slowed down; and; finally; it came up to the same value.
In the next step, the obtained stations layout with the highest value of fit function was the input data for lines topology optimization. The cost of new-build metro line per unit length depends on local conditions, and this amount can vary even by a factor of 12 [38]. This variety comes mainly from difficult urban infrastructure, geology land acquisition, and type of construction. Table 5 shows the indicative costs of the construction of a metro line per one kilometer. Since the considered region includes also some less urbanized areas, a big part of the metro network could be constructed at the ground level. The underground or elevated facilities should be built in the centers of the cities.
Because the presented idea of building of a metro system is just in the initial phase, no cost analyses, even superficial ones, have been carried out on this issue. In the light of such variability of the obtained results, a conclusion arose that the found stations layouts should be regarded just as the preliminary. The determination of the exact location of each station would require further work and far more detailed data. Anyway, such a matter could be expected due to simplicity and rawness of the model. On the other side, the values of fit function were quite close to each other. This is shown in Figure 7, which presents the progress of the optimization process for five runs. The value of fit function was normalized to unity. As it can be noted, all the solutions were very close to the final value of fit function about 2000th generation. All of them converged in the same manner: the optimization progress was very quick during the first thousand generations; next, it slowed down; and; finally; it came up to the same value.
In the next step, the obtained stations layout with the highest value of fit function was the input data for lines topology optimization. The cost of new-build metro line per unit length depends on local conditions, and this amount can vary even by a factor of 12 [38]. This variety comes mainly from difficult urban infrastructure, geology land acquisition, and type of construction. Table 5 shows the indicative costs of the construction of a metro line per one kilometer. Since the considered region  In such a situation, no explicit costs were entered into the model and the costs were treated relatively. Therefore, just the simplified map of the building cost coefficient (k) was introduced ( Figure 8). Light gray areas mark the normal cost for less urbanized areas (ratio = 1); dark gray areas mark the increased cost due to dense built-up terrains (ratio = 2), and black areas mark the costs for the city centers (ratio = 4). includes also some less urbanized areas, a big part of the metro network could be constructed at the ground level. The underground or elevated facilities should be built in the centers of the cities. Because the presented idea of building of a metro system is just in the initial phase, no cost analyses, even superficial ones, have been carried out on this issue. In such a situation, no explicit costs were entered into the model and the costs were treated relatively. Therefore, just the simplified map of the building cost coefficient (k) was introduced ( Figure 8). Light gray areas mark the normal cost for less urbanized areas (ratio = 1); dark gray areas mark the increased cost due to dense built-up terrains (ratio = 2), and black areas mark the costs for the city centers (ratio = 4). The optimization of lines topology was carried out for different assumptions concerning the number of lines and the formulation of the objective function according to the below scheme (Table  6). Disabling Equation (8) means the coherency of the metro network is assured, but the average travel time is not minimized.

Objective Function Full (Equation(9)) Equation(8)Disabled
The results of optimization of lines topology are shown in Figure 9. As previously, a series of 10 calculations was carried out for each case, and the best solutions are shown. The same regularity was The optimization of lines topology was carried out for different assumptions concerning the number of lines and the formulation of the objective function according to the below scheme (Table 6). Disabling Equation (8) means the coherency of the metro network is assured, but the average travel time is not minimized. Table 6. Scheme of the optimization of lines topology.

Objective Function Full (Equation (9)) Equation (8) Disabled
No of lines The results of optimization of lines topology are shown in Figure 9. As previously, a series of 10 calculations was carried out for each case, and the best solutions are shown. The same regularity was observed too: despite almost the same values of fit function, the obtained topologies of metro network differed significantly. Because the fit function was determined strictly here, a different explanation is suggested. The possible reason for such an effect is the equifinality of the problem. This means the same desired result can be achieved in different ways.
Appl. Sci. 2019, 9, x 13 of 16 Figure 9. The best solutions of lines topology for all examined cases (according to Table 6).
There is a conspicuous difference between pairs of cases A, C and B, D: the latter pair assures the coherency of the metro network, but sometimes it is needed for a long journey between stations despite their physical proximity. In contradiction, very dense network for cases A and C allows for fast travelling between any pair of stations, although sometimes with more than one transfer. This difference of the radiation accessibility is the obvious effect of the different formulations of the objective function. The short summary of the obtained results is shown in Table 7. Two quantities (building cost and total distance travelled) are expressed relatively (referring to the maximal values) due to avoid large numbers without meaningful units. Having in mind the nondeterministic nature of applied optimization method, one can easily conclude that the metro network performance does not depend strongly on the number of the lines. In the light of data in Table 6, cases A and C seem to be almost the same; this concerns cases B and D as well. Therefore, a conclusion has Figure 9. The best solutions of lines topology for all examined cases (according to Table 6).
There is a conspicuous difference between pairs of cases A, C and B, D: the latter pair assures the coherency of the metro network, but sometimes it is needed for a long journey between stations despite their physical proximity. In contradiction, very dense network for cases A and C allows for fast travelling between any pair of stations, although sometimes with more than one transfer. This difference of the radiation accessibility is the obvious effect of the different formulations of the objective function. The short summary of the obtained results is shown in Table 7. Two quantities (building cost and total distance travelled) are expressed relatively (referring to the maximal values) due to avoid large numbers without meaningful units. Having in mind the non-deterministic nature of applied optimization method, one can easily conclude that the metro network performance does not depend strongly on the number of the lines. In the light of data in Table 6, cases A and C seem to be almost the same; this concerns cases B and D as well. Therefore, a conclusion has appeared that the total construction cost does not depend really on the number of lines. The proposed model does not provide the premises to make a decision on the number of lines: one could expect the lower number of transfers for three lines, but a system with four lines seems to be less prone to disturbances. The choice between coherency (variants B and D) and highly interconnected network (variants A and D) is dependent obviously on the available budget. Figure 10 shows the progress of optimization process for 5 runs of case A. The progress of the optimization slowed down at about the 5000th generation. Therefore, the 10,000 generations can be regarded as the number sufficient to obtain the converged solution. As previously, the optimization process ran in a similar manner: quick and significant improvements of the solution were made in the beginning phase; afterwards, only rare and small corrections were made.

Conclusions
The paper presents a method allowing one to obtain a quick draft of a metro network. Since the task is computationally complex, the two-stage approach with the use of the genetic algorithms was proposed. The method was applied to prepare a preliminary design of the metro network for the central part of the Upper Silesia region in Southern Poland. In the first stage, the optimal locations of stations accounting for population density were found. In the second stage, the optimal lines topology was established; here, two alternative assumptions were considered: just the network coherency was required, or the high radiation accessibility was expected. Two variants of the metro network with three or four metro lines were checked too.
The obtained results were the effect of a few hours of calculations using a standard PC with i7 CPU 2.9 GHz and 8GB RAM. Some effort was required to tune the optimization parameters. The input data were in the form of simple bitmaps, which allowed one to examine different variants of the assumed inputs.
The presented approach can be helpful regarding preliminary considerations of a metro system structure. As no precise data concerning transportation demands and construction costs were introduced into the model, the obtained results cannot be regarded as the final design of the metro system. This concerns mainly the problem of stations locations-this issue requires additional considerations on street topology and the location of other urban facilities. The presented results can be the basis for discussing some variants of stations layout or lines topology. They can be subjected to the public discussion. They can also be used when estimating the required budget.
An important finding is that the requirement of the high radiation accessibility will result in a significant increase of building costs (of about 30%). Otherwise, the increase of the average trip length should be expected (of about 60%).The finally chosen option depends on the budget available. Obviously, a trade-off solution is reasonable as well, but it cannot be found using the presented model The progress of the optimization slowed down at about the 5000th generation. Therefore, the 10,000 generations can be regarded as the number sufficient to obtain the converged solution. As previously, the optimization process ran in a similar manner: quick and significant improvements of the solution were made in the beginning phase; afterwards, only rare and small corrections were made.

Conclusions
The paper presents a method allowing one to obtain a quick draft of a metro network. Since the task is computationally complex, the two-stage approach with the use of the genetic algorithms was proposed. The method was applied to prepare a preliminary design of the metro network for the central part of the Upper Silesia region in Southern Poland. In the first stage, the optimal locations of stations accounting for population density were found. In the second stage, the optimal lines topology was established; here, two alternative assumptions were considered: just the network coherency was required, or the high radiation accessibility was expected. Two variants of the metro network with three or four metro lines were checked too.
The obtained results were the effect of a few hours of calculations using a standard PC with i7 CPU 2.9 GHz and 8GB RAM. Some effort was required to tune the optimization parameters. The input data were in the form of simple bitmaps, which allowed one to examine different variants of the assumed inputs.
The presented approach can be helpful regarding preliminary considerations of a metro system structure. As no precise data concerning transportation demands and construction costs were introduced into the model, the obtained results cannot be regarded as the final design of the metro system. This concerns mainly the problem of stations locations-this issue requires additional considerations on street topology and the location of other urban facilities. The presented results can be the basis for discussing some variants of stations layout or lines topology. They can be subjected to the public discussion. They can also be used when estimating the required budget.
An important finding is that the requirement of the high radiation accessibility will result in a significant increase of building costs (of about 30%). Otherwise, the increase of the average trip length should be expected (of about 60%). The finally chosen option depends on the budget available. Obviously, a trade-off solution is reasonable as well, but it cannot be found using the presented model as it is currently. At present, the model just minimizes the relative construction costs. Thus, it should be extended to take into account the explicit budget limitations and the real construction costs. It was also shown that the level of service and the costs do not depend significantly on the number of the metro lines.
Despite the mentioned limitations, the obtained results give an insight into the designed metro system structure. The main advantage of the presented method is the possibility to quickly make a draft of topology of the metro network by introducing only the basic data describing the analyzed region. The simplicity of the presented model makes it possible to easily change the introduced data to analyze different scenarios of the future development of the region.
All the above remarks do not mean that the applied method-the genetic algorithm-is unable to provide a detailed design of a metro system or any of its components. Artificial intelligence methods, including genetic algorithms, can be applied at any level of the design process. The accuracy of the obtained results is limited only by the model structure, not by the applied optimization method.
Author Contributions: A.K. participated in the preparation of the research. He built the numerical models, tuned the controlling parameters, and took part in the optimization analyses. He participated in the writing of the article. M.K. took part in the data acquisition and in the optimization analyses. She analyzed the obtained results. She participated in the writing of the article as well.
Funding: This research received no external funding.