1. Introduction
Competitive city indices have existed for over three decades. The first such initiative was the 1970 Prices and Earnings Survey created by UBS, a financial services company, which compared the purchasing power of citizens in 72 cities globally [
1].
While indices were initially developed to guide investment decisions, or for multinational companies to calculate compensation packages for expatriate executives (such as indices designed by the Economist Intelligence Unit, a research and advisory services company, and Mercer, a human capital consulting company), over the years their focus broadened to include a wide range of themes and to target a wider audience, which includes business stakeholders but also policy and decision makers, practitioners, and concerned citizens [
1].
City indicators are used to capture changing trends, for example, demographics, infrastructure service delivery, rate of city growth, or to measure and monitor performance [
2,
3,
4]. Indices, which are combinations of indicators, are designed to measure overall performance or progress [
4,
5].
In the context of international development, the United Nations Human Settlements Programme (UN-Habitat) first initiated the Housing Indicators Programme in 1991, which focused on monitoring shelter performances. It was rebranded the Urban Indicators Programme in 1993 to reflect its expanded focus on multiple policy-oriented urban issues [
3,
4].
Nigeria has witnessed a decade of social and economic growth. This has been facilitated in part by a transformative urbanisation process and extensively documented social and demographic change. Urban centres concentrate dense populations, businesses, and infrastructure, which can bolster agglomeration economies, innovation, and economic growth. These centres often serve as the engines of economic growth.
The positive contribution of infrastructure towards socio-economic growth and development is broadly acknowledged in the academic and policy literature [
6,
7,
8]. Infrastructure investments have contributed more than half to Africa’s economic growth in recent years in terms of Gross Domestic Product (GDP) [
9]. However, and therefore of major relevance to economic growth and development, Nigeria suffers from obsolete infrastructure [
10,
11,
12].
The sudden revenues received from oil in the 1970s created the conditions for major investments in infrastructure capacity, notably ports, roads, bridges, and airports [
13,
14]. Reductions in Federal budgets due to fluctuations in oil prices in the 1980s resulted in no significant additions to infrastructure capacity, as well as a lack of funds destined to the maintenance of existing infrastructure. This caused a deterioration of the country’s road network [
15]. As roads are essential for the distribution of goods, a failing road system is a major constraint for economic growth. The deficiencies of the power supply and the impacts on growth and productivity is another major issue in Nigeria [
16].
The remainder of this article catalogues the rationale, steps, and findings resulting from the statistical development of a composite indicator for evaluating the quality of urban infrastructure across Nigerian cities. In developing the City Infrastructure Quality Index (CIQI), we were conscious that its relevance and practical application would depend upon the clarity about the aims of measurement and the fundamental principles, quality, and robustness that underpin it. The design process therefore adopts a number of quality assurance principles which include integrity, open, sound, and transparent methodology, robust and reliable official data, serviceability in terms of a planned revision cycle over the long term, and accessibility.
  2. Priority Policy Areas for the CIQI
The CIQI aims at providing a relative measure of performance in urban infrastructure in Nigeria, focusing specifically on factors of production and resources for inclusive economic growth and development. The focus is on four broad policy areas identified for the development of the CIQI, based on consultations with local stakeholders, and by taking into consideration policy and academic literature. These are: power, transport, industrial land, and Information and Communication Technologies (ICT).
  2.1. Transport-Related Infrastructure
Transport problems are a significant problem in Nigeria, accounting for annual sales losses of 2.4 percent [
17]. Road transport is the primary means of transport in the country and poor quality roads and congestion are the main causes of these losses. An adequate inter-regional road and rail network can improve regional connectivity, which helps to promote regional and international trade and can significantly contribute to economic growth and development.
Good roads help to stimulate economic growth by improving travel times for road users and the delivery of goods and services [
18]. Additionally, evidence shows that quality investment in road infrastructure helps to cut the costs that commuters pay for transportation services [
18]. Similarly, when public urban transportation receives adequate investment attention, the economy benefits [
19,
20,
21,
22]. There is evidence of a positive relationship between commuting time and average gross product per capita [
21]. Furthermore, good transportation infrastructure makes it easier for new firms to enter the business environment, thereby stimulating business competitiveness [
20].
  2.2. Industrial Land
Within Nigerian cities, the space and accommodation needs of formal and informal economic activities are often not provided for in the land use and urban development process. Access to land often regularly receives one of the highest rankings when comparing different business constraints in Nigeria. The low availability of land and buildings restricts investment. Due to widespread bureaucratic obstructions, Nigeria is one of the most difficult and expensive places to register landed property in the world [
23]. A key issue is therefore how the formal and informal sector accesses the urban facilities required for its operations and better performance.
  2.3. Power
The biggest constraint to productivity in Nigeria, as perceived by businesses, is power. Almost all Nigerian firms experience power outages, averaging eight hours per calendar day, which results in indirect costs equivalent to 4.3 percent of sales for manufacturing firms and 5.3 percent for retail firms [
17].
To combat this situation, the majority of firms (88 percent) have their own power generating sets, which adds significantly to their operating costs. Approximately 69 percent of the total electricity utilised by manufacturing firms does not come from the national grid. Larger manufacturers are more dependent on self-generation of electricity than smaller ones. On average, the cost of acquiring and maintaining an electricity generator amounts to 9 percent of the total value of machinery budgets and 13 percent of operating expenses [
17].
  2.4. Information and Communication Technologies (ICT)
Recent research proves that there is a positive relationship between increasing penetration of mobile phones and the growth of the monetary measure of the market value of goods and services [
24]. With an average of 10 additional mobile phones per 100 people, a country has the potential to unravel 0.6 percent of supplementary GDP growth. An increase in telecommunications capability also attracts foreign direct investment (FDI) [
25].
In Niger, it was recently discovered that mobile phone penetration can lead to a 29 percent increase in average daily profits of traders [
26]. Evidence also shows that the introduction of mobile phones minimises the variability of the prices of grain by around 10 percent [
27].
  3. Data, Cities, and Preliminary List of Indicators
Like other developing countries, Nigeria has its fair share of challenges in establishing reliable statistical data infrastructure for evidence-based research. However, in recent years, collaborative engagements between statutory agencies like the National Bureau of Statistics (NBS) and international development partners (such as the World Bank and the Department for International Development [DFID]), have yielded some improvements in the gathering of statistical datasets. Advancements in data management, statistical analysis and retrieval techniques have further expanded opportunities for researchers to undertake novel interdisciplinary research using the datasets available. One of the central aims of this research study was to utilise existing datasets as much as possible. For piloting the CIQI consideration was therefore given to existing data sources.
  3.1. Data Source
Following a thorough review of variables subsumed in three potential data sources (The World Bank’s Enterprise Survey, United States Agency for International Development’s Demographic and Health Survey (DHS) and the NBS General Household Survey (GHS)-Panel survey), the World Bank’s Enterprise Survey was chosen as the appropriate dataset for piloting CIQI. The DHS and the GHS-Panel surveys do not contain city-level identifiers. The Nigerian Enterprise Survey is funded by the World Bank and DFID. The broad aim of the survey is to assist the Government of Nigeria in developing a diagnostic base for measuring and benchmarking the country’s enterprise and investment climate. Apart from providing comparative indices across all the 36 states of Nigeria and the Federal Capital Territory (FCT), the survey also allows for benchmark comparators against other countries [
28]. The Primary Sampling Unit (PSU) used for the Enterprise Survey is the establishment. In addition to its appropriate alignment with the thematic remits of the CIQI, a major strength of the Enterprise Survey is that it contains city-level identifiers.
  3.2. Data Time Points
This study sought to create a baseline CIQI upon which future indices could be created and compared over time. To achieve this, the 2007 and 2009 Enterprise Survey datasets were fused together and used to construct the baseline CIQI. This was done for two reasons. First, both datasets were reported at the city geography. Secondly, evidence shows that most cities are less susceptible to infrastructural change within a short time lag [
4]. The time lag between the 2007 and 2009 datasets is two years.
  3.3. Selection of Cities
The baseline CIQI is constructed for 37 cities across Nigeria for which data is available. 
Table 1 shows the 37 cities for which the CIQI is constructed together with the source of data for each corresponding city.
Figure 1 shows the spatial divisions of Nigeria’s key administrative areas. As illustrated in 
Figure 2, the proportional representation of the CIQI cities across the six geopolitical zones is fairly balanced. This helps to capture regional dynamics of city infrastructure quality across the entire country.
   3.4. Preliminary Indicators
A total of 21 indicators were initially considered for the development of the CIQI. In the fields of quantitative social science and urban analytics, indirect or proxy indicators are used where direct measures are not feasible. 
Table 2, 
Table 3, 
Table 4 and 
Table 5 show the proxy indicators from the World Bank’s Enterprise Survey that were used together with their description or method of computation [
28].
Data availability can also be limiting factor that imposes the choice of proxy indicators. To the best of our knowledge, there is no consensus in academic literature on the specific validated list of indicators that should be used to construct infrastructure quality indices. Different scholars have combined different indicators to suit the purpose of the index which they have constructed [
3,
4,
5]. For the CIQI, restricted availability of city-level data compelled us to utilise numerous proxy indicators to represent the initial list of indicators.
  4. Statistical Modelling
As described in the previous section, a total of 21 indicators were initially considered for inclusion in the data fusion process. Analytical soundness of each indicator was tested using a series of statistical analysis. These are briefly discussed.
  4.1. Multicollinearity
Multicollinearity is a situation where two or more variables are highly correlated [
29]. Consideration was given to the relationships between the indicators. The Pearson’s coefficient of correlation was used as the statistic for examining the relationship between variables. Correlations can be positive (where the values of the pair of variables increases or decreases in the same direction) or negative (where the value of a variable while its related variable decreases). High correlations inform redundancy. Multicollinearity was also assessed by examining which indicators represent the principal dimensions of the original dataset. This task was achieved using Principal Components Analysis (PCA).
  4.2. Likely Influence of Indicators on the CIQI
A usefulness of PCA as a data reduction technique is its ability to identify which variables are likely to power the data fusion process [
30]. One of the outputs of the PCA analysis is a weights table also called a components loading matrix which is indicative of the power that each variable exerts. Typically, for a variable, the higher the value of this weight, the better its variance can be explained by the corresponding principal component and consequently the greater the power it is likely to exert on the cluster analysis [
30].
  4.3. Assessing Internal Consistency
When assessing internal consistency, the Cronbach Coefficient Alpha (c-alpha) is the most popular statistic that has been used [
31,
32,
33]. It is used to evaluate how well a set of variables measure a single uni-dimensional object. The c-alpha appraises the quota of total inconsistency of the individual indicators due to the association of indicators. The c-alpha is equivalent to zero if individual variables are independent and no correlation exists. However, c-alpha is equal to one if the indicators are perfectly correlated.
  4.4. Normalisation
For each policy area of CIQI, the aim was to construct a single summary measure which can be expressed in meaningful units and easily interpreted. The underlying metrics of the indicators are different which means it would be impossible to simply do a straight forward combination of these indicators. It is inappropriate to combine datasets that are measured in different scales because the resultant index will not be a true reflection of reality. Indicators measured on a scale with a large range will be given undue advantage as opposed to those measured on a smaller range [
34]. The range standardisation was used to equalise the scale of measurement of different indicators.
        
        where:
		 
-  = standardised value 
-  = original value of the indicator for a city  
-  = minimum value of the indicator within the distribution 
-  = maximum value of the indicator within the distribution 
The range standardisation converts all values for each indicator into a range of 1 and 0. A value of 1 being the maximum and 0 being the minimum. Unlike other standardisation methods (z-score and inter-decile range), the range standardisation coped well with skewed indicators.
  4.5. Weighting
Adding a weight to an indicator signifies the level of importance of that indicator to the index [
35]. This is precisely why this process is subjective. What one researcher or policy maker considers important may not be important to another. Even when an indicator is considered important, the level of importance assigned to that indicator may vary from person to person.
Applying a weight to an indicator will ultimately influence the resultant output of the analysis [
34]. Whilst weighting is not compulsory [
36], one reason why weights were deployed in the creation of the CIQI was to compensate for the quality of the underlying statistical datasets available for analysis.
Factor analysis was to create the weights for the indicators prior to combining them into policy area scores. The underlying conjecture for the generation of weights with factor analysis is the assumption of the existence of latent constructs of the policy areas. The Maximum Likelihood technique was used because of its ability to help overcome problems associated with different levels of statistical accuracy [
37]. The analysis generated weights for indicators in each domain. These were amalgamated to produce domain index scores for each of the 37 cities.
  4.6. Creating the Overall CIQI
In order to create the overall CIQI, the domain index scores for the four policy areas had to be pooled together. If they were summed up, this would suggest that exert equal importance in the CIQI. However, some policy issues are considered more important in terms of their relevance to economic growth and development within an urban infrastructure framework. For this reasons, each policy domain was also weighted in terms of their perceived importance.
  5. Data Reduction Procedure
A total of 21 indicators listed in 
Table 2, 
Table 3, 
Table 4 and 
Table 5 were originally considered for inclusion in the data fusion exercise. The suitability of these indicators was appraised in order to minimise the number of indicators if there was a compelling empirical evidence for such decision.
Table 6 shows the correlation matrix for the 21 indicators. The blue colours indicate high positive correlations while the red colours identify high negative correlations. Telecoms Penetration for instance was found to be positively correlated with Transport Availability while Authorised Housing was negatively correlated with Access to Land.
 Indicators exhibiting a normal distribution are often most ideal for inclusion in this sort of analysis but this is often not the case with area-based infrastructure data. The Fisher-Pearson coefficient of skewness given in Equation (2) was used to evaluate the degree of skewness of each variable in the dataset [
38,
39].
      
      where:
	   
-  = value of skew 
-  = mean of the distribution 
-  = standard deviation the distribution 
-  = number of data points 
Positively skewed indicators are particularly undesirable. This was used as a key criterion in selecting or deselecting indicators for the creation of the final CIQI. 
Table 7 shows the skew values for the indicators with the largest positive skew.
Internal consistency and reliability are important for this analysis because, in their absence, it is impossible to have any validity associated with the composite scores which will be generated from the analysis. The Cronbach Coefficient Alpha (c-alpha) was used here to validate internal consistency. Basically, it helps us to determine if it is justifiable to interpret the scores that will be aggregated together. To interpret the c-alpha, we adapted a commonly accepted rule of thumb proposed by George and Mallery [
39] for describing internal consistency. The results from the analysis yielded a high c-alpha value of 1.19 suggesting excellent internal consistency within the dataset.
Principal Components Analysis (PCA) was used to determine the indicators that may exert significant influence on the CIQI. 
Table 8 shows the loadings of the first principal component with indicators in descending order of level of influence on the CIQI. The analysis suggests that Telecoms Penetration would have the largest influence on the overall CIQI.
Deciding the final choice of indicators is an important and rigorous activity. Following extensive review of the analysis, the 21 preliminary indicators were reduced to 14. Results from statistical analysis were juxtaposed next to one another. Each indicator was assessed independently on the basis of its performance on each statistical metric. Additional expert judgement based on policy literature was also considered. The final list of indicators used to create the CIQI is listed in 
Table 9 with all four policy areas equitably represented.
  6. Fusion of Final Indicators
It is improper to combine the fourteen selected indicators directly because they are measured on different scales. For instance, the Power Reliability indicator measures the average monthly power outages while the Access to Electricity indicator is the percentage share of business establishments for which electricity constitutes minimal or no obstacle to operations. Percentages cannot be directly combined with counts without standardisation. Prior to proceeding with the fusion of the indicators to create separate policy area scores and the final CIQI, the different scales of measurement of the indicators were standardised using a range standardisation method. The range standardisation helped to re-calibrate the values for each indicator within a range of 1 and 0. A value of 1 being the maximum and 0 being the minimum.
The natural exponential function [] was used to transform the standardised data ahead of factor analysis. The function helps to correct for the presence of high skew in the distribution by adjusting the data to conform as closely as possible to normality.
In order to create the baseline CIQI, the scores for the four policy areas were combined. Simply adding the scores together would be inappropriate as that would signify that each policy area has equal importance in the measurement of city infrastructure quality. Businesses consider some policy areas to be more important than others in terms of their influence to productivity in particular, and economic development and growth in general. For this reason, each policy area was weighted in accordance with their perceived importance. The overall CIQI was constructed by combining the policy area scores using weights constructed from the ranking of the top business environment obstacles for firms based on the 2014 Enterprise Survey. Businesses indicated that power presents the most pressing obstacle, followed by transport, land and ICT. 
Table 10 shows the weights allocated to each of the four policy domains and the fourteen indicators.
  7. Results and Discussion
The CIQI is the amalgamated summation of the weighted and transformed policy area scores. It is important to note, however, that as a result of the exponential transformations, a city with a CIQI score of 200 does not necessarily translate into twice that with a score of 100. It is suggested that rankings should be utilised when making comparisons between cities. The baseline CIQI is ranked in the same manner as the policy area scores. The higher the value of the CIQI, the better the overall quality of city infrastructure.
We use the median (169) to benchmark the distribution in 
Figure 3. The maximum value of the CIQI (204.3) is assigned to Katsina whilst the minimum (141.7) is assigned to Maiduguri. A spatial distribution of the cities by their CIQI score ranges and corresponding population class sizes is presented in 
Figure 4 whilst 
Figure 5, 
Figure 6, 
Figure 7 and 
Figure 8 show the spatial distribution of scores for each policy area.
It is important to point out that the results are based on a relative measure of infrastructure quality. Eight cities make it to the top 20 percent of cities exhibiting above average quality transport infrastructure. These include Lafia (North Central), Lokoja (North Central), Yenagoa (South South), Katsina (North West), Birnin Kebbi (North West), Gusau (North West), Jos (North Central), and Abakaliki (South East).
Cities that belong to the bottom 20 percent are characterised by below average quality transport infrastructure. The cities in this category include Enugu (South East), Asaba (South South), Maiduguri (North East), Ado-Ekiti (South West), Lafia (North Central), Calabar (South South), Jalingo (North East), and Owerri (South East). Enugu and Asaba are rated below average partly because of lower use of multi-modal transportation by businesses. For Maiduguri, Calabar, Jalingo, and Owerri, the key problem is travel time whilst in Ado-Ekiti business enterprises seem to grapple with the problem of transport availability.
Analysis of the top 20 percent cities exhibiting relatively higher quality power infrastructure was conducted. A total of eight cities belong to this category. They include Lagos (South West), Katsina (North West), Akure (South West), Umuahia (South East), Calabar (South South), Kaduna (North West), Enugu (South East), and Sokoto (North West). Conversely, the bottom 20 percent of cities with the lowest power quality score include Lokoja (North Central), Jos (North Central), Damaturu (North East), Maiduguri (North East), Uyo (South South), Ado-Ekiti (South West), Makurdi (North Central), and Asaba (South South). With the exception of Asaba, all the other low scoring cities are severely impacted by unreliable power supply. For Asaba, the main factor contributing to its low score was the low rate at which businesses secured connectivity to the electricity grid within the statutory period of one week.
Eight cities make it to the top 20 percent of cities exhibiting relatively high quality of industrial land. The cities include Sokoto (North West), Katsina (North West), Port-Harcourt (South South), Asaba (South South), Ado-Ekiti (South West), Gombe (North East), Ibadan (South West), and Lokoja (North Central). A similar analysis was conducted to determine the cities that belong to the bottom 20 percent. The cities with the lowest quality scores for industrial land infrastructure include Yola (North East), Abuja (North Central), Makurdi (North Central), Umuahia (South East), Jalingo (North East), Bauchi (North East), Kano (North West), and Uyo (South South). One of the reasons why Yola is rated low is because of the disproportionately low numbers of business establishments for which land accessibility constitutes minimal obstacle to operations. In Abuja, Umuahia, and Uyo, disputed land ownership is the main factor inhibiting business growth and expansion. For the other low scoring cities (Makurdi, Jalingo, Bauchi, and Kano), the drawback factor is the difficulty in accessing government land.
Eight cities are included in the top 20 percent of cities exhibiting relatively high quality ICT infrastructure. They include Ibadan (South West), Benin City (South South), Lagos (South West), Akure (South West), Ilorin (North Central), Awka (South East), Calabar (South South), and Umuahia (South East). A similar analysis was conducted to determine the cities that belong to the bottom 20 percent. The eight cities with the lowest ICT quality score are Birnin Kebbi (North West), Damaturu (North East), Lafia (North Central), Dutse (North West), Lokoja (North Central), Ado-Ekiti (South West), Katsina (North West), and Yola (North East). 
Figure 7 shows the spatial distribution of the cities by their ICT policy area score and corresponding population class sizes. The map shows an apparent north-south divide. Relatively higher quality of ICT infrastructure appears to favour cities in southern Nigeria. Comparatively lower telecoms penetration is main driver of low ICT quality for five of the lowest scoring cities (Birnin Kebbi, Damaturu, Lafia, Dutse, and Lokoja). For the other three cities the level of internet use amongst business establishments is comparatively lower and it accounts for a relatively low ICT score.
  8. Conclusions
This research allowed the piloting of the CIQI for Nigeria, and the drawing of valuable lessons which may also have relevance for other developing country urban contexts.
In particular, we demonstrated that the application of multivariate data analysis can be helpful in understanding and interpreting the dynamics of the quality and coverage of urban infrastructure. We have shown that city level relative diversity in the quality of infrastructure exists within and between the six geopolitical regions of Nigeria. An interesting finding from this preliminary analysis is the influence that the quality and extent of industrial land (and transport to some extent) is having on the overall CIQI, and particularly the positive impact this has had on the overall performance of cities in the North West of Nigeria, which rank higher than would have been anticipated. On the contrary, the influence of industrial land on the overall CIQI has had a negative impact to the performance of several cities in the southern and central parts of the country, including Lagos and Abuja.
However, we note a number of limitations and weaknesses in the analysis. First, we acknowledge that much of the indicators used are proxy indicators due to some deficiency in data availability. That said, there are volumes of data that are largely under-utilised in Nigeria. For piloting CIQI, consideration was given to existing data sources (such as the World Bank’s Enterprise Survey), as this seemed to be more appropriate, feasible and certainly cost-effective.
We also recognise that for some of the policy areas (notably transport and industrial land) there are very few available indicators in the World Bank’s Enterprise Survey. It would be useful if future editions of this survey considered including more questions related to transportation and industrial land and their effects on business enterprises, given their important contributions to productivity as demonstrated by academic and policy literature alike.
Insufficiency in the number of input indicators means there is scope for the integration of multiple datasets. Future analysis could consider bringing together multiple surveys to generate the CIQI. In addition, where possible, informal but validated crowd-sourced data could be considered when trying to fill in data gaps. It is our experience that such datasets also tend to be under-utilised in urban research. This could obviously present complex methodological challenges. However, we believe these challenges could be overcome. Advancements in data management, statistical analysis and retrieval techniques have expanded opportunities for researchers to undertake novel inter-disciplinary research using the datasets available.
The research, policy, and programming relevance and significance of CIQI is three-fold. Firstly, it provides a coherent set of methods and tools for measuring the performance of service and infrastructure quality and extent in Nigerian cities. This includes the CIQI structure and corresponding indicators that are intended to provide a useful analytical framework, the identification and design of innovative and cost-effective methodologies for utilising formal data sources (official statistics) in data-scarce environments like Nigeria, and the development of data visualisation tools utilising Geographic Information System (GIS) for effective communications of spatial information and data on infrastructure.
Secondly, it provides a source of data and information for Nigeria’s development partners, researchers and interested practitioners, as well as non-government and private sector stakeholders, as it intends to provide useful data and information that can be utilised for performing evidence-based analyses on urban infrastructure in cities. This can be enhanced by also looking at the relationship between infrastructure and economic growth and development, through comparing the CIQI results to other economic indicators (such as subnational GDP output) to identify key trends and relationships.
Lastly, CIQI can be used to bolster government and citizen awareness and support at the local, state, and national levels for identifying and establishing priorities in relation to the provision of urban infrastructure and services, and specifically on factors and resources for economic growth and sustainable development [
38,
40].