You are currently viewing a new version of our website. To view the old version click .
Sustainability
  • Article
  • Open Access

7 September 2022

Airlines’ Sustainability Study Based on Search Engine Optimization Techniques and Technologies

,
and
1
Department of Informatics and Telecommunications, University of Peloponnese, Akadimaikou G. K. Vlachou Street, 221 31 Tripoli, Greece
2
Department of Agribusiness and Supply Chain Management, School of Applied Economics and Social Sciences, Agricultural University of Athens, 118 55 Athens, Greece
*
Author to whom correspondence should be addressed.
This article belongs to the Collection Advertising and Sustainable Development

Abstract

Digital marketing, especially search engine optimization (SEO), is an integral part of websites today. Airlines in the COVID-19 era have to use every possible means to survive despite the adverse conditions for both entrepreneurship and travel. Many of them have allocated resources and money to develop SEO strategies by applying SEO techniques to their websites to gain more visitors and bookings. Thus, this research is focused on analyzing airlines’ website presence as regards the implemented SEO techniques and their effect on airlines’ website traffic. In the first phase of the research, we gathered web data from 243 airline firms during a one-year observation period (December 2020–December 2021) using our own-developed tool. Furthermore, we proceeded to create an exploratory model using fuzzy cognitive mapping. From the technical SEO point of view and the descriptive analysis, we conclude that the traffic on airlines’ websites and, consequently, their sustainability are inseparably linked to the corresponding SEO techniques and technologies used.

1. Introduction

Today more than ever, the application of SEO techniques and technologies to websites, regardless of their industry, is imperative. The rising usage of the web, the new habits of web searchers, the growth of competition, the great need to get organic traffic without paid ads, and the changes in search engine algorithms are just the beginning for a website to survive in the current COVID-19 conditions.
There are countless sources for a website to increase its traffic. The most important, as presented in the Google Analytics App, are: Organic Search, Social, Direct, Referral, and Paid traffic [1,2]. Organic traffic is the traffic that comes from users who use search engines to find (free of charge) what they are looking for. The results presented after each keyword search consist of a list of available websites that use the keyword searched [3]. Thus, the traffic of a website depends on its ranking in search engines and its organic traffic. Since ranking in organic results cannot be paid, a high organic search ranking is, in most cases, hard to achieve [1]. In order for a website to achieve high rankings in web searches, it has to meet certain specifications—ranking factors as they are described in the guidelines that are regularly published by the companies that own the search engines [2]. These factors are known, but their impact on search ranking has not been fully reported, since search engines neither make public their ranking algorithms nor disclose information regarding the factors used in the ranking [4]. However, the dominant SEO techniques and their impact on organic traffic can be found in several studies [5,6].
Through our study, we analyze a large sample, i.e., 243, of airlines websites. Each website is stress tested against several SEO techniques and technologies in order to find out how airlines define their SEO strategy and what specific SEO techniques are used to impose themselves against the competition. Through a fuzzy cognitive map, we determine the relationships between SEO techniques and their contribution to the corresponding traffic. Finally, our descriptive analysis attempts to correlate the contribution of SEO and the corresponding sustainability of the airline industry, by using research hypotheses and problem formulation techniques.
To achieve the above results, we created a php-based software tool, which undertakes to check for the SEO techniques used in the source code of each website and create the corresponding reports used in our descriptive analysis that follows. To deal with access problems from the websites’ firewalls, we used proxy servers and desktop User-Agents.
Our study provides a handful of time-accurate insights for both existing and potential future marketeers regarding SEO strategies that airlines follow and which specific SEO techniques they apply to their websites to overtake their competition in search results.

3. Materials and Methods

The purpose of this paper is the proposal of an innovative methodology for deploying an efficient framework for understanding the strong connection between search engine optimization and web traffic, providing valuable insights for airline firms’ websites. To reach the desired results, we follow nine research stages, presented in Figure 1.
Figure 1. Overall step-by-step representation of the proposed methodology, reflecting the stages for comparing SEO techniques and SEO metrics to Web traffic.

3.1. Websites’ Traffic Sources and Key Performance Indicators

Airlines’ online presence, as well as their bookings, are directly affected by various factors, which should be analyzed and taken into consideration over our estimation. To obtain a more specific vision of the airlines’ online presence, we use the organic web traffic source, which indicates the main source of websites’ traffic. Organic traffic, as mentioned in Section 1, is the traffic coming from users who use search engines (e.g., Google, Yahoo, etc.). It is an unpaid form of web traffic, whose role in business success is vast. The majority of websites rely on organic traffic results, since it represents over 60% of a website’s total traffic [40]. Organic traffic, as an unpaid source of traffic, has many benefits for businesses, such as relatively small investment in contrast with PPC campaigns, long-lasting results enhanced year by year with the appropriate effort, and major increase in brand awareness [41].
Four more factors that search engines take into consideration while evaluating a website, before ranking it in the SERPs, are the organic keywords, the backlinks, the website speed, and the domain authority (DA) [42,43]. Organic keywords are the keywords for which websites are ranked in the organic SERPs of search engines [44]. Backlinks are links from third pages to the target page that act as an up-vote on that page [1]. Website speed is the time in seconds it takes for a web page to be fully downloaded and loaded from the webserver to the user’s browser. Domain authority (DA) is a search engine ranking score developed by Moz that predicts the strength of a website in terms of its ability to rank in SERPs. Domain authority score ranges from 1 to 100, with higher scores corresponding to greater likelihood of ranking [45]. Domain authority as a metric is not visible to search engines. It tries to simulate how search engines rank web pages. In most cases, the domain authority score is representative. DA is important to compare websites with each other. Websites with a higher DA are more likely to appear higher in SERPs than others with a lower DA, respectively.
The web traffic metrics proposed as KPIs in this paper are presented in Table 1.
A performance indicator or key performance indicator (KPI) is a type of performance measurement that evaluates a business’ success in certain activities that it takes over [46,47]. KPIs examined for the purposes of this paper alongside the web metrics are included in Table 1. It is valuable to examine website traffic on a monthly basis to export more accurate results concerning traffic and ranking performance.
The main objective of our KPIs is to achieve higher results in organic traffic as well as higher positions in search results. In our research, we have five key performance indicators, i.e., Organic Traffic, Domain Authority, Website Speed, Organic Keywords, and Backlinks. Each one should be accomplished to gain higher organic traffic and potential customers.
Table 1. Description of the examined web analytics metrics and suggested KPIs and performance measurements.
Table 1. Description of the examined web analytics metrics and suggested KPIs and performance measurements.
KPIs (Unit)Description
Organic Traffic/month
(number of visitors per month)
Organic search refers to the non-paid search results from a search engine. These results can’t be bought or influenced by advertisers, so they are the ones the search engine considers most relevant to the user’s search query [48].
Domain Authority—DA (score in the range (0–100) integer)DA score can be used when comparing websites or tracking the “ranking strength” of a website over time [45]. Websites with higher DA are more likely to rank higher in SERPs than those with lower DA.
Website Speed (seconds)Website speed refers to how quickly a browser can load fully functional web pages from a given website.
Organic Keywords (number of keywords)Organic keywords are keywords used in SEO to attract “free” traffic. When a user types a keyword on the search engines gets as a result websites which rank for this specific keyword [44].
Backlinks (number of backlinks)Backlinks are the number of external pages that show the target page with a link. Backlinks help secure a higher SERP ranking [23]. Backlinks affect mostly the position of a website followed by relevant content [5].

3.2. Sample Selection and Data Retrieval

For the purpose of this paper, we retrieved data from the International Air Transport Association (IATA), which supports aviation with global standards for airline safety, security, efficiency, and sustainability [49]. The sample is based on 243 IATA members. Some of the most well-known airlines in our sample are Air Canada, Air France, American Airlines, British Airways, China Airlines, and Emirates. For the above airline firms, data were collected concerning their websites’ organic, direct, and referral traffic, as well as their websites’ session duration and the bounce rate. The sample was considered representative, providing knowledge about the search engine optimization applied by airlines as well as how this translated into traffic. Data were collected on a monthly basis from the airlines’ websites, for more accurate examination and comprehension of web metrics’ variance. The testing period extended to 12 months, which is limited to specific dates from December 2020 to December 2021. Since this period coincides with the COVID-19 transportation restrictions, the results will provide a clear view of the decline in bookings during this period.
To collect the sample, we created a PHP-based web crawler tool, which undertook to extract information, such as legal name, ICAO code, region, and website from each airline listed as a member in IATA’s website [49,50]. After the data extraction, we stored the entire dataset in a MySQL database, which we used to conduct the SEO checks as a next step [50].
The software tool created for the purpose of this article draws the dataset from IATA’s website and stores each result in a MySQL database. It then runs through the dataset and extracts the source code from each airline’s website via cURL. CURL is a computer software project providing a library (libcurl) and a command-line tool (curl) for transferring data using various network protocols [51]. For each of the 15 SEO techniques presented in Section 2, a separate function was created, which searches the source code of the targeted website to find out if the SEO technique has been applied. Finally, the results from the checks are stored back in the database. A comprehensive example of the generated code and its main functions is shown in Figure 2.
Figure 2. PHP-based SEO software.
To expand our research and extend our results, four third-party API tools were used, i.e., Mobile-Friendly Test Tool API, PageSpeed Insights API, Mozscape API, and Ubersuggest Traffic Analytic tool. Each of these tools obtains the website’s URL and returns results based on its own measurements.
  • Mobile-Friendly Test Tool API, created by Google, checks the given URL against responsive techniques and returns a list of any mobile-usability issues that can affect a user that visits the page on a mobile device [52].
  • PageSpeed Insights API, created by Google, measures the performance of a web page returning suggestions on how to improve the page’s performance, such as page speed, accessibility, and SEO [53].
  • Mozscape API, created by MOZ, takes as an input the website’s URL returning accurate metrics such as Domain Authority [45].
  • Ubersuggest, created by Neil Patel, obtains and returns a 360° overview of any website’s metric and its sources [54].
After conducting SEO evaluations on 243 airline websites, the exported results imported to IBM SPSS 27 (IBM, Armonk, NY, USA). Organizations use IBM SPSS Statistics to understand data, analyze trends, forecast and plan to validate assumptions, and drive accurate conclusions [55]. Descriptive analysis in different correlations was conducted, confirming the research hypotheses that follow. Finally, a fuzzy cognitive map (FCM) was created to graphically represent the relationship between SEO techniques, traffic, and backlinks, showing the direct correlation of SEO techniques with traffic and, consequently, the bookings and sustainability of airlines.

3.3. Problem Formulation and Research Hypotheses

The rising competition, the COVID-19 era, and travel restrictions around the world tend to increase companies’ efforts for efficiency in digital marketing campaigns. Airlines use every available tool at their disposal, from copywriting, email marketing, social media marketing to search engine optimization to reach their potential customers through numerous methods. By understanding the consumer habits of their audience, airlines develop marketing strategies to target this audience more effectively.
Airline websites are the main source of bookings and companies dedicate resources to organize and maximize their website’s efficiency and effectiveness. Their position in search engine results (SERPs) plays a crucial role in gaining organic visitors. The web page title, the meta description, and even the rich results must be applied with attention to detail to win the coveted click. Content, user interface, and website speed will pique the visitor’s interest converting him/her to a loyal client. Knowing that traffic and conversion to client is the main point of airlines, we define five hypotheses on which we base our research, attempting to confirm them, aiming to extend the practical knowledge over the significance and impact of search engine optimization to airline website traffic.
Hypothesis 1 (H1).
SEO techniques impact airlines’ website traffic.
The main goal of this paper is to acknowledge whether search engine optimization impacts airlines’ website traffic. By knowing which SEO techniques airlines implement on their websites, comparing the findings with the results in traffic provides us with valuable information and potential benefits from using SEO techniques. As mentioned in Section 3.2, a dataset was collected from the IATA website [49], consisting of 243 airline firms’ websites. Scanning and performing SEO tests on each website source code, using our own developed software [50], we gather information regarding the SEO techniques each website has implemented. Using third-party software tools described in Section 3.2, we scan each website gathering information concerning their domain authority, website speed, organic keywords, organic traffic, and backlinks. All data collected are stored in an excel sheet. Using the IBM SPSS 27 software descriptive analysis was conducted in order to find connections/patterns between SEO techniques used by websites and their organic traffic. The ultimate goal of this research is to uncover which SEO techniques used by the most well-known airlines are more effective in organic traffic, assuming that the websites with the highest organic traffic applied the appropriate SEO techniques in their source code.
Hypothesis 2 (H2).
On-page SEO techniques impact airlines’ website ranking.
Hypothesis 2 is a segment of Hypothesis 1, targeting the on-page SEO techniques. Our second hypothesis is based on on-page SEO techniques and how they affect website ranking. Targeting multiple keywords, as well as the implementation of on-page SEO techniques, is assumed to offer an increase in organic results.
Hypothesis 3 (H3).
Off-page SEO techniques—backlinks impact airlines’ website ranking.
Hypothesis 3 is a segment of Hypothesis 1, targeting off-page SEO techniques. Hypothesis 3 focuses on the implementation of off-page SEO techniques and how they affect the traffic of airlines’ websites. We assume that the creation of backlinks to third-party websites will lead to strengthening website’s domain authority and consequently will lead to higher positions in the SERPs.
Hypothesis 4 (H4).
Airlines’ organic keywords impact airlines’ website ranking.
In Hypothesis 4 we assume that organic keywords are valuable for airlines’ website leading to higher rankings on SERPs and increased Web traffic.
Hypothesis 5 (H5).
Airlines’ backlinks impact the airlines’ domain authority.
In Hypothesis 5 we assume that the backlinks are integrally linked with domain authority.

4. Results

4.1. Data Normalization

In this section, we examine the collected data. As mentioned in Section 3.2, our tool performed SEO checks on 243 websites. The results were stored in excel so that we can use them in further data analysis [56].
Collected data are separated into three groups.
  • Group 1: Metrics are checks performed by third-party tools and are not SEO techniques. The contents of the Group Metrics are as follows: DA (integer 0 to 100), speed_test (decimal), number of keywords (integer), and number of backlinks (integer).
  • Group 2: SEO techniques are checks performed by our tool regarding the existence (or not) of SEO techniques on each website in the dataset. All metrics returned 1 if the SEO technique was applied to the website and 0 otherwise. The contents of the Group SEO techniques are as follows: images_alt, links_title, rss, sitemap, robots, heading1, heading2, web_ssl, meta_description, opengraph, url_seo_friendly, amp, minified_css, minified_js, title, structured_data, and responsive_test.
  • Group 3: Web Traffic (organic traffic) is the monthly statistical data collected for each website. All web traffic data are integers and refer to website data for the last twelve months. The contents of the Group Web Traffic are as follows: traffic_1, traffic_2, traffic_3, traffic_4, traffic_5, traffic_6, traffic_7, traffic_8, traffic_9, traffic_10, traffic_11, and traffic_12.

4.2. Limitations

During the examination, the responsive_test technique found that the Mobile-Friendly Test Tool API created by Google considers non-responsive pages that have even one non-responsive element. Manual checks revealed that these specific websites partially adopt responsive techniques and have several elements in their source code that are non-responsive. These pages are marked with the number 0, which means that they do not follow that specific SEO technique. Further, in 17 out of the 243 websites, the speed measurements were performed manually with the Pingdom website speed test by Solar Winds, because the PageSpeed Insights API by Google could not operate a speed test on these pages.
The AMP and Sitemap SEO technique tests found that none of the websites applied these techniques, although they are suggested by Google on Webmaster Guidelines [2]. Consequently, t-test cannot be performed on these techniques, excluding them from our final results.
We collected the monthly organic traffic for 12 months to have a more representative view of the website traffic. For the data analysis that follows, we calculated the mean value of 12 months of organic traffic for each website and saved it in a new column, called Web Traffic. All the following comparisons regarding web traffic refer to the mean value of organic traffic for each website.
Raw data collected from multiple websites do not convey any significant trends or the behavior of the individuals before the corresponding analysis. Data analysis refers to the process of converting raw data into meaningful information, using mathematical, statistical, or computational algorithms for better comprehension [57]. For the quantitative data analysis, some major steps need to be considered to firmly execute the assessment and generate the intended results. The procedure consists of two major steps: (1) reviewing the data by descriptive analysis and (2) conducting inferential and descriptive statistics to answer the research questions [58]. Τhe study is presented in the following sections of the chapter and each section is intended to answer the research questions, as the analysis must be aligned with the research objectives.

4.3. Descriptive Statistics

Descriptive analysis helps to describe, demonstrate, or summarize the collected data in a constructive manner so that the trends and patterns can be easily observed and analyzed [59]. Mean, median, mode, standard deviation, skewness, and kurtosis are some of the important measures. Table 2 indicates that the maximum domain authority, speed test, keywords, and backlinks in this study are 39,833,333.33, 88, 17.67, 2,272,818, and 117,837,438, respectively. According to Tabachnick, Fidell, and Ullman 2017, the values of skewness or kurtosis within −1.5 and +1.5 is considered right-skewed distribution of the data [60].
Table 2. Descriptive statistics on SEO metrics.

4.4. Inferential Statistics

Inferential statistics refers to the procedure of using data analysis to infer attributes of an underlying distribution of probability. Inferential statistics deals with the information acquired from the sample of the population to draw conclusive statements about the entire population [61]. Inferential statistics make use of statistical models to compare the sample data to that of earlier studies. It varies from descriptive statistics in that it allows one to draw conclusions based on extrapolations rather than merely reporting the data that have been seen, like descriptive statistics does [62]. The theoretical framework of this study is given in Figure 3.
Figure 3. Testing research model.

4.4.1. Impact of Group: SEO Techniques on Web Traffic

At this point, 15 different t-tests were performed for each of the 15 SEO techniques: Group 2 using SPSS 27 by IBM software. After, the t-test mean values were observed. The mean value is equal to the sum of all the values in the dataset divided by the total number of values. The “YES” mean reflects the average web traffic from the websites that applied the technique. The “NO” mean reflects the average web traffic from the websites that did not apply the technique. If the “YES” mean is greater than the “NO” mean, the websites that implemented this technique in their source code have greater traffic and the technique is valuable for the website. Another component to consider when comparing Yes or No is the p-value. In statistics, the p-value is the probability of obtaining results at least as extreme as the observed results of a statistical hypothesis test, assuming that the null hypothesis is correct. The p-value is used as an alternative to rejection points to provide the smallest level of significance at which the null hypothesis would be rejected. A smaller p-value means that there is stronger evidence in favor of the alternative hypothesis [63]. A p-value less than 0.05 (typically ≤ 0.05) is statistically significant. A p-value higher than 0.05 (>0.05) is not statistically significant and indicates strong evidence for the null hypothesis [64]. The results from 2 of 15 t-tests are presented in detail below and the rest are presented briefly in Table 3 for the sake of brevity.
Table 3. Independent t-test on the meta description technique.
Impact of meta description technique on the web traffic
H0: There is no difference in website traffic due to the meta description technique (null hypothesis).
H1: There is a difference in website traffic due to the meta description technique.
In this study, independent samples t-test is applied to make a comparison of the mean web traffic between websites that implement the meta description technique (n = 169) and websites that do not implement the meta description technique (n = 74). The t-test was statistically significant, as the mean web traffic of meta-description-technique-implemented websites (M = 1,268,843, SD = 4,712,815) was higher than the meta description technique non-implemented websites (M = 559,083, SD = 1,682,786), t = −0.260, p < 0.05, two-tailed. Therefore, the null hypothesis H0 that there is no difference in website traffic due to the meta description technique can be rejected. Consequently, it can be concluded that websites that implement a meta description technique have greater traffic and, by extension, this technique is valuable for the websites (Table 3).
Impact of image alt technique on web traffic
H0: There is no difference in website traffic due to the image alt technique (null hypothesis).
H1: There is a difference in website traffic due to the image alt technique.
In this study, independent samples t-test is applied to make a comparison of the mean web traffic between websites that implement image alt technique (n = 56) and websites that did not implement the image alt technique (n = 187). The t-test was not statistically significant since the p-value is higher than 0.05 and the mean web traffic of image-alt-technique-implemented websites (M = 1,156,760, SD = 4,532,209) was not significantly lower than the image alt technique non-implemented websites (M = 705,223, SD = 1,564,573), t = 0.732, p > 0.05, two-tailed. Therefore, the null hypothesis H0 that there is no difference in website traffic due to the image alt technique cannot be rejected (Table 4).
Table 4. Independent t-test on the image alt technique.
Table 5 presents the 15 t-tests performed for the 15 corresponding SEO techniques. In the last column of Table 5, SEO techniques that resulted in a positive impact on the traffic are marked as 1, while the techniques that did not have a significant contribution to the traffic are marked as 0.
Table 5. T-test on 15 SEO techniques.

4.4.2. Impact of Group: SEO Metrics on the Web Traffic

Regression is a statistical technique to formulate the model and analyze the relationship between the dependent and independent variables. It aims to study the degree of relationship between two or more variables. This is achieved with the help of hypothesis testing.
In this part of our research, three different tests (one-way analysis of variance (ANOVA), coefficients, and scatterplots) were performed for each of the four metrics: Group 1 using the SPSS 27 by IBM software. The analysis is intended to determine the relationship between metrics and web traffic.
The one-way ANOVA is used to determine whether there are any statistically significant differences between the means of two or more independent (unrelated) groups or not. Unstandardized coefficients indicate how much the dependent variable varies with an independent variable when all other independent variables are held constant. A simple scatterplot can be used to (a) determine whether a relationship is linear, (b) detect outliers, and (c) graphically present a relationship between two continuous variables [65].
The null and alternate hypotheses remain the same as the techniques.
  • H0: (Metric) does not have a positive impact on the web traffic.
  • H1: (Metric) has a positive impact on the web traffic.
The model summary table reports the strength of the relationship between the model and the dependent variable (Table 6).
Table 6. Model summary. Strength of the relationship between the model and the dependent variable.
  • Domain Authority: The regression model summary is explained by 15.9% variability in the web traffic, R2 = 0.159, adjusted R2 = 0.155.
  • Keywords: The regression model summary is explained by 65.2% variability in the web traffic, R2 = 0.652, adjusted R2 = 0.650.
  • Website Speed: The regression model summary is explained by only 0.1% variability in the web traffic, R2 = 0.001, adjusted R2 = −0.003.
  • Backlinks: The regression model summary is explained by 86.9% variability in the web traffic, R2 = 0.869, adjusted R2 = 0.869.
The ANOVA table represents the overall significance of the model, which is determined by the web traffic (Table 7). The F ratio is the ratio of two mean square values. Degrees of freedom (df) of error was calculated by total sample (n = 243)—2 = 241 and total degrees of freedom (df) was calculated by total sample (n = 243)—1 = 242.
Table 7. ANOVA results. Overall significance of the model.
  • Domain Authority: The result shows F (1, 241) = 45.524, p < 0.05, which suggests the model is highly significant. Therefore, the independent variable in this model is related to the dependent variable.
  • Keywords: The result shows F (1, 241) = 450.989, p < 0.05, which suggests the model is highly significant. Therefore, the independent variable in this is related to the dependent variable.
  • Website Speed: The result shows F (1, 239) = 0.277, p > 0.05, which suggests the model is not statistically significant. Therefore, the independent variable in this is not related to the dependent variable.
  • Backlinks: The result shows F (1, 241) = 1604.989, p < 0.05, which suggests the model is highly significant. Therefore, the independent variable in this model is related to the dependent variable.
The coefficient table represents how much the dependent variable is expected to increase when that independent variable increases by one, holding all the other independent variables constant [66] (Table 8). When interpreting the coefficient table, the B variable is important, representing the increase in the dependent variable as soon as the independent increases by one unit. On the other hand, the Beta variable compares the strength of the effect of each individual independent variable to the dependent variable. The higher the value of the beta coefficient, the stronger the effect, based on absolute numbers [67].
Table 8. Coefficient results. How much dependent variable is expected to increase when that independent variable increases by one.
  • Domain authority: When the beta value of domain authority is increased by one unit, web traffic will have increased by 108,135 visitors per month. Therefore, domain authority does not have a positive impact on the web traffic; thus, the null hypothesis is rejected. It can be concluded that the domain authority scores have a significant positive effect on the web traffic.
  • Keywords: When the beta value of keywords is increased by one unit, the web traffic will have increased by 10.24 visitors per month. Therefore, keywords do not have a positive impact on the web traffic; thus, the null hypothesis is rejected. It can be concluded that the keywords have a significant positive effect on the web traffic.
  • Website speed: Since the p value of the following table is greater than 0.05, the impact of speed is not statistically significant. Therefore, the null hypothesis that speed test does not have positive impact on the web traffic is rejected.
  • Backlinks: When the beta value of backlinks is increased by one unit, the web traffic will have increased by 0.399 visitors per month. Therefore, backlinks do not have a positive impact on the web traffic; thus, the null hypothesis is rejected. It can be concluded that the backlinks have a significant positive effect on the web traffic.
Scatterplots are essential for identifying trends and patterns. In a scatterplot, each observation (or point) has two coordinates [68]. The strength of the link between the variables is determined by calculating the correlation coefficient. The plot shows the first variable’s value on the x-axis and the second variable’s value on the y-axis for each data point [69]. The relationship between two quantitative variables is shown in a scatter plot (Figure 4, Figure 5, Figure 6 and Figure 7).
Figure 4. Scatterplot of Domain Authority by web traffic.
Figure 5. Scatterplot of Keywords by web traffic.
Figure 6. Scatterplot of Website Speed by web traffic.
Figure 7. Scatterplot of Backlinks by web traffic.
  • Domain Authority: The following graph indicates the coordinates of domain authority and web traffic. The web traffic has been gradually increased because of the increase in DA scores, indicating a linear relationship between domain authority and the web traffic.
  • Keywords: The following graph indicates that the more keywords used on a website, the more traffic it will get. Therefore, there is a strong positive linear relationship between keywords and web traffic.
  • Website speed: The following shows the relationship between speed test and web traffic. It shows that an increase in speed test does not result in a proportional increase or decrease in web traffic. Therefore, there is a constant and nonlinear relationship between these two variables.
  • Backlinks: Like keywords, the following graph indicates that the more backlinks used on a website, the more traffic it will get. Therefore, it can be concluded that there is a strong positive linear relationship between keywords and web traffic.
We used exactly the same data analysis methodology as the SEO metrics to identify the correlation between backlinks and domain authority. The findings of the regression model found that backlinks have a strong connection with domain authority. When the beta value of backlinks is increased by one unit, on average, the domain authority will have increased by 6.004 units.

4.5. Diagnostic Exploratory Model Development

Results of the t-tests on SEO techniques and ANOVA, coefficients, and scatterplots on SEO metrics present significant relationships with the Web traffic. Thus, we aim to provide additional insights to marketing strategists over the impact of SEO techniques and metrics usage on airline websites, in favor of their digital marketing strategy. Therefore, we deployed fuzzy cognitive mapping (FCM), which will implement selected web metrics from both SEO techniques and metrics depicting the relationships to Web metrics, so as to obtain a more adaptive model for process assessment. Fuzzy cognitive mapping deploys a descriptive and consolidated stochastic classification methodology, mainly used to represent the correlations between airlines’ web metrics [70]. We use the orange color to mark the SEO techniques, green color to mark the SEO metrics, and blue color to mark the airlines’ Web traffic. Blue arrows in Figure 8 illustrate variables with positive relations with Web traffic and black arrows illustrate variables with neutral relations with Web traffic. Development of the FCM was conducted via the Mental Modeler cloud-based application [71].
Figure 8. Using fuzzy cognitive map to locate the correlations between SEO techniques/metrics and Web traffic.
Fuzzy cognitive maps are fuzzy graph structures that represent causal reasoning. Exploratory modelling implementation is vital when digital marketing agencies need to make a decision or a digital marketing plan [72].
As we notice in Figure 8, most of the SEO techniques and metrics positively affect Web traffic. Some of them have a relationship with each other, such as Backlinks and DA.

5. Discussion

The main focus of this paper is the development of a precise methodology, containing pioneering context, aiming to provide useful insights concerning search engine optimization usage and its contribution to the development of airlines’ web traffic as well as airlines’ sustainability. Using our own-developed tool, we retrieve airlines’ data from IATA’s website. For each airline website (243 in total), we performed SEO checks evaluating SEO techniques implemented on their source code. At the same time, we used four SEO third-party APIs to collect data for each airline’s website, such as 12-month web traffic, domain authority, website’s speed, and organic keywords. The average domain authority of the airlines’ websites in our dataset is 56, the average loading speed is 10.5 s, and the average web traffic is 1,052,702 users per month.
We analyzed our data through descriptive analysis and t-tests to find out the contribution of each technique and metric to airlines’ web traffic.
Figure 9 presents the SEO techniques and airlines’ website adoption scores. It was noticed that the majority of the airlines’ websites implemented SEO techniques, such as SEO-friendly URLs by 99.59%, SSL certificates by 97.53%, title tag by 84.36%, and meta description by 69.55%. As mentioned earlier, the AMP and Sitemap SEO technique tests showed that none of the airlines’ websites applied these techniques, although they are suggested by Google on Webmaster Guidelines [2].
Figure 9. SEO techniques adoption percentage.
Conducting descriptive regression analysis and t-tests accordingly in Section 4, we confirm the research hypotheses listed in Section 3.3. Our study underlines that on-page SEO impacts airlines’ website traffic on a great scale. The same applies to off-page SEO, where we confirm that backlinks influence airlines’ web traffic. More specifically, when the beta value of backlinks increases by one unit, the web traffic will be increased by 0.399 units. From the organic keywords point of view, when the beta value of keywords increases by one unit, the airlines’ web traffic will be increased by 10.24 units, respectively. On the contrary, website speed seems not to have any impact on airlines’ web traffic, although mentioned as an important SEO factor in Section 2.1.11. In addition, through our analysis, it was found that there is a strong connection between backlinks and domain authority.
Although each individual SEO technique cannot influence the rankings on its own, the appropriate combination of SEO techniques can lead to greater search results. Expanding this research, future research could focus on how the combination of specific SEO techniques, creating potential groups, can affect the corresponding ranking in search results.

6. Conclusions

COVID-19-era travel restrictions have led airline firms to adopt new digital marketing strategies to survive against their rising competition. Most airlines invested in search engine optimization to increase their websites’ rankings on SERPs and, by extension, increase their organic traffic. This study undertakes to uncover the SEO strategies airline websites follow. Using our own-developed tool, we scan each website’s source code, mining the corresponding SEO techniques used by airlines. At the same time, we use four third-party APIs to collect valuable time-accurate data, such as domain authority, organic keywords, backlinks, and 12-month period web traffic. Meanwhile, through our data analysis, we end up with the most-adopted SEO techniques and how each one of them can affect airlines’ websites’ traffic. We conclude that SEO techniques and metrics have a great correlation with web traffic, which, in turn, increase airlines’ conversions and bookings. Our research identifies the SEO techniques that are most widely used, assuming that to be applied by large companies, such as airlines, they can also deliver results on smaller websites.

Author Contributions

Conceptualization, K.I.R. and N.D.T.; methodology, K.I.R., N.D.T. and D.K.N.; software, K.I.R.; validation, K.I.R. and N.D.T.; formal analysis, K.I.R., N.D.T. and D.K.N.; investigation, K.I.R., N.D.T. and D.K.N.; resources, K.I.R.; data curation, K.I.R.; writing—original draft preparation, K.I.R. and N.D.T.; writing—review and editing, K.I.R. and, N.D.T.; visualization, K.I.R.; supervision, N.D.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Data supporting reported results can be found at https://github.com/kroumeliotis/airlines-seo (accessed on 8 August 2022).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Matoševic, G.; Dobša, J.; Mladenic, D. Using Machine Learning for Web Page Classification in Search Engine Optimization. Future Internet 2021, 13, 9. [Google Scholar] [CrossRef]
  2. Webmaster Guidelines, Google Search Central, Google Developers. Available online: https://developers.google.com/search/docs/advanced/guidelines/webmaster-guidelines (accessed on 20 August 2022).
  3. Luh, C.-J.; Yang, S.-A.; Huang, T.-L.D. Estimating Google’s search engine ranking function from a search engine optimization perspective. Online Inf. Rev. 2016, 40, 239–255. [Google Scholar] [CrossRef]
  4. Iqbal, M.; Khalid, M. Search Engine Optimization (SEO): A Study of important key factors in achieving a better Search Engine Result Page (SERP) Position. Sukkur IBA J. Comput. Math. Sci. 2022, 6, 1–15. [Google Scholar] [CrossRef]
  5. Ziakis, C.; Vlachopoulou, M.; Kyrkoudis, T.; Karagkiozidou, M. Important Factors for Improving Google Search Rank. Future Internet 2019, 11, 32. [Google Scholar] [CrossRef]
  6. Roumeliotis, K.I.; Tselikas, N.D. An effective SEO techniques and technologies guide-map. J. Web Eng. 2022, 21, 1603–1650. [Google Scholar] [CrossRef]
  7. Patil, V.M.; Patil, A.V. SEO: On-Page + Off-Page Analysis. In Proceedings of the International Conference on Information, Communication, Engineering and Technology (ICICET), Pune, India, 29–31 August 2018. [Google Scholar]
  8. Kumar, G.; Paul, R.K. Literature Review on On-Page & Off-Page SEO for Ranking Purpose. United Int. J. Res. Technol. 2020, 1, 30–34. [Google Scholar]
  9. Wang, F.; Li, Y.; Zhang, Y. An empirical study on the search engine optimization technique and its outcomes. In Proceedings of the 2nd International Conference on Artificial Intelligence, Management Science and Electronic Commerce (AIMSEC), Zhengzhou, China, 8–10 August 2011. [Google Scholar]
  10. (Meta) Title Tags + Title Length Checker [2021 SEO]–Moz. Available online: https://moz.com/learn/seo/title-tag (accessed on 20 August 2022).
  11. Van, T.L.; Minh, D.P.; Le Dinh, T. Identification of paths and parameters in RESTful URLs for the detection of web Attacks. In Proceedings of the 4th NAFOSTED Conference on Information and Computer Science, Hanoi, Vietnam, 24–25 November 2017. [Google Scholar]
  12. Rovira, C.; Codina, L.; Lopezosa, C. Language Bias in the Google Scholar Ranking Algorithm. Future Internet 2021, 13, 31. [Google Scholar] [CrossRef]
  13. Roumeliotis, K.I.; Tselikas, N.D. Search Engine Optimization Techniques: The Story of an Old-Fashioned Website. In Business Intelligence and Modelling; IC-BIM 2019, Paris, France, 12–14 September 2019; Springer Book Series in Business and Economics; Springer: Cham, Switzerland, 2019. [Google Scholar]
  14. URL Structure [2021 SEO]–Moz SEO Learning Center. Available online: https://moz.com/learn/seo/url (accessed on 20 August 2022).
  15. An Image Format for the Web|WebP|Google Developers. Available online: https://developers.google.com/speed/webp (accessed on 5 May 2022).
  16. Zhou, H.; Qin, S.; Liu, J.; Chen, J. Study on Website Search Engine Optimization. In Proceedings of the International Conference on Computer Science and Service System, Nanjing, China, 11–13 August 2012. [Google Scholar]
  17. Roumeliotis, K.I.; Tselikas, N.D. Evaluating Progressive Web App Accessibility for People with Disabilities. Network 2022, 2, 22. [Google Scholar] [CrossRef]
  18. Zhang, S.; Cabage, N. Does SEO Matter? Increasing Classroom Blog Visibility through Search Engine Optimization. In Proceedings of the 46th Hawaii International Conference on System Sciences, Wailea, HI, USA, 7–10 January 2013. [Google Scholar]
  19. All Standards and Drafts-W3C. Available online: https://www.w3.org/TR/ (accessed on 20 August 2022).
  20. Shroff, P.H.; Chaudhary, S.R. Critical rendering path optimizations to reduce the web page loading time. In Proceedings of the 2nd International Conference for Convergence in Technology (I2CT), Mumbai, India, 7–9 April 2017. [Google Scholar]
  21. Tran, H.; Tran, N.; Nguyen, S.; Nguyen, H.; Nguyen, T.N. Recovering Variable Names for Minified Code with Usage Con-texts. In Proceedings of the IEEE/ACM 41st International Conference on Software Engineering (ICSE), Montreal, QC, Canada, 25–31 May 2019. [Google Scholar]
  22. Ma, D. Offering RSS Feeds: Does It Help to Gain Competitive Advantage? In Proceedings of the 42nd Hawaii International Conference on System Sciences, Waikoloa, HI, USA, 5–8 January 2009. [Google Scholar]
  23. Gudivada, V.N.; Rao, D.; Paris, J. Understanding Search-Engine Optimization. Computer 2015, 48, 43–52. [Google Scholar] [CrossRef]
  24. Mobile-Friendly Test Tool. Available online: https://search.google.com/test/mobile-friendly (accessed on 20 August 2022).
  25. MdSaidul, H.; Abeer, A.; Angelika, M.; Prasad, P.W.C.; Amr, E. Comprehensive Search Engine Optimization Model for Commercial Websites: Surgeon’s Website in Sydney. J. Softw. 2018, 13, 43–56. [Google Scholar]
  26. Kaur, S.; Kaur, K.; Kaur, P. An Empirical Performance Evaluation of Universities Website. Int. J. Comput. Appl. 2016, 146, 10–16. [Google Scholar] [CrossRef]
  27. Google Lighthouse. Available online: https://developers.google.com/web/tools/lighthouse (accessed on 20 August 2022).
  28. Pingdom Website Speed Test. Available online: https://tools.pingdom.com/ (accessed on 20 August 2022).
  29. Google Chrome Help. Available online: https://support.google.com/chrome/answer/95617?hl=en (accessed on 20 August 2022).
  30. Jun, B.; Bustamante, F.; Whang, S.; Bischof, Z. AMP up your Mobile Web Experience: Characterizing the Impact of Google’s Accelerated Mobile Project. In Proceedings of the MobiCom ’19: The 25th Annual International Conference on Mobile Com-puting and Networking, Los Cabos, Mexico, 21–25 October 2019. [Google Scholar]
  31. Roumeliotis, K.I.; Tselikas, N.D. Accelerated Mobile Pages: A Comparative Study. In Business Intelligence and Modelling; Springer Book Series in Business and Economics; Springer: Cham, Switzerland, 2019. [Google Scholar]
  32. Start Building Websites with AMP. Available online: https://amp.dev/documentation/ (accessed on 20 August 2022).
  33. Phokeer, A. On the potential of Google AMP to promote local content in developing regions. In Proceedings of the 11th International Conference on Communication Systems & Networks (COMSNETS), Bengaluru, India, 7–11 January 2019; pp. 80–87. [Google Scholar]
  34. Welcome to Schema.org. Available online: https://schema.org/ (accessed on 20 August 2022).
  35. Guha, R.; Brickley, D.; MacBeth, S. Schema.org: Evolution of Structured Data on the Web: Big data makes common schemas even more necessary. Queue 2015, 13, 10–37. [Google Scholar] [CrossRef] [Green Version]
  36. Navarrete, R.; Lujan-Mora, S. Microdata with Schema vocabulary: Improvement search results visualization of open eductional resources. In Proceedings of the 13th Iberian Conference on Information Systems and Technologies (CISTI), Caceres, Spain, 13–16 June 2018. [Google Scholar]
  37. Navarrete, R.; Luján-Mora, S. Use of embedded markup for semantic annotations in e-government and e-education websites. In Proceedings of the Fourth International Conference on eDemocracy & eGovernment (ICEDEG), Quito, Ecuador, 19–21 April 2017. [Google Scholar]
  38. The Open Graph Protocol. Available online: https://ogp.me/ (accessed on 20 August 2022).
  39. Sergey, B.; Lawrence, P. The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. ISDN Syst. 1998, 30, 107–117. [Google Scholar]
  40. Krohn, S. Organic Traffic and Why It Is Important. 2016. Available online: https://www.linkedin.com/pulse/organic-traffic-why-important-krohn-online-traffic-generation (accessed on 20 August 2022).
  41. Jeffers, J. Is Direct Traffic an Indicator of Brand Strength? 2019. Available online: https://www.portent.com/blog/analytics/is-direct-traffic-an-indicator-of-brand-strength.htm (accessed on 20 August 2022).
  42. Gokhan, E.; Coskun, B. The Role of Search Engine Optimization on Keeping the User on the Site. Procedia Comput. Sci. 2014, 36, 335–342. [Google Scholar]
  43. SEO in eCommerce: Everything You Need to Improve 2019. Available online: https://www.cloudoe.gr/en/article/seo-in-ecommerce-everything-you-need-to-improve (accessed on 20 August 2022).
  44. Organic Keywords: SEO for Beginners. Available online: https://www.semrush.com/blog/organic-keywords/ (accessed on 20 August 2022).
  45. Mozscape API. Available online: https://moz.com/products/api (accessed on 20 August 2022).
  46. Saura, J.R.; Palos-Sánchez, P.; Cerdá Suárez, L.M. Understanding the Digital Marketing Environment with KPIs and Web Analytics. Future Internet 2017, 9, 76. [Google Scholar] [CrossRef]
  47. Sakas, D.P.; Giannakopoulos, N.T. Harvesting Crowdsourcing Platforms’ Traffic in Favour of Air Forwarders’ Brand Name and Sustainability. Sustainability 2021, 13, 8222. [Google Scholar] [CrossRef]
  48. What is Organic Search? Everything You Need to Know. 2021. Available online: https://ahrefs.com/blog/organic-search/ (accessed on 20 August 2022).
  49. Current Airline Members. Available online: https://www.iata.org/en/about/members/airline-list/ (accessed on 20 August 2022).
  50. PHP-Based Software to Retrieve Data and Process Airlines’ SEO Data. Available online: https://github.com/kroumeliotis/airlines-seo (accessed on 20 August 2022).
  51. Command Line Tool and Library for Transferring Data with URLs. Available online: https://curl.se/ (accessed on 20 August 2022).
  52. Mobile-Friendly Test Tool. Available online: https://support.google.com/webmasters/answer/6352293?hl=en (accessed on 20 August 2022).
  53. Pagespeedapi Runpagespeed. Available online: https://developers.google.com/speed/docs/insights/v4/reference/pagespeedapi/runpagespeed (accessed on 20 August 2022).
  54. Ubersuggest. Available online: https://neilpatel.com/ubersuggest/ (accessed on 20 August 2022).
  55. IBM SPSS 27. Available online: https://www.ibm.com/support/pages/downloading-ibm-spss-statistics-27 (accessed on 20 August 2022).
  56. Airlines Dataset Excel. Available online: https://github.com/kroumeliotis/airlines-seo/blob/main/Airlines.xls (accessed on 20 August 2022).
  57. Ahrens, J.P. Visualization and Data Analysis at the Exascale. Web 2011 [Technical Report]. Available online: https://doi.org/10.2172/1011053 (accessed on 20 August 2022).
  58. Zikmund, W.G.; D’Alessandro, S.; Winzar, H.; Lowe, B.; Babin, B. Marketing Research: Asia-Pacific Edition, 4th ed.; Victoria Cengage Learning: South Melbourne, VIC, Australia, 2017; ISBN 9780170369824. [Google Scholar]
  59. Shrestha, S.K. Brand Loyalty of Baby Diaper Products. Manag. Dyn. 2018, 21, 79–88. [Google Scholar] [CrossRef]
  60. Tabachnick, B.G.; Fidell, L.S.; Ullman, J.B. Using Multivariate Statistics, 7th ed.; Pearson Boston: Boston, MA, USA, 2019; ISBN 9780135350904. [Google Scholar]
  61. Dukkipati, P.R.V. Probability and Statistics for Scientists and Engineers; New Academic Science Ltd.: London, UK, 2011; Volume 27, p. 1. ISBN 978-1906574833. [Google Scholar]
  62. Asadoorian, M.O.; Kantarelis, D. Essentials of Inferential Statistics, 4th ed.; University Press of America: Lanham, MD, USA, 2004; ISBN 978-0761830306. [Google Scholar]
  63. P-Value. Available online: https://www.investopedia.com/terms/p/p-value.asp (accessed on 20 August 2022).
  64. What a P-Value Tells You about Statistical Significance. Available online: https://www.simplypsychology.org/p-value.html (accessed on 20 August 2022).
  65. The Ultimate IBM SPSS Statistics Guides. Available online: https://statistics.laerd.com (accessed on 20 August 2022).
  66. Clogg, C.C.; Petkova, E.; Haritou, A. Statistical methods for comparing regression coefficients between models. Am. J. Sociol. 1995, 100, 1261–1293. [Google Scholar] [CrossRef]
  67. Standardized Beta Coefficient: Definition & Example. Available online: https://www.statisticshowto.com/standardized-beta-coefficient/ (accessed on 20 August 2022).
  68. Ceci, M.; Hollmén, J.; Todorovski, L.; Vens, C.; Džeroski, S. (Eds.) Machine Learning and Knowledge Discovery in Databases; ECML PKDD 2017; Lecture Notes in Computer Science; Springer: Chem, Switzerland, 2017. [Google Scholar]
  69. Keim, D.A.; Hao, M.C.; Dayal, U.; Janetzko, H.; Bak, P. Generalized Scatter Plots. Inf. Vis. 2010, 9, 301–311. [Google Scholar] [CrossRef]
  70. Sakas, D.P.; Giannakopoulos, N.T. Big Data Contribution in Desktop and Mobile Devices Comparison, Regarding Airlines’ Digital Brand Name Effect. Big Data Cogn. Comput. 2021, 5, 48. [Google Scholar] [CrossRef]
  71. Gray, S.A.; Gray, S.; Cox, L.J.; Henly-Shepard, S. Mental Modeler: A Fuzzy-Logic Cognitive Mapping Modeling Tool for Adaptive Environmental Management. In Proceedings of the 46th Hawaii International Conference on System Sciences, Wailea, HI, USA, 7–10 January 2013. [Google Scholar]
  72. Salmeron, J.L. Supporting Decision Makers with Fuzzy Cognitive Maps. Res.-Technol. Manag. 2009, 52, 53–59. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.