You are currently viewing a new version of our website. To view the old version click .
Land
  • Article
  • Open Access

24 July 2022

An Intelligent Homogeneous Model Based on an Enhanced Weighted Kernel Self-Organizing Map for Forecasting House Prices

and
1
Department of Information Management, National Yunlin University of Science and Technology, Douliou 640301, Taiwan
2
Department of Business Administration, I-Shou University, Kaohsiung City 840301, Taiwan
*
Author to whom correspondence should be addressed.
This article belongs to the Section Land Innovations – Data and Machine Learning

Abstract

Accurately forecasting housing prices will enable investors to attain profits, and it can provide information to stakeholders that housing prices in the community are falling, stabilizing, or rising. Previous studies on housing price forecasting mostly used hedonic pricing and weighted regression methods, which led to the lack of consideration of the nonlinear relationship model and its explanatory power. Furthermore, the attribute data of housing price forecasts are a heterogeneous study, and they are difficult to forecast accurately. Therefore, this study proposes an intelligent homogeneous model based on an enhanced weighted kernel self-organizing map (EW-KSOM) for forecasting house prices; that is, this study proposes an EW-KSOM algorithm to cluster the collected data and then applies random forest, extra tree, multilayer perception, and support vector regression to forecast the house prices of full, district, and apartment complex data. In the experimental comparison, we compare the performance of the proposed enhanced weighted kernel self-organizing map with the listing clustering methods. The results show that the best forecast algorithm is the combined EW-KSOM and random forest under the root mean square error and root-relative square error, and the proposed method can effectively improve the forecast capability of housing prices and understand the influencing factors of housing prices in full and important districts. Furthermore, we obtain that the top five key factors influencing house prices are transferred land area, house age, building transfer total area, population percentage, and the total number of floors. Lastly, the research results can provide references for investors and related organizations.

1. Introduction

Housing is a basic human need, and everyone needs a place to live. Real estate is both a consumer product and an investment product [1], and it also has a significant impact on macroeconomics. Housing allocation is increasingly important for sustainable economic development and social stability because it is related to people’s lives, and the demand for residential land is also an important issue in real estate [2]. House price appraisal is one of the most important transaction decisions that affects the national real estate policy [3]. Hence, the stakeholders use various information about a house to measure the value-to-price ratio of it. Housing prices are affected by many factors, including the market economy, industrial policies, land price, income levels in different regions, living environment, public facilities, and the needs of the people [4]. Furthermore, the changes in housing prices cover various influencing factors, and important factors can be obtained through data analysis. Stakeholders understand how real estate buyers and sellers evaluate housing prices and produce better evaluation indicators, and then the government can propose corresponding policies to maintain the stability of the real estate market, including adjusting the land supply structure and buying houses.
Many people have expressed widespread concern about housing prices, and an effective tool for forecasting future housing price fluctuations is needed. Various methods have been applied to forecast housing prices. Multivariate statistical models can consider many factors that affect housing prices. However, they require some assumptions on data distribution, and the performance of these models is easily limited by sample size and data integrity. Compared with statistical models, intelligent forecast models are now becoming mainstream because they can identify the nonlinear relationship between variables and provide higher accuracy and deeper interpretation of the behavior of market interaction. The cost and efficiency of intelligent learning from external data and economic signals are no longer an obstacle, and more and more stakeholders are adopting intelligent predictions to forecast housing prices.
The data of housing price forecasts are part of a heterogeneous study. The most obvious heterogeneity of housing prices is that relatively low-priced properties are more likely to find buyers than those with higher prices. For an inexperienced agent, for a price higher than the inferred price, the price increase leads to a greater decrease in the probability of sales compared with a price lower than the inferred price. For any price, the more experienced agents are more likely to sell. It is very difficult to forecast housing prices correctly. However, we can use a large amount of collected data to reduce the forecast error by using the method of homogenizing the data, such as how a clustering algorithm can group heterogeneous data into similar clusters. Therefore, this research proposes an enhanced weighted kernel self-organizing map (EW-KSOM) to cluster the collected attribute data of real estate transactions and to forecast housing prices by using four intelligent models. The following are the contributions of this research:
  • Collect data from the real price registration of real estate information in Taiwan and the population statistics data from the Civil Affairs Bureau of the Taichung City government. The originally collected data have a total of 16,617 records with 26 attributes. It is also found from the relevant literature that the collected data have 12 important attributes that affect the changes in housing prices.
  • Use the proposed EW-KSOM to cluster the collected data and then compare it with the listing clustering algorithms.
  • Apply random forest, extra tree, multilayer perception regression, and support vector regression to forecast the house prices of full and important districts, and we compare the performance of the proposed intelligent homogenous model with the listing methods.
  • The results can be used as references for home buyers, real estate companies, and relevant government agencies.
The remaining sections of this paper are organized as follows. Section 2 introduces the related work, including the factors and forecasts of real estate prices, clustering techniques, and intelligent forecast methods. The proposed method and computational steps are described in Section 3. Section 4 explains the dataset and experimental environment. Section 5 illustrates the experimental results and comparisons. Section 6 presents the discussions, and Section 7 presents the conclusion.

3. Proposed Model

This section introduces the proposed EW-KSOM and intelligent homogeneous model based on the EW-KSOM algorithm.

3.1. EW-KSOM Algorithm

Hung [40] applied k-means to cluster the heterogeneous data into the homogeneous data and presented a homogeneous attribute intelligent model for forecasting house prices. MacDonald and Fyfe [27] proposed a k-means-based KSOM, where each mean is described as a weighted sum of the mapping function. Based on the gradient descent method, Andras [29] proposed a kernel-Kohonen network, where the input and weights are transformed into the high-dimensional kernel Hilbert space. The Euclidean distance in the high-dimensional space is the updated weight. Another KSOM is also based on the normalized gradient descent method [28], in which the algorithm aims to minimize the mean square error of the input and the corresponding prototype in the mapping space. The SOM and KSOM are not necessarily assigned the number of clusters in advance. After training the weight coefficients by competitive learning, the center of each cluster is automatically obtained. Furthermore, Lau et al. [41] stated that these kernel extensions of the original SOM are to minimize the energy function, and a KSOM can provide slightly better performance than other methods. To enhance the clustering capability and reduce heterogeneity, this study proposes an EW-KSOM algorithm. First, we set the objective and enhanced the weighted function. Let i(X) be the index of the winning neuron with respect to the input X. Hence, the objective function is
i ( X ) = arg min E w ( j ) | | X W j | | 2
The weighted function is Ew(j) = exp(−‖XWj2/σ2), and the parameter σ = [0.1,1].
Second, the EW-KSOM algorithm is as follows:
1.
Define the input and output layer.
Assuming that an input dataset is X’ = [x1, x2, x3xn], which is an n-dimensional vector, and the number of neurons in the input layer is n. The neurons in the output layer are arranged in a matrix in a two-dimensional space, and each neuron has a weight vector.
2.
Normalize the input vector and weight:
X = X / | | X | |
w j = W j / | | W | |
where ‖X‖ and ‖W‖ are the Euclidean norms of the input and weight vector, respectively.
3.
Input the data into the neural network.
Use the dot product to calculate the normalized data and weight vector. The output neuron with the largest dot product value wins the competition.
4.
Update the neurons in the topological neighborhood of the winning neuron.
The updating weight of the winning neuron is
w j ( t + 1 ) = w j ( t ) + E w ( j ) η ( j ) h j J ( w j )
where J(wj)= ‖φ(X’) − φ(wj)‖2 = 2wj − 2φ(X’), φ is a kernel function, ∇ is gradient descent, η(j) is the learning rate function of the training step j, and the topological distance from the winning neuron is defined.
Furthermore, the winning neighborhood function is
h j = exp ( d 2 δ 2 ( j ) ) ,
where δ(t) = d × (jmaxj)/jmax is the width of the topological neighborhood function, d = ‖ri(x)rj2, rj is the location of neuron j, and ri(x) is the location of winning neuron.
5.
Update the learning rate.
Change the different learning rate η and the topological neighborhood N, and η(j) = 1/j generally takes the reciprocal of the number of iterations. When N increases, the time and the distance decrease.
6.
Determine whether to converge.
If the learning rate ηηmin or reaches the set number of iterations, then the algorithm ends.

3.2. Intelligent Homogeneous Model Based on the EW-KSOM Algorithm

The data of housing price forecasts are a heterogeneous study, and it is difficult to accurately forecast housing prices because there are many different data sources, including the category and numeric value. This study referred to Hung [40], who applied k-mean to cluster the heterogeneous data into the homogeneous data for improving housing price forecasting. Therefore, we proposed an EW-KSOM algorithm to cluster the collected data and named each cluster. Furthermore, we applied RF, ET, MLPR, and SVR to forecast the house prices of full and important districts, and we compared the performance of EW-KSOM with the listing clustering methods. The procedure of the proposed intelligent homogeneous model based on EW-KSOM for forecasting house prices is shown in Figure 1. The procedure contained five steps: data collection, data preprocessing, the proposed EW-KSOM algorithm, forecasting house prices, and evaluation and comparison. Steps 1 and 2 are described in Section 4, and steps 3 and 4 are explained in this section. Here, we only introduce the root mean square error (RMSE) and root relative squared error (RRSE) of step 5 as follows.
Figure 1. Procedure of intelligent homogeneous model based on EW-KSOM for forecasting house prices.
1.
RMSE
RMSE represents the sample standard error of the difference between the forecasted value and the observed value [42]. The RMSE is mainly used to aggregate the size of the error in the forecast with a value to show its forecast ability. The RMSE is a good measure of accuracy, but because it is related to the numerical range, it can only be used to compare the forecasted error of a specific variable between different models. The RMSE is defined as shown in Equation (6):
RMSE = i = 1 n | T ( i ) F ( i ) | 2 n
where T(i) represents the actual house prices, F(i) denotes the forecasted house prices, and n is the amount of data.
2.
RRSE
RRSE [43] takes the total square error and normalizes it by dividing the total square error of the actual values. By taking the square root of the relative square error, the error can be reduced to the same dimension as the forecasted amount. It is defined as shown in Equation (7):
i = 1 n | T ( i ) F ( i ) | 2 / i = 1 n | T ( i ) T ( i ) ¯ | 2
where T ( i ) ¯ = i = 1 n T ( i ) / n .

4. Dataset and Experimental Environment

Taichung City has a population of about 2.82 million and is the second-most populous city in Taiwan. Taichung City has 29 administrative districts, but the Heping and Daan districts have no house transaction records. Therefore, the collected data contained 27 administrative districts of Taichung City from the real price registration of real estate information in Taiwan (https://pip.moi.gov.tw/V3/Z/SCRZ0211.aspx?Mode=PN02 (accessed on 1 March 2021)), and the population statistics data were from the Civil Affairs Bureau of the Taichung City government (https://demographics.taichung.gov.tw/Demographic/index.html?s=12685860 (accessed on 1 March 2021)). The original data were collected from real estate registration data from 2019 to 2020, and we selected the residential transaction data with 26 attributes and 16,617 records. This study used the population percentage of 29 administrative districts and the average house prices (TWD/m2) in Taichung City to express the impact of community environment and economic development on housing prices, as shown in Table 2. After preprocessing, the retained attributes included the district population percentage, transfer floor, total floor, building construction type, house age, transferred land area, total area square meters of building transfer, bedroom, living room, bathroom, compartmented, management, and unit price per square meter. Then, we coded these categorical variables as digital for clustering and forecasting, and the descriptions and values of the 12 independent variables and 1 dependent variable (house prices) are shown in Table 3.
Table 2. The 29 administrative districts and average house prices in Taichung City.
Table 3. Operational definitions of the dependent and independent variables.
To visualize the data, we used the Geocoding API of the Google Maps service to obtain the latitude and longitude based on the locations (address) of the transaction houses, and then we applied the Choropleth and HeatMap of map visualizations in the Python tool to generate visual maps. Figure 2 only shows the 27 administrative districts with transaction records, and it is a map visualization of the average house prices/m2 for each district. The deeper color districts denote higher house prices. From Table 2 and Figure 2, we observe that the highest housing prices in the Taiping district were due to the new expressway (Taiwan Route 74), with convenient transportation close to the urban area, a large land area, and new residential buildings.
Figure 2. Average house price/m2 for each district.
The experimental environment used Python 3.7.4 to compute all results and visualize data on an Intel i7-7700 with a Windows 10 operating system and a 3.6-GHz CPU. Four forecasting algorithms (RF, ET, MLPR, and SVR) were used to forecast the housing prices. The parameter settings of the clustering and forecast algorithms are shown in Table 4.
Table 4. Algorithms and their parameter settings.

5. Experimental Results and Comparisons

This section illustrates the proposed EW-KSOM algorithm to cluster the collected data and then applies RF, ET, MLPR, and SVR to forecast house prices. Furthermore, we compare the performance of the proposed enhanced weighted kernel self-organizing map with KM, SOM, KKM, KSOM, and no clustering algorithms. The experimental data was partitioned into 66% for training and 34% for testing, and the average of repeated implementation 100 times was taken to present their results for finding the best algorithm. After data preprocessing, we followed steps 3–5 of the proposed procedure in Figure 1 of Section 3.2. First, this experiment used the SOM family algorithms because the SOM, KSOM, and EW-KSOM do not need to assign the number of clusters in advance, and then the best number of clusters was four from the SOM family algorithms. Second, we implemented KM family algorithms based on the best number of clusters k = 4, and then we applied RF, ET, MLPR, and SVR to forecast house prices. The comparative results of the full data are shown in Table 5, and it shows that the proposed EW-KSOM algorithm combined with RF was the best forecast result in terms of the RMSE and RRSE metrics.
Table 5. Forecast results of different clustering methods.
To further compare the performance of the proposed EW-KSOM algorithm, we chose the Beitun and Xitun districts, having the largest population and transaction volumes, to forecast house prices. In addition, the Xitun district is the location of the Taichung City government and council, being the most prosperous commercial area. From the comparative results of the full data in Table 5, we only used RF to forecast the house prices of the Beitun and Xitun districts. The results are shown in Table 6, and they show that the forecast capability of the proposed EW-KSOM was better than the listing methods (including no clustering) in terms of the RMSE and RRSE metrics. Furthermore, KKM was the worst clustering method, with results poorer than no clustering.
Table 6. District forecast results of proposed clustering and no clustering.

6. Discussions

From the experimental results, the proposed EW-KSOM clustering algorithm was better than the listing algorithms based on the minimum RMSE and RRSE metrics. However, there are some findings and discussions, which are listed below:
1.
Key Factors
This study collected 13 factors (as Table 3) to forecast house prices, among which the population percentage was applied to express the importance of the geographical location of the region and its urban development. After the experimental results, we used the RF forecast model to rank the factors’ importance of full data, as shown in Figure 3. We observed that the top four positive factors are the following: transferred land area > house age > population percentage > total number of floors. The top four negative factors are as follows: building transfer total area > bedroom > living room > compartment. Our results are consistent with some literature: (1) housing prices in geographical locations with a large population would rise sharply [3,11], (2) the values of residential buildings on higher floors were relatively higher and has a positive impact [44], and (3) the house area was highly correlated with the house price [12]. Furthermore, we used apartment complex data (building construction type 5: all floors with the elevator, as in Table 3) from the collected data and ranked the factors’ importance of the apartment complex data, as shown in Figure 4. We observed that the four positive factors were the following: population percentage > building transfer total area > total number of floors > transfer floor. The top three negative factors were the following: house age > transferred land area > bedroom.
Figure 3. Important factors of influencing house prices (full data). Note: the blue bars denote positive factors, and the red bars mean negative factors.
Figure 4. Important factors of influencing house prices (apartment complex data). Note: The blue bars denote positive factors, and the red bars mean negative factors.
2.
Homogeneous Effect
To improve forecast capabilities, this study proposes an intelligent homogeneous model based on EW-KSOM clustering to forecast house prices. The results show that the proposed EW-KSOM algorithm combined with RF had the best forecast results in the full and district data, as shown in Table 5 and Table 6. Furthermore, Kumar et al. [6] used k-means clustering to identify the locations of real estate investment based on user preferences and selected condominium and apartment complex data to rank the influencing factors of an apartment complex. Therefore, we also selected apartment complex data from the collected data and applied the proposed intelligent homogeneous model based on EW-KSOM clustering to forecast house prices. Here, we only compared the proposed algorithm with KSOM because the proposed EW-KSOM algorithm is a modified form of KSOM clustering. The results show that the proposed EW-KSOM algorithm combined with RF produced the best forecast result, as shown in Table 7. In summary, the clustering allowed the data to be homogeneous, the forecast effect was better than no clustering, and the proposed method was better than the listing methods.
Table 7. District comparison results of proposed clustering with no clustering.
3.
Ensemble Forecast Algorithm
From the experimental results, as shown in Table 5, RF was the best forecast algorithm compared with the listing methods because RF is an ensemble learning and nonlinear algorithm to reduce overfitting, and it is very stable for categorical and continuous data [30]. Furthermore, RF in regression is used to average the results of all independent trees, and the majority voting method is used to obtain the results in classification [31]. RF can rank the importance of the variables from the most relevant to the least relevant for adding value to the attribute selection in data analysis [31,32]. In the experimental data of the full, district, and apartment complexes, the results show that the proposed EW-KSOM combined with RF produced the best forecast result.

7. Conclusions

Housing is the basic condition of people’s lives. When buying a house, one must have a variety of housing information, and the most important information that affects the purchase is the house’s price [6]. The changing trend of housing prices will be considered by a number of factors. This study collected 13 factors from the relevant literature and used the collected data of the 27 administrative districts of Taichung City to forecast house prices. Previous housing price forecasting mostly used hedonic pricing and weighted regression methods, which led to a lack of consideration of the nonlinear relationship model and its explanatory power. This research used the proposed EW-KSOM algorithm to homogenize the collected data, and then we applied an ensemble learning RF to forecast house prices. The results show that the proposed intelligent homogeneous model had better forecast capabilities, as shown in Table 5, Table 6 and Table 7. From the research results, the following summarizes the significance and contributions of this study:
  • Finding the key factors of full, district, and apartment complex data using ensemble learning RF;
  • Proposing EW-KSOM to cluster the full and apartment complex data and then compare their forecast capabilities with the listing methods;
  • Applying RF, ET, MLPR, and SVR to forecast the house prices of full, district, and apartment complex data and then comparing the performance of the proposed intelligent homogenous model with the listing clustering methods;
  • Providing the results to home buyers, real estate companies, and relevant government agencies for reference.
The limitation of this study is that the housing price factor is related to the sample used, and the important factors derived from the study will vary with different community samples. Therefore, we used two different datasets to describe their positive and negative factors: one was the full dataset, and the other was the complex apartment dataset. Furthermore, we refer to the work of Kazak et al. [45] to describe the influencing housing price factors. In the full dataset with 15,955 samples, the results show that the four important positive factors are transferred land area, house age, population percentage, and the total number of floors, while the four negative factors are the building transfer total area, bedroom, living room, and compartment, as shown in Figure 3. In terms of the complex apartment dataset with 6368 samples, the four positive factors are the population percentage, building transfer total area, total number of floors, and transfer floor, while the three negative factors are the house age, transferred land area, and bedroom, as shown in Figure 4.
In future work, we can collect a larger variety of city data to find the significant factors and enhance forecasting capabilities, such as those of the top six Taiwan cities. Furthermore, we can use text mining to build a knowledge graph of buyer behaviors.

Author Contributions

Conceptualization, C.-H.C. and M.-C.T.; methodology, C.-H.C. and M.-C.T.; software, M.-C.T.; validation, C.-H.C. and M.-C.T.; formal analysis, M.-C.T.; investigation, M.-C.T.; resources, C.-H.C.; data curation, C.-H.C.; writing—original draft preparation, C.-H.C.; writing—review and editing, M.-C.T.; visualization, M.-C.T.; supervision, C.-H.C. and M.-C.T.; project administration, M.-C.T.; funding acquisition, C.-H.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The collected data are open data from the real price registration of real estate information in Taiwan (https://pip.moi.gov.tw/V3/Z/SCRZ0211.aspx?Mode=PN02 (accessed on 1 March 2021)) and from the Civil Affairs Bureau of the Taichung City government (https://demographics.taichung.gov.tw/Demographic/index.html?s=12685860 (accessed on 1 March 2021)).

Acknowledgments

The authors appreciate Duan-Zi Hung for his valuable insights and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Das, P.; Füss, R.; Hanle, B.; Russ, I.N. The cross-over effect of irrational sentiments in housing, commercial property, and stock markets. J. Bank. Financ. 2020, 114, 105799. [Google Scholar] [CrossRef]
  2. Li, Y.; Zhu, D.; Zhao, J.; Zheng, X.; Zhang, L. Effect of the housing purchase restriction policy on the Real Estate Market: Evidence from a typical suburb of Beijing, China. Land Use Policy 2020, 94, 104528. [Google Scholar] [CrossRef]
  3. Park, B.; Bae, J.K. Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data. Expert Syst. Appl. 2015, 42, 2928–2934. [Google Scholar] [CrossRef]
  4. Liu, L.; Wu, L. Predicting housing prices in China based on modified Holt’s exponential smoothing incorporating whale optimization algorithm. Socio Econ. Plan. Sci. 2020, 72, 100916. [Google Scholar] [CrossRef]
  5. Zhuge, C.; Shao, C.; Gao, J.; Dong, C.; Zhang, H. Agent-based joint model of residential location choice and real estate price for land use and transport model. Comput. Environ. Urban Syst. 2016, 57, 93–105. [Google Scholar] [CrossRef]
  6. Kumar, S.; Talasila, V.; Pasumarthy, R. A novel architecture to identify locations for Real Estate Investment. Int. J. Inf. Manag. 2019, 56, 102012. [Google Scholar] [CrossRef]
  7. Rosen, S. Hedonic prices and implicit markets: Product differentiation in pure competition. J. Political Econ. 1974, 82, 34–55. [Google Scholar] [CrossRef]
  8. Taltavull de La Paz, P.; López, E.; Juárez, F. Ripple effect on housing prices. Evidence from tourist markets in Alicante, Spain. Int. J. Strateg. Prop. Manag. 2017, 21, 1–14. [Google Scholar] [CrossRef]
  9. Truong, Q.; Nguyen, M.; Dang, H.; Mei, B. Housing Price Prediction via Improved Machine Learning Techniques. Procedia Comput. Sci. 2020, 174, 433–442. [Google Scholar] [CrossRef]
  10. Čeh, M.; Kilibarda, M.; Lisec, A.; Bajat, B. Estimating the performance of random forest versus multiple regression for predicting prices of the apartments. ISPRS Int. J. Geo Inf. 2018, 7, 168. [Google Scholar] [CrossRef] [Green Version]
  11. Rico-Juan, J.R.; Taltavull de La Paz, P. Machine learning with explainability or spatial hedonics tools? An analysis of the asking prices in the housing market in Alicante, Spain. Expert Syst. Appl. 2021, 171, 114590. [Google Scholar] [CrossRef]
  12. Kang, Y.; Zhang, F.; Peng, W.; Gao, S.; Rao, J.; Duarte, F.; Ratti, C. Understanding house price appreciation using multi-source big geo-data and machine learning. Land Use Policy 2020, 111, 104919. [Google Scholar] [CrossRef]
  13. D’Acci, L. Quality of urban area, distance from city centre, and housing value. Case study on real estate values in Turin. Cities 2019, 91, 71–92. [Google Scholar] [CrossRef]
  14. Debrezion, G.; Pels, E.; Rietveld, P. The impact of rail transport on real estate prices: An empirical analysis of the Dutch housing market. Urban Stud. 2011, 48, 997–1015. [Google Scholar] [CrossRef]
  15. Shih, M.; Chiang, Y.H.; Chang, H.B. Where does floating TDR land? An analysis of location attributes in real estate development in Taiwan. Land Use Policy 2019, 82, 832–840. [Google Scholar] [CrossRef]
  16. Tubadji, A.; Nijkamp, P. Green Online vs Green Offline preferences on local public goods trade-offs and house prices. Socio Econ. Plan. Sci. 2017, 58, 72–86. [Google Scholar] [CrossRef]
  17. Hair, J.F.; Anderson, R.E.; Black, W.C. Multivariate Data Analysis, 7th ed.; Pearson: Harlow, UK, 2014. [Google Scholar]
  18. Sinaga, K.P.; Yang, M.-S. Unsupervised K-Means Clustering Algorithm. IEEE Access 2020, 8, 80716–80727. [Google Scholar] [CrossRef]
  19. Dhillon, I.S.; Guan, Y.; Kulis, B. Kernel k-means: Spectral clustering and normalized cuts. In Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, WA, USA, 22–25 August 2004; pp. 551–556. [Google Scholar] [CrossRef]
  20. Kohonen, T. Self-Organisation and Associative Memory; Springer: Berlin/Heidelberg, Germany, 1989. [Google Scholar]
  21. Zhu, C.; Idemudia, C.U.; Feng, W. Improved logistic regression model for diabetes prediction by integrating PCA and K-means techniques. Inform. Med. Unlocked 2019, 17, 100179. [Google Scholar] [CrossRef]
  22. Forgy, E.W. Cluster analysis of multivariate data: Efficiency versus interpretability of classifications. Biometrics 1965, 21, 768–769. [Google Scholar]
  23. Ketchen, D.J.; Shook, C.L. The application of cluster analysis in strategic management research: An analysis and critique. Strateg. Manag. J. 1996, 17, 441–458. [Google Scholar] [CrossRef]
  24. Rousseeuw, P.J. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar] [CrossRef] [Green Version]
  25. Kim, D.-W.; Lee, K.Y.; Lee, D.; Lee, K.H. Evaluation of the performance of clustering algorithms in kernel-induced feature space. Pattern Recognit. 2005, 38, 607–611. [Google Scholar] [CrossRef]
  26. Wang, S.; Gittens, A.; Mahoney, M.W. Scalable kernel k-means clustering with Nyström approximation: Relative-error bounds. J. Mach. Learn. Res. 2019, 20, 431–479. [Google Scholar]
  27. MacDonald, D.; Fyfe, C. The kernel self-organising map. In Proceedings of the KES’2000. Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies. Proceedings (Cat. No.00TH8516), Brighton, UK, 30 August–1 September 2000; Volume 1, pp. 317–320. [Google Scholar] [CrossRef]
  28. Pan, Z.S.; Chen, S.C.; Zhang, D.Q. A kernel-base SOM classifier in input space. Acta Electronica Sinica 2004, 32, 227–231. (In Chinese) [Google Scholar]
  29. Andras, P. Kernel–Kohonen networks. Int. J. Neural Syst. 2002, 12, 117–135. [Google Scholar] [CrossRef]
  30. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  31. Callens, A.; Morichon, D.; Abadie, S.; Delpey, M.; Liquet, B. Using Random forest and Gradient boosting trees to improve wave forecast at a specific location. Appl. Ocean. Res. 2020, 104, 102339. [Google Scholar] [CrossRef]
  32. Simsekler, M.C.E.; Qazi, A.; Alalami, M.A.; Ellahham, S.; Ozonoff, A. Evaluation of patient safety culture using a random forest algorithm. Reliab. Eng. Syst. Saf. 2020, 204, 107186. [Google Scholar] [CrossRef]
  33. Geurts, P.; Ernst, D.; Wehenkel, L. Extremely randomized trees. Mach. Learn. 2006, 63, 3–42. [Google Scholar] [CrossRef] [Green Version]
  34. Sagi, O.; Rokach, L. Explainable decision forest: Transforming a decision forest into an interpretable tree. Inf. Fusion 2020, 61, 124–138. [Google Scholar] [CrossRef]
  35. Relich, M.; Świć, A. Parametric Estimation and Constraint Programming-Based Planning and Simulation of Production Cost of a New Product. Appl. Sci. 2020, 10, 6330. [Google Scholar] [CrossRef]
  36. Rosenblatt, F. Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms; Spartan Books: Washington, DC, USA, 1961. [Google Scholar]
  37. Bin, J.; Gardiner, B.; Li, E.; Liu, Z. Multi-source urban data fusion for property value assessment: A case study in Philadelphia. Neurocomputing 2020, 404, 70–83. [Google Scholar] [CrossRef]
  38. Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
  39. Kamara, A.F.; Chen, E.; Liu, Q.; Pan, Z. A hybrid neural network for predicting Days on Market a measure of liquidity in real estate industry. Knowl. Based Syst. 2020, 208, 106417. [Google Scholar] [CrossRef]
  40. Hung, D.Z. A Homogeneous-Attribute Intelligent Model for Forecasting House Prices: Taking Taichung City as an Example. Master’s Thesis, Department of Information Management, National Yunlin University of Science and Technology, Yunlin, Taiwan, 2021. [Google Scholar]
  41. Lau, K.W.; Yin, H.; Hubbard, S. Kernel self-organising maps for classification. Neurocomputing 2006, 69, 2033–2040. [Google Scholar] [CrossRef]
  42. Hyndman, R.J.; Koehler, A.B. Another look at measures of forecast accuracy. Int. J. Forecast. 2006, 22, 679–688. [Google Scholar] [CrossRef] [Green Version]
  43. Rodrigues, É.O.; Pinheiro, V.H.A.; Liatsis, P.; Conci, A. Machine learning in the prediction of cardiac epicardial and mediastinal fat volumes. Comput. Biol. Med. 2017, 89, 520–529. [Google Scholar] [CrossRef]
  44. Yilmazer, S.; Kocaman, S. A mass appraisal assessment study using machine learning based on multiple regression and random forest. Land Use Policy 2020, 99, 104889. [Google Scholar] [CrossRef]
  45. Kazak, J.; Dziezyc, H.; Forys, I.; Szewranski, S. Indicator-based analysis of socially sensitive and territorially sustainable development in relation to household energy consumption. Eng. Rural Dev. 2018, 17, 1653–1661. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.