HELIOS Approach: Utilizing AI and LLM for Enhanced Homogeneity Identification in Real Estate Market Analysis
Abstract
:1. Introduction
- Evaluating the performance of the HELIOS system in accurately defining homogeneous property groups;
- Addressing challenges in data analysis and procedural approaches to determining homogeneity;
- Offering recommendations for improving the definition of homogeneity in real estate markets, particularly for mass valuation processes.
Definition of Homogeneity in the Real Estate Market
- ➢
- F represents the measure of homogeneity linkage between markets and property similarity;
- ➢
- H denotes the degree of market homogeneity;
- ➢
- S represents the measure of property similarity;
- ➢
- k, α, β are parameters that can be adjusted according to specific research objectives.
2. Background and Conceptual Foundations
2.1. Cognitive Decision-Making Support Systems Based on Cognition and LLMs
2.2. LLM Technologies and Their Impact
3. HELIOS Concept in the Real Estate Market—Methodology and Implementation
3.1. Methodological Overview
3.1.1. Research Paradigms
- ➢
- Cognitive Paradigm:Using language models to understand and generate text akin to human cognition enables more advanced and precise real estate market analyses.
- ➢
- Systemic Paradigm:A systemic approach to data analysis that accounts for the complexity and diversity of data sourced from various origins and their interconnections.
- ➢
- Iterative Paradigm:An iterative process of model refinement through early validation and incorporation of expert feedback, ensuring continuous improvement and adaptation of the system to the specific needs of the real estate market.
3.1.2. Research Methodology
- Application of NLP and LLMs in Real Estate Market Analysis:
- ➢
- Data Collection: Utilising LLM models to process and analyze vast amounts of data from various sources, such as descriptions of completed transactions in the real estate market under scrutiny.
- ➢
- Natural Language Processing: Interpreting property descriptions, market analyses, and forecasts to gain insights into factors influencing property prices.
- ➢
- Historical Analysis: Examining historical property price data to forecast future trends.
- Data Standardization and Unification:
- ➢
- Information Aggregation: Automatic retrieval and aggregating data from various sources to achieve a coherent view of available properties.
- ➢
- Data Extraction and Classification: Employing LLMs to extract essential information from unstructured textual data and classify it based on specified parameters.
- ➢
- Attribute Notation Standardization: Unifying different notations of attribute values to enable their joint analysis.
- Development of the HELIOS System:
- ➢
- ETL Process (Extract, Transform, Load): Processing data to achieve a format suitable for subsequent stages of analysis.
- ➢
- Early Verification: Incorporating feedback from real estate market experts to iteratively refine model accuracy.
- ➢
- Property “Fingerprint” Creation: Utilising LLM technology to generate detailed profiles of properties.
- Methodology for Classifying Homogenous Objects:
- ➢
- Text-to-Vector Transformation: Utilizing the S-BERT model to transform property descriptions into vector representations of the text context.
- ➢
- Vector Classification: Applying the self-organizing map (SOM) method to classify objects based on vectors.
- ➢
- Detailed descriptions of the above elements, enabling a deeper understanding of the methodology and techniques employed in the HELIOS concept system, are provided in the following sections.
3.2. Leveraging LLMs for Resolving Homogeneity Market Analysis Challenges
- UPDATING PRICES: The paramount challenge in real estate price updates is the quality and timeliness of data. The model’s effectiveness hinges on the accuracy and recency of available data. Only accurate and complete information can result in good forecasts. Additionally, the intricacies of real estate markets, influenced by unpredictable factors like political, economic, social, and environmental changes, need help integrating them into predictive models. In the realm of property analysis and valuation, the large language model emerges as a valuable tool for updating trends in real estate price changes as of the valuation date. The application of the LLM model enriches the analytical process by providing nuanced insights into evolving market dynamics.
- ➢
- Data Collection: LLMs can sift through and analyze vast amounts of data from various sources, such as real estate listings, articles in the real estate market, industry reports, and market data, to identify current price trends.
- ➢
- Natural Language Processing: Leveraging natural language processing capabilities, LLMs can interpret the language used in property descriptions, market analyses, and forecasts, providing insights into factors influencing property prices.
- ➢
- Historical Analysis: Through the analysis of historical price data, LLMs can assist in understanding how property prices have evolved in the past, which can be utilized for forecasting future trends.
- ➢
- Price Modeling: LLMs can support the creation of predictive models for property prices by analyzing large datasets and identifying patterns that may not be apparent through traditional analytical methods.
- ➢
- Adaptation to Local Factors: Models can be adjusted to account for local factors influencing prices, such as changes in infrastructure, regional policies, or investor interest.
- UNIFORM INFORMATION SOURCES SCARCITY: The scarcity of uniform information sources is the primary challenge within the real estate domain. The absence of consistent and comprehensive data hampers accurate analyses and impedes the development of reliable predictive models. The varying formats, incomplete datasets, and disparities in information representation across different sources contribute to the complexity of aggregating and processing real estate data.Addressing this issue, the application of LLMs emerges as a potential solution. LLMs offer assistance in various aspects of data processing and aggregation related to real estate.
- ➢
- Automatic Information Aggregation: LLMs can search diverse data sources, including listings, cadastral databases, real estate sales or rental portals, and information from social media and online forums. They analyze and aggregate this information, providing a more cohesive and integrated view of available properties.
- ➢
- Data Extraction and Classification: LLMs can extract essential information from unstructured text data, such as property descriptions. They identify relevant attributes, such as location, price, property size, number of rooms, etc., and classify listings based on these parameters.
- ➢
- Sentiment Analysis: Using LLMs allows sentiment analysis of opinions about properties posted online. This can aid potential buyers or renters in understanding the overall perception of a property or its location, especially in the absence of uniform reviews or opinions.
- ➢
- Description Generation: LLMs can automatically generate or enhance property descriptions based on given specifications and images. This standardizes the presentation of property listings, facilitating easier comparisons.
- VARIABILITY IN ATTRIBUTE NOTATION: Diverse and inconsistent methods of recording property attributes introduce complexities in data standardization, hindering comprehensive analyses and accurate modeling.Large language models offer valuable support in mitigating the challenges posed by variability in attribute notation within the real estate market.
- ➢
- Semantic Understanding: LLMs interpret and understand diverse attribute notations, facilitating a more unified representation of property features.
- ➢
- Data Harmonization: The capability to harmonize disparate attribute notations is streamlined, ensuring a standardized and consistent approach to data representation.
- ➢
- Contextual Analysis: Enabling the discernment of the nuanced meanings embedded in varying attribute notations. This contextual understanding contributes to more accurate and comprehensive data analyses.
- ➢
- Automated Standardization: Automates the standardization of attribute notations, reducing manual efforts and enhancing the efficiency of data processing in real estate analyses.
- ATTRIBUTES SEMANTIC CLARIFICATION: The real estate market encounters challenges with attributes requiring semantic clarification, such as ambiguous descriptions leading to inaccurate analyses and stakeholders’ subjective interpretations, causing inconsistencies in understanding property features.Large language models offer robust solutions to overcome challenges associated with attributes requiring semantic clarification in the real estate market.
- ➢
- Semantic Disambiguation: Excelling in disambiguating semantic nuances, these models provide a clearer understanding of attribute descriptions and minimize ambiguity in property features.
- ➢
- Standardized Interpretation: Assisting in standardizing the interpretation of attribute descriptions, these models ensure a more consistent understanding among various stakeholders.
- ➢
- Contextual Analysis: By analyzing contextual information, these models enhance the interpretation of attributes, considering, for example, factors like the location of tall trees in proximity to roads or sides of a property.
- ➢
- Automated Clarification: Automates the clarification process by extracting key details from attribute descriptions, contributing to a more precise and standardized representation of real estate features.
- DATABASE EXHIBITS DEFICIENCIES: The database grapples with challenges arising from insufficient data quality, inconsistencies, and incompleteness. These deficiencies stem from varied sources, including disparate data collection methods, outdated records, and a lack of standardized information, thereby impeding the generation of precise real estate insights and forecasts.
- ➢
- Contextual Analysis: LLMs excel in contextual analysis, providing a deeper understanding of the data context and aiding in mitigating inconsistencies and gaps.
- ➢
- Automated Clarification: Automates the clarification process, extracting key details from ambiguous data and enhancing overall data quality.
- ➢
- Mitigation of Varied Sources: Achieve this by applying a standardized and uniform approach to interpreting and clarifying data, ensuring consistency across disparate collection methods.
- ➢
- Enhanced Reliability: Enhances data reliability by actively rectifying deficiencies arising from outdated records and a lack of standardized information in the database.
- DEFICIENT ANALYTICAL METHODOLOGY: Problems in the real estate market arise from a deficient analytical methodology characterized by a lack of precision, limiting predictive capabilities, and hindering effective decision-making for stakeholders navigating property evaluations and market trends with potential inaccuracies.LLMs can address these issues as follows:
- ➢
- Advanced Data Processing: Excels in processing vast and varied datasets, providing a more comprehensive foundation for analytical models.
- ➢
- Precise Predictive Modeling: Enhances the precision of predictive models, allowing for more accurate forecasts of property prices and market trends.
- ➢
- Contextual Understanding: Offers contextual analysis, contributing to a deeper understanding of the intricate factors influencing real estate dynamics, thus refining the analytical methodology.
- ➢
- Automated Decision Support: By automating parts of the analytical process, LLMs empower stakeholders with timely and informed decision-making support, mitigating the limitations of the current methodology.
3.3. Unveiling the Helios System Concept
- -
- The “Real Estate Data Landscape Reality” component reflects the generic and unstructured data from diverse sources.
- -
- The “ETL (Extract, Transform, Load) process” is a data processing step that allows data to be obtained in the required format for further processing steps.
- -
- The “Artificial Homogeneous Property Database with Flaws” is a database developed as a verifier for results with artificially introduced flaws tailored to detect potential weaknesses in the LLM.
- -
- The “General LLM” component delivers a comprehensive set of rules that serve as the cornerstone for the system’s adept understanding and interpretation of linguistic patterns and structures.
- -
- “Early validation” provides a mechanism for incorporating expert opinions and feedback from entities within the real estate market, contributing to refining and improving the system’s accuracy and effectiveness. This component is particularly significant because it emphasizes collaboration between human expertise and machine efficiency, ensuring that human insight complements and enhances the system’s analytical capabilities, addresses discrepancies, and improves overall performance.
- -
- The “HELIOS Nexus” serves as a system component designed to enhance cognitive processes by utilizing a custom LLM asset.
- -
- The “Released LLM” comprises a tailored language model customized to align with the real estate market’s specific linguistic nuances and requirements.
- -
- The “Real Estate Fingerprint” utilizing LLM technology, represents a comprehensive approach involving the synergistic combination of standardized tokens to capture and analyze unique characteristics and features within the real estate domain. This method aims to create a distinct and refined profile of properties by leveraging linguistic rule-based processes.
- -
- “Fingerprints Matching” employs a round-robin approach to assess the magnitude of differences between each and every pair of objects through iterative analysis.
- -
- The “Non-Homogeneity Feedback” is utilized for fine-tuning, involving the adjustment of model parameters in the event of not achieving a satisfactory result at the level set by the operator.
- -
- The “Helios Homogeneity Output” provides the final approved classification of homogenous objects.
Helios System Interaction Description
3.4. Data
3.5. HELIOS Concept in Use—Empirical Example
3.6. Verification of Models Outputs
- Iterations above 4000 led to the stabilization of object migration counts between nodes;
- Sigma, set in the form of step decay (step = 0.1) and fixed at the level of 1.5, enabled the minimization of the distance between objects relative to the nodes, thereby ensuring homogeneity of the objects;
- Setting the learning rate at 1.1 in decay step mode enabled the stabilization of node vector values;
- A grid resolution fixed at the level of 6 × 6 allowed for the elimination of empty cells while simultaneously minimizing the average distance between nodes in the cell grid and their associated objects;
- The average of objects’ mean distance obtained indicates the highest possible potential for grouping objects into homogeneous groups in this experiment.
4. Discussion
- ⇒
- Detection and Correction of Anomalies: Experts review the system outputs to identify anomalies or inconsistencies that might not be evident through numerical analysis alone. For instance, a property might be grouped based on numerical similarities, but contextual knowledge may reveal these properties to be fundamentally different in significant ways.
- ⇒
- Feedback Loop for Continuous Improvement: Human feedback is essential for the continuous learning and improvement of the system. This feedback is then used to adjust the model parameters, enhancing the system’s ability to make more accurate predictions and classifications in the future.
- ⇒
- Ensuring Practical Relevance: By incorporating expert opinions, the system ensures that the results are not only numerically sound but also practically relevant. This is particularly important in the real estate market, where contextual factors and market nuances play a critical role in decision-making.
- ⇒
- Validation of Synthetic Data: The system includes an “Artificial Homogeneous Property Database with Flaws” used to test and validate the model. Experts play a crucial role in reviewing this synthetic data to ensure that it accurately reflects real-world conditions and to identify any flaws that need to be addressed.
5. Conclusions
- Homogeneous Market Definition: HELIOS leverages advanced machine learning and linguistic intelligence to define homogeneous market areas, ensuring accurate mass appraisals.
- Comparable Properties Set Definition: HELIOS accurately selects comparable properties by analysing their characteristics comprehensively.
- Data Review and Validation: HELIOS reviews and validates data, identifying inconsistencies to ensure accuracy.
- Exploratory Data Analysis: HELIOS uncovers patterns and trends, providing insights that improve the mass appraisal process.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- TEGOVA. European Valuation Standards; TEGOVA: Bruxelles, Belgium, 2020. [Google Scholar]
- IAAO. International Association of Assessing Officers. Standard of Mass Appraisal of Real Property; IAAO: Kansas City, MO, USA, 2017; Available online: www.iaao.org (accessed on 20 April 2024).
- IVSC. International Valuation Standards; IVS: London, UK, 2023. [Google Scholar]
- RICS. RICS Valuation—Global Standards; RICS: London, UK, 2021. [Google Scholar]
- McCluskey, W.J.; Borst, R.A. The Theory and Practice of Comparable Selection in Real Estate Valuation. Stud. Syst. Decis. Control. 2017, 86, 307–330. [Google Scholar] [CrossRef]
- Renigier-Biłozor, M.; Janowski, A. Human-Machine Synergy in Real Estate Similarity Concept. Real Estate Manag. Valuat. 2024, 32, 13–30. [Google Scholar] [CrossRef]
- Renigier-Biłozor, M.; Janowski, A.; Walacik, M.; Chmielewska, A. Modern Challenges of Property Market Analysis- Homogeneous Areas Determination. Land Use Policy 2022, 119, 106209. [Google Scholar] [CrossRef]
- d’Amato, M.; Kauko, T. Advances in Automated Valuation Modeling; Springer International Publishing AG: Berlin/Heidelberg, Germany, 2017; Volume 86. [Google Scholar] [CrossRef]
- Zyga, J. Connection between Similarity and Estimation Results of Property Values Obtained by Statistical Methods. Real Estate Manag. Valuat. 2016, 24, 5–15. [Google Scholar] [CrossRef]
- Doszyn, M. System Kalibracji Macierzy Wpływu Atrybutów w Szczecińskim Algorytmie Masowej Wyceny Nieruchomosci; Wydawnictwo Naukowe Uniwersytetu Szczecinskiego: Szczecin, Poland, 2020. [Google Scholar]
- Royuela, V.; Duque, J.C. HouSI: Heuristic for Delimitation of Housing Submarkets and Price Homogeneous Areas. Comput. Environ. Urban Syst. 2013, 37, 59–69. [Google Scholar] [CrossRef]
- Gnat, S. Measurement of Entropy in the Assessment of Homogeneity of Areas Valued with the Szczecin Algorithm of Real Estate Mass Appraisal. J. Econ. Manag. 2019, 38, 89–106. [Google Scholar] [CrossRef]
- Heckman, J.J.; Abbring, J.; Carneiro, P.; Durlauf, S.; Moon, S.; Navarro, S.; Srinivasan, T.N.; Trujillo, J.; Vytlacil, E.; Weil, J.; et al. Econometric Causality. NBER Work. Pap. Ser. 2008, 76, 13–30. [Google Scholar] [CrossRef]
- Available online: https://www.nar.realtor/artificial-intelligence-real-estate (accessed on 2 March 2024).
- Available online: https://www.fool.com/investing/stock-market/market-sectors/information-technology/ai-stocks/ai-in-real-estate (accessed on 2 March 2024).
- Available online: https://www.dealmachine.com/blog/ai-real-estate (accessed on 2 March 2024).
- Available online: https://likely.ai/ (accessed on 2 March 2024).
- Available online: https://www.linkedin.com/pulse/artificial-intelligence-ai-real-estate-market-izahf/ (accessed on 2 March 2024).
- Del Giudice, V.; De Paola, P. Spatial Analysis of Residential Real Estate Rental Market with Geoadditive Models. Stud. Syst. Decis. Control. 2017, 86, 155–162. [Google Scholar] [CrossRef]
- Hwang, S.; Thill, J.C. Delineating Urban Housing Submarkets with Fuzzy Clustering. Environ. Plan. B 2009, 36, 865–882. [Google Scholar] [CrossRef]
- Pavlov, A.D. Space-Varying Regression Coefficients: A Semi-Parametric Approach Applied to Real Estate Markets. Real Estate Econ. 2000, 28, 249–283. [Google Scholar] [CrossRef]
- Kauko, T.; D’Amato, M. Mass Appraisal Methods: An International Perspective for Property Valuers; John Wiley & Sons: Hoboken, NJ, USA, 2008; p. 332. [Google Scholar]
- Ćetković, J.; Lakić, S.; Lazarevska, M.; Žarković, M.; Vujošević, S.; Cvijović, J.; Gogić, M. Assessment of the Real Estate Market Value in the European Market by Artificial Neural Networks Application. Complexity 2018, 2018, 1472957. [Google Scholar] [CrossRef]
- Choy, L.H.T.; Ho, W.K.O. The Use of Machine Learning in Real Estate Research. Land 2023, 12, 740. [Google Scholar] [CrossRef]
- Lee, H.; Han, H.; Pettit, C.; Gao, Q.; Shi, V. Machine Learning Approach to Residential Valuation: A Convolutional Neural Network Model for Geographic Variation. Ann. Reg. Sci. 2023, 72, 579–599. [Google Scholar] [CrossRef]
- McCluskey, W.J.; Zulkarnain Daud, D.; Kamarudin, N. Boosted Regression Trees: An Application for the Mass Appraisal of Residential Property in Malaysia. J. Financ. Manag. Prop. Constr. 2014, 19, 152–167. [Google Scholar] [CrossRef]
- Zhou, G.; Ji, Y.; Chen, X.; Zhang, F. Artificial Neural Networks and the Mass Appraisal of Real Estate. Int. J. Online Biomed. Eng. 2018, 14, 180–187. [Google Scholar] [CrossRef]
- Xu, K.; Nguyen, H. Predicting housing prices and analyzing real estate market in the Chicago suburbs using Machine Learning. arXiv 2022, arXiv:2210.06261. [Google Scholar] [CrossRef]
- Mayer, M.; Bourassa, S.C.; Hoesli, M.; Scognamiglio, D. Machine Learning Applications to Land and Structure Valuation. J. Risk Financ. Manag. 2022, 15, 193. [Google Scholar] [CrossRef]
- Lee, C. Designing an optimal neural network architecture: An application to property valuation. Prop. Manag. 2022; ahead of print. [Google Scholar]
- Lee, C. Training and Interpreting Machine Learning Models: Application in Property Tax Assessment. Real Estate Manag. Valuat. 2022, 30, 13–22. [Google Scholar] [CrossRef]
- Carbon, C.C. Understanding Human Perception by Human-Made Illusions. Front. Hum. Neurosci. 2014, 8, 103880. [Google Scholar] [CrossRef]
- Ning, Y.; Liu, J.; Yan, L. Uncertain Aggregate Production Planning. Soft Comput. 2013, 17, 617–624. [Google Scholar] [CrossRef]
- Miller, G.A. The Cognitive Revolution: A Historical Perspective. Trends Cogn. Sci. 2003, 7, 141–144. [Google Scholar] [CrossRef] [PubMed]
- Ogiela, L. Cognitive Information Systems in Management Sciences; Academic Press: Cambridge, MA, USA, 2017; pp. 1–130. [Google Scholar]
- Grossberg, S. Adaptive Resonance Theory: How a Brain Learns to Consciously Attend, Learn, and Recognize a Changing World. Neural Netw. 2013, 37, 1–47. [Google Scholar] [CrossRef]
- Makridakis, S.; Petropoulos, F.; Kang, Y. Large Language Models: Their Success and Impact. Forecasting 2023, 5, 536–549. [Google Scholar] [CrossRef]
- The Next Generation of Artificial Intelligence (Part 2). Available online: https://www.forbes.com/sites/robtoews/2020/10/29/the-next-generation-of-artificial-intelligence-part-2/?sh=6b79401e7a30 (accessed on 27 March 2024).
- Huang, J.; Gu, S.S.; Hou, L.; Wu, Y.; Wang, X.; Yu, H.; Han, J. Large Language Models Can Self-Improve. In Proceedings of the EMNLP 2023–2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 6–10 December 2022; pp. 1051–1068. [Google Scholar] [CrossRef]
- Understanding Explainable AI and Interpretable AI—MarkTechPost. Available online: https://www.marktechpost.com/2023/07/06/understanding-explainable-ai-and-interpretable-ai/ (accessed on 27 March 2024).
- Bestvater, S.E.; Monroe, B.L. Sentiment Is Not Stance: Target-Aware Opinion Classification for Political Text Analysis. Political Anal. 2023, 31, 235–256. [Google Scholar] [CrossRef]
- Wang, Y. Topic Classification for Political Texts with Pretrained Language Models. Political Anal. 2023, 31, 662–668. [Google Scholar] [CrossRef]
- Häffner, S.; Hofer, M.; Nagl, M.; Walterskirchen, J. Introducing an Interpretable Deep Learning Approach to Domain-Specific Dictionary Creation: A Use Case for Conflict Prediction. Political Anal. 2023, 31, 481–499. [Google Scholar] [CrossRef]
- Muchlinski, D.; Siroky, D.; He, J.; Kocher, M. Comparing Random Forest with Logistic Regression for Predicting Class-Imbalanced Civil War Onset Data. Political Anal. 2016, 24, 87–103. [Google Scholar] [CrossRef]
- Houlsby, N.; Giurgiu, A.; Jastrzçbski, S.; Morrone, B.; de Laroussilhe, Q.; Gesmundo, A.; Attariyan, M.; Gelly, S. Parameter-Efficient Transfer Learning for NLP. In Proceedings of the 36th International Conference on Machine Learning, ICML 2019, Long Beach, CA, USA, 9–15 June 2019; pp. 4944–4953. [Google Scholar]
- Trott, S.; Jones, C.; Chang, T.; Michaelov, J.; Bergen, B. Do Large Language Models Know What Humans Know? Cogn. Sci. 2022, 47, e13309. [Google Scholar] [CrossRef] [PubMed]
- Speech and Language Processing. Available online: https://web.stanford.edu/~jurafsky/slp3/ (accessed on 27 March 2024).
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention Is All You Need. Adv. Neural Inf. Process. Syst. 2017, 2017, 5999–6009. [Google Scholar]
- Boonstra, M.J.; Weissenbacher, D.; Moore, J.H.; Gonzalez-Hernandez, G.; Asselbergs, F.W. Artificial Intelligence: Revolutionizing Cardiology with Large Language Models. Eur. Heart J. 2024, 45, 332–345. [Google Scholar] [CrossRef]
- Zhang, X.; Yang, Q. XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters. In Proceedings of the International Conference on Information and Knowledge Management, New York, NY, USA, 21–25 October 2023; pp. 4435–4439. [Google Scholar] [CrossRef]
- Mashrur, A.; Luo, W.; Zaidi, N.A.; Robles-Kelly, A. Machine Learning for Financial Risk Management: A Survey. IEEE Access 2020, 8, 203203–203223. [Google Scholar] [CrossRef]
- Pagliaro, C.; Mehta, D.; Shiao, H.T.; Wang, S.; Xiong, L. Investor Behavior Modeling by Analyzing Financial Advisor Notes: A Machine Learning Perspective. In Proceedings of the ICAIF 2021—2nd ACM International Conference on AI in Finance, New York, NY, USA, 3–5 November 2021. [Google Scholar] [CrossRef]
- Shah, A.; Raj, P.; Pushpam Kumar, S.P.; Asha, H.V. FinAID, A Financial Advisor Application Using AI. Int. J. Recent Technol. Eng. 2020, 9, 2282–2286. [Google Scholar] [CrossRef]
- Li, Y.; Wang, S.; Ding, H.; Chen, H. Large Language Models in Finance: A Survey. In Proceedings of the ICAIF 2023—4th ACM International Conference on AI in Finance, New York, NY, USA, 27–29 November 2023; pp. 374–382. [Google Scholar] [CrossRef]
- Wang, Y. On Finetuning Large Language Models. Political Anal. 2023, 1–5. [Google Scholar] [CrossRef]
Parameters | The Lower Value | The Highest Value | Optimal Value |
---|---|---|---|
Iterations | 1000 | 10,000 | 4000 |
Sigma | 3.1 | 0.1 | 1.5 |
Learning rate | 3.1 | 0.1 | 1.1 |
Average of mean distance | 31,841 | 8653 | 10,887 |
Grid resolution | 2 × 2 | 10 × 10 | 6 × 6 |
Node X | Node Y | Mean Distance | Quantity Vectors |
---|---|---|---|
0 | 0 | 7047 | 142 |
0 | 1 | 1362 | 95 |
0 | 2 | 3691 | 139 |
0 | 3 | 10,008 | 155 |
0 | 4 | 12,754 | 179 |
0 | 5 | 15,113 | 50 |
1 | 0 | 6076 | 99 |
1 | 1 | 9219 | 195 |
1 | 2 | 3447 | 81 |
1 | 3 | 13,874 | 57 |
1 | 4 | 10,171 | 208 |
1 | 5 | 13,292 | 131 |
2 | 0 | 12,400 | 388 |
2 | 1 | 6089 | 178 |
2 | 2 | 14,112 | 85 |
2 | 3 | 14,801 | 61 |
2 | 4 | 12,870 | 40 |
2 | 5 | 14,899 | 62 |
3 | 0 | 5907 | 85 |
3 | 1 | 12,342 | 118 |
3 | 2 | 14,164 | 109 |
3 | 3 | 14,780 | 162 |
3 | 4 | 13,627 | 128 |
3 | 5 | 12,202 | 129 |
4 | 0 | 15,658 | 350 |
4 | 1 | 13,937 | 37 |
4 | 2 | 13,263 | 105 |
4 | 3 | 1225 | 52 |
4 | 4 | 13,186 | 98 |
4 | 5 | 14,883 | 54 |
5 | 0 | 8542 | 113 |
5 | 1 | 11,567 | 216 |
5 | 2 | 3087 | 25 |
5 | 3 | 14,855 | 58 |
5 | 4 | 13,784 | 134 |
5 | 5 | 13,690 | 54 |
Average Mean Distance 10,887 | |||
Sum 4372 |
Number of Transactions | Node X | Node Y | Real Estate Fingerprint Characteristic |
---|---|---|---|
388 (the largest group) | 2 | 0 | transaction date: fourteen of November, two thousand twenty years, price per square meter: six thousand, two hundred and seventy-one, associated premises area: no data, building construction: other than brick, type of rights: co-ownership, seller: legal entity, market type: primary, precinct: Osowa |
350 (the second largest group) | 4 | 0 | price per square meter: five thousand, eight hundred and eighty associated premises area: no data, precinct: Jasien, share in joint area: unknow, type of rights: ownership, type of object: ud, seller: individual, market type: secondary, |
99 (the average group) | 1 | 0 | price per square meter: five thousand, three hundred and twenty-six, transaction date: ten of April, two thousand nineteen year. address: ul. Krzysztofa Komedy twenty-six, share: one hundred and twenty-eight out of ten thousand, premises area in square meter: forty-five, elevator: no elevator, storey: fourth, year of construction: two thousand and eighteen |
37 (the smallest group) | 4 | 1 | price per square meter: six thousand and forty-six, transaction date: seventeen of July, two thousand twenty year, premises area in square meter: two hundred fifteen storey: fifth, year of construction: two thousand and nineteen, address: ul. Eugeniusza Wegrzyna, function: three-unit building, building construction: brick |
No Group | Quantity | Mean Distance | No Group | Quantity | Mean Distance |
---|---|---|---|---|---|
1 | 1 | 0 | 151 | 5 | 913 |
2 | 1 | 0 | 152 | 2 | 972 |
3 | 1 | 0 | 153 | 2 | 1038 |
4 | 1 | 0 | 154 | 2 | 1058 |
5 | 1 | 0 | 155 | 2 | 1134 |
6 | 1 | 0 | 156 | 9 | 1085 |
7 | 1 | 0 | 157 | 2 | 1232 |
8 | 1 | 0 | 158 | 2 | 1250 |
9 | 1 | 0 | 159 | 10 | 15,161 |
10 | 1 | 0 | 160 | 3 | 1427 |
11 | 1 | 0 | 161 | 3 | 1325 |
12 | 1 | 0 | 162 | 6 | 1483 |
13 | 1 | 0 | 163 | 3 | 1694 |
14 | 1 | 0 | 164 | 82 | 150,178 |
15 | 1 | 0 | 165 | 18 | 180,860 |
16 | 1 | 0 | 166 | 5 | 1958 |
17 | 1 | 0 | 167 | 2 | 1655 |
18 | 1 | 0 | 168 | 3 | 1794 |
19 | 1 | 0 | 169 | 3 | 1751 |
20 | 1 | 0 | 170 | 4 | 1641 |
21 | 1 | 0 | 171 | 12 | 1933 |
22 | 1 | 0 | 172 | 9 | 1997 |
23 | 1 | 0 | 173 | 4 | 2021 |
24 | 1 | 0 | 174 | 22 | 1972 |
25 | 1 | 0 | 175 | 2 | 1727 |
26 | 1 | 0 | 176 | 2 | 1751 |
27 | 1 | 0 | 177 | 2 | 1754 |
28 | 1 | 0 | 178 | 3 | 1730 |
29 | 1 | 0 | 179 | 7 | 1947 |
30 | 1 | 0 | 180 | 2 | 1788 |
31 | 1 | 0 | 181 | 13 | 1780 |
32 | 1 | 0 | 182 | 2 | 1817 |
33 | 1 | 0 | 183 | 2 | 1821 |
34 | 1 | 0 | 184 | 2 | 1834 |
35 | 1 | 0 | 185 | 6 | 1837 |
36 | 1 | 0 | 186 | 2 | 1843 |
37 | 1 | 0 | 187 | 3 | 1672 |
38 | 1 | 0 | 188 | 17 | 2070 |
39 | 1 | 0 | 189 | 2 | 1870 |
40 | 1 | 0 | 190 | 2 | 1886 |
41 | 1 | 0 | 191 | 2 | 1917 |
42 | 1 | 0 | 192 | 19 | 2425 |
43 | 1 | 0 | 193 | 22 | 2183 |
44 | 1 | 0 | 194 | 3 | 2284 |
45 | 1 | 0 | 195 | 2 | 1945 |
46 | 1 | 0 | 196 | 2 | 1947 |
47 | 1 | 0 | 197 | 3 | 1822 |
48 | 1 | 0 | 198 | 5 | 1961 |
49 | 1 | 0 | 199 | 4 | 2108 |
50 | 1 | 0 | 200 | 2 | 2019 |
51 | 1 | 0 | 201 | 41 | 1846 |
52 | 1 | 0 | 202 | 10 | 2251 |
53 | 1 | 0 | 203 | 9 | 2227 |
54 | 1 | 0 | 204 | 10 | 2286 |
55 | 1 | 0 | 205 | 3 | 2190 |
56 | 1 | 0 | 206 | 15 | 2277 |
57 | 1 | 0 | 207 | 3 | 1896 |
58 | 1 | 0 | 208 | 2 | 2078 |
59 | 1 | 0 | 209 | 6 | 2337 |
60 | 1 | 0 | 210 | 2 | 2096 |
61 | 1 | 0 | 211 | 11 | 1430 |
62 | 1 | 0 | 212 | 2 | 2115 |
63 | 1 | 0 | 213 | 2 | 2117 |
64 | 1 | 0 | 214 | 4 | 2105 |
65 | 1 | 0 | 215 | 4 | 2160 |
66 | 1 | 0 | 216 | 2 | 2127 |
67 | 1 | 0 | 217 | 2 | 2129 |
68 | 1 | 0 | 218 | 2 | 2132 |
69 | 1 | 0 | 219 | 30 | 2173 |
70 | 1 | 0 | 220 | 7 | 2392 |
71 | 1 | 0 | 221 | 2 | 2167 |
72 | 1 | 0 | 222 | 3 | 1814 |
73 | 1 | 0 | 223 | 115 | 1851 |
74 | 1 | 0 | 224 | 2 | 2177 |
75 | 1 | 0 | 225 | 5 | 2390 |
76 | 1 | 0 | 226 | 2 | 2184 |
77 | 1 | 0 | 227 | 3 | 2593 |
78 | 1 | 0 | 228 | 27 | 1913 |
79 | 1 | 0 | 229 | 34 | 2696 |
80 | 1 | 0 | 230 | 5 | 2240 |
81 | 1 | 0 | 231 | 3 | 2222 |
82 | 1 | 0 | 232 | 2 | 2260 |
83 | 1 | 0 | 233 | 6 | 2092 |
84 | 1 | 0 | 234 | 25 | 2529 |
85 | 1 | 0 | 235 | 4 | 2172 |
86 | 1 | 0 | 236 | 10 | 3090 |
87 | 1 | 0 | 237 | 7 | 2973 |
88 | 1 | 0 | 238 | 3 | 2359 |
89 | 1 | 0 | 239 | 5 | 2238 |
90 | 1 | 0 | 240 | 3 | 2246 |
91 | 1 | 0 | 241 | 5 | 2107 |
92 | 1 | 0 | 242 | 4 | 2251 |
93 | 1 | 0 | 243 | 622 | 2885 |
94 | 1 | 0 | 244 | 2 | 2373 |
95 | 1 | 0 | 245 | 3 | 2454 |
96 | 1 | 0 | 246 | 23 | 3073 |
97 | 1 | 0 | 247 | 11 | 276,378 |
98 | 1 | 0 | 248 | 5 | 2440 |
99 | 1 | 0 | 249 | 2 | 2407 |
100 | 1 | 0 | 250 | 8 | 2478 |
101 | 1 | 0 | 251 | 471 | 2,539,523 |
102 | 1 | 0 | 252 | 4 | 12376 |
103 | 1 | 0 | 253 | 3 | 2115 |
104 | 1 | 0 | 254 | 10 | 2387 |
105 | 1 | 0 | 255 | 30 | 1986 |
106 | 1 | 0 | 256 | 4 | 2338 |
107 | 1 | 0 | 257 | 74 | 22,375 |
108 | 1 | 0 | 258 | 3 | 2257 |
109 | 1 | 0 | 259 | 4 | 2486 |
110 | 1 | 0 | 260 | 5 | 2806 |
111 | 1 | 0 | 261 | 6 | 2177 |
112 | 1 | 0 | 262 | 5 | 2418 |
113 | 1 | 0 | 263 | 2 | 2495 |
114 | 1 | 0 | 264 | 3 | 2598 |
115 | 1 | 0 | 265 | 29 | 42,408 |
116 | 1 | 0 | 266 | 6 | 2512 |
117 | 1 | 0 | 267 | 10 | 2346 |
118 | 1 | 0 | 268 | 13 | 1768 |
119 | 1 | 0 | 269 | 2 | 2551 |
120 | 1 | 0 | 270 | 2 | 2557 |
121 | 1 | 0 | 271 | 3 | 42,761 |
122 | 1 | 0 | 272 | 14 | 3225 |
123 | 1 | 0 | 273 | 5 | 2624 |
124 | 1 | 0 | 274 | 33 | 123,164 |
125 | 1 | 0 | 275 | 2 | 2590 |
126 | 1 | 0 | 276 | 7 | 2353 |
127 | 1 | 0 | 277 | 14 | 152,547 |
128 | 1 | 0 | 278 | 4 | 2587 |
129 | 1 | 0 | 279 | 3 | 2426 |
130 | 1 | 0 | 280 | 2 | 2646 |
131 | 1 | 0 | 281 | 5 | 2668 |
132 | 1 | 0 | 282 | 4 | 2479 |
133 | 1 | 0 | 283 | 118 | 263,739 |
134 | 1 | 0 | 284 | 16 | 2777 |
135 | 1 | 0 | 285 | 2 | 2685 |
136 | 1 | 0 | 286 | 19 | 2647 |
137 | 1 | 0 | 287 | 7 | 2643 |
138 | 1 | 0 | 288 | 23 | 3311 |
139 | 1 | 0 | 289 | 4 | 2897 |
140 | 1 | 0 | 290 | 286 | 153,680 |
141 | 1 | 0 | 291 | 20 | 2942 |
142 | 1 | 0 | 292 | 23 | 3167 |
143 | 1 | 0 | 293 | 15 | 3138 |
144 | 1 | 0 | 294 | 5 | 2700 |
145 | 1 | 0 | 295 | 2 | 2752 |
146 | 1 | 0 | 296 | 80 | 142,999 |
147 | 1 | 0 | 297 | 71 | 183,785 |
148 | 1 | 0 | 298 | 22 | 122,785 |
149 | 1 | 0 | 299 | 91 | 453,967 |
150 | 1 | 0 | 300 | 2 | 2794 |
301 | 3 | 2317 | |||
302 | 2 | 2801 | |||
303 | 18 | 3454 | |||
304 | 15 | 2299 | |||
305 | 30 | 2451 | |||
306 | 38 | 3529 | |||
307 | 2 | 2812 | |||
308 | 2 | 2812 | |||
309 | 216 | 14,180 | |||
310 | 24 | 12,960 | |||
311 | 5 | 2803 | |||
312 | 16 | 2472 | |||
313 | 47 | 4490 | |||
314 | 243 | 13,019 | |||
315 | 4 | 3115 | |||
316 | 296 | 142,724 | |||
317 | 198 | 204,622 | |||
318 | 2 | 2851 | |||
319 | 7 | 2943 | |||
Average Mean Distance | 33,108 | ||||
Sum | 4372 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Janowski, A.; Renigier-Bilozor, M. HELIOS Approach: Utilizing AI and LLM for Enhanced Homogeneity Identification in Real Estate Market Analysis. Appl. Sci. 2024, 14, 6135. https://doi.org/10.3390/app14146135
Janowski A, Renigier-Bilozor M. HELIOS Approach: Utilizing AI and LLM for Enhanced Homogeneity Identification in Real Estate Market Analysis. Applied Sciences. 2024; 14(14):6135. https://doi.org/10.3390/app14146135
Chicago/Turabian StyleJanowski, Artur, and Malgorzata Renigier-Bilozor. 2024. "HELIOS Approach: Utilizing AI and LLM for Enhanced Homogeneity Identification in Real Estate Market Analysis" Applied Sciences 14, no. 14: 6135. https://doi.org/10.3390/app14146135
APA StyleJanowski, A., & Renigier-Bilozor, M. (2024). HELIOS Approach: Utilizing AI and LLM for Enhanced Homogeneity Identification in Real Estate Market Analysis. Applied Sciences, 14(14), 6135. https://doi.org/10.3390/app14146135