IDBspRS: An Interior Design-Built Service Package Recommendation System Using Artificial Intelligence

Karmaakar, Pranabanti; Jarwar, Muhammad Aslam; Wahid, Junaid Abdul; Hasan, Najam Ul

doi:10.3390/su18073605

Open AccessArticle

IDBspRS: An Interior Design-Built Service Package Recommendation System Using Artificial Intelligence

¹

School of Computing and Digital Technologies, Sheffield Hallam University, Sheffield S1 1WB, UK

²

School of Computer Science and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China

^*

Author to whom correspondence should be addressed.

Sustainability 2026, 18(7), 3605; https://doi.org/10.3390/su18073605

Submission received: 19 February 2026 / Revised: 30 March 2026 / Accepted: 3 April 2026 / Published: 7 April 2026

(This article belongs to the Special Issue AI and ML Applications for a Sustainable Future)

Download

Browse Figures

Versions Notes

Abstract

Digital transformation in the interior design industry has opened new opportunities for innovation; however, many cost-conscious homeowners still face difficulties in selecting and customizing design packages that achieve a balance between overall cost and sustainable quality. Existing interior design platforms lack seamless support and often require homeowners to invest considerable time and effort to tailor services to their needs while staying within budget. To address these challenges, this paper explores the use of machine learning to build a predictive modelling framework that supports personalized and value-driven interior design recommendations. The proposed approach uses a hybrid recommendation system that combines content-based and collaborative filtering. It also incorporates lightweight techniques such as TF–IDF (Term Frequency–Inverse Document Frequency) and logistic regression to more effectively capture user preferences, budget limits, and several interior-design service categories. Primary data was collected from small to medium-sized interior design companies. To demonstrate the proposed approach, a user-friendly web application tool is developed to integrate machine learning-enabled recommendation services. The resulting solution provides access to professional interior design services, enhancing customization and customer satisfaction while reducing the time and effort required from homeowners. To validate and compare the performance of the proposed approach, several machine learning models including Random Forest, XGBoost and KNN (K-Nearest Neighbors) were tested using standard metrics such as accuracy, precision, recall, and ROC-AUC (Receiver Operating Characteristic-Area Under the Curve). The proposed logistic regression hybrid model achieved the strongest overall results, with an accuracy of 83.62%. These findings demonstrate the significant contribution of this work to enhancing personalization and accessibility in the interior design sector via machine learning-enabled recommendation systems. The proposed approach bridges the gap between expert-level services and financial limits, making it a practical choice for cost-conscious homeowners.

Keywords:

applied machine learning; interior design; recommendation system; logistic regression

1. Introduction

The interior construction industry is undergoing a transformative shift driven by rapid urbanization, increased disposable income, and evolving consumer lifestyles [1]. According to Mordor Intelligence [2], India’s interior design market size reached US $31.5 billion in 2023 and is expected to grow to US $67.4 billion by 2032, with a CAGR (Compound Annual Growth Rate) of 8.81% from 2024 to 2032 (Figure 1). This progress is driven by the expanding demand for designs that optimize space, the expanding real estate industry, and rapid urbanization. Historically reliant on conventional consultation and design practices, the companies are now adopting online platforms with a view to addressing mounting demands from homeowners. As more individuals turn to online avenues for their interior design requirements with growing emphasis on affordability and sustainability, the demand for creative, budget-friendly, and customized solutions has significantly increased [3]. Yet there remains a considerable gap in meeting demands by price-conscious homeowners who require top-notch design services at non-premium prices. Conventional methods tend to require extensive manual labor and high expenses, thus rendering them less attainable for the expanding middle-class segment keen on adopting digital innovations. This paper proposes an AI-driven recommendation system to address these issues, which would enable a platform that integrates machine learning algorithms to offer personalized, budget-friendly interior design solutions. By considering in tandem key factors such as types of services, space requirements, and budget limitations, the proposed system offers a significant advantage over conventional practices, which are time-consuming and expensive. The proposed solution aims to allow homeowners to access and assist professional interior design service packages that improve their living spaces using digital technologies.

The study begins by analyzing the pain points of homeowners in finding affordable and customizable solutions on existing platforms. Many existing systems lack adaptability and fail to cater to diverse customer preferences. Leveraging predictive models, the proposed approach uses machine learning models to analyze customer data and provide personalized recommendations. The proposed approach will form the foundation of an interior design prototype platform that ensures ease of use and accurate recommendations. The system’s effectiveness has been evaluated by assessing its ability to reduce costs, time, and effort, while enhancing customer satisfaction and accessibility.

Machine learning-based recommendation systems are widely used in online platforms to enhance user experiences through personalized suggestions, but their application in service-based industries such as interior design remains underexplored. Unlike product-focused recommendations, service-based systems must integrate multiple services while addressing budgetary constraints and varied user preferences. Existing research highlights certain limitations. Afoudi Y. et al. [5] showed that combining collaborative filtering, content-based methods, and self-organizing map neural networks can improve recommendation accuracy. However, their model does not address the cold-start issue and requires considerable computational resources. Their evaluation was also limited to movie datasets, which makes it harder to apply the approach to new users, products, or services, especially when budget constraints come into play. On the other hand, Han et al. [6] explored how users, product bundles, individual items, and pricing information interact to better capture the purchasing behavior of budget-minded consumers. Their method used a graph neural network to identify suitable bundle recommendations and was evaluated across three datasets. However, the usefulness of such budget-aware recommendation systems may be limited when users are reluctant to share their financial constraints. The authors in [7,8] introduced explainable NLP (Natural Language Processing) methods for transparency in recommendations, but these approaches were limited to single-product contexts and failed to account for dependencies between multiple services. Singh et al. (2020) [9] investigated reinforcement learning architectures with adaptability according to user feedback, but they are too resource-demanding to be realistic on constrained-budget platforms.

This paper aims to address these gaps by presenting a hybrid recommendation model that integrates explainable artificial intelligence and emphasizes the fusion of multiple service bundles. Machine learning algorithms form the core of recommendation systems, enabling the analysis of user behaviors and preferences. Deep learning and hybrid models are capable of handling complex data, yet their use within the interior design industry remains under-explored. Khanal et al. (2019) [10] mentioned the usage of these models for e-learning but did not consider their extension to service-based contexts. Likewise, refs. [11,12,13] explored the use of generative AI and explainable NLP for user-specific recommendations, but these approaches still struggle with scalability and bias, especially when dealing with high-dimensional service data and diverse user groups.

The TF–IDF algorithm, commonly employed in feature extraction [14], does not support evolving user preferences effectively because of its fixed weighting method. Category-based filtering, investigated in [15], alleviates computational complexity in recommendation systems; however, it encounters difficulty supporting cross-category recommendation, an important functionality for the interior design industry where merchandise embraces various categories such as furniture, design, and installation. To address this limitation, the proposed method extends category-based filtering with cross-category relevance scores, to more effectively recommend sets of multiservice packages.

Neural networks, especially convolutional neural networks (CNNs) and recurrent neural networks (RNNs) have exhibited excellent potential in the field of recommendation systems. Alamdari et al. (2022) [16] demonstrated the effectiveness of CNNs for visually centric recommendations; however, their research confined the application to scenarios involving individual products. The current research endeavors to expand the usage of CNNs in the area of interior design to enable the combination of visual and textual data for end-to-end recommendations.

This study provides an end-to-end system of interior design personalized recommendations based on hybrid machine learning algorithms like TF–IDF, cosine similarity, and cross-category filtering. The proposed approach aims to fill the gap between affordability and user-centric solutions, thereby allowing homeowners to avail themselves of professional-quality services as per their personal requirements within their budget. The proposed system will also prioritize scalability so that homeowners with modest budgets can also use the system while being flexible across various categories of services.

This approach presents a novel perspective to the growing field of recommendation systems, here applied to the Indian interior design market. While previous works have been more focused on product-based domains and single-service recommendations, this research emphasizes the importance of integrating multiple services with a focus on accessibility and affordability. In addressing these shortcomings, the study hopes to transform the experience of interior design for homeowners, and by extension, make high-quality services more accessible and affordable. The incorporation of high-end algorithms and hybrid models into the platform is a giant leap towards addressing the digital divide in the interior construction sector, specifically concerning the expanding middle-class demography in India. Furthermore, the key contributions of this paper are summarized below:

A research gap is identified in existing interior design recommendation systems, which fail to provide customization and streamline service package selection for budget-conscious homeowners.
The study introduces a primary dataset derived from invoices collected from small and medium-sized interior design firms.
A hybrid recommendation approach is proposed that combines content-based filtering and collaborative filtering with lightweight methods like TF–IDF, logistic regression.
The proposed approach is empirically validated using several machine-learning methods including KNN, Random Forest, and XGBoost, along with standard performance metrics such as accuracy, precision, recall and ROC-AUC on a dataset collected from Indian interior design organizations.

The rest of the paper is structured as follows. Section 2 reviews related work on AI-enabled interior design recommendation systems. Section 3 outlines the research questions and hypotheses. Section 4 presents the proposed hybrid recommendation framework. Section 5 details the system implementation, along with the evaluation methodology and results. Section 6 discusses findings and future work, and Section 7 concludes the paper.

2. Related Work

For budget-conscious homeowners, choosing and personalizing an all-inclusive interior package that includes services like flooring, painting, plumbing and electrical work can be a complicated and daunting task. A significant gap exists in e-commerce platforms that cater specifically to consumers seeking integrated interior design solutions. Existing platforms often require extensive manual adjustments to align specifications with financial constraints, making the process time-consuming and inefficient. To establish a strong theoretical foundation for this study, the literature review is structured into four key thematic areas listed below and illustrated in Figure 2.

Overview of interior design-built service industry.
Value engineering in the interior-design and construction sector.
AI-enabled recommendation systems in interior design e-commerce.
Machine learning approaches and algorithms for recommendation system development for interior design.

2.1. Interior Design-Built Industry

The interior design sector has undergone significant digitization. Numerous e-commerce companies, such as Livspace [17] and Homelane [18], have introduced technology-driven design services. Furthermore, technological advancements have streamlined the construction process, significantly enhancing customer convenience. Online platforms have played a key role in democratizing the industry, making design services more accessible to a wider range of customers [19]. Addressing the vast interior design market in India requires prioritizing consumer convenience and transparency. The integration of value-engineered interior service packages into digital platforms has the potential to be transformative for urban homeowners.

2.2. Value Engineering in Interior Design and Construction Sector

Value Engineering (VE) is a systematic approach that analyzes the functions of a program or system to improve performance, quality, safety, reliability, and lifecycle costs. The underlying philosophy of VE is to achieve the “best value” meaning to accomplish essential functions at the lowest lifecycle cost. In building, VE not only conserves money but also encourages environmentally sustainable and energy-saving practices. In addition to saving costs, its use also results in tight project schedules, increased quality, and early identification of design faults [20]. In the case of the construction sector, project value is enhanced using VE by expanding functionality while reducing costs. Construction firms are made more competitive by effectively making use of regional resources, reducing costs, and optimizing bid costs. But contract procurement is not always a matter of cost-effectiveness alone; overall value of the project, considering quality, durability, usability, feasibility, compliance with regulation, and managerial effectiveness, is critical [20,21]. VE principles are particularly applicable in the area of digital interior design services, where it is difficult and time-consuming to construct detailed interior packages within the stipulated budget and time frame. Customizing such packages to specific homeowner preferences reinforces the need for streamlined technology-based solutions that offer greater efficiency and accessibility in the field.

2.3. AI-Based Recommendation Systems in Interior Design

Machine learning (ML) and data mining techniques are used in AI-enabled recommendation systems to analyze user buying patterns and generate customized suggestions. The rapid advancement of ML and deep learning (DL) has drastically improved the accuracy and personalization of these recommendations. Numerous types of recommendation systems exist, each using distinct methodologies to process user input and produce recommendations.

2.3.1. Content-Based Filtering

The content-based filtering approach suggests items based on user interaction history with the content and their attributes [22]. It is particularly helpful in avoiding cold-start issues using item attributes rather than collaborative user data. However, one of the significant limitations of content-based filtering is overspecialization, wherein the system continues to recommend items that are too much alike, thereby lacking diversity in recommendations [23]. Recent studies have suggested the implementation of hybrid solutions that merge content-based and collaborative filtering techniques to resolve these issues. For example, work by Elahi et al. [24] explores the application of item-based stereotypes for the purpose of augmenting recommendations at the cold-start stage.

2.3.2. Collaborative Filtering

Collaborative filtering is one of the most widely used techniques, analyzing user behavior based on the preferences of similar users to predict individual preferences. User-based collaborative filtering is particularly noted for its intuitive implementation [25,26]. Nevertheless, both item-to-item and user-to-user collaborative filtering have several challenges, including sensitivity to temporal shifts in item popularity and the cold-start problem, where newly added items lack sufficient user interaction data [27]. Lin et al. [28] addressed these issues by proposing a distributionally robust and temporally aware optimization framework for cold-start recommendations, aiming to enhance the adaptability of collaborating filtering models to temporal feature shifts. Similarly, Zhou et al. [29] introduced a contrastive collaborative filtering framework to solve the cold-start problem by leveraging co-occurrence collaborative signals during the training process.

2.3.3. Hybrid Filtering

In order to transcend the limitations usually inherent in traditional content-based filtering and collaborative filtering techniques, hybrid recommendation systems were developed as a new approach. Hybrid techniques integrate various types of recommendation schemes to improve aggregate performance and surmount the weaknesses inherent in each scheme when used independently. Recent research has documented significant effectiveness of hybrid recommendation systems in an extremely broad range of applications. For instance, ref. [30] proposed a hybrid technique that integrates collaborative filtering with matrix factorization and neural network methods, achieving improved recommendation accuracy and coverage. Similarly, ref. [5] introduced a hybrid method consisting of content-based and collaborative filtering methods through the use of artificial neural networks. In spite of these developments, e-commerce recommendation systems still face major challenges, particularly around data quality, data availability, and budget constraints. Moreover, the cold-start problem remains a significant issue, since new user or product registration does not normally have sufficient historical data for making pertinent recommendations [31]. Recent research suggests that machine learning-based approaches can mitigate such difficulties through increased flexibility and robustness of recommendation algorithms [32].

2.4. Interior Design Recommendation Systems Using AI

In unsupervised machine learning technique matrix factorization (MF) is widely used in recommendation systems; scalability, ease of implementation, and flexibility are its advantages [33]. In contrast to content-based methods involving human effort and data labelling, MF correctly produces recommendations without any human effort using consumer buying history along with product ratings as input parameters [34]. Bayesian methods employ probabilistic models for the representation of uncertainty and incorporating prior knowledge into recommendations, hence offering flexibility across different kinds of data as well as robustness in the treatment of uncertainty [35]. Bayesian Personalized Ranking (BPR) is a recommender system that employs Bayesian inference to generate personalized ranking of items. It concentrates on pairwise relationships and user–item interactions and is thus particularly suitable for implicit feedback data [36]. BPR can solve cold-start issues by learning the latent representations from user–item interactions, resulting in scalability using stochastic gradient descent for efficient large-scale recommendation systems. Yet, its prominence on rating accuracy can come at the expense of interpretability by prioritizing recommendations derived from learned patterns over explicit explanations. K-Nearest Neighbors (KNN), a machine that learns from examples, generates predictions based on item or user similarity. Commonly used for collaborative filtering, KNN is straightforward and intuitive to comprehend without requiring a traditional training process. The KNN algorithm, while valued for its simplicity, encounters numerous challenges when it is applied to high-dimensional and large-scale datasets. With increasing dimensionality, the effect known as the “curse of dimensionality” begins to come into play, leading to sparsity of data and reduced efficiency of distance measures. With higher dimensions, the calculations become more complicated, thereby increasing the computational demands [37]. Furthermore, the hybrid logistic regression model employed in the proposed approach continues to learn from input variables, which are convenient for making predictions about user ratings; however, they may be prone to overfitting if it is not regularized appropriately. Empirical research has established that the use of regularization methods, i.e., Lasso (L1) and Ridge (L2), improves the performance of a model by alleviating issues related to overfitting and multicollinearity [38]. In addition, research has validated that regularization improves accuracy significantly, especially for scenarios with small training datasets [39].

3. Research Questions and Hypotheses

Based on the aims of this study and the research gaps identified in the literature review, the following research questions and hypotheses are formulated.

3.1. Research Question (RQ1)

Does integrating budget constraints into the recommendation model lead to improving financially appropriate service recommendations for cost-conscious homeowners?

Hypothesis (H1).

Including price and category information in the recommendation model will help the system match user budgets more accurately than models that do not use price-related features.

3.2. Research Question (RQ2)

Can the proposed approach provide a diverse as well as context-appropriate set of service recommendations that match a user’s selected design specifications or service categories?

Hypothesis (H2).

Using hybrid filtering with both text-based and category-based features will produce recommendations that are more diverse and better aligned with user design choices than content-only methods.

4. Proposed System Approach

This paper uses a hybrid approach that blends positivism and pragmatism to address gaps in the interior design industry by using machine learning (ML) for personalized, budget friendly recommendations. The dataset was sourced through interviews, questionnaires, and invoices from interior design companies in Kolkata, focusing on property sizes, service types, specifications, costs, and products. The data were then preprocessed to ensure data quality through cleaning, imputation, and normalization.

A hybrid recommendation model integrates TF–IDF-based content filtering for textual features and collaborative filtering for price and category alignment. Logistic Regression, combined with training and testing splits, evaluates the proposed approach’s performance via accuracy, precision, recall, and ROC-AUC. Class imbalances in the dataset are handled through up-sampling to ensure fairer predictions.

User feedback refines relevance scores generated from logistic regression probabilities, ensuring personalized suggestions. Additional ablation studies and stress tests are carried out to understand how well the model performs under different conditions and heavier workloads. This approach combines historical data with real-time user inputs collected through the developed web interface, enabling accurate, adaptable, and user-centric recommendations while also addressing accessibility gaps in India’s interior design landscape.

4.1. System Architecture

The Interior Design-Built services application has been developed to provide a user-friendly platform for integrating the recommendation system and its associated web services. This application streamlines the process of planning and budgeting interior design projects. It achieves this by seamlessly combining a dynamic frontend, robust backend, and advanced recommendation algorithms, offering users an efficient and personalized experience.

The system architecture (Figure 3) consists of three layers: the frontend, backend, and recommendation system. The frontend, built with ReactJS and styled using Tailwind CSS, serves as the user interface, enabling service browsing, cart management, and the display of recommendations. It manages user interactions with state-handling tools like useState and useEffect. The backend, developed using Express.js, acts as a middleware, handling user authentication, cart management, and interactions with the database. A PostgreSQL database, accessed via Prisma ORM, stores user, service, and cart data. The web part of recommendation system, implemented in Flask, leverages content-based and collaborative filtering using TF–IDF vectorization to generate tailored service recommendations.

The system’s data flow begins with users browsing and selecting services, which are added to the cart and sent to the backend via API requests. The backend updates the cart in the database and communicates with the recommendation system, which processes selected services to generate and return five sets of recommendations. These are displayed on the frontend for user exploration. The proposed system architecture ensures a seamless and engaging user experience, combining efficient data handling with accurate and personalized recommendations. The app’s user interface is designed to be intuitive, facilitating user inputs, displaying recommendations, and providing transparent explanations to build trust. The machine learning-based recommendation module processes user preferences using natural language processing (NLP) to structure inputs and a recommendation engine to generate initial suggestions. The database stores all relevant information, including user profiles, service details, and interaction history, guaranteeing scalability and security while supporting the recommendation system.

4.2. System Preliminary—TF–IDF (Term Frequency Inverse Document Frequency)

TF–IDF is a statistical measure technique which is used to evaluate the significance of a word in a document relative to a collection of corpus (documents). TF–IDF is a popular approach in natural language processing (NLP) and information retrieval tasks, including content filtering. In the experiments, TF–IDF is chosen due to several reasons. For instance, it provides recommendations based on intrinsic content, effective for detailed service specifications like material types and finishes, independent of extensive user interaction history. Equation (1) was used to calculate the frequency and relevance of terms that appear in each invoice.

T F (t, d) = \frac{f_{t, d}}{N_{d}}

(1)

Here, t is an item in the invoice, d refers to the invoice itself,

f_{t, d}

denotes the number of times item t appears in invoice d, and

N_{d}

is the total number of items listed in that invoice. It is also checked how frequently each item or specification appears across all invoices by applying the inverse document frequency, as shown in Equation (2).

I D F (t, D) = log (\frac{N}{n_{t}})

(2)

where N is the total number of invoices and quotations used to create the dataset, and

n_{t}

is the number of invoices and quotations that contain the same item. Finally, we used the weighted TF–IDF (Equation (3)) to transform invoice documents into vectorized form.

T F - I D F (t, d, D) = T F (t, d) \times I D F (t, D)

(3)

The weighting scheme of TF–IDF ensures that items appearing frequently in a single invoice but rarely across other quotations receive higher weight, while items appearing in many quotations receive lower weight.

4.3. System Preliminary Overview—Hybrid Model

The recommendation module uses a hybrid model that integrates content-based and collaborative filtering. Content-based filtering uses TF–IDF vectorization to transform textual features into numerical representations, enabling the system to analyze detailed service specifications. Collaborative filtering incorporates price ranges and categories, ensuring that recommendations align with user budgets and service preferences. This hybrid approach balances nuanced content analysis with practical considerations, generating personalized and budget-friendly suggestions.

4.3.1. Content-Based Component

Content-based filtering uses text information taken from service titles and specifications. These are combined into a single feature representation, as shown in Equation (4).

C_{i} = t_{i} + s_{i}

(4)

TF–IDF vectorization is then applied to

C_{i}

for all services, producing a numerical representation that captures term importance based on Equations (1)–(3). This enables the system to identify services with similar descriptions, materials, finishes, and functional properties. The similarity between any two services i and j is determined using cosine similarity by using Equation (5).

sim (i, j) = \frac{{SV}_{i} \cdot {SV}_{j}}{∥ {SV}_{i} ∥ ∥ {SV}_{j} ∥}

(5)

where

sim (i, j)

is the cosine similarity score between service i and service j;

{SV}_{i}

and

{SV}_{j}

are the TF–IDF feature vectors for the respective services;

{SV}_{i} \cdot {SV}_{j}

denotes the dot product representing the overlap of terms; and

∥ {SV}_{i} ∥

,

∥ {SV}_{j} ∥

are the Euclidean norms used for length normalization.

4.3.2. Collaborative Component

The collaborative filtering component focuses on latent preference patterns derived from service prices, categories, user–item interactions, and user similarity. These factors reflect typical homeowner constraints and selection behaviors. Each service acquired through the collaboration component is represented as shown in Equation (6).

{SV}_{j} = [c_{j, 1}, c_{j, 2}, \dots, c_{j, k}]

(6)

where c denotes the service, which includes price and category. These attributes enable the model to learn approximate neighborhoods of financially and thematically similar services. This component ensures that recommendations align with user budgets and comparable service groups.

4.3.3. Hybrid Integration

The hybrid model combines the output of the TF–IDF-based relevance classifier (content-based) with price–category alignment (collaborative filtering). For each service, a final hybrid score is computed through a weighted fusion of the content relevance probability

{\hat{p}}_{i}

and a collaborative similarity score

s_{i}^{CF}

, as shown in Equation (7).

S_{i} (α) = α s_{i}^{CF} + (1 - α) {\hat{p}}_{i}, α \in [0, 1] .

(7)

The collaborative score

s_{i}^{CF}

reflects similarity based on price ranges and service categories, while

{\hat{p}}_{i}

captures textual relevance. Only services within the user’s acceptable budget range

B

are considered, enforced using a budget constraint (Equation (8)).

{\tilde{S}}_{i} = S_{i} (α) \cdot ⊮ {p_{i} \in B}

(8)

The top-ranked services according to

{\tilde{S}}_{i}

form the recommended set. This integration leverages the strengths of both filtering approaches’ descriptiveness and budget-aware similarity, resulting in personalized, contextually aligned, and financially appropriate service recommendations.

Next, TF–IDF vectorization is applied on

{\tilde{S}}_{i}

to transform textual features into vector representations. As the collected dataset was not labelled, a set of random services was selected for labelling. Let

S^{*}

be the list of randomly selected service IDs. We assign labels as shown in Equation (9).

y_{i} = \{\begin{matrix} 1, & if {\tilde{S}}_{i} \in S * \\ 0, & otherwise \end{matrix}

(9)

Therefore, the resulting labelled dataset is represented as

(y_{i})

. To address the class imbalance, we applied SMOTE to upsample the training dataset (see Section 4.4).

The Logistic Regression classifier serves as the core machine learning algorithm for relevance prediction in the recommendation system. It leverages features transformed by the TF–IDF (Term Frequency-Inverse Document Frequency) method to determine the likelihood that a service matches a user’s preferences. TF–IDF is used to convert textual data such as service titles and specifications into numerical vectors that represent the importance of each term relative to the entire dataset. These transformed features serve as inputs to the Logistic Regression model, which then predicts the probability of each service being relevant to the user. Logistic Regression is particularly suited for this task due to its light weight and its ability to handle binary classification problems, such as predicting whether a service is relevant (1) or not relevant (0). By training the model on a labelled dataset of services with known relevance, the algorithm learns to associate patterns in the feature data with the corresponding relevance labels. Once trained, the model can accurately predict the relevance of new, unseen services based on the learned relationships between the features and the target variable. To train the logistic regression classifier with balanced weights, Equation (10) has been used.

y^{'} = σ (w^{T} X + b)

(10)

where

σ

is a non-linear activation function; in this case, the sigmoid has been used, which is

σ (Z) = \frac{1}{1 + e^{- Z}} .

In this context, w and b are the regression model parameters. In the proposed hybrid model, the binary cross-entropy loss function is applied, also known as log loss. It is defined in Equation (11).

Loss = - \frac{1}{N} \sum_{i = 1}^{N} [y_{i} log (p_{i}) + (1 - y_{i}) log (1 - p_{i})],

(11)

where N is the number of services in the dataset used to compute the loss,

y_{i}

is the true label for services, and

p_{i}

is the predicted probability.

This combination of logistic regression and TF–IDF for classification and feature extraction, respectively, allows the system to provide accurate and personalized service recommendation output, which could be tailored to each user’s specific preferences. The application of these methods helps the system stay efficient, scalable and interpretable all at the same time, which keeps the model’s complexity at a minimum.

While neural network models could have been alternatives to the current approach, the proposed approach prioritizes simplicity and interpretability. The rationale to utilize TF–IDF was driven by its efficiency in analyzing textual data and compatibility with developer-friendly libraries like scikit-learn. While complicated models may have higher predictive accuracy, TF–IDF provides a balance between performance and resource consumption, which made it the most pragmatic solution for this recommendation system. Additionally, TF–IDF and logistic regression are computationally lightweight compared to BERT (Bidirectional Encoder Representations from Transformers)/LLMs (Large Language Models). They require significantly less memory and processing power, making them suitable for environments with limited computational resources. TF–IDF + logistic regression can perform well on small to medium-sized datasets where deep learning models like BERT may overfit due to their large number of parameters. As the primary collected dataset is very small, BERT/LLMs typically require large amounts of labelled data to generalize well, whereas TF–IDF + logistic regression can work effectively with smaller datasets.

The second part of the architecture (as shown in Figure 3) utilizes the previously trained logistic regression classifier to predict the relevance of interior design services based on user-selected features. This part is implemented on the Interior Design-Built Services Calculator application, which is integrated into a web-based interface. Users can select different service characteristics, such as the type of service (e.g., painting, woodwork), specifications (e.g., material type, number of layers), and budget constraints. Then, the Logistic Regression model, trained with TF–IDF-transformed features, processes these inputs to determine whether the selected services are relevant to the user’s preferences.

A loop is executed to iterate through the selected services and predict relevance scores for all services within the specified budget range and category. The model continuously processes the input features, calculating the probability of each service being relevant, until it identifies a set of services with high relevance scores (class “YES” or 1). Once a suitable set of services is found, the system presents these recommendations to the user, enabling them to explore budget-friendly service packages. The workflow of the proposed hybrid model is shown in Figure 4.

4.4. Dataset Description and Preprocessing

The primary dataset was derived from invoices and quotations provided by homeowners and interior design firms in Kolkata. All instances were anonymized to maintain privacy (Figure 5 shows a sample of raw data). After careful evaluation, key features were selected, and the data was organized into a tabular format in an Excel file (Figure 6). The characteristics of the dataset include identifiers for services, titles, specifications, prices, and categories, and these form basic information for the services offered. The structured dataset enables proper preprocessing, feature engineering, and the subsequent utilization of machine learning and recommendation algorithms (Figure 4). The text entries were normalized for uniformity, and missing values were also handled through the utilization of imputation techniques such as K-Nearest Neighbors (KNN) imputation. A combined feature column was created by merging service titles and specifications, enabling content-based similarity analysis.

The initial primary dataset consisted of 235 instances representing interior design services and their associated attributes. From this dataset, 9 samples were manually labelled by two authors with the help of domain experts as either relevant (is_relevant = 1) or non-relevant (is_relevant = 0), based on users’ budget constraints and preferences. Inter-rater reliability between the two experts was assessed using Cohen’s Kappa, which produced a value of kappa = 0.82, indicating a strong level of agreement. Because the dataset was relatively small, additional training samples were generated using the SMOTE (Synthetic Minority Over-sampling Technique) to address class imbalance and support more reliable training of the model. SMOTE uses existing data instances and generates new synthetic minority samples. In this paper, the Python library imblearn was used for SMOTE with the parameters k_neighbors = 5, sampling_strategy = ’auto’, and random_state = 42. The algorithm works by selecting a minority class sample and forming a new synthetic example by interpolating between that sample and one of its nearest neighbors in the feature space. This method helps address class imbalance and reduces the risk of the model becoming biased toward the majority class.

Although the dataset represents real market conditions, its small size is still a limitation. Even so, the combination of expert labelling, synthetic data generation, and feature engineering allows the model to be evaluated realistically and supports its ability to generalize to similar service-recommendation tasks.

5. System Implementation, Results and Evaluations

The proposed interior service package recommendation system has been evaluated by using well-known machine learning evaluation metrics such as accuracy, precision, recall, and ROC-AUC. Precision measures the percentage of correctly predicted relevant services, while recall assesses the proportion of actual relevant services identified. The ROC–AUC further quantifies the model’s capability to discriminate between relevant and non-relevant services. The proposed system also calculates average relevance ratings on collections of recommendations, which reflect the general quality of the suggestions. User feedback further strengthens the system by gathering data on recommendation accuracy and relevance to enable continuous improvement.

5.1. Integration of the Recommendation System with the Estimation Application

The proposed developed system connects to a PostgreSQL database managed via Prisma ORM, thus ensuring efficient data storage and retrieval. This relational design supports both content-based and collaborative filtering approaches, hence ensuring scalability for potential expansion. Machine learning libraries like Scikit-learn ensure that there are no complex algorithms except TF–IDF vectorization and supervised machine learning approaches. These tools ensure that implementation and optimization processes are simplified, allowing the system to adapt to modified user needs. The developed recommendation system solves some key interior design dilemmas, including an overwhelming choice overload, constrained budgets, and decision fatigue. The developed system provides the functionality to create personalized service packages on the fly enabled through machine learning, thus simplifying decision making and untangling complexity. It provides valuable market insight into user needs, enabling businesses to optimize inventory and services. The budget-based recommendation model is innovative, offering a simplified, effective, and enriching experience for users and service providers alike.

Briefly, the Interior Design-Built Services Application uses lightweight machine learning algorithms and processes data efficiently. The human-centered design principles were followed to create an innovative application for homeowners. By focusing budget limitations and simplifying the service selection process, the proposed application breaks through traditional boundaries in the interior design field; it allows users to achieve their desired living spaces more efficiently and affordably. The runtime recommendation’s ability of the system and its integrative support ensure its compatibility with future needs, thus making it a change catalyst in digital interior design.

5.2. System Configuration and Results

The recommendation system was developed and executed on a Windows 11 machine with the following specifications: processor: 12th Gen Intel^® Core™ i7-12650H; installed RAM: 16.0 GB (15.6 GB usable); 64-bit operating system, x64-based processor. The experimental setup consists of the computational environment, collected dataset, and preprocessing pipeline. This environment provides a significantly sufficient computational capability, as the proposed machine learning approach is lightweight and sustainable. From experiments, we established that the chosen system configuration has ample processing power and memory to handle moderately complex machine learning models and data preprocessing tasks efficiently. Still, given these specifications, optimization was essential to support seamless operation in intensive data operations. Several methods were implemented to optimize the preprocessing of data and model training to allow for resource-conservative computations without loss of the system’s accuracy and reliability. Pre-processed datasets were maintained in a PostgreSQL database (run in a Docker container) for effective querying and access to data. PostgreSQL was adopted due to its scalability and reliability features, thus being appropriate for managing large volumes of data as well as catering to complex queries. The solution employs Prisma ORM for managing database operations, thereby integrating conveniently with the Node.js backend. This environment strikes a balance between performance and data access in catering to both the recommendation engine and ecommerce site. The complete code along with the dataset can be accessed from the GitHub page [40].

The Interior Design-Built Service Package Recommendation System (IDBspRS) performed well in initial trials; it effectively generates relevant recommendations based on user input (Figure 7). The IDBspRS used content-based and collaborative filtering methods to successfully identify appropriate suggestions and to provide contextually fitting options. The combination of these techniques yielded moderate success, with potential for future refinement.

In Figure 8, the cart webpage presents the items selected by the user. This section provides an overview of the services and products added to the cart, allowing users to review their selections before proceeding to checkout. The interface provides options to users to modify their choices by adding or removing items as needed. Furthermore, to display the cart items, the cart page also presents the first three recommendation sets based on the user’s selected interior design services as shown in Figure 9a. The proposed model generates these recommendations by using a hybrid recommendation approach that considers pricing alignment, relevance and diversity. The recommendations help users to explore additional services or products that complement their current selections. The cart webpage further extends the recommendations by displaying the last two recommendation sets tailored to the user’s selections as shown in Figure 9b. These additional recommendations allow users to have a comprehensive view of available service bundles, which potentially assist in their purchasing decisions by providing more personalized suggestions.

The interior design recommendations generated by the proposed model were evaluated based on pricing alignment, relevance, diversity and accuracy. For example, an extensive examination of how to evaluate recommendation systems focuses on relevance, diversity, and accuracy but does not clearly address price alignment [41]. But that has been incorporated into the system, as the application is a budget-driven recommendation system. Relevance measures how closely the recommended packages match user preferences. The relevance score for each recommendation is derived from the predicted probabilities generated by the trained Logistic Regression classifier. Specifically, the classifier assigns a probability to each service, representing the likelihood of its relevance based on the input features, which are constructed using a TF–IDF vectorizer with combined content and collaborative features. A higher relevance score indicates a greater likelihood that the service aligns with the user’s preferences or requirements. The relevance scores ranged from 0.6 to 0.85, indicating that the system consistently provided relevant suggestions. For example, selecting “gypsum board false ceiling with GI channel” resulted in similar but appropriate recommendations like “Gypsum board ceiling with thermal insulation.” Diversity ensured that users were presented with a variety of options rather than repetitive suggestions. The price alignment was in proper shape as the system suggested services that were within ±50% of the chosen service price. Briefly, while the system excelled in these areas, enhancements in metadata, diversification strategies, and price sensitivity would be able to render it even more effective and easier to use in general. While the system offered some variety, it occasionally suggested similar packages due to limited metadata, highlighting the need for an advanced diversification dataset and techniques. Accuracy was obtained indirectly by using precision and recall, due to class imbalance. Given the disproportionate distribution of class labels (e.g., a high prevalence of non-relevant items), a model could achieve high accuracy by predominantly predicting the majority class (non-relevant) while failing to effectively identify relevant services [42].

The proposed hybrid model, built on logistic regression with Stratified K-Fold cross-validation, achieved an accuracy of 83.62%, recall of 86.24%, and precision of 76.52%. These results show that integrating budget-related features enhances the model’s ability to identify financially appropriate services, directly supporting H1and answering RQ1 by showing that price and category information improve the accuracy of budget-aware recommendations. Although these results show better predictive capability, particularly in identifying all relevant cases, the relatively lower precision indicates a tendency toward false positives, suggesting that additional refinement is needed to improve classification performance. In order to do the ablation experiments, the TF–IDF was excluded, and the logistic regression model was trained without specification and price features; performance declined, with an accuracy of 81.23%. This supports H2 and answers RQ2, showing that hybrid filtering produces richer and more design-aligned recommendations than content-only approaches. For comparison, the proposed approach was compared with the KNN, Random Forest, and XGBoost models, where XGBoost perform better, achieving an accuracy of 80.50%, a precision of 72.32%, and a recall of 82.80%. Moreover, a summary of these results is presented in Table 1.

For comparison the ROC–AUC results are also shown in Figure 10. The Receiver Operating Characteristic (ROC) analysis indicates strong performance across all models, with Logistic Regression achieving the highest AUC score of 92.15%, showing excellent ability to distinguish between relevant and non-relevant services. XGBoost (AUC = 89.78%) and Random Forest (AUC = 87.10%) also perform well, showing their strength in capturing non-linear patterns in the hybrid feature space. KNN achieves a lower AUC of 85.25%.

In machine learning, a confusion matrix is a primary metric used to measure the performance of a model in comparing the actual labels of a test dataset to the predicted labels. For the implementation in this research of the proposed hybrid model, the confusion matrix is an important metric for ascertaining whether the proposed approach could classify relevant and non-relevant services accurately or not. The confusion matrix for a binary classification problem contains four elements: true positive (TP), false positive (FP), true negative (TN), and false negative (FN). The true positive (TP) cell contains the number of instances that are correctly identified as relevant services. The false positive (FP) cell, on the other hand, contains instances that are wrongly marked as relevant services but are actually non-relevant. Similarly, the true negative (TN) cell represents the number of services correctly labelled as not relevant, and the false negative (FN) cell is for the cases incorrectly labelled as non-relevant but actually were relevant.

Figure 11 shows a comparative confusion matrix analysis of logistic regression, KNN, Random Forest, and XGBoost models. The confusion matrix for logistic regression shows that the model correctly predicted (true positive) relevant services, i.e., 326, and 450 instances were correctly predicted as non-relevant services (true negative). Similarly, the model wrongly predicted (false positive) relevant services, where the actual services were non-relevant. This occurred 100 times. The model wrongly predicted (false negative) non-relevant services, where the actual services were relevant. This occurred 52 times.

Overall, the logistic regression hybrid model performed best, showing a strong balance between detecting relevant and non-relevant cases. KNN was most effective at recognizing non-relevant items and produced the fewest false positives, although it missed many relevant cases. Random Forest showed balanced but moderate performance, with more false positives than the other models. XGBoost detected the most relevant items by producing the fewest false negatives, but this also resulted in the highest number of false positives.

6. Discussion and Future Work

6.1. Discussion on Results

The IDBspRS proved to be highly effective in presenting personalized and relevant recommendations to users, solving key problems of budget suitability while offering varied design options. Through the use of a synergistic methodology that combines content-based filtering with collaborative filtering methods, the system is able to effectively map users to services that are highly compatible with individual preferences. With content-based filtering, the system filtered metadata like service names and specifications to ensure that recommended services were extremely relevant. Collaborative filtering, however, detected user behavior patterns to recommend services most frequently chosen by users with the same tastes. The blending of the two methods enabled the system to provide an integrated and individualized recommendation experience, which benefited niche need users, where traditional systems tend to fail.

As for the quality of the recommendations, the system performed well and the relevance in the recommendations varied from 60% to 85% for individual services, which established that the system consistently suggested a good set of recommendations. Additionally, the system’s diversity was moderate; it offers users a variety of options within their preferred service categories. Yet, in a few cases, some redundancy occurred where limited metadata was available, evidencing the need for more data expansion and diverse service descriptions.

Precision and recall measures of the system represented another significant strength. By having 86.24% recall (Regression-Hybrid, with TF–IDF), the model accurately labelled most services of interest, which is vital in recommendation systems when the aim is not to miss any service that might have been helpful. Nonetheless, precision being 76.52% means that certain recommendations did not meet the expectations of the user to some extent, but in many cases, such a compromise is taken for achieving maximum recall. In general, the IDBspRS has been a practical solution for homeowners who seek affordable and pertinent interior design recommendations. The analysis also identifies some possibilities for future improvement, i.e., enriching the metadata to a larger degree and refining price recommendations to come up with more accurate recommendations.

6.2. Future Work

Although the IDBspRS, as developed, has demonstrated applicability, several areas need further development to better meet the needs of a broader user base and reflect real-world application scenarios. One of the primary suggestions for future research is the diversification and enlargement of the dataset. At present, the system is developed using a limited range of service types and user profiles. Expanding the dataset to include more user types and service categories would make the recommendations more robust and widely applicable. With a more diverse range of users included, the system will be in a better position to accommodate differences in preferences, thereby rendering the recommendations more meaningful for an extended spectrum of users.

Another critical area for improvement regards the incorporation of more sophisticated feedback mechanisms. Presently, the system depends on basic feedback mechanisms, i.e., affirmations and negations; however, incorporating more complex feedback alternatives, e.g., determining the accuracy of recommendations or making elaborate explanations of service categories, would render the predictability of output generated by the system more achievable.

The enhancement would enable the recommendation engine to refine its quality over time based on the analysis of actual user context, thereby making more accurate and personalized recommendations. Control and customization by the user are valuable in developing a more personalized experience. The users should have options to modify the weight given to different categories, change their priority in terms of design alternatives, or state their budget constraints. With increased control over the recommendation process, the users will be shown proposals that more closely match their individual project needs, leading to increased user satisfaction. Furthermore, the system’s adaptive nature can be augmented through the integration of a more advanced recommendation generator that builds on both instantaneous feedback and an ongoing history of user behavior. This would make the system stay relevant to the user as their needs evolve with time. This approach would cultivate enhanced trust within the system and users would be in a position to make more informed decisions, consequently enhancing engagement and user satisfaction. With these future developments implemented, the IDBspRS can be even more powerful, more user-friendly, and more flexible, further advancing its role as a means for transforming homeowners’ interior design and home remodelling activities.

7. Conclusions

In conclusion, the results show that the proposed approach, the Interior Design-Built Service Package Recommendation System (IDBspRS), is a lightweight and effective solution for meeting the challenges of homeowners when selecting and managing interior design services under budget constraint situations. The proposed hybrid approach combines both collaborative and content-based filtering techniques. With an accuracy of 83.62% and an ROC–AUC of 92.15%, the system delivers personalized, relevant, and contextually appropriate recommendations that align with users’ preferences. The integration of machine learning with real-time user feedback, captured through the developed web-based tool, enables the system to continually refine its outputs, allowing users to receive tailored service packages that meet their design requirements and budget.

Although IDBspRS is effective, there are several aspects that need to be improved upon, ranging from the diversification of the dataset to the offering of personalized services and increased autonomy for the user. The system’s capacity to provide good recommendations, combined with openness and inclusiveness of user contribution, provides a good platform for further improvement. Incorporating a larger dataset, along with other advanced feedback mechanisms and customized features for particular users, would make it more adaptable and viable to a larger number of users.

In summary, the IDBspRS is a significant step forward in the interior design recommendation system area, enabled with lightweight logistic regression and TF–IDF approaches. It provides a user-oriented and scalable approach to interior design and home renovation projects and demonstrates great potential for further development and extended application.

Author Contributions

Conceptualization: P.K. and M.A.J.; Dataset collection: P.K.; Experiments: P.K. and M.A.J.; Supervision: M.A.J.; Manuscript writing and proofreading: P.K., M.A.J., N.U.H. and J.A.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of Sheffield Hallam University (protocol code ER93394124) and 29 November 2024 of approval.

Informed Consent Statement

Informed consent was obtained from all participants involved in the study.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) license to any Author Accepted Manuscript version of this paper arising from this submission. This paper is based on the MSc computing dissertation completed by the first author. Ethical approval for the study was granted under the UREC 2–Low Risk Research with Human Participants category, as approved by the candidate’s dissertation supervisor. During the preparation of this manuscript/study, the authors used M365 Copilot, GPT-5 model for the purposes of grammar checking, language improvement, and proofreading. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Urbanisation, Rural Consumption and Omnichannel Retail to Drive India’s Home and Household Market to US$237 Billion by 2030: Deloitte Report. Available online: https://www.deloitte.com/in/en/about/press-room/urbanisation-omnichannel-retail-drive-indias-household-market.html (accessed on 19 January 2026).
Interior Design Industry in India—Size, Statistics & Industry Growth. Available online: https://www.mordorintelligence.com/industry-reports/india-interior-design-market (accessed on 19 January 2026).
India Retailing. Powering Consumption Growth: India’s Home and Household Market by Deloitte. Available online: https://www.indiaretailing.com/2024/10/22/powering-consumption-growth-indias-home-and-household-market-by-deloitte/ (accessed on 15 February 2026).
BlueWeave. India Interior Design Market Size, Share and Trend 2029. Available online: https://www.blueweaveconsulting.com/report/india-interior-design-market (accessed on 15 February 2026).
Afoudi, Y.; Lazaar, M.; Achhab, M.A. Hybrid recommendation system combined content-based filtering and collaborative prediction using artificial neural network. Simul. Model. Pract. Theory 2021, 113, 102375. [Google Scholar] [CrossRef]
Han, G.; Feng, Z.; Xu, Y. Bundle Recommendation for Budget-Conscious Consumers. Tsinghua Sci. Technol. 2026, 31, 1111–1123. [Google Scholar] [CrossRef]
Mohammadi, H.; Bagheri, A.; Giachanou, A.; Oberski, D.L. Explainability in Practice: A Survey of Explainable NLP Across Various Domains. arXiv 2025, arXiv:2502.00837. [Google Scholar] [CrossRef]
Ramos, J.; Rahmani, H.A.; Wang, X.; Fu, X.; Lipani, A. Transparent and Scrutable Recommendations Using Natural Language User Profiles. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers); Association for Computational Linguistics: Bangkok, Thailand, 2024; pp. 13971–13984. [Google Scholar] [CrossRef]
Afsar, M.M.; Crump, T.; Far, B. Reinforcement learning based recommender systems: A survey. ACM Comput. Surv. 2022, 55, 1–38. [Google Scholar] [CrossRef]
Khanal, S.S.; Prasad, P.W.C.; Alsadoon, A.; Maag, A. A systematic review: Machine learning based recommendation systems for e-learning. Educ. Inf. Technol. 2020, 25, 2635–2664. [Google Scholar] [CrossRef]
Said, A. On explaining recommendations with Large Language Models: A review. Front. Big Data 2025, 7, 1505284. [Google Scholar] [CrossRef] [PubMed]
Silva, Í.; Marinho, L.; Said, A.; Willemsen, M.C. Leveraging ChatGPT for Automated Human-centered Explanations in Recommender Systems. ACM Int. Conf. Proc. Ser. 2024, 12, 597–608. [Google Scholar] [CrossRef]
Ren, Q.; Ren, X.; Huang, C. XRec: Large language models for explainable recommendation. arXiv 2024, arXiv:2406.02377. [Google Scholar] [CrossRef]
Yahav, I.; Shehory, O.; Schwartz, D. Comments Mining With TF-IDF: The Inherent Bias and Its Removal. IEEE Trans. Knowl. Data Eng. 2019, 31, 437–450. [Google Scholar] [CrossRef]
Dandu, M.M.K.; Vijayabaskar, S.; Voola, P.K.; Agarwal, R.; Goel, O. Cross Category Recommendations Using LLMs. Darpan Int. Res. Anal. 2024, 12, 80–107. [Google Scholar] [CrossRef]
Alamdari, P.M.; Navimipour, N.J.; Hosseinzadeh, M.; Safaei, A.A.; Darwesh, A. Image-based Product Recommendation Method for E-commerce Applications Using Convolutional Neural Networks. Acta Inform. Pragensia 2022, 11, 15–35. [Google Scholar] [CrossRef]
Livspace. Best Interior Designers Company in India. Available online: https://www.livspace.com/in (accessed on 16 February 2026).
HomeLane. Available online: https://www.homelane.com/ (accessed on 16 February 2026).
StartMotionMedia. The Market Trends and Insights of the Indian Interior Designing Industry. Available online: https://www.startmotionmedia.com/the-market-trends-and-insights-of-the-indian-interior-designing-industry/ (accessed on 5 February 2026).
Atabay, S.; Galipogullari, N. Application of Value Engineering in Construction Projects. J. Traffic Transp. Eng. 2013, 1, 12. [Google Scholar] [CrossRef][Green Version]
Procore. Value Engineering in Construction: 6 Steps to Increase Project Value. Available online: https://www.procore.com/library/value-engineering (accessed on 5 February 2026).
Pazzani, M.J.; Billsus, D. Content-Based Recommendation Systems. In The Adaptive Web; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2007; Volume 4321, pp. 325–341. [Google Scholar]
Liu, J.; Dolan, P.; Pedersen, E.R. Personalized news recommendation based on click behavior. In IUI ’10: Proceedings of the 15th International Conference on Intelligent USER Interfaces; Association for Computing Machinery: New York, NY, USA, 2010; pp. 31–40. [Google Scholar] [CrossRef]
AlRossais, N.; Kudenko, D.; Yuan, T. Improving cold-start recommendations using item-based stereotypes. User Model. User-Adapt. Interact. 2021, 31, 867–905. [Google Scholar] [CrossRef]
Sarwar, B.; Karypis, G.; Konstan, J.; Riedl, J. Item-based collaborative filtering recommendation algorithms. In WWW ’01: Proceedings of the 10th International Conference on World Wide Web; Association for Computing Machinery: New York, NY, USA, 2001; pp. 285–295. [Google Scholar] [CrossRef]
Su, X.; Khoshgoftaar, T.M. A survey of collaborative filtering techniques. Adv. Artif. Intell. 2009, 2009, 421425. [Google Scholar] [CrossRef]
Abdalla, N.; Forthomme, D. Probabilistic Approach for Recommendation Systems. In Proceedings of the 2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA), Nassau, Bahamas, 12–14 December 2022; pp. 629–634. [Google Scholar] [CrossRef]
Lin, X.; Wang, W.; Zhao, J.; Li, Y.; Feng, F.; Chua, T.-S. Temporally and distributionally robust optimization for cold-start recommendation. Proc. AAAI Conf. Artif. Intell. 2024, 38, 8750–8758. [Google Scholar] [CrossRef]
Zhou, Z.; Zhang, L.; Yang, N. Contrastive Collaborative Filtering for Cold-Start Item Recommendation. In WWW ’23: Proceedings of the ACM Web Conference 2023; Association for Computing Machinery: New York, NY, USA, 2001; pp. 928–937. [Google Scholar] [CrossRef]
Anelli, V.W.; Bellogín, A.; Noia, T.D.; Pomo, C. Reenvisioning the comparison between neural collaborative filtering and matrix factorization. In RecSys ’21: Proceedings of the 15th ACM Conference on Recommender Systems; Association for Computing Machinery: New York, NY, USA, 2021; pp. 521–529. [Google Scholar] [CrossRef]
Nguyen, D.N.; Nguyen, V.-H.; Trinh, T.; Ho, T.; Le, H.-S. A personalized product recommendation model in e-commerce based on retrieval strategy. J. Open Innov. Technol. Complex. 2024, 10, 100303. [Google Scholar] [CrossRef]
Thomas, B.; John, A.K. Machine Learning Techniques for Recommender Systems—A Comparative Case Analysis. IOP Conf. Ser. Mater. Sci. Eng. 2021, 1085, 012011. [Google Scholar] [CrossRef]
Xue, H.-J.; Dai, X.-Y.; Zhang, J.; Huang, S.; Chen, J. Deep matrix factorization models for recommender systems. In IJCAI’17: Proceedings of the 26th International Joint Conference on Artificial Intelligence; AAAI Press: Washington, DC, USA, 2017; pp. 3203–3209. Available online: https://dl.acm.org/doi/10.5555/3172077.3172336 (accessed on 1 June 2025).
Koren, Y.; Bell, R.; Volinsky, C. Matrix Factorization Techniques for Recommender Systems. Computer 2009, 42, 30–37. [Google Scholar] [CrossRef]
Machine Learning Textbook. Available online: https://www.cs.ubc.ca/~murphyk/MLbook/ (accessed on 17 March 2025).
Ko, H.; Lee, S.; Park, Y.; Choi, A. A Survey of Recommendation Systems: Recommendation Models, Techniques, and Application Fields. Electronics 2022, 11, 141. [Google Scholar] [CrossRef]
Halder, R.K.; Uddin, M.N.; Uddin, M.A.; Aryal, S.; Khraisat, A. Enhancing K-nearest neighbor algorithm: A comprehensive review. J. Big Data 2024, 11, 113. [Google Scholar] [CrossRef]
Pillonetto, G.; Chen, T.; Chiuso, A.; Nicolao, G.D.; Ljung, L. Regularization of Linear Regression Models. In Regularized System Identification; Springer: Cham, Switzerland, 2022. [Google Scholar]
Salehin, I.; Kang, D.K. A Review on Dropout Regularization Approaches for Deep Neural Networks. Electronics 2023, 12, 3106. [Google Scholar] [CrossRef]
GitHub. PranabantiKarmaakar99/Dissertaion. Available online: https://github.com/PranabantiKarmaakar99/Dissertaion/tree/master (accessed on 17 February 2026).
Jadon, A.; Patil, A. A Comprehensive Survey of Evaluation Techniques for Recommendation Systems. In Computation of Artificial Intelligence and Machine Learning. ICCAIML 2024; Springer: Cham, Switzerland, 2025; pp. 1–12. [Google Scholar]
Owusu-Adjei, M.; Ben Hayfron-Acquah, J.; Frimpong, T.; Abdul-Salaam, G. Imbalanced class distribution and performance evaluation metrics: A systematic review. PLOS Digit. Health 2023, 2, e0000290. [Google Scholar] [CrossRef] [PubMed]

Figure 1. India interior design-build industry market trends [4].

Figure 2. Themes in the related work.

Figure 3. System architecture of the interior design build estimator.

Figure 4. Proposed system working logic flow.

Figure 5. Quotation received from interior design build shop.

Figure 6. Dataset generated from the invoices and quotations.

Figure 7. The recommendation services, where a user can select interior design services.

Figure 8. The cart page where the cart chosen by the user is displayed.

Figure 9. A set of recommendations for the services selected by the user.

Figure 10. ROC-AUC analysis of logistic regression, KNN, Random Forest, and XGBoost models.

Figure 11. Comparative confusion matrix analysis of logistic regression, KNN, Random Forest, and XGBoost models.

Table 1. Model performance comparison and ablation results (sorted by accuracy).

Model	Accuracy (%)	Precision (%)	Recall (%)	ROC-AUC (%)
KNN (with TF–IDF)	78.56	74.65	71.69	85.25
Random Forest (with TF–IDF)	78.56	72.32	76.72	87.10
XGBoost (with TF–IDF)	80.05	72.96	82.80	89.78
Logistic Regression (without TF–IDF)	81.23	72.26	78.56	88.35
Proposed Logistic Regression (Hybrid, with TF–IDF)	83.62	76.52	86.24	92.15

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Karmaakar, P.; Jarwar, M.A.; Wahid, J.A.; Hasan, N.U. IDBspRS: An Interior Design-Built Service Package Recommendation System Using Artificial Intelligence. Sustainability 2026, 18, 3605. https://doi.org/10.3390/su18073605

AMA Style

Karmaakar P, Jarwar MA, Wahid JA, Hasan NU. IDBspRS: An Interior Design-Built Service Package Recommendation System Using Artificial Intelligence. Sustainability. 2026; 18(7):3605. https://doi.org/10.3390/su18073605

Chicago/Turabian Style

Karmaakar, Pranabanti, Muhammad Aslam Jarwar, Junaid Abdul Wahid, and Najam Ul Hasan. 2026. "IDBspRS: An Interior Design-Built Service Package Recommendation System Using Artificial Intelligence" Sustainability 18, no. 7: 3605. https://doi.org/10.3390/su18073605

APA Style

Karmaakar, P., Jarwar, M. A., Wahid, J. A., & Hasan, N. U. (2026). IDBspRS: An Interior Design-Built Service Package Recommendation System Using Artificial Intelligence. Sustainability, 18(7), 3605. https://doi.org/10.3390/su18073605

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

IDBspRS: An Interior Design-Built Service Package Recommendation System Using Artificial Intelligence

Abstract

1. Introduction

2. Related Work

2.1. Interior Design-Built Industry

2.2. Value Engineering in Interior Design and Construction Sector

2.3. AI-Based Recommendation Systems in Interior Design

2.3.1. Content-Based Filtering

2.3.2. Collaborative Filtering

2.3.3. Hybrid Filtering

2.4. Interior Design Recommendation Systems Using AI

3. Research Questions and Hypotheses

3.1. Research Question (RQ1)

3.2. Research Question (RQ2)

4. Proposed System Approach

4.1. System Architecture

4.2. System Preliminary—TF–IDF (Term Frequency Inverse Document Frequency)

4.3. System Preliminary Overview—Hybrid Model

4.3.1. Content-Based Component

4.3.2. Collaborative Component

4.3.3. Hybrid Integration

4.4. Dataset Description and Preprocessing

5. System Implementation, Results and Evaluations

5.1. Integration of the Recommendation System with the Estimation Application

5.2. System Configuration and Results

6. Discussion and Future Work

6.1. Discussion on Results

6.2. Future Work

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI