Previous Article in Journal
Federated Learning Frameworks for Intelligent Transportation Systems: A Comparative Adaptation Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Lightweight LLM-Based Semantic–Spatial Inference Framework for Fine-Grained Urban POI Analysis

1
School of Architecture, South China University of Technology, Guangzhou 510641, China
2
State Key Laboratory of Subtropical Building and Urban Science, South China University of Technology, Guangzhou 510641, China
3
College of Computer Science and Technology, Xidian University, Xi’an 710071, China
*
Author to whom correspondence should be addressed.
Smart Cities 2026, 9(1), 13; https://doi.org/10.3390/smartcities9010013
Submission received: 27 October 2025 / Revised: 22 December 2025 / Accepted: 7 January 2026 / Published: 16 January 2026

Highlights

What are the main findings?
  • A fine-tuned open large language model (Qwen3-8B + LoRA) achieves superior fine-grained classification of unstructured POI texts (macro-F1 = 0.843), outperforming BERT and commercial LLM baselines while running on consumer-grade hardware.
  • The framework expands usable fine-category POI labels by ≈14–15× in Guangzhou and Shenzhen, enabling robust 500 m grid residual–hotspot analyses that reveal structural cultural preferences and urban diversity patterns.
What is the implication of the main finding?
  • The pipeline transforms unstructured urban text into interpretable spatial evidence, providing a reproducible and resource-efficient method for service-equity assessment and cultural-space analysis in planning.
  • Its domain-agnostic, lightweight design can be extended to other urban functions (e.g., healthcare, education), offering a scalable tool for fine-resolution spatial governance in smart-city contexts.

Abstract

Unstructured POI name texts are widely used in fine-grained urban analysis, yet missing labels and semantic ambiguity often limit their value for spatial inference. This study proposes a large language model-based semantic–spatial inference framework (LLM-SSIF), a lightweight semantic–spatial pipeline that translates POI texts into interpretable, fine-grained spatial evidence through an end-to-end workflow that couples scalable label expansion with scale-controlled spatial diagnostics at a 500 m resolution. A key advantage of LLM-SSIF is its deployability: LoRA-based parameter-efficient fine-tuning of an open LLM enables lightweight adaptation under limited compute while scaling fine-label coverage. Trained on a nationwide cuisine-labeled dataset (~220,000 records), the model achieves strong multi-class short-text recognition (macro-F1 = 0.843) and, in the Guangzhou–Shenzhen demonstration, expands usable fine-category labels by ~14–15× to support grid-level inference under long-tail sparsity. The spatial module then isolates cuisine-specific over/under-representation beyond overall restaurant intensity, revealing contrasting cultural configurations between Guangzhou and Shenzhen. Overall, LLM-SSIF provides a reproducible and transferable way to translate unstructured POI texts into spatial–statistical evidence for comparative urban analysis.

1. Introduction

With the rapid development of big data and artificial intelligence, urban research increasingly relies on multi-source heterogeneous data to analyze complex spatial structures and social phenomena [1,2]. Among these sources, points of interest (POIs) provide high-coverage, fine-location records of urban functions such as dining, retail, healthcare, and education [3]. These records embed rich patterns related to activity rhythms, functional zoning, and cultural representation [4,5], patterns that are core to foundational theories of urban spatial structure and cultural embeddedness. POI names are not only clues for facility identification but also reflect layers of local culture, consumer preferences, and population mobility [6,7]. In this sense, extracting information from unstructured POI texts is a prerequisite for turning “urban semantics” into interpretable spatial evidence for planning and geography research [8].
A substantial body of research has sought to enrich POI semantics for fine-grained urban analysis, and existing approaches can be grouped into three streams. The first stream develops POI classification and semantic enrichment using keyword heuristics, topic models, embeddings, and supervised text classifiers [9,10,11], but it often degrades on real-world POI names due to ambiguity, naming conventions, and class imbalance [12,13,14], especially for long-tail categories [15]. The second stream uses enriched POI categories for urban function mapping and interpretation of consumption and cultural landscapes [9,15], yet grid-or community-scale inference is frequently limited by missing labels and inconsistent taxonomies across providers [16,17,18,19], undermining high-resolution and cross-city comparability [20,21]. The third stream applies LLMs to urban text understanding and short-text disambiguation [22,23,24], but practical deployment remains constrained by reliance on costly, opaque commercial APIs and the computational burden of full-parameter tuning for open models [25,26,27]. Consequently, a lightweight, open, and scalable pathway remains lacking for expanding fine-grained POI labels and converting semantic outputs into interpretable spatial evidence under realistic resource constraints [28,29].
To address this gap, this study proposes an LLM-based semantic–spatial inference framework (LLM-SSIF) for unstructured POI text. A PEFT (LoRA)-based semantic expansion module enables scalable fine-grained label expansion on an open LLM under practical compute budgets [30,31]. A spatial inference module combines scale-controllable residualization with local clustering detection to translate expanded labels into spatial evidence and distinguish structural cultural signals from density-driven scale effects [32,33]. Conceptually, the framework follows an inferential chain from POI text semantics to fine-grained labels and then to scale-adjusted spatial deviations that reveal patterns of urban cultural differentiation. More broadly, by transforming short POI names into structured signals suitable for spatial–statistical inference [34], the framework enables “urban semantics” to enter mainstream spatial analytic workflows and supports more comparable evidence on cultural embeddedness and polycentric differentiation [7,35].
Accordingly, this study addresses the following three interrelated questions: (1) How can lightweight PEFT enable accurate, fine-grained labeling of unstructured POI texts under limited compute [30]? (2) Can the framework scale labels under incomplete platform categorization and limited annotation while remaining robust for long-tail cuisines and transferable across cities? (3) How can LLM-derived labels be translated into interpretable spatial evidence of cultural differentiation [36]?
To address these questions, the Tongyi Qianwen 3-8B (Qwen3-8B) model is fine-tuned with LoRA on consumer-grade hardware and evaluated on a nationwide restaurant POI dataset (≈220,000 labeled records), followed by cross-city case studies in Guangzhou and Shenzhen. Beyond classification performance, a downstream spatial inference module is implemented to generate 500 m grid-level spatial evidence through GLM residualization (with an offset for total counts) and Getis–Ord Gi* hotspot detection.
The methodological contributions are threefold. First, the study explicitly links LLM-based label expansion with scale-controlled spatial inference, bridging the gap between semantic enrichment and theory-relevant urban interpretation. Second, a lightweight and reproducible PEFT (LoRA) strategy is demonstrated as a practical alternative to closed commercial APIs and computationally intensive full-parameter tuning, enabling fine-grained POI text recognition under typical research resources. Third, the framework operationalizes a scale-controlled “residual–hotspot” workflow that converts fine-grained labels into comparable spatial evidence for assessing cross-city cultural differentiation.
The structure of the paper is as follows: Section 2 introduces the research methods and data; Section 3 presents classification results and performance evaluation; Section 4 conducts spatial analysis based on case studies of Guangzhou and Shenzhen, revealing structural distributions of cuisines and their cultural implications; and Section 5 concludes with a discussion and future directions.

2. Methodology

2.1. Research Framework

To bridge unstructured POI text with interpretable urban spatial evidence, this study proposes an LLM-based semantic–spatial inference framework (LLM-SSIF) and operationalizes the cross-disciplinary goal of “unstructured text → urban spatial evidence” as the following four-stage pipeline (see Figure 1): (i) data acquisition, taxonomy harmonization, and cleaning; (ii) supervised PEFT (LoRA) of an open LLM; (iii) model evaluation via macro-averaged metrics and error profiling; and (iv) aggregation of labels to 500 m grids and coupling GLM residuals with Gi* hotspots to yield interpretable spatial evidence. The design prioritizes deployability and interpretability, stably expanding fine-label coverage under limited compute.
First, this study compiles approximately 220,000 cuisine-labeled dining POIs from across China. After data cleaning and multi-class taxonomy harmonization, the dataset is split into training, validation, and test sets. The nationwide coverage increases category and regional diversity, strengthening generalization and providing a robust basis for subsequent city-level spatial analyses and case studies.
Second, to achieve accurate fine-grained classification within limited computational resources, this study adopts the open LLM Qwen3-8B and fine-tunes it with PEFT (LoRA). This enables lightweight training and efficient inference on consumer-grade GPUs, achieving a practical performance–compute–deployability trade-off.
Third, the model’s effectiveness is evaluated through a focus on robustness and coverage. This study prioritizes macro-averaged precision, recall, and F1-score to ensure balanced assessment across imbalanced cuisine classes, supplemented by the Valid Coverage Rate (VCR) to quantify the expansion of analytically usable labels. A detailed examination of the confusion matrix offers further insight into model behavior, particularly for long-tail and semantically ambiguous categories.
Fourth, to transform the expanded labels into evidence of cultural spatial differentiation, this study connects the classification output to spatial econometric analysis. For our case studies in Guangzhou and Shenzhen, classification outputs are aggregated to 500 m × 500 m grid cells and analyzed using descriptive mapping, offset-GLM residualization (with total count as offset), and Getis–Ord Gi* hotspot detection. This reveals structural preferences across cuisines and the spatial manifestations of urban cultural differences.
In sum, LLM-SSIF couples PEFT-tuned LLM labeling with offset-GLM residualization and Gi* hotspot detection on 500 m grids, converting unstructured POI names into analysis-ready inputs and interpretable city-scale spatial evidence. The cross-city case studies demonstrate transfer with minimal adaptation while maintaining robust macro-level performance.

2.2. Data and Preprocessing

This study uses a nationwide dataset of restaurant points of interest (POIs) in China for 2023 and selects Guangzhou and Shenzhen as case cities for the spatial analysis. Restaurant POIs were collected from the Amap (Gaode Map) Open Platform via Python 3.7 scripts using its Web Service API (POI Search, v3). After acquisition, the raw POI records were systematically cleaned and organized. Duplicate entries were identified and removed by jointly applying fuzzy matching on POI names and a spatial proximity constraint (within 50 m), retaining only the most information-complete record for each duplicate group. This procedure provided a relatively stable data foundation for constructing the classification taxonomy.
Although Amap’s official POI categories offer relatively fine-grained delineation of food-service venues, cuisine-specific labels are frequently missing in practice. In our dataset of approximately 5.10 million restaurant POIs, only about 220,000 records contain cuisine annotations—roughly 4.3% of the total—rendering the labeled subset insufficient for fine-grained spatial analyses of culinary–cultural heterogeneity.
To support cuisine identification from restaurant POIs, we develop an LLM-based classification label system. To ensure analytical reliability and interpretability, we focus on 15 cuisine categories with both strong identification performance and adequate sample sizes (see Table 1).
As shown in Figure 2a, cuisine-labeled POIs cover almost all prefecture-level cities, ensuring nationwide representativeness. Figure 2b shows that most names are 3–8 characters, reflecting typical naming but also creating ambiguity in short texts. At the categorical level (Figure 2c), Sichuan, Hunan, and Cantonese dominate, yet even the smallest categories contain over a thousand samples, providing sufficient data to support model training across 15 cuisines. Meanwhile, to further verify the effectiveness of the LLM-SSIF, Guangzhou and Shenzhen are selected as case studies for in-depth exploration.
The final dataset is split into training, validation, and test sets in a 64%:16%:20% ratio—first dividing the data 80:20 into training and test sets, and then randomly sampling 20% from the training set for validation. Class balance is maintained across all subsets to ensure consistency during model training and evaluation.

2.3. Model and Lightweight Fine-Tuning

This study adopts Parameter-Efficient Fine-Tuning (PEFT) to enable efficient model adaptation under constrained hardware resources (see Figure 3). By updating only a small subset of parameters in a pre-trained model, PEFT substantially reduces computational and GPU memory costs relative to full-parameter fine-tuning [31]. Among available PEFT strategies, we employ Low-Rank Adaptation (LoRA) [30,37], which injects trainable low-rank matrices into Transformer attention layers. This design achieves effective task adaptation with a modest parameter increase and, in practice, can attain performance close to full fine-tuning.
Hyperparameter choices are guided by LoRA’s low-rank adaptation mechanism and training stability. We set the rank to 8, providing sufficient adaptation capacity while keeping the parameter increment manageable. The scaling factor (α) is set to 16 to balance the numerical scale between the frozen pre-trained weights and the low-rank updates, thereby improving training stability [30]. For regularization, we set LoRA dropout to 0, motivated by a combined strategy of “structural constraint + training-time monitoring.” Specifically, LoRA introduces only a limited number of trainable parameters while the backbone weights remain frozen, which already restricts model degrees of freedom; in addition, we mitigate overfitting through validation-based monitoring and early stopping. For our short-text multi-class classification task, additional dropout may weaken the effective update signal in the low-rank branch and is unlikely to yield substantial generalization gains.
We optimize the model using AdamW (learning rate: 5 × 10−5) with a cosine annealing schedule to balance convergence speed and stability, and apply gradient clipping (max norm = 1.0) to reduce instability risks under mixed-precision training. The per-device batch size is 16, combined with gradient accumulation to increase the effective batch size, alleviating memory constraints and reducing gradient noise. Training is conducted in bfloat16 precision with a maximum sequence length of 512 tokens, which covers the primary semantic content in POI names and prompt templates. All configurations are summarized in Table 2.
The maximum number of epochs is set to 10, with early stopping enabled [38]. The training loss decreases rapidly in the early stage and then stabilizes (Figure 4). The validation loss reaches its minimum at approximately 1300 steps (≈4.7 epochs; loss ≈ 0.0724) and subsequently increases (Figure 5); accordingly, we select the best checkpoint between epochs 4 and 5 as the final model. All lightweight fine-tuning experiments are conducted on consumer-grade hardware (NVIDIA RTX 4090 GPU, 24 GB VRAM, designed by NVIDIA Corporation, Santa Clara, USA), indicating that effective training and deployment for fine-grained text classification can be achieved in this setting without access to high-performance computing infrastructure.

2.4. Evaluation Metrics

To comprehensively evaluate the effectiveness and deployability of the proposed framework for classifying unstructured urban POI text, this study constructs a multi-dimensional evaluation system. The main metrics include overall accuracy, macro-averaged precision, macro-averaged recall, macro-averaged F1-score, and per-class F1 scores [39].
Due to the significant imbalance in POI category distribution (e.g., Cantonese and Hunan cuisine samples far outnumber those of niche cuisines), relying solely on overall accuracy or weighted metrics can lead to large-category dominance and obscure performance on underrepresented classes [40]. Therefore, macro-averaged metrics are used, where precision, recall, and F1-score are computed for each class and then averaged arithmetically, giving each category equal weight in the evaluation [41]. Compared to micro- or weighted averages, macro-averaging is better suited for assessing model performance across diverse classes in imbalanced short-text classification tasks [42].
Additionally, to reflect the model’s effective coverage of the full dataset, this study introduces Valid Coverage Rate (VCR) as a supplementary metric.
Overall accuracy is defined as follows:
A c c u r a c y = T P + T N T P + T N + F P + F N
where T P (True Positives) refers to correctly predicted positive samples, T N (True Negatives) to correctly predicted negative samples, F P (False Positives) to samples incorrectly predicted as positive, and F N (False Negatives) to samples incorrectly predicted as negative.
For multi-class classification, the precision, recall, and F1-score of class i are defined as follows:
P r e c i s i o n i = T P i T P i + F P i
R e c a l l i = T P i T P i + F N i
F 1 i = 2 P r e c i s i o n i × R e c a l l i P r e c i s i o n i + R e c a l l i
where T P i , F P i , and F N i are the true positives, false positives, and false negatives for class i , respectively. Precision measures the model’s ability to avoid false positives, recall measures its ability to identify all relevant instances, and F1-score balances both.
Macro-averaged metrics are calculated as the arithmetic mean across all classes, as follows:
M a c r o P r e c i s i o n = 1 N i = 1 N P r e c i s i o n i
M a c r o R e c a l l = 1 N i = 1 N R e c a l l i
M a c r o F 1 = 1 N i = 1 N F 1 i
where N is the total number of classes (excluding the Other category). Macro-averaging mitigates the bias introduced by class imbalance and better reflects model performance across different classes.
The Valid Coverage Rate (VCR) is defined as follows:
V C R = N v a l i d N t o t a l
where N v a l i d denotes the number of samples assigned to any fine-grained class (i.e., predicted as non-Other) and N t o t a l is the total number of samples. Thus, VCR quantifies the proportion of the dataset that receives a usable non-Other label from the model.
Overall, macro-averaging (excluding “Other”) curbs head-class dominance and reflects performance on long-tail categories. Valid Coverage Rate (VCR) measures the proportion of samples assigned non-Other labels, i.e., the effective sample density for 500 m grid econometrics. In practice, macro-F1 captures label reliability, whereas VCR captures analytical coverage; together, they determine the robustness of fine-scale spatial evidence.

2.5. Residuals and Spatial Analysis

In order to transform the expanded labels into interpretable evidence of structural cultural preferences and to distinguish these from patterns driven solely by general commercial density, this study employs a two-step spatial econometric approach. The core analytical challenge lies in determining whether the clustering of a cuisine stems from a locally embedded cultural preference or merely mirrors the overall restaurant supply.
After obtaining cuisine distribution data classified by the LLM, this study further computes standardized residuals at the 500 m × 500 m grid level to reveal deviations of different cuisines in urban space. Specifically, let the observed count of cuisine c in grid j be as follows:
y j c = o b s e r v e d   c o u n t   o f   c u i s i n e   c   i n   g r i d   j
and the total number of restaurants in grid j be as follows:
n j = t o t a l   n u m b e r   o f   r e s t a u r a n t s   i n   g r i d   j
Let the overall proportion of cuisine c in the entire city be p c . Under scale control, the expected count is as follows:
μ j c = n j p c
To control for the scale effect dominated by the total number of restaurants n j , this study employs a Generalized Linear Model (GLM) to estimate the expected count μ j c of cuisine c in grid j . The model incorporates log n j as a fixed offset (not an estimated variable) [43]. The core purpose of using an offset is to remove the ‘expected’ component due to variations in commercial density from the observed counts, thereby isolating the relative over- or under-concentration specific to a cuisine. The model is fitted as a Poisson or Negative Binomial regression with only an intercept, as follows:
log ( μ j c ) = β 0 c + log ( n j ) ,
μ j c = n j e β 0 c
If the Negative Binomial model converges, its variance function is V a r ( μ j c ) = μ j c + κ μ j c 2 . If it fails to converge, it degenerates to a Poisson model: V a r ( μ j c ) = μ j c .
Based on the expected value and variance, the standardized Pearson residual is calculated as follows:
r j c = y j c μ j c V a r ( μ j c )
This residual measures the relative deviation between local observations and citywide expectations. If r j c > 0 , the grid has more instances of a cuisine than expected, suggesting possible spatial clustering; if r j c < 0 , the cuisine is underrepresented, indicating possible spatial insufficiency. Larger absolute values of r j c indicate greater deviation. Residuals for cuisines with extremely low citywide frequencies or insufficient variance are set to zero to avoid estimation bias.
The purpose of residualization is to control for grid-level scale/exposure effects, thereby preventing mechanical influences of overall restaurant volume on cuisine counts and shifting the analytical focus from “spatial agglomeration of total restaurants” to “structural deviations relative to a citywide baseline.” Accordingly, Pearson residuals capture the relative over- or under-representation of a given cuisine within local grids compared with its mean citywide share, rather than differences in absolute counts. Importantly, residualization is not intended to eliminate spatial structure. On the contrary, if spatial autocorrelation persists after removing scale effects, this remaining dependence constitutes the key signal of interest in this study, pointing to structural spatial organization potentially shaped by embedded cultural preferences. This rationale is consistent with the core design logic of residual analysis in spatial point processes proposed by Baddeley et al. (2005) [32]: residuals are meant to “remove model-explained effects and reveal systematic departures not captured by the model,” rather than to eradicate all spatial association.
To assess whether residualization effectively attenuates scale-driven spatial dependence, we compute Moran’s I for both the raw counts and the residuals (Table 3). The results show that, for representative cuisines in Guangzhou and Shenzhen, Moran’s I for the residuals is substantially lower than that for the raw counts (all p < 0.001), indicating that the GLM with an offset term effectively reduces spatial autocorrelation induced by overall density/scale effects. At the same time, the residuals remain significantly positively autocorrelated, suggesting that identifiable structural spatial patterns persist even after controlling for scale.
Residualization controls for grid-level exposure effects so that cuisine patterns are evaluated as structural deviations from a citywide baseline, rather than as a mechanical by-product of total restaurant volume. Pearson residuals, therefore, quantify the relative over- or under-representation of each cuisine in a grid compared with its expected citywide share. Crucially, residualization is designed to remove model-explained scale effects while retaining any remaining spatial dependence as the key signal of interest, consistent with residual diagnostics in spatial point processes [32].
This study applies the local Getis–Ord Gi* statistic to the residual surface to localize the structural signal. Gi* detects statistically significant clustering of over- or under-representation relative to the citywide baseline; negative Gi* values indicate systematically lower-than-expected shares after controlling for exposure/scale, rather than “disadvantage.”
G i = j w i j x j X ¯ j w i j S n j w i j 2 j w i j 2 n 1
where G i denotes the local Getis–Ord Gi* statistic for grid cell i . Here, x j represents the standardized residual r j c of cuisine i in grid j ;   w i j is the spatial weight between grids i and j ; X ¯ ¯ and S are, respectively, the mean and standard deviation of the residuals across all grids in the city; and n is the total number of grid cells. ArcGIS 10.7 was employed to automatically determines the optimal spatial scale and performs Monte Carlo testing, outputting hot and coldspots at 90%, 95%, and 99% confidence levels.
Through this residual–hotspot hybrid approach, the study identifies local “excess” or “deficiency” clustering patterns of various cuisines at the 500 m × 500 m grid level, while controlling for total restaurant volume. This allows for a more accurate interpretation of the structural differences in urban culinary cultures.

2.6. Key Methodological Assumptions

To directly address our three scientific questions, the proposed LLM-based semantic–spatial inference framework (LLM-SSIF) relies on three methodological assumptions. First, short and unstructured POI name texts contain sufficient semantic cues for fine-grained cuisine recognition, such that supervised PEFT (LoRA) can learn stable category signals under limited computational resources. Second, nationwide training coverage (≈220,000 cuisine-labeled POIs) provides sufficient diversity in naming conventions and regional expressions to support practical cross-city generalization under incomplete platform categorization and limited manual annotation. Third, in spatial inference, GLM residualization with total restaurant counts as an offset can adequately control scale effects, so standardized Pearson residuals reflect relative over-/under-representation beyond citywide expectations and can be meaningfully coupled with Getis–Ord Gi* hotspot detection on 500 m grids to yield interpretable evidence of cultural spatial differentiation.

3. Results

3.1. Overall Classification Performance

Table 4 presents a comparative evaluation of five models on the cuisine classification task for dining POIs. Among them, the PEFT (LoRA)-tuned Qwen3-8B achieves the highest scores across key metrics (see Figure 6), including macro-averaged Precision (0.835), Recall (0.853), and F1-score (0.843), as well as overall accuracy (0.893). These results indicate that domain-specific fine-tuning significantly enhances performance on multi-class short-text classification tasks, achieving a well-balanced trade-off between precision and recall while maintaining high classification coverage and minimizing misclassification rates.
By comparison, commercial API models such as GPT-4o and Doubao-1.5-pro report similar or slightly higher Recall values (0.838 and 0.880, respectively), but their lower Precision scores (~0.777) result in reduced F1-scores (0.801 and 0.813). This suggests that while they achieve broader coverage, they are more susceptible to misclassification in categories with fuzzy boundaries or high inter-class similarity. DeepSeek-R1 achieves the highest Recall (0.887) among all models and outperforms both commercial APIs in Precision (0.790), but still lags behind Qwen3-8B in overall balance, with a final F1-score of 0.827. In contrast, the traditional BERT model records the lowest scores across all metrics, including macro Precision (0.753), Recall (0.678), F1-score (0.704), and accuracy (0.817), underscoring its limitations in complex, imbalanced, and semantically rich classification tasks.
In terms of Valid Coverage Rate (VCR), Qwen3-8B achieves 95.82%, slightly below BERT’s 97.02%. However, the model delivers a substantially higher classification accuracy, indicating its capacity to maintain high coverage while improving label precision. These results collectively validate Qwen3-8B as the best-performing model across both predictive and coverage metrics and reinforce the effectiveness and scalability of the proposed LLM-based label-expansion approach for large-scale POI applications.

3.2. Per-Class Performance

Table 5 presents the per-class F1-score results for major cuisine types. It can be observed that Qwen3-8B achieves the highest or near-highest F1-scores across most categories, demonstrating its robustness in multi-class short-text classification tasks. The model performs especially well in highly standardized categories such as Sichuan cuisine (0.920), Japanese cuisine (0.931), and Korean cuisine (0.916). For less represented and more heterogeneous categories like Jiangsu cuisine, the F1-score (0.659) is relatively lower but still significantly outperforms BERT (0.299), showing robustness in low-resource and semantically ambiguous scenarios. Overall, Qwen3-8B maintains a leading position in mainstream cuisines and exhibits strong adaptability in niche and complex categories.
Commercial LLMs perform comparably or slightly better in some mainstream categories (e.g., Japanese cuisine and Western cuisine), but overall lack the stability of Qwen3-8B. DeepSeek-R1 performs similarly to Qwen3-8B in categories like Hunan, Cantonese, and Western cuisine, but shows greater fluctuation in niche classes. BERT significantly lags behind in most categories, especially in cross-cultural or ambiguously named categories (e.g., Hubei and Fujian cuisine), where the gap is more pronounced. In summary, Qwen3-8B’s fine-grained classification results provide strong empirical support for the framework’s generalizability and robustness.

3.3. Confusion Analysis

To assess performance in high-complexity contexts, this study uses a row-normalized confusion matrix (i.e., per-class recall) to evaluate class-wise discrimination (see Figure 7). Qwen3-8B attains over 90% per-class recall in several categories—Sichuan (91.4%), Huizhou (92.5%), and Japanese (92.6%)—which aligns with their per-class F1-scores and indicates that the model captures nuanced cues in short POI names.
Importantly, Qwen3-8B distinguishes well between closely related classes such as Sichuan and Hunan cuisines, achieving 91.4% and 88.3% per-class recall, respectively. This discriminative capacity is particularly valuable for identifying spatial patterns of culinary diversity in urban settings.
In contrast, Jiangsu cuisine (F1 = 0.659) and Hubei cuisine (F1 = 0.751) are more prone to confusion (see Figure 7) because their restaurant names provide weaker name-level cues. Sichuan and Hunan names often contain explicit markers, such as regional abbreviations “川/蜀” (Sichuan) and “湘” (Hunan) and flavor cues like “麻辣” (mala; numbing–spicy). By contrast, Jiangsu names more frequently use cuisine-agnostic generic expressions, including common suffix patterns such as “X记/家” (X as a placeholder name) and broad terms like “龙虾” (lobster), “土菜馆” (home-style eatery), and “小厨” (small kitchen). When distinctive markers are absent, predictions become more sensitive to inter-class semantic overlap and co-occurrence in the training data, leading to more frequent shifts toward high-co-occurrence categories such as Sichuan cuisine.
Overall, the confusion matrix exhibits strong diagonal dominance, with per-class recall exceeding 80% for most categories. These results confirm the model’s stability across both high-frequency and long-tail classes and highlight its capacity for fine-grained labeling.

4. Case Study: Guangzhou and Shenzhen

4.1. Data Expansion and Classification Scaling

Although the platform provides ≈220,000 fine-grained cuisine-labeled samples nationwide (about 4.3% of all restaurant POIs), labels become rapidly fragmented in the Guangzhou and Shenzhen case studies when the unit of analysis is refined to a “single city–single cuisine–500 m grid.” Beyond a few mainstream cuisines, many long-tail categories within a city contain only dozens or even single-digit labeled records (Table 6), limiting high-resolution spatial inference. To examine cross-regional and cross-cultural transferability, we further conducted an additional case analysis in Chengdu (a Sichuan-cuisine–dominant context distinct from Cantonese cuisine); due to space constraints, the full results are reported in Appendix A.
To mitigate this sparsity, the semantic expansion module of LLM-SSIF is applied. Specifically, a PEFT (LoRA)-tuned Qwen3-8B model assigns cuisine labels to previously unlabeled restaurant POIs, improving within-city coverage and category representativeness. In Guangzhou and Shenzhen, the number of labeled records increased from 5949 and 6562 to 88,565 and 90,539—about 14.89 and 13.80 times, respectively. Importantly, the gains are not confined to dominant categories; long-tail cuisines also exhibit substantial lift ratios (see Table 6 and Figure 8), indicating robust generalization across both frequent and rare labels and enabling grid-level spatial inference on a materially less sparse dataset.

4.2. City-Level Composition and Spatial Patterns

At a macro-level, Guangzhou and Shenzhen share structural similarities in their foodscapes while also revealing distinctive differences. In both cities, local cuisine (i.e., the city’s native cuisine—Cantonese in both cases) constitutes the largest segment, underscoring the foundational role of regional culinary culture (see Figure 9). The degree of dominance, however, diverges. In Guangzhou, local cuisine accounts for 51.1% of all dining establishments (45,267 restaurants), reinforcing the city’s status as the birthplace of Cantonese cuisine and a historic gastronomic hub. In Shenzhen, the local share is 42.1% (38,162 restaurants), while domestic (non-local Chinese) cuisines reach 37.3% (33,741 restaurants), substantially higher than Guangzhou’s 25.7% (22,804 restaurants).
This divergence aligns with the cities’ socio-demographic profiles: Guangzhou’s composition reflects rootedness in traditional local culture, whereas Shenzhen, often characterized as a migrant city, shows greater interregional diversity and demand heterogeneity. Foreign (non-Chinese) cuisines are also well represented in both cities, with Guangzhou at 23.1% (20,494 restaurants) and Shenzhen at 20.6% (18,636 restaurants), indicating that globalized consumption patterns are embedded across both urban contexts.
Beyond composition, the grid-level distribution of total restaurants at a 500 m × 500 m resolution reveals contrasting exposure landscapes (Figure 10). In Guangzhou, restaurant counts concentrate in the historic core and taper toward surrounding districts, approximating a largely monocentric intensity pattern. In Shenzhen, multiple high-intensity clusters emerge across core districts and extend into several peripheral districts, reflecting a more polycentric spatial structure.
Taken together, the city-level composition (Figure 9) and grid-level exposure patterns (Figure 10) derived from the LLM-SSIF-classified POI set indicate systematic cross-city differences between Guangzhou and Shenzhen, providing a coherent baseline for subsequent within-city structural analysis.

4.3. Cultural Embedding Through Hotspot and Residual Analysis

Building on the macro baselines, this study first applies conventional hotspot analysis to assess whether the LLM-expanded dataset reproduces real-world clustering. As shown in Figure 11, Getis–Ord Gi* on raw counts identifies significant clusters across core urban areas in both cities, consistent with central advantages in density, accessibility, and commercial activity. This indicates that the LLM-classified data can reliably reproduce macro-level agglomeration.
At the same time, most central districts appear as hotspots regardless of cuisine type, reflecting scale effects from overall restaurant density and making cuisine-specific spatial preferences hard to see. To address this, this study implements a residual-based hotspot approach: expected counts are estimated via a GLM with total restaurant count as an offset, and Gi* is applied to standardized residuals (Figure 12). This isolates relative over-/under-representation, revealing structural cultural preferences beyond aggregate intensity.
The residual hotspot analysis reveals distinct cuisine-specific patterns across the three representative cuisines. After scale control, Cantonese shifts from raw-count hotspots in high-intensity central areas to relative coldspots in several central commercial zones, while emerging as hotspots in historically rooted neighborhoods and selected peripheral contexts. Hunan shows limited differentiation in the raw-count analysis but displays multiple peripheral residual hotspots, suggesting non-local clustering beyond overall supply intensity. Western remains strongly core-oriented under both analyses, forming residual hotspots in central areas and systematic under-representation toward the periphery.
These residual patterns can be interpreted as cuisine-specific “spatial anchoring” in relative share after controlling for overall restaurant intensity. From the perspective of polycentric urban structure and service distribution, cuisine-specific residual hotspots indicate localized service specialization anchored to different centers and their surrounding consumption catchments, rather than a simple reflection of overall supply intensity. For Cantonese, residual hotspots indicate a systematically higher-than-expected local share, consistent with cultural path dependence and locally anchored daily consumption, whereas the central core’s higher cuisine diversity can dilute its relative representation despite high absolute supply. For Hunan, peripheral residual hotspots plausibly reflect migrant-linked culinary clustering: non-local food practices become locally anchored and reproduced where interregional migrants and their everyday consumption networks concentrate, rather than implying a citywide shift in taste. For Western cuisine, persistent central residual hotspots point to dependence on central consumption environments associated with higher purchasing power and service-economy activity, thereby sustaining a durable core–periphery gradient even after exposure control.
Overall, the residual–hotspot evidence demonstrates how LLM-expanded labels can be translated into interpretable, scale-controlled spatial signals suitable for cross-city comparison within the LLM-SSIF pipeline.

5. Discussion

The primary contribution of this study is the proposed LLM-based semantic–spatial inference framework (LLM-SSIF), a unified pipeline that translates unstructured POI name text into interpretable, fine-grained spatial evidence. This framework distinguishes itself from existing work in two key aspects. First, it prioritizes a practical trade-off for urban researchers by employing PEFT (LoRA) on open LLMs, making high-quality adaptation feasible on consumer hardware—contrasting with reliance on black-box commercial APIs or computationally prohibitive full fine-tuning. Second, and more critically, it is end-to-end oriented to spatial inference, embedding classification outputs within a scale-controlled residual–hotspot spatial analytic workflow to reveal structural cultural preferences beyond raw density patterns.
Empirically, LLM-SSIF is instantiated via a PEFT (LoRA) label-expansion module trained on consumer-grade GPUs [30], achieving a favorable performance–compute–deployability trade-off while retaining data control [44]. In the Guangzhou–Shenzhen cuisine case, the approach enlarges usable fine-category coverage by ~14–15× and surpasses strong baselines on macro-averaged metrics, showing improved robustness for long-tail and semantically similar classes [45]. This scaling is crucial at the “single city–single cuisine–500 m grid” resolution, where platform labels otherwise become too fragmented to support stable high-resolution analysis.
Beyond classification, the pipeline links labels to scale-controlled spatial analytic inference to produce interpretable evidence. Naïve diagnostics (intensity maps and raw-count hotspot detection) capture clustering but are dominated by scale effects. By coupling labels with GLM residualization (using total count as an offset) and Getis–Ord Gi* on 500 m × 500 m grids, this study reveals cuisine-specific structural differentiation in relative share that intensity maps alone obscure: Cantonese is over-represented in historically rooted and selected peripheral contexts but less distinctive in high-intensity central commercial areas, Hunan forms multiple peripheral migrant-linked clusters, and Western remains persistently core-oriented with under-representation toward the periphery. These patterns align with each city’s historical–cultural context and migrant dynamics, illustrating how the label-expansion → residual–hotspot chain yields interpretable urban evidence.
Building on this methodological contribution, the framework also demonstrates clear practical value. The expanded labels enable fine-grained monitoring of service content and cultural differentiation at a 500 m resolution without requiring high-performance computing infrastructure, and the modular design lowers adoption barriers in planning and operations. Although the study does not pursue causal identification [46], the observed spatial tendencies are consistent with established mechanisms in urban cultural geography [47], reinforcing the interpretability of the methodological evidence [48].
Substantively, the Guangzhou–Shenzhen results provide a consumption-content lens on urban structure: Guangzhou exhibits a more historically embedded, core-weighted culinary pattern, whereas Shenzhen shows a more polycentric, migration-linked configuration. By separating cuisine-specific deviations from overall supply intensity, the residual–hotspot maps shift interpretation from where restaurants concentrate to where specific cultural contents are structurally anchored, and the two cities serve primarily as demonstrations of the LLM-SSIF inferential chain (text → fine labels → scale-controlled deviations → localized structural signals), rather than exhaustive urban narratives.
Although illustrated with restaurants, the pipeline is domain-agnostic. Any unstructured POI dataset—clinics, schools, or retail—can be transformed into fine-grained spatial evidence through the same label-expansion and residual–hotspot chain. This extensibility highlights its value for service-structure evaluation and comparative urban profiling beyond dining.
Naturally, this study has limitations. Because LLM-SSIF is designed to translate unstructured POI texts into scale-controlled spatial–statistical evidence, rather than to identify causal effects, the underlying mechanisms are not causally established: the current analysis does not yet incorporate multi-source explanatory data (e.g., rent levels or population flows) to test and disentangle potential drivers behind the observed patterns [49]. In addition, bias assessment remains limited and should be strengthened by quantifying label noise and ambiguity through targeted spot-checks and confusion-pair diagnostics.
Nevertheless, the framework is grounded on a nationwide training dataset of more than 220,000 cuisine-labeled POIs, ensuring exposure to diverse naming conventions and cultural variations. This provides an inherent basis for cross-city generalization. The Guangzhou–Shenzhen cases are, therefore, presented as representative demonstrations of how the framework translates expanded labels into spatial evidence, rather than as the only test beds. Future work could adopt explicit cross-regional evaluations to further quantify robustness under extreme heterogeneity.

6. Conclusions

This study proposes LLM-SSIF, a resource-efficient and transferable pipeline that links LLM-based label expansion with scale-controlled residual–hotspot spatial analysis at 500 m resolution, bridging unstructured POI text with interpretable urban evidence. The framework substantially enlarges fine-category coverage and outperforms baselines on macro metrics, alleviating long-tail sparsity and enabling robust spatial analytic inference via offset-GLM residualization and Getis–Ord Gi*. The Guangzhou–Shenzhen case demonstrates how expanded labels uncover structural cultural preferences beyond raw intensity patterns, supporting planning practice with fine-grained evidence. Future work should test mechanisms using multi-source data and causal or quasi-causal designs, while extending the pipeline to other urban functions and planning-oriented tools for routine monitoring and equity assessment.

Author Contributions

Conceptualization, Z.H. and M.Z.; methodology, Z.H. and S.H.; software, Y.G.; validation, Z.H., Y.G. and S.H.; formal analysis, Z.H. and S.H.; investigation, Y.G.; re-sources, M.Z.; data curation, Y.G.; writing—original draft preparation, Z.H. and Y.G.; writing—review and editing, Z.H.; visualization, Z.H. and Y.G.; supervision, M.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Major Project of the National Social Science Foundation of China [21&ZD107].

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

During the preparation of this manuscript, the authors used ChatGPT-5 for language polishing. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
GLMGeneralized Linear Model
Gi*Getis–Ord Gi* statistic
LLMLarge Language Model
LLM-SSIFLLM-based semantic–spatial inference framework
LoRALow-Rank Adaptation
NLPNatural Language Processing
PEFTParameter-Efficient Fine-Tuning
POIPoint of Interest
SFTSupervised Fine-Tuning
VCRValid Coverage Rate

Appendix A. Cross-Regional Urban Case Study: Chengdu

Appendix A.1. Motivation and Case Selection

To further examine the applicability and robustness of the proposed LLM-based semantic–spatial inference framework across different regional and cultural contexts, this appendix introduces Chengdu as an additional cross-regional case study. Compared with Guangzhou, Chengdu differs substantially in its linguistic environment, culinary traditions, and regional historical background. Its foodscape is characterized by a strong dominance of local cuisine and deeply rooted cultural identity, making it a representative case for evaluating the framework under culturally distinct urban conditions.
It should be emphasized that the Chengdu case is not intended to introduce new research questions or to alter the main empirical focus of the study. Rather, it serves as a supplementary validation exercise at the methodological level, aimed at assessing whether the proposed label-expansion and residual–hotspot analysis pipeline can consistently produce interpretable and stable spatial evidence when applied to cities with markedly different cultural settings and urban structures.

Appendix A.2. Label Expansion and Overall Cuisine Structure

Applying the proposed LLM-based label expansion framework to Chengdu substantially alleviates the sparsity of fine-grained cuisine labels. The number of usable cuisine-labeled POIs increases from approximately 5000 platform-annotated records to more than 110,000 LLM-classified records, corresponding to an expansion ratio exceeding 20× (see Table A1 and Figure A1). This expansion provides a stable data foundation for grid-level spatial analysis.
Table A1. Labeled vs. LLM-classified counts and lift ratios (Chengdu).
Table A1. Labeled vs. LLM-classified counts and lift ratios (Chengdu).
Cuisine NameLabel CountLLM CountLift Ratio
NE Chinese65147822.74
Yunnan–Guizhou54135925.17
Sichuan364784,80823.25
Huizhou39531.67
Japanese50417843.54
Thai–Viet401393.48
Hunan3753814.54
Cantonese176556931.64
Jiangsu9535.89
NW Chinese95159216.76
Western35311,29832.01
Hubei68614.33
Fujian2238417.45
Korean26422418.49
Shandong519539.00
Total5280111,61921.14
Figure A1. Cuisine-specific lift ratios in Chengdu. The lift ratio is defined as the number of restaurants after LLM-based label expansion divided by the count with original platform labels. Ratios greater than 1 indicate expansion, with the largest relative gains observed in long-tail cuisines.
Figure A1. Cuisine-specific lift ratios in Chengdu. The lift ratio is defined as the number of restaurants after LLM-based label expansion divided by the count with original platform labels. Ratios greater than 1 indicate expansion, with the largest relative gains observed in long-tail cuisines.
Smartcities 09 00013 g0a1
In terms of overall composition, Chengdu’s restaurant system is strongly dominated by local cuisine (see Figure A2). Sichuan cuisine accounts for the vast majority of classified restaurants, far exceeding all other cuisine types. This highly concentrated structure reflects the deep cultural embedding of local food traditions in everyday consumption. Domestic non-local Chinese cuisines and foreign cuisines together form a relatively small long-tail segment, contrasting sharply with the more balanced and diversified composition observed in Guangzhou and Shenzhen.
Figure A2. Composition of local, domestic, and foreign cuisines in Chengdu, based on the LLM-classified (label-expanded) POI set. Local denotes Sichuan cuisine, domestic includes all other Chinese cuisines, and foreign represents non-Chinese cuisines. Chengdu is dominated by local cuisine (76.0%).
Figure A2. Composition of local, domestic, and foreign cuisines in Chengdu, based on the LLM-classified (label-expanded) POI set. Local denotes Sichuan cuisine, domestic includes all other Chinese cuisines, and foreign represents non-Chinese cuisines. Chengdu is dominated by local cuisine (76.0%).
Smartcities 09 00013 g0a2

Appendix A.3. Grid-Level Distribution of Restaurant Density

At the 500 m × 500 m grid scale, restaurant density in Chengdu exhibits a compact, center-oriented structure with several secondary nodes (see Figure A3). The highest densities are concentrated in the central districts—primarily Jinjiang, Qingyang, Wuhou, Jinniu, and Chenghua—forming a continuous and dense core area. Additional but less pronounced clusters are distributed across selected peripheral zones.
Compared with the largely monocentric pattern identified in Guangzhou, Chengdu’s restaurant distribution appears more compact and spatially continuous, highlighting the dominant role of the central urban area while accommodating several functionally complementary subcenters.
Figure A3. Grid-level distribution of restaurants in Chengdu. Dining POIs from the LLM-classified dataset are aggregated to 500 m × 500 m grid cells to indicate absolute intensity.
Figure A3. Grid-level distribution of restaurants in Chengdu. Dining POIs from the LLM-classified dataset are aggregated to 500 m × 500 m grid cells to indicate absolute intensity.
Smartcities 09 00013 g0a3

Appendix A.4. Cultural Embedding Through Hotspot and Residual Analysis

This section briefly summarizes the spatial patterns without repeating the full theoretical interpretation provided in the main text.
Based on the LLM-expanded restaurant POI dataset, this section examines spatial clustering patterns of representative cuisines in Chengdu at a 500 m × 500 m grid resolution, using Getis–Ord Gi* statistics under the following two settings: conventional hotspot analysis based on raw counts and residual-based hotspot analysis after controlling for total restaurant intensity via GLMs.
Conventional Gi* results (Figure A4a–c) show highly similar clustering patterns across cuisines, with significant hotspots concentrated in the central urban area and major commercial nodes. This consistency indicates that, without accounting for scale effects, hotspot detection is largely driven by the overall concentration of dining establishments rather than cuisine-specific spatial organization.
After controlling for total restaurant supply, residual-based hotspot analysis (Figure A4d–f) reveals more differentiated spatial structures. For Sichuan cuisine, central hotspots observed in the raw-count analysis are substantially weakened, with only scattered positive residual clusters remaining in selected residential areas. Yunnan–Guizhou cuisine exhibits localized positive residual hotspots in residential or secondary commercial zones, while lacking continuous clustering in the city core, suggesting a dependence on specific everyday consumption contexts. Western cuisine, by contrast, retains concentrated positive residual hotspots in central business districts and systematic coldspots in peripheral areas, indicating a stable center-oriented spatial pattern even after removing scale effects.
Overall, the Chengdu case illustrates that conventional hotspot analysis primarily reflects macro-scale dining concentration, whereas residual-based analysis isolates cuisine-specific structural preferences after controlling for density. This two-tier approach reveals spatial differentiation mechanisms that are otherwise masked by aggregate intensity, providing more interpretable evidence of how different cuisines are organized within urban space.
Figure A4. Conventional and residual-based hotspot patterns of cuisines in Chengdu at a 500 m × 500 m grid resolution. Panels (ac) show Getis–Ord Gi* results based on raw restaurant counts, highlighting clustering driven by overall dining density. Panels (df) present hotspot results based on standardized residuals from generalized linear models (GLMs) with total restaurant count included as an offset. Red areas indicate over-represented hotspots and blue areas indicate under-represented coldspots; darker shades denote higher statistical significance (90%, 95%, and 99%). By controlling for scale effects, the residual maps reveal structural spatial preferences beyond aggregate restaurant intensity.
Figure A4. Conventional and residual-based hotspot patterns of cuisines in Chengdu at a 500 m × 500 m grid resolution. Panels (ac) show Getis–Ord Gi* results based on raw restaurant counts, highlighting clustering driven by overall dining density. Panels (df) present hotspot results based on standardized residuals from generalized linear models (GLMs) with total restaurant count included as an offset. Red areas indicate over-represented hotspots and blue areas indicate under-represented coldspots; darker shades denote higher statistical significance (90%, 95%, and 99%). By controlling for scale effects, the residual maps reveal structural spatial preferences beyond aggregate restaurant intensity.
Smartcities 09 00013 g0a4

Appendix A.5. Summary of the Chengdu Case

Overall, the Chengdu case illustrates a restaurant system characterized by strong local dominance, compact spatial organization, and clear functional differentiation across cuisine types. While Sichuan cuisine occupies a central position in the citywide restaurant system, its relative share within the core weakens once scale effects are controlled. Non-local and foreign cuisines remain spatially selective and node-dependent.
As a cross-regional validation, the Chengdu case complements the Guangzhou analysis by demonstrating that the proposed LLM-SSIF remains effective under distinct cultural and structural conditions. Thus, this appendix supports the robustness and transferability of the framework without altering the main methodological or empirical conclusions of the study.

References

  1. Milias, V.; Psyllidis, A. Assessing the Influence of Point-of-Interest Features on the Classification of Place Categories. Comput. Environ. Urban Syst. 2021, 86, 101597. [Google Scholar] [CrossRef]
  2. Wang, X.; Cheng, T.; Law, S.; Zeng, Z.; Yin, L.; Liu, J. Multi-Modal Contrastive Learning of Urban Space Representations from POI Data. Comput. Environ. Urban Syst. 2025, 120, 102299. [Google Scholar] [CrossRef]
  3. Yang, X.; Bo, S.; Wang, J. Classifying Urban Functional Zones by Integrating the Homogeneity and Structural Similarity of POIs. J. Urban Plan. Dev. 2024, 150, 04024052. [Google Scholar] [CrossRef]
  4. Andrade, R.; Alves, A.; Bento, C. POI Mining for Land Use Classification: A Case Study. ISPRS Int. J. Geo-Inf. 2020, 9, 493. [Google Scholar] [CrossRef]
  5. Jiang, S.; Alves, A.; Rodrigues, F.; Ferreira, J.; Pereira, F.C. Mining Point-of-Interest Data from Social Networks for Urban Land Use Classification and Disaggregation. Comput. Environ. Urban Syst. 2015, 53, 36–46. [Google Scholar] [CrossRef]
  6. Dong, L.; Ratti, C.; Zheng, S. Predicting Neighborhoods’ Socioeconomic Attributes Using Restaurant Data. Proc. Natl. Acad. Sci. USA 2019, 116, 15447–15452. [Google Scholar] [CrossRef]
  7. Liu, Y.; Kuai, C.; Liao, X.; Ma, H.; He, B.Y.; Ma, J. Semantic Trajectory Data Mining with LLM-Informed POI Classification. In Proceedings of the 2024 IEEE 27th International Conference on Intelligent Transportation Systems (ITSC), Edmonton, AB, Canada, 24–27 September 2024; pp. 207–213. [Google Scholar]
  8. Zhai, W.; Bai, X.; Shi, Y.; Han, Y.; Peng, Z.-R.; Gu, C. Beyond Word2vec: An Approach for Urban Functional Region Extraction and Identification by Combining Place2vec and POIs. Comput. Environ. Urban Syst. 2019, 74, 1–12. [Google Scholar] [CrossRef]
  9. Hu, S.; He, Z.; Wu, L.; Yin, L.; Xu, Y.; Cui, H. A Framework for Extracting Urban Functional Regions Based on Multiprototype Word Embeddings Using Points-of-Interest Data. Comput. Environ. Urban Syst. 2020, 80, 101442. [Google Scholar] [CrossRef]
  10. Niu, H.; Silva, E.A. Delineating Urban Functional Use from Points of Interest Data with Neural Network Embedding: A Case Study in Greater London. Comput. Environ. Urban Syst. 2021, 88, 101651. [Google Scholar] [CrossRef]
  11. Yan, B.; Janowicz, K.; Mai, G.; Gao, S. From ITDL to Place2Vec: Reasoning About Place Type Similarity and Relatedness by Learning Embeddings from Augmented Spatial Contexts. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Redondo Beach, CA, USA, 7–10 November 2017; pp. 1–10. [Google Scholar]
  12. Liu, K.; Yin, L.; Lu, F.; Mou, N. Visualizing and Exploring POI Configurations of Urban Regions on POI-Type Semantic Space. Cities 2020, 99, 102610. [Google Scholar] [CrossRef]
  13. Zhang, H.; Du, Q.; Zhang, S.; Yang, R. A Semantically Enhanced Label Prediction Method for Imbalanced POI Data Category Distribution. ISPRS Int. J. Geo-Inf. 2024, 13, 364. [Google Scholar] [CrossRef]
  14. Yu, W.; Chen, J.; Wei, C. A Hierarchical Learning Model for Inferring the Labels of Points of Interest with Unbalanced Data Distribution. Int. J. Appl. Earth Obs. Geoinf. 2022, 108, 102751. [Google Scholar] [CrossRef]
  15. Yeow, L.W.; Low, R.; Tan, Y.X.; Cheah, L.; Yeow, L.W.; Low, R.; Tan, Y.X.; Cheah, L. Point-of-Interest (POI) Data Validation Methods: An Urban Case Study. ISPRS Int. J. Geo-Inf. 2021, 10, 735. [Google Scholar] [CrossRef]
  16. Tang, W.; Chakeri, A.; Krim, H. Discovering Urban Functional Zones from Biased and Sparse Points of Interests and Sparse Human Activities. Expert Syst. Appl. 2022, 207, 118062. [Google Scholar] [CrossRef]
  17. Amap POI Classification Table. Available online: https://lbs.amap.com/api/webservice/download (accessed on 12 September 2025).
  18. Baidu POI Classification System. Available online: https://lbsyun.baidu.com/index.php?title=open/poitags (accessed on 12 September 2025).
  19. Sun, K.; Hu, Y.; Ma, Y.; Zhou, R.Z.; Zhu, Y. Conflating Point of Interest (POI) Data: A Systematic Review of Matching Methods. Comput. Environ. Urban Syst. 2023, 103, 101977. [Google Scholar] [CrossRef]
  20. Xu, X.; Wang, Z.; Gao, Q.; Zhong, T.; Hui, B.; Zhou, F.; Trajcevski, G. Spatial-Temporal Contrasting for Fine-Grained Urban Flow Inference. IEEE Trans. Big Data 2023, 9, 1711–1725. [Google Scholar] [CrossRef]
  21. Zhang, B.; Xia, C. The Effects of Sample Size and Sample Prevalence on Cellular Automata Simulation of Urban Growth. Int. J. Geogr. Inf. Sci. 2022, 36, 158–187. [Google Scholar] [CrossRef]
  22. Kim, H.; Lee, S. POI GPT: Extracting POI Information from Social Media Text Data. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2024, XLVIII-4/W10-2024, 113–118. [Google Scholar] [CrossRef]
  23. Li, Z.; Xia, L.; Ren, X.; Tang, J.; Chen, T.; Xu, Y.; Huang, C. Urban Computing in the Era of Large Language Models. ACM Trans. Intell. Syst. Technol. 2025, 16, 146. [Google Scholar] [CrossRef]
  24. Zheng, Y.; Xu, F.; Lin, Y.; Santi, P.; Ratti, C.; Wang, Q.R.; Li, Y. Urban Planning in the Era of Large Language Models. Nat. Comput. Sci. 2025, 5, 727–736. [Google Scholar] [CrossRef]
  25. Carammia, M.; Iacus, S.M.; Porro, G. Rethinking Scale: The Efficacy of Fine-Tuned Open-Source LLMs in Large-Scale Reproducible Social Science Research. arXiv 2024, arXiv:2411.00890. [Google Scholar]
  26. Zhang, Y.; Li, J.; Wang, Z.; He, Z.; Guan, Q.; Lin, J.; Yu, W. Geospatial Large Language Model Trained with a Simulated Environment for Generating Tool-Use Chains Autonomously. Int. J. Appl. Earth Obs. Geoinf. 2025, 136, 104312. [Google Scholar] [CrossRef]
  27. Aryan, A.; Nain, A.K.; McMahon, A.; Meyer, L.A.; Sahota, H.S. The Costly Dilemma: Generalization, Evaluation and Cost-Optimal Deployment of Large Language Models. arXiv 2023, arXiv:2308.08061. [Google Scholar] [CrossRef]
  28. Alizadeh, M.; Kubli, M.; Samei, Z.; Dehghani, S.; Zahedivafa, M.; Bermeo, J.D.; Korobeynikova, M.; Gilardi, F. Open-Source LLMs for Text Annotation: A Practical Guide for Model Setting and Fine-Tuning. J. Comput. Soc. Sci. 2024, 8, 17. [Google Scholar] [CrossRef] [PubMed]
  29. Zheng, Y.; Zhang, R.; Zhang, J.; Ye, Y.; Luo, Z.; Feng, Z.; Ma, Y. LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models. arXiv 2024, arXiv:2403.13372. [Google Scholar]
  30. Hu, E.J.; Shen, Y.; Wallis, P.; Allen-Zhu, Z.; Li, Y.; Wang, S.; Wang, L.; Chen, W. LoRA: Low-Rank Adaptation of Large Language Models. arXiv 2021, arXiv:2106.09685. [Google Scholar]
  31. Ding, N.; Qin, Y.; Yang, G.; Wei, F.; Yang, Z.; Su, Y.; Hu, S.; Chen, Y.; Chan, C.-M.; Chen, W.; et al. Parameter-Efficient Fine-Tuning of Large-Scale Pre-Trained Language Models. Nat. Mach. Intell. 2023, 5, 220–235. [Google Scholar] [CrossRef]
  32. Baddeley, A.; Turner, R.; Møller, J.; Hazelton, M. Residual Analysis for Spatial Point Processes (with Discussion). J. R. Stat. Soc. Ser. B. Stat. Methodol. 2005, 67, 617–666. [Google Scholar] [CrossRef]
  33. Anselin, L. Local Indicators of Spatial Association—LISA. Geogr. Anal. 1995, 27, 93–115. [Google Scholar] [CrossRef]
  34. Reades, J.; Hu, Y.; Tranos, E.; Delmelle, E. The City as Text. Nat. Cities 2025, 2, 794–800. [Google Scholar] [CrossRef]
  35. Cheng, J.; Wang, J.; Zhang, Y.; Ji, J.; Zhu, Y.; Zhang, Z.; Zhao, X. POI-Enhancer: An LLM-Based Semantic Enhancement Framework for POI Representation Learning. Proc. AAAI Conf. Artif. Intell. 2025, 39, 11509–11517. [Google Scholar] [CrossRef]
  36. Jiang, S.; Zhang, H.; Wang, H.; Zhou, L.; Tang, G. Using Restaurant POI Data to Explore Regional Structure of Food Culture Based on Cuisine Preference. ISPRS Int. J. Geo-Inf. 2021, 10, 38. [Google Scholar] [CrossRef]
  37. Gao, D.; Ma, Y.; Liu, S.; Song, M.; Jin, L.; Jiang, W.; Wang, X.; Ning, W.; Yu, S.; Xuan, Q.; et al. FashionGPT: LLM Instruction Fine-Tuning with Multiple LoRA-Adapter Fusion. Knowl.-Based Syst. 2024, 299, 112043. [Google Scholar] [CrossRef]
  38. Mathav Raj, J.; Kushala, V.; Warrier, H.; Gupta, Y. Fine Tuning LLM for Enterprise: Practical Guidelines and Recommendations. arXiv 2024, arXiv:2404.10779. [Google Scholar] [CrossRef]
  39. Saito, T.; Rehmsmeier, M. The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets. PLoS ONE 2015, 10, e0118432. [Google Scholar] [CrossRef]
  40. Farhadpour, S.; Warner, T.A.; Maxwell, A.E. Selecting and Interpreting Multiclass Loss and Accuracy Assessment Metrics for Classifications with Class Imbalance: Guidance and Best Practices. Remote Sens. 2024, 16, 533. [Google Scholar] [CrossRef]
  41. Feng, Y.; Zhou, M.; Tong, X. Imbalanced Classification: A Paradigm-Based Review. Stat. Anal. Data Min. ASA Data Sci. J. 2021, 14, 383–406. [Google Scholar] [CrossRef]
  42. Opitz, J. A Closer Look at Classification Evaluation Metrics and a Critical Reflection of Common Evaluation Practice. Trans. Assoc. Comput. Linguist. 2024, 12, 820–836. [Google Scholar] [CrossRef]
  43. Myers, R.H.; Montgomery, D.C. A Tutorial on Generalized Linear Models. J. Qual. Technol. 1997, 29, 274–291. [Google Scholar] [CrossRef]
  44. Zhao, J.; Wang, T.; Abid, W.; Angus, G.; Garg, A.; Kinnison, J.; Sherstinsky, A.; Molino, P.; Addair, T.; Rishi, D. LoRA Land: 310 Fine-Tuned LLMs That Rival GPT-4, A Technical Report. arXiv 2024, arXiv:2405.00732. [Google Scholar]
  45. Rahman, K.; Ghani, A.; Alzahrani, A.; Tariq, M.U.; Rahman, A.U. Pre-Trained Model-Based NFR Classification: Overcoming Limited Data Challenges. IEEE Access 2023, 11, 81787–81802. [Google Scholar] [CrossRef]
  46. Rahman, I.; Grunwald, A.; Saha, S. Access to Cultural Ecosystem Services and How Urban Green Spaces Marginalize Underprivileged Groups. npj Urban Sustain. 2025, 5, 36. [Google Scholar] [CrossRef]
  47. Arvidsson, M.; Lovsjö, N.; Keuschnigg, M. Urban Scaling Laws Arise from Within-City Inequalities. Nat. Hum. Behav. 2023, 7, 365–374. [Google Scholar] [CrossRef]
  48. Ma, W.; Zhang, W.; Chen, Q.; Jiang, H.; Zhao, K.; Song, C.; Yin, Y. Understanding the Regional Dynamics and Determinants of Urban Cultural Facilities in the Yangtze River Economic Belt, China. Humanit. Soc. Sci. Commun. 2025, 12, 547. [Google Scholar] [CrossRef]
  49. Adamiak, C.; Marjavaara, R. Airbnb and Urban Population Change: An Empirical Analysis of the Case of Stockholm, Sweden. Urban Res. Pract. 2024, 17, 654–680. [Google Scholar] [CrossRef]
Figure 1. LLM-SSIF research framework.
Figure 1. LLM-SSIF research framework.
Smartcities 09 00013 g001
Figure 2. Distribution of labeled restaurant POIs and sample characteristics. (a) Spatial distribution of cuisine-labeled restaurant POIs, showing concentration in eastern coastal provinces and major metropolitan areas. (b) Distribution of POI name lengths, with most names clustered between 3 and 8 characters. (c) Labeled-sample counts by cuisine, dominated by Sichuan, Hunan, and Cantonese cuisines, with Japanese, Western, and Korean also being sizable.
Figure 2. Distribution of labeled restaurant POIs and sample characteristics. (a) Spatial distribution of cuisine-labeled restaurant POIs, showing concentration in eastern coastal provinces and major metropolitan areas. (b) Distribution of POI name lengths, with most names clustered between 3 and 8 characters. (c) Labeled-sample counts by cuisine, dominated by Sichuan, Hunan, and Cantonese cuisines, with Japanese, Western, and Korean also being sizable.
Smartcities 09 00013 g002
Figure 3. Workflow of the LLM-based label-expansion framework for fine-grained spatial analysis of urban POIs. The framework integrates POI labels and instructions into a supervised fine-tuning process, where a frozen LLM is adapted with trainable LoRA modules. The best LoRA checkpoint is deployed as a fine-tuned model for inference, enabling the generation of classified POI labels.
Figure 3. Workflow of the LLM-based label-expansion framework for fine-grained spatial analysis of urban POIs. The framework integrates POI labels and instructions into a supervised fine-tuning process, where a frozen LLM is adapted with trainable LoRA modules. The best LoRA checkpoint is deployed as a fine-tuned model for inference, enabling the generation of classified POI labels.
Smartcities 09 00013 g003
Figure 4. Training loss during Qwen3-8B fine-tuning with PEFT (LoRA). Loss is plotted by optimization step (epoch ticks on the top axis). The inset zooms in after the first 100 steps. The curve shows a rapid initial drop followed by stabilization, indicating convergence under the chosen hyperparameters.
Figure 4. Training loss during Qwen3-8B fine-tuning with PEFT (LoRA). Loss is plotted by optimization step (epoch ticks on the top axis). The inset zooms in after the first 100 steps. The curve shows a rapid initial drop followed by stabilization, indicating convergence under the chosen hyperparameters.
Smartcities 09 00013 g004
Figure 5. Validation loss during Qwen3-8B fine-tuning with PEFT (LoRA). Loss is evaluated on the validation set by step (epoch ticks on the top axis). The minimum validation loss (≈0.0724) occurs around step 1300 (~epoch 4.66); this checkpoint is selected for subsequent experiments via early stopping.
Figure 5. Validation loss during Qwen3-8B fine-tuning with PEFT (LoRA). Loss is evaluated on the validation set by step (epoch ticks on the top axis). The minimum validation loss (≈0.0724) occurs around step 1300 (~epoch 4.66); this checkpoint is selected for subsequent experiments via early stopping.
Smartcities 09 00013 g005
Figure 6. Macro-average F1-score comparison across models. Models: Qwen3-8B (PEFT (LoRA)), GPT-4o, Doubao-1.5-pro, DeepSeek-R1, and BERT (baseline). Values shown on the bars are the scores (higher is better). Macro-averaged metrics exclude the Other class.
Figure 6. Macro-average F1-score comparison across models. Models: Qwen3-8B (PEFT (LoRA)), GPT-4o, Doubao-1.5-pro, DeepSeek-R1, and BERT (baseline). Values shown on the bars are the scores (higher is better). Macro-averaged metrics exclude the Other class.
Smartcities 09 00013 g006
Figure 7. Row-normalized confusion matrix for cuisine classification (%). Computed on the test set for the PEFT (LoRA)-tuned Qwen3-8B model. Cell values denote per-class recall (rows sum to 100%); the diagonal shows correct classifications, and off-diagonal cells indicate confusions between cuisines. Darker shades indicate higher percentages.
Figure 7. Row-normalized confusion matrix for cuisine classification (%). Computed on the test set for the PEFT (LoRA)-tuned Qwen3-8B model. Cell values denote per-class recall (rows sum to 100%); the diagonal shows correct classifications, and off-diagonal cells indicate confusions between cuisines. Darker shades indicate higher percentages.
Smartcities 09 00013 g007
Figure 8. Cuisine-specific lift ratios in Guangzhou and Shenzhen. The lift ratio is defined as the number of restaurants after LLM-based label expansion divided by the count with original platform labels. Ratios greater than 1 indicate expansion, with the largest relative gains observed in long-tail cuisines.
Figure 8. Cuisine-specific lift ratios in Guangzhou and Shenzhen. The lift ratio is defined as the number of restaurants after LLM-based label expansion divided by the count with original platform labels. Ratios greater than 1 indicate expansion, with the largest relative gains observed in long-tail cuisines.
Smartcities 09 00013 g008
Figure 9. Composition of local, domestic, and foreign cuisines in Guangzhou and Shenzhen, based on the LLM-classified (label-expanded) POI set. Local denotes Cantonese cuisine, domestic includes all other Chinese cuisines, and foreign represents non-Chinese cuisines. Guangzhou is dominated by local cuisine (51.1%), whereas Shenzhen shows greater domestic diversity.
Figure 9. Composition of local, domestic, and foreign cuisines in Guangzhou and Shenzhen, based on the LLM-classified (label-expanded) POI set. Local denotes Cantonese cuisine, domestic includes all other Chinese cuisines, and foreign represents non-Chinese cuisines. Guangzhou is dominated by local cuisine (51.1%), whereas Shenzhen shows greater domestic diversity.
Smartcities 09 00013 g009
Figure 10. Grid-level distribution of restaurants in Guangzhou and Shenzhen. Dining POIs from the LLM-classified dataset are aggregated to 500 m × 500 m grid cells to indicate absolute intensity. (a) Guangzhou and (b) Shenzhen. Darker shades correspond to higher restaurant counts, highlighting monocentric clustering in Guangzhou and a polycentric structure in Shenzhen.
Figure 10. Grid-level distribution of restaurants in Guangzhou and Shenzhen. Dining POIs from the LLM-classified dataset are aggregated to 500 m × 500 m grid cells to indicate absolute intensity. (a) Guangzhou and (b) Shenzhen. Darker shades correspond to higher restaurant counts, highlighting monocentric clustering in Guangzhou and a polycentric structure in Shenzhen.
Smartcities 09 00013 g010
Figure 11. Raw-count hotspots (Getis–Ord Gi*) at 500 m × 500 m resolution. Panels (ac): Guangzhou; panels (df): Shenzhen. Red = hotspots, Blue = coldspots; significance is reported at 90%, 95%, and 99% confidence levels (darker shades indicate higher confidence). Because the statistic is computed on raw grid counts, patterns largely reflect overall restaurant supply.
Figure 11. Raw-count hotspots (Getis–Ord Gi*) at 500 m × 500 m resolution. Panels (ac): Guangzhou; panels (df): Shenzhen. Red = hotspots, Blue = coldspots; significance is reported at 90%, 95%, and 99% confidence levels (darker shades indicate higher confidence). Because the statistic is computed on raw grid counts, patterns largely reflect overall restaurant supply.
Smartcities 09 00013 g011
Figure 12. Residual hotspots (GLM residuals + Getis–Ord Gi*) at 500 m × 500 m resolution. Standardized residuals are computed from GLMs with total restaurant count as an offset. Panels (ac): Guangzhou; panels (df): Shenzhen. Because the statistic is computed on residuals (observed—expected), the maps highlight structural preferences beyond overall supply.
Figure 12. Residual hotspots (GLM residuals + Getis–Ord Gi*) at 500 m × 500 m resolution. Standardized residuals are computed from GLMs with total restaurant count as an offset. Panels (ac): Guangzhou; panels (df): Shenzhen. Because the statistic is computed on residuals (observed—expected), the maps highlight structural preferences beyond overall supply.
Smartcities 09 00013 g012
Table 1. Cuisine classification with abbreviations and description.
Table 1. Cuisine classification with abbreviations and description.
IDCuisine NameAbbreviationDescription
1Northeastern Chinese CuisineNE ChineseHearty dishes, wheat-based staples
2Yunnan–Guizhou CuisineYunnan–GuizhouSpicy, sour, ethnic minority flavors
3Sichuan CuisineSichuanFamous for chili, numbing spices
4Huizhou CuisineHuizhouAnhui region, stews, umami flavors
5Japanese CuisineJapaneseSushi, ramen, seafood
6Thai–Vietnamese CuisineThai–VietSpicy, sour, fresh herbs
7Hunan CuisineHunanHot, sour, bold flavors
8Cantonese CuisineCantoneseDim sum, mild seasoning, seafood
9Jiangsu CuisineJiangsuFreshwater fish, light sweet taste
10Northwestern Chinese CuisineNW ChineseLamb, wheaten foods, hearty flavors
11Western CuisineWesternEuropean and American styles, grilled meats, dairy
12Hubei CuisineHubeiFreshwater fish, noodles
13Fujian CuisineFujianSeafood, soups, mild sweetness
14Korean CuisineKoreanKimchi, barbecue, rice dishes
15Shandong CuisineShandongSeafood, wheat, light salty flavors
Table 2. Fine-tuning hyperparameters.
Table 2. Fine-tuning hyperparameters.
HyperparameterValue
ModelQwen3-8B
Fine-tuning methodLoRA
LoRA rank (r)8
LoRA α16
LoRA dropout0
LoRA targetAll attention modules
Trainable parameters21,823,488 (≈0.27% of 8.21B backbone)
OptimizerAdamW
Learning rate5 × 10−5
SchedulerCosine decay
Precisionbfloat16
Batch size16
Max sequence length512 tokens
Epochsmax 10 (early stopping at ~4.66)
Training duration (wall-clock)~13.25 h
Dataset splitTrain:Val:Test = 64%:16%:20%
Table 3. Global Moran’s I diagnostics for observed cuisine counts and GLM residuals at the 500 m grid level.
Table 3. Global Moran’s I diagnostics for observed cuisine counts and GLM residuals at the 500 m grid level.
CityCuisineMoran’s I
(Raw)
Moran’s I
(Residual)
ΔI
(Residual—Raw)
GuangzhouCantonese0.24710.0615−0.1856
Hunan0.12550.0848−0.0407
Western0.24050.0187−0.2218
ShenzhenCantonese0.11740.0862−0.0312
Hunan0.10200.0943−0.0077
Western0.18850.0177−0.1708
Note: Moran’s I values for both the raw counts and residuals are statistically significant (all p < 0.001).
Table 4. Overall model performance comparison.
Table 4. Overall model performance comparison.
ModelQwen3-8B
PEFT (LoRA)
GPT-4oDoubao-1.5-ProDeepSeek-R1BERT
Accuracy0.8930.8610.8750.8810.817
Precision0.8350.7770.7770.7900.753
Recall0.8530.8380.8800.8870.678
F1-score0.8430.8010.8130.8270.704
VCR (%)95.82%90.80%86.90%90.54%97.02%
Note: Macro-averaged metrics are computed excluding the Other class. VCR is the proportion of all samples predicted as non-Other (usable fine-grained labels).
Table 5. Per-class F1 by cuisine.
Table 5. Per-class F1 by cuisine.
Cuisine NameQwen3-8B
PEFT (LoRA)
GPT-4oDoubao-1.5-ProDeepSeek-R1BERT
NE Chinese0.8840.7860.8660.8340.798
Yunnan–Guizhou0.8300.7450.8300.8000.672
Sichuan0.9200.9020.9150.9130.856
Huizhou0.9080.8380.8460.9000.625
Japanese0.9310.9350.9410.9390.918
Thai–Viet0.8240.8250.8010.8480.825
Hunan0.8990.8850.8990.9040.815
Cantonese0.8800.8400.8550.8850.790
Jiangsu0.6590.6180.5700.6440.299
NW Chinese0.8000.7360.7220.6900.778
Western0.8850.8710.8910.8880.867
Hubei0.7510.7470.7140.7740.362
Fujian0.7320.6400.6160.6610.555
Korean0.9160.9040.9140.9150.913
Shandong0.8320.7450.8090.8120.484
Table 6. Labeled vs. LLM-classified counts and lift ratios (Guangzhou, Shenzhen).
Table 6. Labeled vs. LLM-classified counts and lift ratios (Guangzhou, Shenzhen).
Cuisine NameGuangzhouShenzhen
Label CountLLM CountLift RatioLabel CountLLM CountLift Ratio
NE Chinese105181117.25146255717.51
Yunnan–Guizhou3071823.933887623.05
Sichuan879943710.74114513,38111.69
Huizhou69115.171414310.21
Japanese61130244.9549325625.20
Thai–Viet462034.41301324.40
Hunan121358434.82178499265.56
Cantonese220345,26720.55208438,16218.31
Jiangsu36120.3389111.38
NW Chinese72182425.33123269821.93
Western47614,92631.3635912,98336.16
Hubei5090618.1287137215.77
Fujian46190041.3069233133.78
Korean202234111.59169295917.51
Shandong721330.431336628.15
Total594988,56514.89656290,53913.80
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Huang, Z.; Guo, Y.; Huang, S.; Zhao, M. A Lightweight LLM-Based Semantic–Spatial Inference Framework for Fine-Grained Urban POI Analysis. Smart Cities 2026, 9, 13. https://doi.org/10.3390/smartcities9010013

AMA Style

Huang Z, Guo Y, Huang S, Zhao M. A Lightweight LLM-Based Semantic–Spatial Inference Framework for Fine-Grained Urban POI Analysis. Smart Cities. 2026; 9(1):13. https://doi.org/10.3390/smartcities9010013

Chicago/Turabian Style

Huang, Zhuo, Yixing Guo, Shuo Huang, and Miaoxi Zhao. 2026. "A Lightweight LLM-Based Semantic–Spatial Inference Framework for Fine-Grained Urban POI Analysis" Smart Cities 9, no. 1: 13. https://doi.org/10.3390/smartcities9010013

APA Style

Huang, Z., Guo, Y., Huang, S., & Zhao, M. (2026). A Lightweight LLM-Based Semantic–Spatial Inference Framework for Fine-Grained Urban POI Analysis. Smart Cities, 9(1), 13. https://doi.org/10.3390/smartcities9010013

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop