A Novel Approach for Improving XML Querying over Wireless Broadcast Channels
Abstract
1. Introduction
2. Related Work
3. Proposed Work
- Read the XML data and query
- Divide the XML data stream into several partitions based on criteria, such as document size (small/large), document type (role/schema), or document content (key characteristics/metadata).
- Assign each partition to a different wireless broadcast channel, ensuring that the load on each channel is balanced.
- Broadcast the XML data partitions over their respective channels.
- Implement a query routing mechanism that can route the queries to the appropriate channel or combination of channels that contain the relevant XML data partition.
- The query result is forwarded to a mobile device
3.1. Architecture Diagram for Proposed XML Querying Scheme
- Reading the XML Data Stream:- The system starts by reading the input XML data stream.
 
- Choose Partitioning Criterion:- Select the criterion for partitioning, such as document size, document type, or document content.
 
- For Each Document in the XML Data Stream:- Iterate through each document in the XML data stream.
 
- Determine the Value of the Chosen Criterion:- For each document, determine the value of the chosen partitioning criterion.
 
- Create Partition for Each Unique Criterion Value:- Create a separate partition for each unique value of the chosen criterion.
 
- Assign Document to Corresponding Partition:- Assign the document to the partition that corresponds to its criterion value.
 
- Repeat Steps 3–6 for All Documents:- Continue processing each document in the XML data stream according to the above steps.
 
- Output Partitions as Separate XML Data Streams:- Output the resulting partitions as separate XML data streams.
 
- Divide the XML Data Stream:- XML Data Source: The origin of the XML data stream.
- Partitioning Process: The data stream is divided into several partitions based on predefined criteria such as document size, type, or content.
 
- Assign Partitions to Wireless Broadcast Channels:- Each partition is assigned to a different wireless broadcast channel, ensuring balanced load distribution across all channels.
 
- Broadcast the XML Data Partitions:- Wireless Broadcast Channels: Each channel broadcasts its assigned XML data partition.
 
- Implement Query Routing Mechanism:- Query Router: Directs user queries to the appropriate channels.
- Query Interface: Where users input their queries.
- Routing Logic: Routes the query to the relevant channel or channels based on the XML data partitions they contain.
- Result Collection and Combination: Collect results from each channel and combine them to generate the final query result.
 
- Result Delivery: Returns the combined query results to the user.
3.2. Detailed Description of the Architecture Components
- (i)
- Mobile Device: On the mobile device, users enter queries into a Query Interface, which then routes these queries to the appropriate channels or retrieves cached results if available. Furthermore, the Query Decomposer divides complex queries into sub-queries and communicates with the cache to check for existing results or replace them with new ones.- Query Interface: Where users input their queries.
- Query Router: Routes queries to the appropriate channels or cached results.
- Query Decomposer/Cache: Decomposes complex queries into sub-queries and checks/updates the cache.
 
- (ii)
- XML Data Partitions: The XML data are divided into segments based on specific criteria such as document size, type, or content, and each partition is assigned to a separate wireless broadcast channel. This approach distributes the load and allows each channel to broadcast its assigned XML data segment efficiently.- Document Size: XML documents of comparable sizes are grouped together by document size partitioning. For instance, one partition might include smaller documents, while another might contain larger documents. Because smaller files may be stored and retrieved faster, this method can effectively improve response times for simpler queries in networks where smaller files are visited more frequently. Larger papers, on the other hand, can be streamed independently to prevent using resources that are required for smaller, faster-access files.
- Document Type: Document type partitioning arranges the XML data according to the role or schema structure of the document. “Customer records,” “product catalogs,” and “transaction logs” are a few examples. Sorting by type eliminates the need to search through irrelevant data kinds by directing particular query types to the appropriate channel. When data have different access patterns for each type or when queries are type-specific, this method works very well.
- Document Content: Documents are divided using content-based partitioning according to specific important attributes or metadata found in the XML content itself. XML documents could be divided, for instance, by date, topic, or geographic location. This method makes it possible for queries aimed at particular content categories to be effectively routed to the right channels by enabling partitioning that accurately reflects the data being queried.
 
- (iii)
- Wireless Broadcast Channels: Each partition is assigned to a different channel to balance the load and broadcast the data.
- (iv)
- XML Query Results Aggregator: After the queries are processed, the mobile device’s XML Query Results Aggregator combines the results from the various channels or cached data to create the final query result, which is then displayed to the user.
3.3. Workflow of the Proposed XML Querying Scheme
- (i)
- Partition of the XML data: The proposed method for improving XML querying over wireless broadcast channels begins by dividing the XML data into segments. The data are divided into partitions.
- (ii)
- Assign partitions to channels: Each partition is then assigned to a specific channel for broadcasting. Each partition is broadcast over a specific channel.
- (iii)
- Query routing: When a query is received, it is routed via the Query Router. Queries are routed via the Query Router.- Query Decomposition: If necessary, the query is split into sub-queries and routed to the relevant partitions.
- Query Caching: The system also includes query caching, which checks to see if the results are already cached; if not, the query is executed, and the results are cached for later use.
- Query Broadcasting: Queries can be broadcast to all channels if required.
 
- (iv)
- Aggregate results: The results from the partitions or cache are aggregated to form the final result.
- (v)
- Return query results: The aggregated results are returned to the user.
3.4. Proposed Algorithm
| Algorithm 1: An algorithm for the proposed scheme for improving XML querying over wireless broadcast channels | 
| Inputs: XML data stream, wireless broadcast channels, query Outputs: Query results 
 | 
| Algorithm 2: An algorithm to partition an XML data stream into several partitions based on criteria | 
| Inputs: XML data stream Outputs: Partitions of XML data stream 
 | 
3.5. Implementation
4. Analysis
4.1. Concern Parameter’s Description
- (i)
- Partitioning the XML data stream: The time required to partition an XML data stream is determined by the partitioning criteria and the data stream’s size and complexity. In general, partitioning takes linear time, so the access time can be estimated as O (n), where n is the size of the XML data stream.
- (ii)
- Assigning partitions to wireless broadcast channels: The time required for this step varies according to the number of wireless broadcast channels and the load-balancing algorithm used. In general, load balancing can be performed in either constant or logarithmic time, so access time can be estimated as O (log n) or O (1), where n is the number of wireless broadcast channels.
- (iii)
- Broadcasting XML data partitions: The time required to broadcast the XML data partitions varies according to network bandwidth and partition size. Broadcasting is typically performed in linear time, so the access time can be estimated as O (m), where m is the total size of the XML data partitions.
- (iv)
- Query routing mechanism:- Query decomposition: The time required for query decomposition varies according to the complexity of the query and the number of partitions involved. In general, query decomposition can take either linear or logarithmic time, so access time can be estimated as O (k log n) or O (k), where k is the number of partitions involved.
- Query caching: The time required for query caching is determined by the cache lookup time and network latency. If the query and its results are already cached, the estimated access time is O (1). If the query is not cached, the access time can be calculated as O (log n) or O (1), depending on the query routing algorithm used.
- Query broadcasting: The time required to broadcast a query is determined by the network bandwidth and the query’s size. In general, broadcasting can be performed in linear time, so the access time can be estimated as O (p), where p is the query size.
 
- (v)
- Combining query results: The time required to combine query results is determined by the number of partitions used and the complexity of the result aggregation algorithm. In general, result aggregation can take either linear or logarithmic time, so access time can be estimated as O (k log n) or O (k), where k is the number of partitions involved.
- (vi)
- Returning query results: The time required to return query results is determined by the network latency and the size of the result set. Returning results typically takes linear time, so the access time can be estimated as O (r), where r is the size of the result set.
- Tpartition is the time required to partition the XML data stream.
- Ttransmission is the time needed to transmit the data over the wireless broadcast channels.
- Trouting is the time taken to route the query to the appropriate channel(s).
- Tprocessing is the time required to process the query and retrieve the relevant data.
- Partitioning Time (Tpartition)
- Transmission Time (Ttransmission)
- Routing Time (Trouting)
- Processing Time (Tprocessing)
4.2. Sample Data
- (i)
- XML Data Size and Complexity:- Size of XML data (sizexml): 10 MB
- Complexity of XML data (complexityxml): Medium (2)
 
- (ii)
- Data Partitioning:- Number of partitions: 5
- Time to partition each MB of XML data: 0.1 s per MB
- Partitioning time for each partition (Tpartition): sizexml × 0.1 × complexityxml/numberpartitions
 
- (iii)
- Transmission:- Size of each partition (sizepartition): 2 MB (since 10 MB/5 partitions)
- Bandwidth of each channel (bandwidthchannel): 1 MBps
- Number of channels (numberchannels): 5
- Transmission time for each partition
 
- (iv)
- Routing:- Complexity of routing (complexityrouting): Low (1)
- Routing time (Trouting): 0.5 s
 
- (v)
- Query Processing:- Complexity of query (complexityquery): High (3)
- Efficiency of processing (efficiencyprocessing): Medium (processing time is 1 s per query complexity unit)
- Processing time (Tprocessing): complexityquery × efficiencyprocessing
 
4.3. Sample Data Ranges
- Size of XML data (sizexml): 5 MB, 10 MB, 15 MB
- Complexity of XML data (complexityxml): Low (1), Medium (2), High (3)
- Number of partitions: 3, 5, 7
- Bandwidth of each channel (bandwidthchannel): 1 MBps, 2 MBps
- Number of channels (numberchannels): 3, 5, 7
- Complexity of routing (complexityrouting): Low (1), Medium (2), High (3)
- Complexity of query (complexityquery): Low (1), Medium (2), High (3)
- Efficiency of processing (efficiencyprocessing): Low (0.5 s per unit), Medium (1 s per unit), High (1.5 s per unit)
- Low: 0.5 s
- Medium: 1.0 s
- High: 1.5 s
5. Comparative Study
5.1. Comparison of Access Time of the Proposed Scheme with Various Data Placement Schemes
- (i)
- The proposed scheme implements data partitioning, load balancing, and query routing.
- (ii)
- The broadcast disk method rotates data into predefined segments [17].
- (iii)
- Index-based methods use indexing to improve query performance [18].
- (iv)
- Sequential broadcast means broadcasting all data sequentially [19].
- Size of XML data (sizexml): 10 MB
- Complexity of XML data (complexityxml): Medium (2)
- Number of partitions (for Proposed Scheme): 5
- Bandwidth of each channel (bandwidthchannel): 1 MBps
- Number of channels (numberchannels): 5
- Routing Complexity (for Proposed Scheme): Medium (2)
- Query Complexity: Medium (2)
- Processing Efficiency: Medium (1 s per unit)
5.2. Comparison of Various Performance Metrics of the Proposed Scheme with Various XML Data Placement Schemes
- Query Response Time (QRT): The time it takes to receive results after issuing a query.
- Network Traffic (NT): The amount of data sent across the network during the querying process. This refers to the total amount of data transmitted during querying, including any overhead.
- Resource Utilization (RU): The extent to which system resources (CPU, memory, and bandwidth) are utilized. This parameter is calculated as the percentage of CPU, memory, and bandwidth used during the query.
Sample Data and Assumptions
- Size of XML data (sizexml): 10 MB
- Number of Partitions: 5
- Bandwidth of each channel (bandwidthchannel): 1 MBps
- Query Complexity: Medium (2 units)
- Processing Efficiency: Medium (1 s per unit)
- Routing Complexity for Proposed Scheme: Medium (1 s)
5.3. Comparison of Access Time of the Proposed Scheme with Centralized Querying and Unpartitioned Broadcasting XML Data Placement Schemes
Assumptions and Sample Data
- Size of XML data (sizexml): 10 MB
- Complexity of XML data (complexityxml): Medium (2)
- Number of partitions (for Proposed Scheme): 5
- Bandwidth of each channel (bandwidthchannel): 1 MBps
- Number of channels (numberchannels): 5
- Routing Complexity for Proposed Scheme: Medium (1 s)
- Query Complexity: Medium (2 units)
- Processing Efficiency: Medium (1 s per unit)
- Centralized Querying Complexity: Additional 2 s for centralized processing
5.4. Comparison of Various Performance Metrics of the Proposed Scheme with Centralized Querying and Unpartitioned Broadcasting XML Data Placement Schemes
- (i)
- Query Response Time (QRT): Time taken from issuing a query to receiving the results.
- (ii)
- Network Traffic (NT): Amount of data transmitted over the network during the query process.
- (iii)
- Resource Utilization (RU): Utilization of system resources (CPU, memory, bandwidth) during the query process.
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Imieĺinski, T.; Badrinath, B.R. Data management for mobile computing. ACM Sigmod Record. 1993, 22, 34–39. [Google Scholar] [CrossRef]
- Goel, V.; Panwar, G.; Ahlawat, A.K. Energy efficient air indexing schemes for single and multi-level wireless channels. In Proceedings of the 2013 3rd IEEE International Advance Computing Conference (IACC), Ghaziabad, India, 22–23 February 2013; pp. 525–530. [Google Scholar] [CrossRef]
- Hu, Q.; Lee, D.L.; Lee, W.C. A comparison of indexing methods for data broadcast on the air. In Proceedings of the Twelfth International Conference on Information Networking (ICOIN-12), Tokyo, Japan, 21–23 January 1998; pp. 656–659. [Google Scholar] [CrossRef]
- Chung, Y.D.; Kim, M.H. An index replication scheme for wireless data broadcasting. J. Syst. Softw. 2000, 51, 191–199. [Google Scholar] [CrossRef]
- Chung, Y.D. An indexing scheme for energy-efficient processing of content-based retrieval queries on a wireless data stream. Inf. Sci. 2007, 177, 525–542. [Google Scholar] [CrossRef]
- Jing, J.; Elmagarmid, A.; Helal, A.; Alonso, R. Bit-sequences: An adaptive cache invalidation method in mobile client/server environments. Mob. Netw. Appl. 1997, 2, 115–127. [Google Scholar] [CrossRef]
- Lee, C.C.; Leu, Y. Efficient data broadcast schemes for mobile computing environments with data missing. Inf. Sci. 2005, 172, 335–359. [Google Scholar] [CrossRef]
- Sheu, T.L.; Wu, Y.J. A preemptive channel allocation scheme for multimedia traffic in mobile wireless networks. Inf. Sci. 2006, 176, 217–236. [Google Scholar] [CrossRef]
- Chung, Y.D.; Lee, J.Y. An indexing method for wireless broadcast XML data. Inf. Sci. 2007, 177, 1931–1953. [Google Scholar] [CrossRef]
- Shekarriz, M.; Babamir, S.M.; Mirabi, M. Query processing optimization in broadcasting XML data in mobile communications. J. Supercomput. 2021, 77, 5354–5380. [Google Scholar] [CrossRef]
- Prabhavathy, P.; Bose, S.; Kannan, A. Energy efficient xpath query processing on wireless xml streaming data. Comput. Inform. 2015, 34, 1289–1308. [Google Scholar]
- Goel, V.; Gautam, D.; Gupta, A.; Kumar, S. An improvised indexing technique for XML data over multiple channels in wireless environment:(1, Xm) method. Int. J. Commun. Syst. 2019, 32, e4122. [Google Scholar] [CrossRef]
- Kumar, D.; Jha, V.K. A review on recent trends in query processing and optimization in big data. Wirel. Pers. Commun. 2022, 124, 633–654. [Google Scholar] [CrossRef]
- Liu, X. Wireless network communication in the XML metadata storage of Wushu historical archives. Wirel. Commun. Mob. Comput. 2021, 2021, 5171713. [Google Scholar] [CrossRef]
- Vats, P.; Aalam, Z.; Kaur, S.; Kaur, A.; Gehlot, N. A hybrid approach for retrieving geographic information in wireless environment using indexing technique. In Proceedings of the ICT Analysis and Applications 2021, Goa, India, 5–6 August 2021; Springer: Singapore, 2022; pp. 145–155. [Google Scholar] [CrossRef]
- Reimers, U. Digital Video Broadcasting (DVB): The International Standard for Digital Television; Springer Science & Business Media: Berlin, Germany, 9 March 2013. [Google Scholar]
- Acharya, S.; Franklin, M.; Zdonik, S. Dissemination-based data delivery using broadcast disks. IEEE Pers. Commun. 1995, 2, 50–60. [Google Scholar] [CrossRef]
- Imielinski, T.; Viswanathan, S.; Badrinath, B.R. Data on air: Organization and access. IEEE Trans. Knowl. Data Eng. 1997, 9, 353–372. [Google Scholar] [CrossRef]
- Hu, Q.; Lee, W.C.; Lee, D.L. A hybrid index technique for power efficient data broadcast. Distrib. Parallel Databases 2001, 9, 151–177. [Google Scholar] [CrossRef]
- Sumari, P.; Darus, R.M.; Rahiman, A.R. A Broadcast Disk scheme for mobile information system. J. Comput. Sci. Technol. 2010, 10, 8–13. [Google Scholar]
- Javani, M. MIRABIMD An Efficient Index and Data Distribution Scheme for XML Data Broadcast in Mobile Wireless Networks. J. Inf. Sci. Eng. 2017, 33. [Google Scholar] [CrossRef]
- Qin, Y.; Sheng, Q.Z.; Wang, H.; Falkner, N.J. Organizing XML data in a wireless broadcast system by exploiting structural similarity. Wirel. Pers. Commun. 2018, 98, 1299–1329. [Google Scholar] [CrossRef]
- Subramaniam, S.; Haw, S.C.; Soon, L.K. Improved centralized XML query processing using distributed query workload. IEEE Access 2021, 9, 29127–29142. [Google Scholar] [CrossRef]
- Fathi Ozonbolagh, S.; Mirabi, M. Efficient XML data placement schemes over multiple mobile wireless broadcast channels. J. Supercomput. 2022, 78, 168–199. [Google Scholar] [CrossRef]






| Size of XML (MB) | XML Complexity | Partitions | Bandwidth (MBps) | Channels | Routing Complexity | Query Complexity | Processing Efficiency (s/unit) | Access Time (s) | 
|---|---|---|---|---|---|---|---|---|
| 5 | Low (1) | 3 | 1 | 3 | Low (1) | Low (1) | 0.5 | 3.17 | 
| 10 | Medium (2) | 5 | 1 | 5 | Medium (2) | Medium (2) | 1 | 6.9 | 
| 15 | High (3) | 7 | 2 | 7 | High (3) | High (3) | 1.5 | 9.64 | 
| 10 | Low (1) | 5 | 2 | 5 | Low (1) | High (3) | 0.5 | 5.9 | 
| 5 | High (3) | 3 | 1 | 3 | Medium (2) | Medium (2) | 1 | 6.67 | 
| 15 | Medium (2) | 7 | 1 | 7 | Low (1) | Low (1) | 1.5 | 7.5 | 
| Technique | Proposed Scheme | Broadcast Disk Method [20] | Index-Based Method [21] | Sequential Broadcast [22] | |
|---|---|---|---|---|---|
| Parameters | |||||
| Partitioning Time (seconds) | 0.4 | 0 | 0 | 0 | |
| Transmission Time (seconds) | 2 | 2 | 2 | 10 | |
| Routing Time (seconds) | 1 | 1 | 1 | 5 | |
| Processing Time (seconds) | 2 | 2 | 2 | 2 | |
| Total Access Time (seconds) | 5.4 | 5 | 5 | 17 | |
| Metric | Proposed Scheme | Broadcast Disk Method [20] | Index-Based Method [21] | Sequential Broadcast [22] | 
|---|---|---|---|---|
| Query Response Time (seconds) | 5.4 | 5 | 5 | 17 | 
| Network Traffic (MB) | 10 | 10 | 11 | 10 | 
| CPU Utilization (%) | 50 | 30 | 60 | 70 | 
| Memory Utilization (%) | 40 | 30 | 50 | 60 | 
| Bandwidth Utilization (%) | 60 | 50 | 50 | 80 | 
| Technique | Proposed Scheme | Centralized Querying [23] | Unpartitioned Broadcasting [24] | |
|---|---|---|---|---|
| Parameters | ||||
| Partitioning Time (seconds) | 4 | NA | NA | |
| Transmission Time (seconds) | 2 | 10 | 10 | |
| Routing Time (seconds) | 4 | 2 | 2 | |
| Processing Time (seconds) | 2 | 2 | 2 | |
| Total Access Time (seconds) | 1 | 14 | 14 | |
| Metric | Proposed Scheme | Centralized Querying [23] | Unpartitioned Broadcasting [24] | 
|---|---|---|---|
| Query Response Time (seconds) | 5.4 | 24 | 17 | 
| Network Traffic (MB) | 10 | 20 | 10 | 
| CPU Utilization (%) | 50 | 70 | 60 | 
| Memory Utilization (%) | 40 | 60 | 50 | 
| Bandwidth Utilization (%) | 60 | 80 | 70 | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ahlawat, V.K.; Agarwal, G.; Goel, V.; Sanghi, A.; Choi, S.Y.; Hui, K.L.; Sain, M. A Novel Approach for Improving XML Querying over Wireless Broadcast Channels. Sensors 2024, 24, 7206. https://doi.org/10.3390/s24227206
Ahlawat VK, Agarwal G, Goel V, Sanghi A, Choi SY, Hui KL, Sain M. A Novel Approach for Improving XML Querying over Wireless Broadcast Channels. Sensors. 2024; 24(22):7206. https://doi.org/10.3390/s24227206
Chicago/Turabian StyleAhlawat, Vinay Kumar, Gaurav Agarwal, Vikas Goel, Akash Sanghi, Sun Young Choi, Kueh Lee Hui, and Mangal Sain. 2024. "A Novel Approach for Improving XML Querying over Wireless Broadcast Channels" Sensors 24, no. 22: 7206. https://doi.org/10.3390/s24227206
APA StyleAhlawat, V. K., Agarwal, G., Goel, V., Sanghi, A., Choi, S. Y., Hui, K. L., & Sain, M. (2024). A Novel Approach for Improving XML Querying over Wireless Broadcast Channels. Sensors, 24(22), 7206. https://doi.org/10.3390/s24227206
 
        

 
                                                


 
       