Pre- and Post-Processing Algorithms with Deep Learning Classifier for Wi-Fi Fingerprint-Based Indoor Positioning

Haider, Amir; Wei, Yiqiao; Liu, Shuzhi; Hwang, Seung-Hoon

doi:10.3390/electronics8020195

Open AccessFeature PaperArticle

Pre- and Post-Processing Algorithms with Deep Learning Classifier for Wi-Fi Fingerprint-Based Indoor Positioning

Division of Electronics and Electrical Engineering, Dongguk University-Seoul, Seoul 04620, Korea

^*

Author to whom correspondence should be addressed.

Electronics 2019, 8(2), 195; https://doi.org/10.3390/electronics8020195

Submission received: 28 November 2018 / Revised: 3 February 2019 / Accepted: 4 February 2019 / Published: 8 February 2019

(This article belongs to the Special Issue Challenges and Opportunities of IoT Deployments—Avoiding the Internet of Junk)

Download

Browse Figures

Versions Notes

Abstract

:

To accommodate the rapidly increasing demand for connected infrastructure, automation for industrial sites and building smart cities, the development of Internet of Things (IoT)-based solutions is considered one of the major trends in modern day industrial revolution. In particular, providing high precision indoor positioning services for such applications is a key challenge. Wi-Fi fingerprint-based indoor positioning systems have been adapted as promising candidates for such applications. The performance of such indoor positioning systems degrade drastically due to several impairments like noisy datasets, high variation in Wi-Fi signals over time, fading of Wi-Fi signals due to multipath propagation caused by hurdles, people walking in the area under consideration and the addition/removal of Wi-Fi access points (APs). In this paper, we propose data pre- and post-processing algorithms with deep learning classifiers for Wi-Fi fingerprint-based indoor positioning, in order to provide immunity against limitations in the database and the indoor environment. In addition, we investigate the performance of the proposed system through simulation as well as extensive experiments. The results demonstrate that the pre-processing algorithm can efficiently fill in the missing Wi-Fi received signal strength fingerprints in the database, resulting in a success rate of 88.96% in simulation and 86.61% in a real-time experiment. The post-processing algorithm can improve the results from 9.05–10.94% for the conducted experiments, providing the highest success rate of 95.94% with a precision of 4 m for Wi-Fi fingerprint-based indoor positioning.

Keywords:

Wi-Fi fingerprint; indoor positioning; IoT; data pre- and post-processing; deep learning

1. Introduction

In the past decade, indoor location-based services (LBSs) have attracted lots of attention, driving the development of indoor positioning technologies to fulfil the technological requirements of modern day communications services. One of the key research issues is the development of a precise and reliable indoor positioning system to improve the inadequate performance of current positioning systems in indoor environment [1]. While global positioning systems (GPS) are unavailable in indoor environments due to the low penetration power of microwaves in indoor locations, several other candidates such as radio waves, acoustic signals, magnetic field, Wi-Fi, and other sensors information from mobile devices [2,3,4] can serve the purpose of determining the user location in indoor environments. Among the mentioned technologies, Wi-Fi signals-based indoor positioning is the most popular due to its wide deployment and the low cost of Wi-Fi networks [5]. Wi-Fi is considered more suitable in complex environments. Wi-Fi-based positioning technology can be classified into two types, time-space attribute methods and received signal strength (RSS)-based methods [6]. The latter is also known as fingerprinting and is one of the most popular indoor positioning technologies. Unlike measurement-based methods, fingerprint-based methods can be easily implemented without additional hardware. Generally, the process of Wi-Fi fingerprint-based indoor positioning consists of two phases: offline and online. In the offline phase, the radio map or fingerprint database is constructed by measuring and recording the RSS of Wi-Fi signals from different access points (APs). In the online phase, the real-time RSS measurements from the user are matched and compared with the fingerprints in the database via classification algorithms to estimate the position of the user [7]. The fingerprint database can be constructed using an IoT device or smartphone. This database is then used to train the classifier to be able to predict user position in a real indoor environment with desired target locations. A high-performance positioning server is in charge of database storage and position prediction (classification task). Machine learning and deep learning-based classifiers can be employed to handle this classification task at the positioning server [8,9,10,11]. Figure 1 shows the working procedure for a Wi-Fi fingerprint-based indoor positioning system.

In the literature, several Wi-Fi fingerprint-based indoor positioning systems have been proposed which intend to improve specific location prediction and the overall system success rate. An efficient indoor positioning technique is presented in [12,13] and employs Wi-Fi fingerprinting along with trilateration. The trilateration technique employs knowledge of distance for three known APs at every target location while constructing the RSS fingerprint database. Similarly, a contour-based trilateration technique was presented in [14] where the RPs with same signal levels were combined to form a contour and the location was estimated. This approach highly depends on the position and orientation of the desired APs. A novel indoor positioning scheme which operates robustly by extracting the multipath propagation delay profile of Wi-Fi signals was presented in [15]. With this approach, changes in the indoor environment like multipath propagation can be efficiently compensated to improve system robustness. An experimental evaluation of location methods based on RSS measurements was presented in [16]. This work focused on VHF frequencies and outdoor positioning system and did not take Wi-Fi RSS fingerprint-based indoor positioning into consideration. To improve the fingerprinting database construction, an algorithm termed “Slide” is presented in [17], where the RSS fingerprint is recorded using a smartphone. This algorithm is limited only to linear positioning, which is not the case in most complex-structured indoor environments. As mentioned earlier, the time-varying nature of Wi-Fi signals and time consumed in the development of the RSS fingerprint database are major constraints in adopting Wi-Fi RSS measurements for indoor positioning. To address this issue, a gradient-based approach to stabilize the RSS gradient of Wi-Fi signals was proposed in [18], but an accuracy of 5.6 m was reported, which is not very practical when the indoor environment is considered. The RSS measurements are prone to environmental dynamics such as weather conditions, which can result in degradation of accuracy of distance estimation. Considering this issue, the rain attenuation effect was included in the pathloss model in order to estimate the impact of precipitation in the outdoor environment [19]. To address time consumption in the construction of the RSS fingerprint database, an autonomous crowdsourcing approach using multiple smartphones is described in [20]. However, autonomous behaviour for database collection is implemented by employing an existing trusted portable navigator in the target location, which is also immune to variations in the indoor environment, resulting in an estimation error greater than 5.7 m, close to that of the above mentioned gradient-based approach. Another approach to reduce time consumption in RSS fingerprint database construction is to analytically estimate the RSS distribution instead of manual data collection. To estimate the RSS distribution for APs, a probabilistic approach is described in [21], where a single- and double-peak Gaussian distribution is employed for the AP signals. The extensive efforts to improve the accuracy of RSS-based centroid localization algorithm in indoor environment were presented in [22]. However, the focus was on wireless sensor network nodes and the indoor area for the experiment was limited to a single room of 6 × 7 square meters. The analysis cannot be generally applied for large area. An extensive overview of the offline evaluation of smartphone-based indoor positioning techniques demonstrated at International Conference on Indoor Positioning and Indoor Navigation (IPIN) competition 2017 was presented in [23]. The authors concluded that the database for fingerprint algorithm would be recorded in a static manner or at very slow continuous motion such as 0.2 m/s. Additionally, the issue of training database density was addressed in [24], where the fingerprint-based approaches were preferred with a dense database.

Both machine learning and deep learning-based classifiers are popular for handling the matching and classification tasks. Recently, the deep learning gains more attention because of its high accuracy with huge amount of trained data [25]. Compared to the machine learning techniques, the deep learning attempts to learn high-level features from data in an incremental manner, which can eliminate the need of domain expertise and hard core feature extraction [26]. Regarding the training time, the deep learning algorithms take longer time in the offline phase due to the large number of parameters. However, the scenario is totally reversed in the online phase where the deep learning takes much less time to run [27]. That is, the deep learning outperforms in the case of large data size. Keeping the above-mentioned aspects in minds, we employ the deep learning-based classifier in this work.

Several classifiers have been introduced to indoor localization to handle the matching task, which gives rise to a complicated multi-classes classification problem. To address the challenge of developing an efficient classifier for the positioning server, [28] presents several machine learning approaches including kNN, a rules-based classifier (JRip), and random forest, employing them for Wi-Fi fingerprint-based indoor positioning systems. The results indicate that the Random Forest classifier offers the best performance. Random forest is one of the most powerful machine learning algorithms available today [29]. It can handle both classification and regression tasks and has been widely used in various applications such as Internet traffic interception, voice and image classification. However, it has been rarely utilized in indoor positioning. An on-device learning approach is depicted in [30], where a mobile application allows users to build and offline train their own RSS maps. A deep learning-based classifier for indoor position is presented in [31]; however, it employs channel state information (CSI) instead of the Wi-Fi RSS value, which is our focus in this work. To the best of our knowledge, the only reported work on deep learning-based Wi-Fi fingerprinting is done by [32]; however, database collection is not addressed and the existing UJIIndoorLoc dataset [33] is employed.

The key issues with the offline phase of such Wi-Fi fingerprint-based indoor positioning systems are the high variations in Wi-Fi signals over time, fading signals due to multipath propagation caused by hurdles, people walking in the area under consideration and addition/removal of Wi-Fi APs [34,35,36]. Additionally, the signal strength and resource allocation by the APs is dependent on the number of connected users. As most Wi-Fi-based positioning technologies use the existing infrastructure of the Wi-Fi network in the indoor environment, these problems cannot be avoided as they require control of the infrastructure. It is important to mention that the performance of the positioning server highly depends on the quality of the RSS fingerprint database used to train the server for position prediction [37]. Most real world data is composed of inaccurate, noisy, inconsistent, and missing data. The reasons for the existence of such data could be technological problems stemming from gadgets that gather data, a human mistake during data entry and much more. The collected data cannot be used directly for training the server. Some specified machine learning and deep learning models for the server need information in a specified format, for example, the random forest algorithm does not support null values, and therefore, execution of the random forest algorithm null values has to be managed from the original raw dataset. To solve this problem, data pre-processing is performed. Data pre-processing is an important part of data science [38]. It includes the two concepts “data cleaning” and “feature engineering”. These two are compulsory for achieving better accuracy and performance in the machine learning and deep learning algorithms. Data pre-processing is necessary because of the presence of unformatted real-world data. Similarly, during the online phase for Wi-Fi fingerprint-based indoor positioning, there can be some errors in position estimation for users as the test data might be affected due to limitations in the indoor environment. To address this issue, another data processing scheme can be applied and this process is termed data post-processing. Data post-processing is performed on the result data. Post-processing allows for analysis of the previous result sets in a formal manner. Data pre- and post-processing algorithms are expected to improve the performance of Wi-Fi fingerprint-based indoor positioning systems [39]. A pre-processing approach that involves removing useless APs and their respective RSS from the fingerprint database was presented in [40]. This can help in reducing the computational overhead during position estimation and improves the system performance. A data pre-processing technique was presented in [41] to remove noisy data from the RSS fingerprint database by combining Wi-Fi and geographical information system (GIS). To accommodate the heterogeneity of RSS measuring devices, a data pre-processing algorithm was presented in [42], which can scale the RSS values from various devices to a uniform range. Unlike the abovementioned server-based pre-processing algorithms, a user-based data pre-processing library employing RSS filtering was presented for android-based devices in [43]. To the best of our knowledge, there is no data pre-processing algorithm developed for filling in missing values in the Wi-Fi RSS fingerprint database, which is expected to improve the system performance. Similarly, a brief survey on post-processing algorithms for machine learning and data mining was presented in [44]. However, to the best of our knowledge, there is no post-processing algorithm available for deep learning-based classifiers for Wi-Fi fingerprint-based indoor positioning.

In order to solve the above problems with Wi-Fi fingerprinting, much research has been conducted on algorithms for data pre-processing to remove noisy and missing data, and many techniques for improving the classification server have been developed. However, conventional techniques that have already been developed still have limitations in securing high accuracy, reliable success percentage and compensation for environmental changes that cause performance degradation. To the best of our knowledge, data post-processing has not been described for Wi-Fi fingerprint-based indoor positioning in the literature. There is a need for the development of an algorithm which can efficiently tackle the above-mentioned challenges in real-time using the existing infrastructure of the indoor environment.

In this work, we present a Wi-Fi fingerprint-based indoor positioning system using a deep learning classifier. The main contributions of this work are as follows:

A data pre-processing algorithm to compensate for impairments in the collected RSS fingerprint database;
A data post-processing algorithm to enhance the system performance by limiting the effects of the indoor environment on the experimental phase of the proposed system; and
Investigation of the performance of a deep learning-based classifier at the server in charge of the RSS fingerprint database storage and position prediction.

The rest of the paper is organized as follows: The proposed system model is described in Section 2. The simulation and experimental results for the proposed system are discussed in Section 3. Finally, Section 4 presents the conclusion of our work and future directions.

2. Proposed System

In this section, an overview of the proposed system is first presented. The environment and setup for simulation and experiment along with RSS fingerprint database construction is described in Section 2.2. Description of the data pre-processing algorithm is presented in Section 2.3. The deep learning classifier and data post-processing algorithm are described in Section 2.4 and 2.5, respectively.

2.1. Overview of the Proposed System

In this work, we propose a Wi-Fi fingerprint-based indoor positioning system equipped with data pre- and post-processing algorithms. Figure 2 depicts the actual two-phase indoor positioning service delivered by the proposed system. The proposed system is a server-based and user-active indoor positioning system. A high-performance positioning server is in charge of database storage and position prediction (classification task) by employing deep learning.

The proposed system consists of two phases: an offline and an online phase. Firstly, in the offline phase, the database is constructed by mapping the RSS measurements at each reference point (RP) through a smartphone. The database in the offline phase of Figure 2 consists of two kinds of database. One is training and another is trial database. The training database is used in the offline phase for training the deep learning classifier. The trial database is used to perform the offline test simulation to evaluate the performance of the deep learning classifier. Both training and trial databases are pre-processed using the proposed pre-processing algorithm. The pre-processed databases are then used to train the deep learning classifier, which turns to be the trained classifier in the online phase. Secondly, in the online phase of Figure 2, five real-time measurements at unknown user positions are pre-processed to fill the missing values. The pre-processed measurements are then passed to the trained classifier to determine the user location. The trained classifier returns five decisions for the user’s position which are then passed to the data-post processing algorithm. The post-processing algorithm selects the most frequent decision for user position based on the “Majority-rule”. Finally, the decision on user location is passed to the user’s smartphone.

2.2. Environment and Setup

To implement the proposed system model, RSS fingerprint database collection and experimental verification of the simulation data are both performed on the 7th floor of the New Engineering Building at Dongguk University, Seoul, Korea. As shown in Figure 3, the 52 × 32 square meters target area is divided into 74 RPs with an interval of 2 m between two consecutive RPs. The pre-installed anonymous APs on the 7th floor are used in this work. In Figure 3, the location of the APs is unknown to the positioning server. The main motive is to use the pre-installed infrastructure of Wi-Fi APs in the building for indoor localization. The positioning server is a Dell Alienware model P31e (Alienware, hardware subsidiary of Dell, Miami, FL, USA). A Samsung SM-E310K smartphone is used to measure the RSS value from all available APs at each RP.

On the software side, construction of the RSS fingerprint database, data pre-processing, deep learning classifier, data post-processing, and online experiment program are all performed in Python and the tensorflow platform.

To collect the RSS fingerprint database for the above environment, a smartphone is used to measure the RSS value from all available APs at each RP. Each measurement contains the RP label, time, date, number of available APs, MAC address and respective RSS value for each AP at every RP on the floor map. This measurement, termed the fingerprint, is then transmitted to the positioning server. Five RSS fingerprints are collected in forward as well as in backward directions at each RP. The sampling time for each RSS measurement is 5 s, which means 25 s for total five measurements. For 74 reference points, the total time consumed 31 min for each direction and 62 min for both directions. For the training data, the measurements are taken in the morning, afternoon and evening for seven days, which results in 22 h approximately. For the trial data, the measurements are taken for two days in the same manner. Therefore, the time consumption is 6.5 h, approximately. All these fingerprints are saved as training and trial files in .csv format in the positioning server, as the deep learning classifier at the server is designed to use .csv format files as input. At each RP, several APs are present with unique MAC addresses. In the .csv file, the MAC addresses are added on the first row at the header of the file. If the MAC address is already present, then the RSS value is added to the corresponding column. If the MAC address is not present, the value is added to a new column. For 74 RPs, the number of columns can, therefore, be more or less than 256. The number 256 would be the maximum corresponding to the input size of the deep learning classifier. Therefore, the training and trial .csv files must contain 257 columns, where 256 columns are used to save the RSS fingerprint while 1 column is used to indicate the RP label. After generating the database, the number of the column in both training and trail .csv files are checked. If the number is less than 256, the remaining columns are filled with “0 s”. Otherwise, extra columns over 256 are discarded, starting from the AP with the minimum number of RSS fingerprints. This collected RSS fingerprint database is then passed to the data pre-processing section as shown in Figure 2.

2.3. Pre-Processing Algorithm

As the collected dataset contains RSS fingerprints recorded at 74 RPs on the floor map, 256 APs are present in the RSS fingerprint database. However, for each individual RP measurement, the number of APs is around 20 on average, so the remaining spaces are filled with "0 s” while the database is prepared. As discussed earlier, some algorithms like random forest cannot perform well with null values, so zero values should be removed. To accomplish this task, a data pre-processing algorithm is proposed in this work. The flow chart for the proposed data pre-processing algorithm is depicted in Figure 4.

The pre-processing algorithm is designed to fill in the missing values previously recorded as “0 s” in the RSS fingerprint database for each RP measurement with the average values for each AP at 74 RPs. It is important to mention here that the average is taken only for the non-zero RSS values for each AP at 74 RPs. The steps followed in the pre-processing algorithm are below:

Step 1: Calculate the average of all non-zero values in each column (representing one AP) of the Training RSS fingerprint database (Training.csv);
Step 2: Check the training data file for "0” values;
Step 3: Replace zero with the average value from step1 at the corresponding row and column in the Training RSS fingerprint database; and
Step 4: Repeat step 1 and step 3 for the trial fingerprint data (Trial.csv). (Note: the average of the training RSS fingerprint database is replaced for "0” values in Trial.csv).

The motive behind the pre-processing is to replace the missing values (“0 s”) in the database with a certain computed value which is relatively closer to the RSS fingerprints available in the database. It would be helpful for the deep learning classifier to operate with certain RSS values rather than zeros. Since the number of RSS fingerprints of 15,540 in the Training.csv is larger than 60 in the Trial.csv, the average value of the Training.csv replaces the missing values (“0s”) in the Trail.csv. In this manner, the missing values in the RSS fingerprint database are replaced with a computed average value and the database is ready to be used for training the deep learning classifier.

2.4. Deep Learning Classifier

In this work, we employed a deep learning-based classifier to predict user position. To deal with the variant and unpredictable Wi-Fi signals, the positioning is cast in a convolutional neural network (CNN) structure that is capable of learning reliable features from a large set of noisy samples. The database required for the simulation and experiments is collected from the real world on different days and times to mimic the actual environment. The proposed positioning system is summarized in Figure 2. For the offline CNN training, a four-layer deep neural network (DNN)-based coarse localizer is trained to extract reliable features from a massive noisy RSS fingerprint database. The deep learning classifier consists of three convolutional layers (CL) and one fully-connected (FC) layer. A SoftMax layer is used for multi-class classification. The architecture for the deep learning classifier is depicted in Figure 5. To reduce the training time, the number of neurons decreases as we go deeper in the network. The .csv file is converted to input image of size 16 × 16 by using tf.reshape in tensorflow [45]. First, the CNN learns reliable high-level features automatically from a large set of widely fluctuating RSS fingerprints and avoids hand-engineering. Second, this structure is capable of learning useful features directly from labelled data. Third, the CNN-based estimator is more robust as it predicts the position utilizing the extracted high level features. Moreover, the structure has an advantage in handling massive RSS fingerprints since the prediction at the positioning stage does not rely on any searches in sample space but requires only the forward evaluation of a trained feed-forward neural network. For the online positioning, the pre-processed real-time RSS measurements at target RPs are fed into the classifier to obtain a final position estimate. The parameters for the deep learning classifier are summarized in Table 1.

2.5. Post-Processing Algorithm

After analysis of the output data, one other data processing scheme can be performed, which is termed data post-processing. The output data for the Wi-Fi fingerprint-based indoor positioning system needs to be post-processed as it might contain wrong decisions regarding the user location. For example, the output data for a Wi-Fi fingerprint-based indoor positioning system may contain the wrong target location as the environmental conditions or the signal strength of the APs may vary over time. This can cause the classifier of the Wi-Fi fingerprint-based indoor position system to select the wrong user location, hence reducing the success percentage and performance of the system. The post-processing algorithm can identify such variations in the output data of the Wi-Fi fingerprint-based indoor positioning system and compensate for imperfections in the indoor environment, such as variations in the signal strength level of Wi-Fi signals over time, lower levels of the RSS caused by multipath fading, changes in the Wi-Fi infrastructure due to addition or removal of Wi-Fi APs and hindrance created in the signal path due to people present in the vicinity of the user location. Post-processing eliminates wrong decision outputs from the indoor positioning system resulting from the above impairments in the indoor environment.

A data post-processing algorithm is proposed in this work to perform this task on the output data for the Wi-Fi fingerprint-based indoor positioning system. Figure 6 shows the steps used to implement the proposed post-processing algorithm. The proposed post-processing algorithm allows analysis of the classifier’s result sets in a formal manner. The algorithm employs “majority rule” to select the most frequent decision on the user’s position in the result data for the deep learning classifier. Following are the steps for the post-processing algorithm:

Step 1: At every RP, five test RSS measurements are taken with the smartphone in forward as well as backward directions. The positioning server returns 5 predicted results for each RP.
Step 2: The algorithm for post-processing computes the most frequent value among the five predictions by employing the “Majority rule”.
Step 3: The most frequent value found during Step 2 is then declared as the final decision for the user location as RP number. The flow graph for the post processing algorithm is shown in Figure 6.

3. Numerical Results

3.1. Simulation Results

The performance of the proposed system is quantified in terms of the training accuracy and success percentage. Training accuracy is the percentage of correct RP predictions without any margin of error when RSS database samples are reused as input to the trained classifier model [7]. The training accuracy for the positioning classifier is given as:

Training accuracy = \frac{N o . o f c o r r e c t p r e d i c t i o n s}{T o t a l n o . o f p r e d i c t i o n s} \times 100

(1)

Success is the percentage of correct position prediction with a margin of error of 2 RPs when the trial database is used as input [46]. As there are 74 RPs in the database, therefore, the total number of samples is 74. The success count is incremented with every correct prediction. The success percentage for the classifier is given as:

Success percentage = \frac{S u c c e s s c o u n t}{T o t a l n o . o f s a m p l e s} \times 100

(2)

As mentioned earlier, the interval between two consecutive RPs is 2 m, so the precision of success is 4 m in this work. The simulation for the proposed Wi-Fi fingerprint-based indoor positioning system is carried out for the raw database as well as the pre-processed database.

To stress the need for data pre-processing, Figure 7a shows five RSS fingerprints for 20 APs taken at RP-1. It can be seen clearly that for several APs, there is missing data filled with "0 s” at the RSS fingerprint database construction stage. This results in a non-uniform RSS fingerprint at every RP. To solve this issue, the pre-processing algorithm is employed which can replace the "0 s” in the RSS fingerprint database with computed average values for each AP. Figure 7b shows the impact of data pre-processing on the RSS fingerprint database. It can be seen clearly that by filling in the missing values using the pre-processing algorithm, the RSS fingerprints at RP-1 become more uniform and, thus, the probability of error at the training stage of the classifier, as well as in simulation and experimental results can be significantly reduced. The simulation results for the raw database and pre-processed database are presented in Table 2. With pre-processing, a gain of 1.5% can be achieved using the data pre-processing algorithm. The simulation results are depicted in Figure 8. The data points used to generate the graphs of Figure 7 and Figure 8 are available at [47]. Concerning the computation overhead for pre-processing, it takes 6.5 s for the training and trial database to be pre-processed for the simulation. Even if such large overhead occurs due to the database size, the system performance is not affected since it only happens in the offline phase.

It is important to note that by employing the pre-processing algorithm, there is not only a gain in the success percentage of the system, but also the success percentage remains constant as the number of training epochs increase. Such consistency in success percentage is vital for the experiment as the trained classifier .meta file is used in the experiment. For the raw database, the success percentage decreases as the number of epochs increase. This degradation can be explained by looking at the training accuracy. As the training accuracy increases, the phenomenon of overfitting occurs. Overfitting happens when a model learns the detail and noise in the training database to the extent that it negatively impacts the performance of the model on trial data. It can be seen clearly in Figure 8 that the training accuracy of the pre-processed database is lower than that of the raw database. This helps to avoid overfitting of training data and, thus, results in a consistent success rate with an increasing number of training epochs.

3.2. Experiment Results

Since the pre-processed RSS fingerprint database shows better performance in the simulation, the trained model .meta file is saved for the epoch with the highest success percentage and used for the online real-time experiment. In the experiment, a smartphone carried by the user captures the real-time RSSs for available APs and sends them with MAC addresses to the positioning server. The server pre-processes the RSS fingerprint and employs the trained classifier model to predict the user position with the received RSS fingerprint. In the online phase, the computation overhead is negligible about 0.4 ms for each real-time RSS measurement. For five real-time RSS measurements, a total computation overhead of 2 ms is needed. To verify the simulation results, we conducted 10 experiments at different times and days to include the effect of environment change on the proposed system. In each experiment, the user carried a smartphone from RP-1 to RP-74, termed the “forward direction” and then from RP-74 to RP-1, termed the “backward direction”. At each RP, the RSS fingerprint measurement and position prediction are both performed five times continuously. The post-processing algorithm is applied to the five predictions received from the positioning server to determine the most frequent value by employing the “majority rule” to predict the user position. The post-processing introduces a negligible computation overhead of 0.03 ms with the majority rule. The detailed experiment results are presented in Table 3.

The experiment results obtained without employing the post-processing algorithm shows the highest success rate of 86.61%, which is 2.35% lower than the simulation results. One reason for this difference is that the changing of environments can result in different propagation paths for Wi-Fi signals and hence greatly affects the RSS fingerprints of APs at each RP. The real-time experiment was performed months after the RSS fingerprint database construction, and the test database used for simulation is split from the training database. To solve this issue, we employed the post-processing algorithm described in this work. The post-processing algorithm can mitigate the effects of environment changes by discarding incorrect decisions regarding user position and selecting only the most frequent decisions. As shown in Figure 9, the post-processing algorithm can improve the results from 9.05–10.94% for the conducted experiments. Furthermore, variations between multiple experiments are reduced from 5.69% to 4.73% by employing the post-processing algorithm. Other approaches to solve such environment variations will be included in future research.

4. Conclusions

In this work, we propose a server-based user-active Wi-Fi fingerprint-based indoor positioning system using a deep learning classifier. The proposed system is equipped with data pre- and post-processing algorithms to reduce the degradation in performance caused due to limitations in the RSS fingerprint database and the indoor environment. Programs for fingerprint database construction using smartphones, data pre-processing, deep learning classifier, data post-processing, and online real-time experiment are developed. The proposed system is implemented and examined in a real-time indoor environment with 74 target locations. The simulation results show that the pre-processing algorithm can efficiently fill in missing RSS fingerprints in the database, resulting in a success rate of 88.96% with the error margin of 2 RPs. The proposed system equipped with a data post-processing algorithm is further verified with online real-time experiments. The proposed system is able to provide an indoor positioning precision of 4 m with a 95.94% success rate. Future work will focus on finding the reasons for the gap between the simulation and real-time experiment, and methods to increase the positioning performance of the proposed system.

Author Contributions

A.H., Y.W., and S.-H.H. contributed to the main idea of this research work. A.H. performed the simulations, experiments and database collection. S.L. performed experiments for the work. The research activity was planned and executed under the supervision of S.-H.H. A.H., and S.-H.H. contributed to the writing of this article.

Funding

This work was supported by the Technology Innovation Program (or Industrial Strategic Technology Development Program-Industrial site core technology Early development project) (10084909, Development of improved RTLS engine and IoT tag using machine learning technique reflecting real environments) funded by the Ministry of Trade, Industry, and Energy (MOTIE, Korea)

Acknowledgments

The authors would like to offer their sincere gratitude to Mr. Sang Moon Lee, CTO of JMP Systems, Korea, for providing the equipment for database collection and setting up the experimental environment.

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this article.

References

Basri, C.; El Khadimi, A. Survey on indoor localization system and recent advances of WiFi fingerprinting technique. In Proceedings of the 5th International Conference on Multimedia Computing and Systems (ICMCS), Marrakech, Morocco, 29 September–1 October2016; pp. 253–259. [Google Scholar]
Zhuang, Y.; Li, Y.; Qi, L.; Lan, H.; Yang, J.; El-Sheimy, N. A two-filter integration of MEMS sensors and Wi-Fi fingerprinting for indoor positioning. IEEE Sens. J. 2016, 13, 5125–5126. [Google Scholar] [CrossRef]
Zou, H.; Chen, Z.; Jiang, H.; Xie, L.; Spanos, C. Accurate indoor localization and tracking using mobile phone inertial sensors, Wi-Fi and iBeacon. In Proceedings of the IEEE International Symposium on Inertial Sensors and Systems (INERTIAL), Kauai, HI, USA, 27–30 March 2017; pp. 1–4. [Google Scholar]
Zhang, P.; Zhao, Q.; Li, Y.; Niu, X.; Zhuang, Y.; Liu, J. Collaborative WiFi fingerprinting using sensor-based navigation on smartphones. Sensors 2015, 7, 17534–17557. [Google Scholar] [CrossRef]
He, S.; CHAN, S.-H.G. Wi-Fi fingerprint-based indoor positioning: Recent advances and comparisons. IEEE Commun. Surv. Tutorials 2016, 18, 466–490. [Google Scholar] [CrossRef]
Hantoush, R. Evaluating Wi-Fi indoor positioning approaches in a real world environment. Ph.D. Thesis, Universidade NOVA de Lisboa, Lisbon, Portugal, 2016. [Google Scholar]
Xia, S.; Liu, Y.; Yuan, G.; Zhu, M.; Wang, Z. Indoor fingerprint positioning based on Wi-Fi: An overview. ISPRS Int. J. Geo-Inf. 2017, 6, 135. [Google Scholar] [CrossRef]
Bozkurt, S.; Elibol, G.; Gunal, S.; Yayan, U. A comparative study on machine learning algorithms for indoor positioning. In Proceedings of the International Symposium on Innovations in Intelligent Systems and Applications (INISTA), Madrid, Spain, 2–4 September 2015; pp. 1–8. [Google Scholar]
Mascharka, D.; Manley, E. Machine learning for indoor localization using mobile phone-based sensors. Available online: https://arxiv.org/abs/1505.06125 (accessed on 25 November 2018).
Zou, H.; Lu, X.; Jiang, H.; Xie, L. A fast and precise indoor localization algorithm based on an online sequential extreme learning machine. Sensors 2015, 1, 1804–1824. [Google Scholar] [CrossRef] [PubMed]
Adege, A.; Lin, H.P.; Tarekegn, G.; Jeng, S.S. Applying Deep Neural Network (DNN) for Robust Indoor Localization in Multi-Building Environment. Appl. Sci. 2018, 7, 1062. [Google Scholar] [CrossRef]
Kodippili, N.S.; Dileeka, D. Integration of fingerprinting and trilateration techniques for improved indoor localization. In Proceedings of the Seventh International Conference on Wireless and Optical Communication Networks (WOCN), Colombo, Sri lanka, 6–8 September 2010; pp. 1–6. [Google Scholar]
Chan, S.; Sohn, G. Indoor localization using Wi-Fi based fingerprinting and trilateration techniques for LBS applications. In Proceedings of the 7th International Conference on 3D Geoinformation, Quebec, QC, Canada, 16–17 May 2012. [Google Scholar]
Suining, H.; Tianyang, H.; Chan, S.-H.G. Contour-based trilateration for indoor fingerprinting localization. In Proceedings of the 13th ACM Conference on Embedded Networked Sensor Systems, Seoul, South Korea, 1–4 November 2015; pp. 225–238. [Google Scholar]
Zayets, A.; Steinbach, E. Robust Wi-Fi-based indoor localization using multipath component analysis. In Proceedings of the IEEE International Conference on Indoor Positioning and Indoor Navigation (IPIN), Sapporo, Japan, 18–21 September 2017; pp. 1–8. [Google Scholar]
Laitinen, H.; Juurakko, S.; Lahti, T.; Korhonen, R.; Lahteenmaki, J. Experimental Evaluation of Location Methods Based on Signal-Strength Measurements. IEEE Trans. Veh. Technol. 2007, 56, 287–296. [Google Scholar]
Chen, K.; Wang, C.; Yin, Z.; Jiang, H.; Tan, G. Slide: Towards fast and accurate mobile fingerprinting for Wi-Fi indoor positioning systems. IEEE Sens. J. 2018, 3, 1213–1223. [Google Scholar] [CrossRef]
Shu, Y.; Huang, Y.; Zhang, J.; Coué, P.; Cheng, P.; Chen, J.; Shin, K.G. Gradient-based fingerprinting for indoor localization and tracking. IEEE Trans. Ind. Electron. 2016, 63, 2424–2433. [Google Scholar] [CrossRef]
Fang, S.H.; Cheng, Y.C.; Chien, Y.R. Exploiting Sensed Radio Strength and Precipitation for Improved Distance Estimation. IEEE Sens. J. 2018, 18, 6863–6873. [Google Scholar] [CrossRef]
Zhuang, Y.; Syed, Z.; Li, Y.; El-Sheimy, N. Evaluation of two WiFi positioning systems based on autonomous crowdsourcing of handheld devices for indoor navigation. IEEE Trans. Mob. Comput. 2016, 15, 1982–1995. [Google Scholar] [CrossRef]
Chen, L.; Li, B.; Zhao, K.; Rizos, C.; Zheng, Z. An improved algorithm to generate a Wi-Fi fingerprint database for indoor positioning. Sensors 2013, 8, 11085–11096. [Google Scholar] [CrossRef] [PubMed]
Pivato, P.; Palopoli, L.; Petri, D. Accuracy of RSS-Based Centroid Localization Algorithms in an Indoor Environment. IEEE Trans. Instrum. Meas. 2011, 60, 11085–11096. [Google Scholar] [CrossRef]
Joaquin, T.S.; Antonio, R.J.; Adriano, M.; Tomás, L.; Lu, M.C.; Knauth, S.; Germán, M.M.S.; Fernando, S.; Antoni, P.N.; Maria, J.N.; et al. Off-Line Evaluation of Mobile-Centric Indoor Positioning Systems: The Experiences from the 2017 IPIN Competition. Sensors 2018, 18, 1–27. [Google Scholar]
Liqun, L.; Jacky, S.; Chunshui, Z.; Thomas, M.; Lin, J.H.; Feng, Z. Experiencing and handling the diversity in data density and environmental locality in an indoor positioning service. In Proceedings of the 20th Annual International Conference on Mobile Computing and Networking (Mobicon), Maui, HI, USA, 7–11 September 2014; pp. 459–470. [Google Scholar]
Ling, G.; Lei, G.; Nour, E.D.; Chengwu, L. Statistical Machine Learning vs Deep Learning in Information Fusion: Competition or Collaboration? In Proceedings of the IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), Miami, FL, USA, 10–12 April 2018; pp. 251–256. [Google Scholar]
Zheng, Y. Methodologies for Cross-Domain Data Fusion: An Overview. IEEE Trans. Big Data 2015, 1, 16–34. [Google Scholar] [CrossRef]
Qingchen, Z.; Laurence, T.Y.; Zhikui, C. Deep Computation Model for Unsupervised Feature Learning on Big Data. IEEE Trans. Serv. Comput. 2016, 9, 161–171. [Google Scholar]
Jedari, E.; Wu, Z.; Rashidzadeh, R.; Saif, M. Wi-Fi based indoor location positioning employing random forest classifier. In Proceedings of the IEEE International Conference on Indoor Positioning and Indoor Navigation (IPIN), Banff, AB, Canada, 13–16 October 2015; pp. 1–5. [Google Scholar]
Gomes, R.; Ahsan, M.; Denton, A. Random Forest Classifier in SDN Framework for User-Based Indoor Localization. In Proceedings of the IEEE International Conference on Electro/Information Technology (EIT), Rochester, MI, USA, 3–5 May 2018; pp. 537–542. [Google Scholar]
Nuño-Maganda, M.; Herrera-Rivas, H.; Torres-Huitzil, C.; Marisol Marín-Castro, H.; Coronado-Pérez, Y. On-Device Learning of Indoor Location for Wi-Fi Fingerprint Approach. Sensors 2018, 7, 2202. [Google Scholar] [CrossRef]
Wang, X.; Gao, L.; Mao, S.; Pandey, S. DeepFi: Deep learning for indoor fingerprinting using channel state information. In Proceedings of the IEEE Wireless Communications and Networking Conference (WCNC), New Orleans, LA, USA, 9–12 March 2015; pp. 1666–1671. [Google Scholar]
Nowicki, M.; Wietrzykowski, J. Low-Effort Place Recognition with WiFi Fingerprints Using Deep Learnin; Springer: Cham, Switzerland, 2017; pp. 575–584. [Google Scholar]
Torres-Sospedra, J.; Montoliu, R.; Martínez-Usó, A.; Avariento, J.P.; Arnau, T.J.; Benedito-Bordonau, M.; Huerta, J. UJIIndoorLoc: A new multi-building and multi-floor database for WLAN fingerprint-based indoor localization problems. In Proceedings of the IEEE International Conference on Indoor Positioning and Indoor Navigation (IPIN), Busan, South Korea, 27–30 October 2014; pp. 261–270. [Google Scholar]
Kudo, T.; Miura, J. Utilizing Wi-Fi signals for improving SLAM and person localization. In Proceedings of the IEEE/SICE International Symposium on System Integration (SII), Taipei, Taiwan, 11–14 December 2017; pp. 487–493. [Google Scholar]
Depatla, S.; Muralidharan, A.; Mostofi, Y. Occupancy estimation using only WiFi power measurements. IEEE J. Sel. Areas Commun. 2015, 33, 1381–1393. [Google Scholar] [CrossRef]
Cai, X.; Li, X.; Yuan, R.; Hei, Y. Identification and mitigation of NLOS based on channel state information for indoor Wi-Fi localization. In Proceedings of the IEEE International Conference on Wireless Communications & Signal Processing (WCSP), Nanjing, China, 15–17 October 2015; pp. 1–5. [Google Scholar]
Dey, S.; Chugg, K.M.; Beerel, P.A. Morse Code Datasets for Machine Learning. In Proceedings of the 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Banglore, India, 10–12 July 2018; pp. 1–7. [Google Scholar]
Gharatkar, S.; Ingle, A.; Naik, T.; Save, A. Review pre-processing using data cleaning and stemming technique. In Proceedings of the IEEE International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS), Coimbatore, India, 17–18 March 2017; pp. 1–4. [Google Scholar]
Gu, Y.; Lo, A.; Niemegeers, I. A survey of indoor positioning systems for wireless personal networks. IEEE Commun. Surv. Tutor. 2009, 11, 13–32. [Google Scholar] [CrossRef]
Eisa, S.; Peixoto, J.; Meneses, F.; Moreira, A. Removing useless APs and fingerprints from WiFi indoor positioning radio maps. In Proceedings of the IEEE International Conference on Indoor Positioning and Indoor Navigation (IPIN), Montbeliard-Belfort, France, 28–31 October 2013; pp. 1–7. [Google Scholar]
JoshvaDevadas, T.; Seelammal, C.; Sadasivam, S. On data cleaning with intelligent agents to improve the accuracy of wi-fi positioning system using gis. Asian J. Sci. Res. 2013, 6, 53–66. [Google Scholar] [CrossRef]
Song, C.; Wang, J. WLAN Fingerprint Indoor Positioning Strategy Based on Implicit Crowdsourcing and Semi-Supervised Learning. ISPRS Int. J. Geo-Info. 2017, 6, 356. [Google Scholar] [CrossRef]
Bogdándy, B. Wi-Fi RSSI pre-processing library for Android. In Proceedings of the 19th International Carpathian Control Conference (ICCC), Szilvasvarad, Hungary, 28–31 March 2018; pp. 649–654. [Google Scholar]
Bruha, I.; Famili, A. Postprocessing in machine learning and data mining. ACM SIGKDD Explor. Newslett. 2000, 2, 110–114. [Google Scholar] [CrossRef]
Tf.reshape.Tensorflow.org. Available online: https://www.tensorflow.org/api_docs/python/tf/reshape (accessed on 28 December 2018).
Liu, H.; Darabi, H.; Nanerjee, P.; Liu, J. Survey of wireless indoor positioning techniques and systems. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 2007, 37, 1067–1080. [Google Scholar] [CrossRef]
Hwang, S.H. Pre- and Post-Processing Algorithms with Deep Learning Classifier for Wi-Fi Fingerprint-Based Indoor Positioning. Available online: https://zenodo.org/record/2556161#.XFa40JCP600 (accessed on 3 February 2019).

Figure 1. Illustration of Wi-Fi fingerprint-based indoor positioning system.

Figure 2. Proposed Wi-Fi fingerprint-based indoor positioning system equipped with data pre- and post-processing modules.

Figure 3. Environment setup for floor map with 74 reference points.

Figure 4. Flow graph for the proposed pre-processing algorithm.

Figure 5. Network architecture for the deep learning classifier.

Figure 6. Flow graph for the proposed post-processing algorithm.

Figure 7. Five RSS fingerprints for 20 APs at RP-1 (a) raw database without pre-processing (b) database with pre-processing.

Figure 8. Simulation results for RSS fingerprint database with and without pre-processing.

Figure 9. Comparison between the experimental results with and without post-processing.

Table 1. Summary of parameters used in the deep learning classifier.

Parameters	Value
No. of layers	4
Type of layers	3-CL
Type of layers	1-FC
Input image size	16 × 16 = 256
No. of Neurons	Layer1: 8192
	Layer2: 4096
	Layer3: 2048
	Layer4: 1024
Batch size	1000
Learning rate	0.001
Training Epoch	2000
Training probability	0.5
Inner-width	1024
Dropout rate	0.5
Location tolerance	2
Regularization (β)	0.01
nb_classes (SoftMax size)	74

Table 2. Summary of simulation results.

Simulation Type	Training Accuracy	Success Percentage
Without Pre-processing	83.98	87.46
With Pre-processing	60.96	88.96

Table 3. Summary of experimental results.

Experiment Index	Time	Direction	Success % Without Post-Processing	Success % With Post-Processing	Gain %
1	Morning	Forward	88.64	97.29
		Backward	84.59	94.59
		Average	86.61	95.94	9.33
2	Afternoon	Forward	86.75	95.94
		Backward	83.78	94.54
		Average	85.26	95.24	9.98
3	Evening	Forward	84.05	94.59
		Backward	86.75	94.59
		Average	85.40	94.59	9.19
4	Morning	Forward	82.43	93.24
		Backward	84.05	94.59
		Average	83.24	93.91	10.67
5	Afternoon	Forward	84.83	94.59
		Backward	82.47	94.59
		Average	83.65	94.59	10.94
6	Evening	Forward	84.05	93.24
		Backward	85.68	94.59
		Average	84.86	93.91	9.05
7	Morning	Forward	82.61	91.89
		Backward	81.74	93.24
		Average	82.17	92.56	10.39
8	Afternoon	Forward	81.58	90.54
		Backward	82.33	91.89
		Average	81.95	91.21	9.26
9	Evening	Forward	80.54	90.54
		Backward	81.31	91.89
		Average	80.92	91.21	10.29
10	Morning	Forward	83.59	93.24
		Backward	84.55	94.59
		Average	84.07	93.91	9.84

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Haider, A.; Wei, Y.; Liu, S.; Hwang, S.-H. Pre- and Post-Processing Algorithms with Deep Learning Classifier for Wi-Fi Fingerprint-Based Indoor Positioning. Electronics 2019, 8, 195. https://doi.org/10.3390/electronics8020195

AMA Style

Haider A, Wei Y, Liu S, Hwang S-H. Pre- and Post-Processing Algorithms with Deep Learning Classifier for Wi-Fi Fingerprint-Based Indoor Positioning. Electronics. 2019; 8(2):195. https://doi.org/10.3390/electronics8020195

Chicago/Turabian Style

Haider, Amir, Yiqiao Wei, Shuzhi Liu, and Seung-Hoon Hwang. 2019. "Pre- and Post-Processing Algorithms with Deep Learning Classifier for Wi-Fi Fingerprint-Based Indoor Positioning" Electronics 8, no. 2: 195. https://doi.org/10.3390/electronics8020195

APA Style

Haider, A., Wei, Y., Liu, S., & Hwang, S.-H. (2019). Pre- and Post-Processing Algorithms with Deep Learning Classifier for Wi-Fi Fingerprint-Based Indoor Positioning. Electronics, 8(2), 195. https://doi.org/10.3390/electronics8020195

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Pre- and Post-Processing Algorithms with Deep Learning Classifier for Wi-Fi Fingerprint-Based Indoor Positioning

Abstract

1. Introduction

2. Proposed System

2.1. Overview of the Proposed System

2.2. Environment and Setup

2.3. Pre-Processing Algorithm

2.4. Deep Learning Classifier

2.5. Post-Processing Algorithm

3. Numerical Results

3.1. Simulation Results

3.2. Experiment Results

4. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI