Discriminative Learning-based Smartphone Indoor Localization

SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 1 Discr iminativ e Lear ning-based Smar tphone Indoor Localization Jos ´ e Luis Carrera V ., Member , IEEE, Zhongliang Zhao , Member , IEEE, T orsten Braun, Member , IEEE Haiyong Luo , Member , IEEE, F ang Zhao Abstract —Due to the growing area of ubiquitous mobile applications , indoor localization of smar tphones has become an interesting research topic. Most of the current indoor localization systems rely on intensive site surve y to achie ve high accur acy . In this w ork, we propose an efﬁcient smar tphones indoor localization system that is able to reduce the site surve y eff or t while still achie ving high localization accuracy . Our system is built b y fusing a variety of signals, such as Wi-Fi receiv ed signal strength indicator, magnetic ﬁeld and ﬂoor plan information in an enhanced par ticle ﬁlter . T o achiev e high and stable performance, we ﬁrst apply discriminativ e learning models to integrate Wi-Fi and magnetic ﬁeld readings to achie ve room lev el landmar k detection. Further, we integ rate landmark detection, range-based localization models, with a g raph-based discretized system state representation. Because our approach requires only discriminative learning-based room lev el landmark detections, the time spent in the learning phase is signiﬁcantly reduced compared to traditional Wi-Fi ﬁngerpr inting or landmark-based approaches. W e conduct e xperimental studies to ev aluate our system in an ofﬁce-like indoor en vironment. Experiment results show that our system can signiﬁcantly reduce the learning efforts, and the localization method can achiev e performance with an aver age localization error of 1.55 meters. Index T erms —Discriminative learning, likelihood estimation, par ticle ﬁlter , indoor localization, landmar k detection. F 1 I N T R O D U C T I O N N O wadays, location information has become a transcen- dent part of the daily life of mobile users. The gr owing area of ubiquitous computing technology has increased the demand of location-based services. The main goal of indoor localization is to supply to a mobile device a mechanism to locate itself and provide this information to users. However , contrary to Global Positioning System (GPS) for outdoor environments, currently there is not a simple and accurate solution for indoor localization. T ypically , indoor position- based services requir e higher localization accuracy than outdoor services. Additionally , the often limited compu- tation and power resour ces on mobile devices put strong constraints on the algorithmic complexity design of solu- tions. Thus, indoor localization is still considered an open challenging pr oblem. W ith the availability of more embedded sensors on mobile devices, numer ous indoor localization techniques (i.e, ultrasonic, radio frequency tags (RFID), magnetic ﬁeld, W i-Fi, Bluetooth, etc.) have been proposed in recent years. However , due to the wide availability of W i-Fi signals in indoor environments, W i-Fi radio-based localization has attracted most attention among differ ent localization ap- proaches. W i-Fi radio-based localization r elies on measur ed radio parameters, such as signal power to estimate the absolute positioning of targets [2]. W i-Fi received signal strength indicator (RSSI) is the most widely used parameter • J. Carrera, Z. Zhao, and T . Braun are with the Institute of Computer Science, University of Bern, Switzerland; H. Luo is with the Institute of Computing T echnology , Chinese Academy of Science, China; F . Zhao is with the School of Software Engineering, Beijing University of Post and T elecommunications, China. • Corresponding author: Z. Zhao (zhao@inf.unibe.ch) for indoor localization. Radio-based indoor localization can be classiﬁed as range-based and range-free methods. Range is deﬁned as the propagation distance from the target to Anchor Nodes (ANs). Fingerprinting is a range-free approach, which is often used because of its robustness to multi-path propaga- tion. In W i-Fi ﬁngerprinting techniques, a training or learn- ing phase is needed to build the RSSI signatur e database of f- line. In the positioning or on-line phase, the tar get location is estimated according to the results of comparing RSSI readings and the signature database. Indoor envir onments pr ovide many dif ferent ambient ra- dio signals, such as W i-Fi, Bluetooth, magnetic ﬁeld, sound, etc [10]. The set of observed ambient radio signals provides unique radio signal signatures for many points in indoor environments. These radio signal signatures can be used as landmarks to detect unique locations in the environment. Thus, the radio signal signatures are often used as the ﬁn- gerprint of the related location. However , building the ﬁn- gerprinting database of radio signals is a labor -intensive and time-consuming task. Moreover , the ﬁngerprint database is prone to environment changes. This means whenever there is a modiﬁcation of the physical indoor envir onment, the previous database will be outdated, and a new one has to be built. Therefor e, the traditional ﬁngerprinting approach is not suitable for lar ge-scale scenarios. Some localization methods have been pr oposed considering magnetic ﬁeld measurements. By using some advanced data fusion mech- anisms (i.e., particle ﬁlters), some works propose to use magnetic ﬁeld r eadings as alternative [11] or as complement to W i-Fi signatures [26] to further improve the positioning accuracy . However , due to the intrinsic disturbances caused by electrical appliances and building materials, the change of the magnetic ﬁeld with location is quite signiﬁcant in SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 2 indoor environments. Moreover , magnetic ﬁeld ﬁngerprints can not be unique, which yields to localization errors. Indoor localization can be seen as a likelihood estima- tion problem. Ther eby , locating a target is the problem of estimating the location of a target given a set of available ob- servations [16]. T o achieve high accuracy , Kalman ﬁlters or particle ﬁlters are often used to fuse this set of observations to derive accurate localization estimations. However , ﬁlter- based data fusion normally requires intensive computation resour ces, which makes the real-time localization a challeng- ing task for smartphones with limited resources. Therefor e, optimization of the data fusion algorithm is key for efﬁcient smartphone indoor localization systems. In this work, we propose a discriminative learning-based smartphone indoor localization system aiming at reducing the site survey efforts while attaining high localization ac- curacy . Our appr oach provides localization by fusing W i- Fi radio readings, magnetic ﬁeld readings and ﬂoor plan information in an enhanced particle ﬁlter . Our approach is able to exploit historical environmental information from a learning phase as well as on-line information to provide high and stable localization accuracy . Our approach inte- grates discriminative learning-based landmark detection, range-based localization models, and a graph-based ﬂoor plan representation in an enhanced particle ﬁlter . Moreover , since our localization method exploits landmark detection with room level accuracy , we r educe the effort and time in building the learning phase compared to traditional ﬁngerprinting-based and landmark-based localization ap- proaches. Our landmark ﬁngerprint database is built by taking W i-Fi and magnetic ﬁeld measurements while walk- ing randomly through the envir onment, which requires only room labeled samples in a very short time period. W e pr ototype our appr oach on commodity smartphones, with all the localization algorithms run on the smartphone itself. T o validate our localization system, we conduct ex- tensive experiments in a complex ofﬁce-like envir onment. Evaluation results show that our approach can achieve an average localization error of 1 . 55 m with standard deviation of 0 . 73 m with signiﬁcantly reduced site survey efforts, which makes it suitable for large-scale deployments. The main contributions of this work are as follows. • W e propose an efﬁcient indoor localization system for smartphones that is able to signiﬁcantly reduce the site survey effort while still achieving high lo- calization accuracy . Since our approach r equires only room level landmark detection (i.e., room recogni- tion), the landmark ﬁngerprint database is built by collecting W i-Fi and magnetic ﬁeld measurements by simply walking randomly through the envir on- ment. Thus, we reduce cost, manpower , effort, com- plexity and survey time in building the learning phase compared to traditional ﬁngerprinting-based and landmark-based localization approaches. • W e propose an enhanced particle ﬁlter to integrate machine learning-based landmark detections, range- based localization models and a graph-based ﬂoor plan representation to pr ovide highly accurate and stable localization results. Our approach is able to fuse historical envir onmental information from a learning phase and perform on-line localization with stable localization accuracy . • W e perform a set of experiments and analysis to validate the performance of our localization method. Several conﬁguration parameters, such as the num- ber of particles as well as the number and position of W i-Fi access points are tested to show that our system can signiﬁcantly reduce the site survey ef forts while preserving high accuracy . The rest of the paper is organized as follows. In Section II we present some related work. The localization approach is reviewed in Section III. Section IV presents the implemen- tation of the terminal-based system and the experiments’ setup. Section V presents the performance evaluation results of our approach. Section VI concludes the paper . 2 R E L AT E D W O R K The current commercial and social importance of indoor location-based applications has attracted signiﬁcant atten- tion in r ecent years. Indeed, indoor localization has been investigated lately and many solutions have been presented. Radio frequency (RF) based approaches include technolo- gies such as GPS, wireless local area networks (WLAN) and RFID localization. Non-RF-based solutions include laser , au- dio, visual, ultrasonic and infrar ed sensors [8]. In [13], a non- RF-based localization approach is pr oposed. This solution estimates the user location based on surveillance cameras. Any wearable device is needed by the user . However , the calibration process for the system becomes complex. In [15] the localization system is deployed using special infrastruc- tures such as infrar ed and acoustic sensors. Despite the fact that the localization system can achieve high accuracy , building these kinds of infrastructure is very costly . Due to the high vulnerability to envir onmental disruptions, most of the non-RF-based localization approaches require demand- ing calibration processes [8]. Among RF-based indoor localization approaches, trilat- eration has been regarded as a popular solution [22] [23]. T rilateration determines the position of the target based on the distances to some anchor nodes. However , unlike outdoor localization, trilateration does not work well in indoor environments because of the presence of obstacles and room partitions [9]. Therefore, as an alternative to trilateration, the process of signal collection and association with indoor locations has become a promising approach for indoor environments. This process is called ﬁngerprinting. Fingerprinting approaches ar e conducted in two phases: the learning phase is performed off-line and the query phase that is executed on-line. The off-line phase is conducted to build or update a < signature, location > database, which consists of a set of refer ence points with known coordinates and the signatur es collected fr om available sensors. Then, the on-line phase is aimed to ﬁnd the closest match between the features of the measured signatur es and those stored in the database. [7]. RADAR [24] was the ﬁrst work that utilized W i-Fi ﬁngerprinting. In RADAR, the interest area is divided into a grid of 1x1 m. RSS measur ements are taken at each cell intersection to create the radio map database. Then in the on-line phase, RSS received is compared to the radio map database to estimate the target location. SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 3 In addition to RF signals, the earth’s magnetic ﬁeld M F g eo and magnetic ﬁeld ﬂuctuations in indoor environ- ments can also be potentially leveraged for indoor local- ization. There are several works that show the feasibility of using anomalies of the magnetic ﬁeld to provide indoor localization [11] [26] [21] [27]. However , due to ferromag- netic material and electrical objects, magnetic signatures have many ambiguities in indoor environments (i.e., similar magnetic ﬁeld value in different locations). Therefor e, MF measurements should be fused with other indoor sensor measurements to derive accurate locations. Recently , landmark-based approaches have been proposed. Similar to ﬁngerprinting-based approaches, landmark-based approaches utilize smartphone sensors to detect unique positions in the indoor environment [10] [36]. In [10], two types of landmarks are deﬁned: seed landmarks and organic landmarks. Seed landmarks are positions that can be associated with speciﬁc objects in the environment such as stairs and elevators and are used to bootstrap the system. Organic landmarks are detected based on their unique signatur e on the sensors, and are identiﬁed by ambient radio signatures such as magnetic ﬁeld or W i-Fi. The deﬁned landmarks are used to calibrate the localization errors, which means that a car efully-designed landmark selection procedur e is the key to high performance. Although, landmark-based localization approaches can achieve high accuracy , the localization pr ocess relies heavily on the presence of numerous predeﬁned landmarks in the environment (e.g., stairs, escalators, elevators, doors). Thus, building the landmark ﬁngerprint database becomes an intensive effort demanding process. V arious machine learning-based approaches have been proposed that uses ﬁngerprinting to estimate user indoor locations. Machine learning-based indoor localization can be classiﬁed into generative or discriminative methods, which builds the model using a joint probability or con- ditional probability respectively [14] [30]. K-nearest neigh- bors (KNN) is the most basic and popular discriminative technique. Based on a similarity measure such as a dis- tance function, the KNN algorithm determines the k closest matches in the signal space to the target. Then, the loca- tion of the tar get can be estimated as the average of the coordinates of the k neighbors [25]. Generative localization methods apply statistical approaches, e.g., Hidden Markov Model [28], Bayesian Infer ence [31], Gaussian Pr ocesses [29], on the ﬁngerprint database. Thus, the accuracy can obviously be improved by adding more measurements. In [29] for instance, Gaussian Processes are used to estimate the signal propagation model through an indoor environment. There is a limited number of works that have focused in reducing off-line efforts in learning-based approaches for indoor localization [3] [4] [5]. These approaches reduce the off-line effort by r educing either the number of samples collected at each survey point or the number of survey points or both of them (i.e., reducing number of collected samples and number of survey points). Then, a generative model is applied to reinfor ce the sample collection data. In [3] for instance, a linear interpolation method is used. In [4], a Bayesian model is applied. In [5], authors propose an propagation method to generate data from collected sam- ples. In [30], authors combine characteristics of generative Figure 1: Indoor Localization System Architecture. and discriminative models in a hybrid model. Although this hybrid model reduces of ﬂine efforts, it still r elies on a number of samples collected from ﬁxed survey points (i.e., labeled samples) along the environment. Therefor e, to maintain high accuracy , the number of survey points shall be increased in larger environments. Thus, collecting samples from numer ous survey points will become a de- manding process, which makes the system unsuitable to large environments. Although generative approaches can handle the miss- ing value problem, discriminative approaches often achieve better performance. Generally speaking, all the machine learning-based localization methods using ﬁngerprinting can achieve good accuracy , if a large number of labeled samples are available. However , such a sample collection process could take severals hours or days for small or big areas, which is very labor expensive and time consuming. Therefor e, it is essential to r educe the ef forts in ofﬂine sample collection pr ocedures while still maintaining high localization accuracy . Therefor e, this work proposes to combine radio signal transmission features along with magnetic ﬁeld disturbance to reduce ambiguity conﬂicts. Moreover , our approach in- corporates a room level landmark detection in the localiza- tion process to accurately converge to the actual position. Differ ent from previous landmark-based approaches, we need only few landmarks, which signiﬁcantly reduces site survey efforts. W e only deﬁne subareas in the environment as landmarks (e.g. rooms). Then, the target is ﬁrst located at room level, and then applying our ranging model and particle ﬁlter algorithm, its location inside the room is derived. 3 S Y S T E M O V E R V I E W This section pr esents the design details of the proposed indoor localization problem. Figure 1 summarizes the struc- ture of our proposed approach, which includes four key subcomponents: a discrete system state space model, a discriminative learning-based room landmark recognition module, a W i-Fi ranging model, and an enhanced particle ﬁler module. Details of each subcomponent ar e described below . SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 4 Figure 2: Discrete System State Space. Nodes (circles) are interconnected by edges. Edges deﬁne the transition model nodes. 3.1 Discrete System State Space In order to minimize computational overhead, our system deﬁnes a discrete structure to replace the conventional ﬂoor map. Thus, all the system states (i.e., indoor positions) can be repr esented by a discrete set of locations by converting from a continuous state space to a discrete state space. Figure 2 shows an example of a graphical representation of the discrete state space of our system. Nodes represent positions in the physical environment, wher eas edges deﬁne the transition model between nodes. For instance, in Figure 2 e 1 deﬁnes a potential transition from node v 1 to node v 2 and vise-versa. Thus, the essential connectivity and accessi- bility of a complex indoor environment is repr esented as an undirected graph. W e consider the physical environment G as a spatial data structur e that deﬁnes space as an array of nodes arranged in rows and columns. G can be deﬁned as follows: G = (Υ , I , ξ ) , (1) where Υ is a set of nodes. I is a set of features deﬁning each node υ in Υ . ξ is a set of edges e deﬁning connections between nodes υ . The grid extraction can be carried out in a few simple steps as follows: • Specifying the areas where it is permissible to walk in rooms, corridors, etc. • Building a grid of nodes for each permitted ar ea. Nodes are separated by 0 . 25 m fr om each other in the room and column. Each node corresponds to an element υ in Υ . • Generating the subset of features i ∈ I for each υ ∈ Υ . Each subset i contains three elements related to a υ element: coordinates ( X , Y ) and an identiﬁer of the room to which υ belongs. • Generating set ξ of node connections. Each node is connected with their immediate neighbors in the grid. 3.2 Discriminative Learning-based Landmark Detec- tion W e pr opose to use a discriminative learning-based appr oach to provide landmark recognition with r oom level accu- racy . For the purpose of indoor localization, we consider a landmark as a location point where readings of at least one sensor show a stable and discriminative pattern. This pattern is referred as the signatur e of the landmark. W e deﬁne each ar ea separated by walls as an independent r oom landmark in the indoor envir onment. In this manner , the process of building the landmark ﬁngerprint database is simpliﬁed to collect some ﬁngerprints by walking randomly through each predeﬁned room. Thereby , the required effort and time to accomplish this task is signiﬁcantly reduced compared to traditional ﬁngerprinting-based and landmark- based localization approaches. Most of the indoor environments offer some ubiquitous radio signatures in different sensing domains [10]. In this work, we propose to use W i-Fi radio and magnetic ﬁeld signatures to provide room landmark detection. Thus, land- mark ﬁngerprints are composed by W i-Fi and magnetic ﬁeld signatures. The tuple < f ing er print, l ocation > deﬁnes the instances in the landmark ﬁngerprint database. W i-Fi-based landmark detection is one of the most widely used techniques, since it can make use of already deployed W i-Fi access points (APs) or anchor nodes (ANs). Additionally , it can provide landmark detection without any knowledge of the access point location or the signal propagation model [7]. However , physical conﬁguration of the indoor environment can inﬂuence accuracy . Unlike most of the existing landmark-based approaches that deﬁne many landmarks to correct the localization err or , we use landmarks for a different purpose. In our approach, we only deﬁne landmarks at room level, and the landmark detection is applied to recognize which room a target is. Afterwards, we use this information and apply our ranging model and particle ﬁlter algorithm to further derive more accurate locations within the detected room. Therefore, we do not need to measure a lot of landmarks, which is a labor intensive task. Indoor magnetic ﬁeld data combine the geomagnetic ﬁeld M F g eo of the Earth and the magnetic ﬁeld from fer- romagnetic objects M F env . Magnetic ﬁeld readings change signiﬁcantly with locations, but the magnetic ﬁeld is stable over time [11]. However , distortions of the magnetic ﬁeld produced by ferromagnetic material and electrical objects lead to ambiguities in the magnetic ﬁeld in indoor environ- ments. Thus, using magnetic ﬁeld ﬁngerprints alone is not accurate. [21]. A smartphone’s magnetometer measures the indoor magnetic ﬁeld vector M F p . Equation 2 shows the relation- ship among M F p , M F g eo and M F env in the absence of noise. M F p = M F g eo + M F env , (2) where M F p = ( M F x , M F y , M F z ) is a three- dimensional vector repr esenting the magnetic ﬁeld value on the phone local coordinate system. Ther efore, M F p becomes error prone due to its phone orientation dependence. As an alternative and similar to [11], we propose to use the gravity sensor to extract the vertical M F v and horizontal M F h component of the magnetic ﬁeld. Thus, the magnetic ﬁeld signature is deﬁned by the vector M F = ( M F v , M F h ) . According to the No Fr ee Lunch theorem [35], ther e is no single learning algorithm that universally performs SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 5 best across all application domains. Therefore, for indoor localization problems, a number of classiﬁers should be tested to validate their performance under a speciﬁc indoor environment. In ﬁngerprinting-based landmark detection approaches, the k-nearest Neighbors (k-NN) method is a widely used technique. However , k-NN provides poor es- timation performance due to the large variability of radio signals [8]. An alternative approach is to consider the lo- calization problem as an approximation function problem. Therefor e, discriminative learning models such as KStar , Multilayer Perceptr on (MLP), J48 and Support V ector Ma- chine (SVM) can be applied for ﬁngerprinting localization purposes. W e setup the room landmark detection method based on some experimental result analysis. Experiments were performed with KStar , Multilayer Perceptron (MLP), J48 and Support V ector Machine (SVM). Results show that in our speciﬁc setups, KStar overcomes MLP , J48, SVM in terms of landmark detection accuracy . Moreover , KStar is able to provide high r oom landmark detection accuracy . Thereby , we set up KStar as discriminative algorithm to provide room landmark r ecognition in our localization approach. KStar is an instance-based learner . It means that classiﬁcation is performed based on pre-classiﬁed samples. The funda- mental assumption is that similar instances should have similar classiﬁcations [32]. In KStar , similarity is established by a distance function. Thus, distance between instances is deﬁned as the complexity of transforming one instance into another . A detailed evaluation of the proposed landmark detection model and performance comparison of differ ent machine learning algorithms can be found in subsection 5.1. 3.3 Ranging Method W ith the derived room information, we focus on how to use the ranging model to derive more accurate location within that r oom. Range is deﬁned as the propagation distance between the target and the ANs. Ranges can be derived by using signal parameters such as RSSI. In theory , RSSI monotonically decreases with increasing propagation distance [1]. Some models to relate RSSI to the propagation distance have been proposed e.g., Log Distance Path Loss (LDPL) [12], Nonlinear Regression Model (NLR) [1]. These models are called radio propagation models. The radio propagation model that describes the LDPL can be described by Equation 3: P ( r ) = P ( r 0 ) − 10 · γ · log 10 ( r r 0 ) , (3) where P ( r ) is the r eceived signal power in a certain location, P ( r 0 ) refers to the power loss in a fr ee space. γ is the path loss efﬁcient. Therefore, r can be calculated as follows: r = 10 ( P ( r 0 ) − P ( r ) 10 · γ ) , (4) The NLR model is deﬁned by Equation 5: r = α · e ( β · P ( r )) , (5) where d is the distance between the transmitter and the receiver . Both α and β are environmental variables. In indoor environments the accuracy of the radio prop- agation models is af fected by multipath and Non Line of Sight (NLOS) pr opagation [1]. Thus, to reduce ranging errors introduced by NLOS and multipath propagation, we propose a new propagation model by combining LDPL and NLR. In LDPL, P ( r 0 ) is deﬁned under Line of Sight (LOS) propagation (i.e., free space), wher eas, NLR parameters are estimated by considering LOS and NLOS propagation. Thereby , it is expected that LDPL will perform better under LOS and short distances, whereas NLR will achieve better performance under NLOS and lar ge distances. Hence, based on these observations, we deﬁne our ranging model as follows: r = ( 10 ( P ( r 0 ) − P ( r ) 10 · γ ) if P(r) > P ( r 5 ) dB α · e ( β · P ( r )) if P ( r ) ≤ P ( r 5 ) dB (6) where P ( r 5 ) is the received signal power under LOS condi- tions (i.e., in short distance, usually 5 m ). 3.4 Enhanced Particle Filter Filtering is the problem of estimating the state of a system when a set of observations becomes available on-line [16]. W e consider indoor localization as a ﬁltering pr oblem, in which the conver gence results predict the position of the target from a set of environmental measurements. Hence, the key component of our localization approach is the enhanced particle ﬁlter we propose. Particle ﬁlter is a pow- erful tool that is able to represent a probability distribution function (PDF) over the target area that r epresents the in- door environment [16]. Indoor localization can be modeled as estimating system state by processing a sequence of noisy measurements. Particle ﬁlters estimate the posterior distribution of the state system based on some conditioned measurements Z t at time t [2]. Thus, we deﬁne the system state vector X t at time t as follows: X t = [ x t , y t ] , (7) where ( x t , y t ) ar e the Cartesian coordinates of the target object. The set of N weighted particles can be deﬁned as: P t = [ X i t , W i t ] , i = 1 , ..., N , (8) where X i t is the state vector with W i t of the i th particle at time t . 3.4.1 P ar ticle Initialization and Sampling Sampling function decides the location of particles. Algo- rithm 1 describes the sampling function S . At the beginning of the localization process, N particles are generated uni- formly distributed over the whole system state space. Equal weight W i t = 1 N , i ∈ [1 , N ] is assigned to each particle in P t . After initialization, all particles move according to the sampling function S . Function S introduces variety based on P t as follows: • N 0 % of the particles are spread uniformly distributed over the whole System State Space G . N 0 is the percentage of particles with lower weights. N 0 is an empirically chosen constant. • The position of particles with higher weights is up- dated based on a random function. Ther e are two SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 6 Algorithm 1: Sampling Function Input : P t − 1 = [ X i t − 1 , W i t − 1 ] ,State Space, N 0 % Output: P t = [ X i t , W i t ] 1 Order X i t − 1 ascending according w i t − 1 2 Sampling step 3 foreach n=1,... N 0 · N / 100 do 4 Position x t − 1 = draw uniformly distributed 5 end 6 foreach n=1,... N 0 · N / 100 ,... N do 7 Position x t − 1 = randomly chosen from immediate neighbor from Υ 8 end 9 Update P t from P t − 1 10 foreach X t − 1 do 11 Set x t = x t − 1 12 end possibilities, either the particle stays on the same position v ∈ Υ or it is updated to one immediate neighbor on Υ . 3.4.2 W eight Update and Resampling The associated weights W i t of the propagated particles must be corrected after updating their positions. The associated weight should be updated based on the likelihood of the ob- servations conditioned on each particle p ( Z t | X i t ) at time t . The observation vector is deﬁned by the estimated ranges to differ ent ANs and the result of the machine learning-based room landmark detection approach. Thus, the range obser- vation vector at time t is deﬁned as Z r t = [ d j t ] , j = 1 ...N AN , where N AN is the number of ANs. The r oom landmark observation vector at time t is deﬁned as Z l d t . W e can assume that the ranging and the landmark detection process are independent fr om each other . Thus, the pr obability p ( Z t | X i t ) can be determined as follows: p ( Z t | X i t ) = p ( Z r t | X i t ) · p ( Z l d t | X i t ) = p ( d j t | X i t ) · p ( Z l d t | X i t ) (9) In this phase, the associated weight W i t of each particle is given by the ranging and r oom landmark information. There, at time t , small weight values will be assigned to particles with low probability to observe Z r t and Z ld t in their positions. Particles with large associated weights will have a stronger contribution in the determination of the state belief of the system. In order to avoid confusion between dif ferent likelihoods used in this work, hereafter we refer to p ( d t | X i t ) as the ranging likelihood, p ( Z l d t | X i t ) as the room landmark likelihood and p ( Z t | X i t ) as the overall likelihood. W e can assume that the ranges to dif ferent ANs ar e inde- pendent from each other . Therefore, the ranging likelihood can be deﬁned as follows: p ( Z r t | X i t ) = N Y j =1 p ( ˆ d j,t | X i t ) , (10) where ˆ d j,t is the measured distance to the AN j at time t . Hereafter , p ( ˆ d j,t | X i t ) will be referred as the individual likelihood. Each individual likelihood can be written as: p ( ˆ d j,t | X i t ) = 1 σ j √ 2 π exp − [ ˆ d j,t − √ ( x i − x j ) 2 +( y i − y j ) 2 ] 2 2 σ 2 j , (11) where ( x j , y j ) ar e the coordinates of the j th AN. In complex indoor envir onments W i-Fi signals suffer from some random variations during transmission. This random behaviour is produced by the presence of multiple obstacles such as walls, furnitur e, ceiling, etc. Obstacles introduce a mixed transmission between Line of Sight (LOS) and Non-LOS (NLOS) conditions. NLOS propagation in- troduces a signiﬁcant bias in power-based ranging [17]. T o mitigate the inﬂuence of ranging errors on the deﬁnition of the ranging likelihood p ( ˆ d k | X i t ) , we propose to adopt the same weighting technique used in our previous work [18]. The weighting technique magniﬁes the contribution of the individual likelihood with smaller errors and suppr esses the contribution of larger ranging errors. Therefore, the weight- ing technique is deﬁned on each individual likelihood as follows: p ( Z t | X i t ) = N Y j =1 p ( ˆ d j | X i t ) m j , (12) where m j is the exponential weight for the individual likelihood of the j th AN. In general, estimation of larger distances introduce more errors than small distances. Thus, the exponential weight m j can be deﬁned as being inversely proportional to the estimated range outputs [18]: m j = 1 d j P N ap n =1 1 d n , (13) where N ap is the number of ANs. Room landmark likelihood p ( Z ld t | X i t ) is provided by the machine learning-based room landmark detection approach. In section 3.2 we analyze the performance of some machine learning algorithms to detect landmarks in our indoor environment. These experiments show that the KStar algorithm achieves the best landmark detection per- formance. Thus, KStar is selected as machine learning land- mark detection algorithm for our proposed localization ap- proach. The KStar [32] classiﬁer calculates the entropy-based distance sums over all possible transformations between two instances. Then, the probability function is deﬁned as the probability of all transformational paths from instance a to b . The r esampling step is a crucial but computationally expensive component of a particle ﬁlter approach. This phase is adopted to eliminate particles with small associated weights by replacing them by particles with lar ge associated weights. Thus, after updating each particle in the prediction phase, we perform a resampling process in a systematic manner . The resampling process in this phase relies on the individual likelihood p ( d t | X i t ) and the room landmark likelihood p ( Z l d t | X i t ) . It means that the associated weight of each particle is calculated by using observations related to range and room landmark estimations. Afterwards, the SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 7 Figure 3: System Implementation Overview . The localization algorithms are deployed in a smartphone. weighted center given by all the particles is calculated as the estimated position of the target object. 4 I M P L E M E N TA T I O N A N D E X P E R I M E N T S 4.1 Implementation details W e have implemented a terminal-based system for accurate indoor localization. The system comprises two main compo- nents: a mobile target and many W i-Fi Anchor Nodes (ANs). The proposed localization algorithms are running on the mobile target. Figure 3 presents the overview of the system. ANs are some commercial W i-Fi access points deployed at known or unknown locations along the area of interest. Positions of the known location ANs are chosen to provide the maximum coverage inside the area of interest. Thus, the locations of ANs are deﬁned on the boundary corners and the boundary itself. W e have adopted D-Link D-635 and D-Link DAP-2553 as W i-Fi ANs in this work. The beacon period is conﬁgured to 100 ms for ANs. The mobile target can be any commer cial Android smart- phone, which supports W i-Fi RSSI readings and magnetic ﬁeld sensors readings. W e have implemented the localiza- tion algorithms in a Motorola Nexus 6 smartphone. Here- after , we refer to Motorola Nexus 6 as mobile target (MT). In order to save resour ces in the smartphone, we set the sampling rate of magnetic ﬁeld sensor to 14Hz. However , the W i-Fi sampling frequency is much lower , i.e., 3Hz. T able 1 shows the main characteristics of the mobile target used in this work. Additionally , it is necessary to know the ﬂoor plan of the area of interest. The system requir es information related with r estricted areas such as sizes of walls and corridors. Particles are not allowed to be spread over restricted areas. The system reports the location of the target in real time. W e have conducted a set of experiments in ofﬁce-like indoor environments to evaluate the performance of our proposed localization system. Experiments were conducted in the building of the Institute of Computer Science (INF) at the University of Bern. A part of the third ﬂoor with an area of 288 m 2 ( 18 m x 16 m ) was chosen to deploy the localization T able 1: Mobile T arget Speciﬁcations Model Platform MT Model: Motorola Nexus 6; OS: Android 5.1.1 CPU: Quad-core 2.7 GHz; RAM: 3GB WLAN: W i-Fi a/b/g/n Accelerometer: Resolution:0.039 Range:19.613 Magnetometer: Resolution: 0.150 Range:9830 system. T o evaluate the performance of each subcompo- nent in the proposed architectur e, we perform individual experiments to validate the performance of the discrimina- tive learning-based r oom landmark detection module and differ ent signal propagation models in the ranging module respectively . 4.2 Room Landmark Detection Experiment Setups Considering that the landmark detection accuracy can be inﬂuenced by some parameters of the environment, we conduct some experiments to determine how environmen- tal parameters such as position of AP or number of APs inﬂuence the accuracy of the W i-Fi-based ﬁngerprinting landmark detection approach. Additionally , we perform experiments to show how the accuracy is impr oved by integrating magnetic ﬁeld (MF) values and W i-Fi readings to establish the ﬁngerprinting data. As shown in Figure 4, we deﬁne 9 wall separated areas in our environment. Her eafter , we refer to these areas as rooms. In order to study the effect of different environmental conﬁgurations, we deﬁne two scenarios by varying some environmental parameters: the ﬁrst scenario contains 5 ANs, the second scenario includes 8 ANs. In scenario 1 the positions of the 5 ANs are known, whereas in scenario 2 the positions of AN7 and AN8 are unknown. Figure 4 presents the physical distribution of ANs in both scenarios. Internal parameters of learning-based algorithms are op- timized from training data. Additionally , certain algorithms also have parameters that are not optimized during the training process. These parameters are called hyperparame- ters of the learning-based algorithm. Due to hyperparam- eters have signiﬁcant impact on the performance of the learning-based algorithm, we use a nested cross validation technique to adjust them [34]. The nested cross validation technique deﬁnes an inner and outer cross validation. The inner cross validation is intended to select the model with optimized hyperparameters, whereas outer cross validation is used to obtain an estimation of the generalization error . T en-fold cross validation was applied on both inner and outer cross validation. The classiﬁers were optimized over a key of hyperparameters. For KStar , we optimized global blend percentage ratio hyperparameter , kernel type func- tion, c and γ for SVM, the conﬁdence factor for J48, number of hidden layers and neur ons per layer for MLP . Based on the parameter optimization process, we established the optimal hyperparameter values for the classiﬁers as follows: global percent ratio of 30% for KStar , single order polyno- mial kernel, c = 1 , γ = 0 . 0 for SVM, single hidden layer with 10 neurons for MLP , and conﬁdence factor of 0.25 for J48. SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 8 AN 1 AN 2 AN 3 AN 4 AN 5 1 2 3 4 5 6 7 8 9 X Axis (meters) 0 2 4 6 8 10 12 14 16 18 Y Axis (meters) 0 2 4 6 8 10 12 14 16 (a) Scenario 1 (5 ANs) AN 1 AN 2 AN 3 AN 4 AN 5 AN 6 AN 7 AN 8 1 2 3 4 5 6 7 8 9 X Axis (meters) 0 2 4 6 8 10 12 14 16 18 Y Axis (meters) 0 2 4 6 8 10 12 14 16 (b) Scenario 2 (8 ANs) Figure 4: Anchor Nodes distribution and room deﬁnition (Square Points: Anchor Nodes; blue numbers: room identiﬁer). 5 ANs are considered in scenario 1, whereas 8 ANs are deployed in scenario 2. T o test the room landmark detection performance, we performed three experiments. The ﬁrst experiment tests room recognition performance in scenario 1. For this exper- iment, the landmark ﬁngerprinting database for landmark detection contains 3712 instances, which were collected from the whole indoor environment. Additionally , we test two approaches regarding the structure of the ﬁngerprint: W i-Fi readings alone and W i-Fi with magnetic ﬁeld. The second experiment tests the inﬂuence of the position of the ANs over the room recognition accuracy . In this experiment we use the same number of ANs (AN1,AN2,AN3,AN4,AN5) as in experiment 1. Positions of AN3 and AN4 change with respect to scenario 1. Distance between AN4 and AN5 is incr eased. AN3 is placed on the right down corner of room 8. Figure 4b shows the physical positions of the ANs. The landmark ﬁngerprint database contains 3700 instances, which wer e collected over the whole environment. The thir d experiment is intended to test the inﬂuence of the number of W i-Fi ANs that are used to deﬁne the ﬁngerprint instance. This experiment was performed in scenario 2. The results of the landmark detection experiments can be found in subsection 5.1. T o build the landmark ﬁngerprint database we ask a per - son to walk randomly around each room holding the phone in the hand. Instances must be collected equally distributed along the whole area in each room. Instance collection rate is only constrained by computational capabilities of the W iFi sensor of the smartphone. Thus, in our experiments each instance was collected at a rate of 3 instances/second. Be- cause our approach doest not need to predeﬁne any survey point, the time needed to build the landmark ﬁngerprint database is proportional to the number of collected instances multiplied by the instance collection rate. T able 8 shows the time r equired to build the landmark ﬁngerprint database for the landmark detection method. 4.3 Ranging Experiment Setups The ﬁrst step in ranging estimation is to take the initial measurements, which are aimed to train the environmental parameters α and β requir ed for the NLR model. Similar to our previous work [2], we estimate α and β values based on several stationary points. These reference points are spread over the whole ﬂoor plan. Please ﬁnd details about the NLR ranging method in our previous work [2]. T o analyze the performance of LDPL and NLR over our indoor environment, we deﬁne 40 test points over the whole scenario 2. Ranges between AN6 and the 40 test points are calculated with NLR, LDPL and PFML ranging models. The differ ence between ranges reported by the propagation model and ground truth ranges are considered as the ranging error . 4.4 Localization Experiment Setups In order to determine the localization errors, 20 testing positions were deﬁned in the interested area. The MT was held by a person who was asked to stand still at each testing point during the localization process. The Euclidean distance between positions reported by the localization sys- tem and ground truth positions is considered as localization error . Red points in Figure 4b represent the evaluated posi- tions for experiments. All the system parameters were determined beforehand. T able 2 shows the environmental parameters α , β deﬁned for the NLR model and γ , P ( r 0 ) parameters deﬁned for the LDPL model. Parameter N 0 of the sampling function is set to 10%. The experiment includes two phases: an off- line learning phase, and an on-line query phase. During the learning phase, a person holding the smartphone continu- ously walks through the indoor areas and collects the ﬁnger- prints and other correlated data. As mentioned before, our approach only requir es room landmark detection. Thereby , the off-line learning efforts and implementation complexity SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 9 T able 2: Environmental parameters for NLR ranging model AN α β γ P ( r 0 ) AN1 1.264 -0.03614 2.7 -30 AN2 1.278 -0.03711 2.7 -32 AN3 0.3701 -0.05153 2.7 -30 AN4 0.5663 -0.05153 2.7 -31 AN5 0.3496 -0.05328 2.7 -30 AN6 0.9739 -0.03578 2.7 -30 Discriminative learning method Experiment 2 Experiment 1 Correctly classified instances (%) 60 65 70 75 80 85 90 95 100 KStar (WiFi and MF) KStar (WiFi) MLP (WiFi and MF) MLP (WiFi) J48 (WiFi and MF) J48 (WiFi) SVM (WiFi and MF) SVM (WiFi) Figure 5: Landmark prediction performance in Experiment 1 and Experiment 2. are signiﬁcantly reduced. During the on-line query phase, a person holds the smartphone and walks through the areas and her/his locations are presented on the smartphone in real-time. 5 E X P E R I M E N T R E S U L T S In this section, we discuss and analyze the results of each individual subcomponents, and the integrated real-time lo- calization accuracy . 5.1 Landmark Detection Accuracy This section discusses the accuracy of the landmark de- tection model when different classiﬁers are used. When comparing the performance of classiﬁers, it is impossible to deﬁne a single metric that provides a fair comparison in all possible applications. Ther efore, several measurements have been proposed in the literatur e [33]. W e focus on the metrics of prediction accuracy and model building time. Accuracy refers to the percentages of correct room detection, and the model building time refers to the time that the algorithm requir es to build the classiﬁcation model. Figure 5 shows performance validation of the selected learning-based classiﬁers obtained from Experiment 1. The best performance is reached by the KStar classiﬁer , which achieves 86% of instances correctly classiﬁed when the ﬁngerprint is composed by W i-Fi and MF r eadings. By using W i-Fi and MF readings in the room recognition process, the accuracy is improved in all tested classiﬁers. The maxi- mum improvement takes place on SVM with 6%. However , the KStar classiﬁer achieves the best overall performance with 86% of instances correctly classiﬁed. Figure 5 shows the evaluation results of the classiﬁers obtained from ex- periment 2. Percentage of corr ectly classiﬁed instances is increased for all tested classiﬁers. This is due to the fact that the similarity of signatur es in r oom 3 and room 7 decreases when the distance between AN4 and A5 increases. Therefor e, this experiment shows that the position of the ANs inﬂuences room recognition accuracy . The KStar clas- siﬁer achieves the best performance with 94.5% of instances correctly classiﬁed. T able 3 presents the classiﬁer validation results obtained from Experiment 3. KStar achieves the best performance, its percentage of instances correctly classiﬁed increases in 2.8% by incr easing the number of ANs from 5 to 8. However , the improvement is just 0.1% from 7 to 8 ANs. Therefor e, increasing the number of ANs improves the accuracy of the classiﬁer . Nevertheless, after a certain number of APs is used, the improvement of adding more APs is almost negligible in all tested classiﬁers. T able 3: Classiﬁcation performance vs number of ANs, Ex- periment 3 (scenario 2). ANs Classiﬁer Accuracy (%) T raining time (s) 6 ANs KStar 96.2 0 MLP 94.0 6.88 J48 92.3 0.04 SVM 91.6 3.16 7 ANs KStar 97.2 0 MLP 95.3 8.17 J48 93.1 0.15 SVM 94.4 12.87 8 ANs KStar 97.3 0 MLP 95.8 8.53 J48 92.3 0.23 SVM 95.5 12.93 In indoor environments, measured values of RSSI vary according to locations. However , it is expected that these values will remain similar on nearby positions. For example, on locations close to landmark borders, high similarities will be observed on RSSI values. These similarities could lead to misclassiﬁcation problems. The KStar classiﬁer outperforms J48, SVM and MLP in terms of accuracy and building time on the three experiments. This is because KStar is an instance-based learner algorithm, which uses entropy as a distance measur e to determine how similar two instances are. Distance between instances is deﬁned as the complexity of transforming one instance into another . Thus, this method is more sensitive to slight variations upon the instance as unity . Unlike KStar , J48 builds the classiﬁcation model by parsing the entr opy of information at attribute level. It means that J48 measures entropy in the attribute domain to decide which attribute goes in a decision node. Therefore, the classiﬁcation model is prone to misclassiﬁcation in this speciﬁc landmark detection problem. Although SVM shows good performance, optimized hyperparameter values affect the smoothness of the decision boundary . It could lead to a certain degree of overﬁtting on the model. Moreover , build- ing time on SVM is the highest compar ed to other classiﬁers. MLP , by means of sigmoid function, achieves about 95.8% of accuracy when using 8 ANs. However , its building time is 99.99% and 99.98% higher than KStar and J48 respectively . Therefor e, as shown on T able 3, it is clear that the KStar classiﬁer achieves better performance than J48, SVM and MLP for the room landmark detection problem. SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 10 Range interval (meters) [0-2) [2-4) [4-6) [6-8) [8-10) [10-13] Mean error (meters) 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 NLR Model LDPL Model Figure 6: Propagation Models Error in Range intervals. LDPL perform better in short distances, whereas NLR per- form better in large distances. 5.2 Ranging Accuracy This section discusses the r esults of the proposed ranging model. Figure 6 shows the mean error of NLR and LDPL models considering range intervals. T able 4 summarizes the ranging mean error of NLR, LDPL and NLR-LDPL models. Figure 7 depicts the CDF of ranging errors for the signal propagation models. Considering the 40 test points, NLR achieves 1 . 48 m of mean error , whereas LDPL achieves 1 . 79 m. Therefore, NLR accomplishes better performance than LDPL. However , it is worth to notice that LDPL performs better than NLR when the range is shorter than 5 m , meaning that LDPL over comes NLR in LOS condi- tions. In this case, LDPL achieves 1 . 01 m of mean ranging error , whereas the mean error of NLR is 1 . 79 m . Ther eby , LDPL achieves better performance for short distances, and NLR achieves better performance for large distances. This is because parameters of the LDPL model are deﬁned in LOS conditions, whereas parameters of NLR ar e deﬁned considering LOS and NLOS conditions. As shown in T able4, our proposed NLR-LDPL ranging model outperforms NLR and LDPL. This is because NLR-LDPL considers LOS and NLOS conditions to apply the most suitable ranging method between NLR and LDPL. T able 4: NLR and LDPL ranging errors. Model Error(all points) Error( < 5m points) NLR 1.48 m. 1.79 m. LDPL 3.16 m. 1.01 m. NLR-LDPL 1.31 m. 1.01 m. 5.3 Localization Accuracy In this section we focus on the performance of the whole localization approach. A performance comparison between our approach and state of the art ﬁngerprinting-based and landmark-based appr oaches would be biased. This is be- cause most of those approaches are environmentally de- pendent (i.e., they rely on the presence of numerous land- marks in the environment). Ther efore, their performance Error(in meters) 0 1 2 3 4 5 6 7 8 CDF(%) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 NLR-LDPL NLR NLPD Figure 7: CDF ranging methods will vary on different environment conditions. Moreover , unlike afor ementioned approaches, our localization solution only requires room landmark detection. However , to show the performance superiority of our solution, we compare the performance of our method with the Nonlinear Least Squares trilateration (NLST) algorithm, K-nearest neigh- bors (KNN) ﬁngerprinting and Kalman Filter-based (KF) localization methods. Hereafter , we r efer to our particle ﬁlter machine learning localization approach as the PFML localization method. For evaluation, we collect the metrics of CDF (Cumu- lative Distribution Function) of localization errors, mean tracking error , and standard deviation of localization errors. 5.3.1 P erf or mance vs Number of P ar ticles This set of experiments was conducted in scenario 2 with 6 ANs for ranging estimation and 8 ANs for landmark recognition. Further details about the environment conﬁguration can be seen in Figure 4b. The number of particles are 600, 1500, 2500. Figure 8a depicts the conﬁdence intervals resulting from this set of experiments. Figure 8b shows the CDF of localization err ors for PFML localization methods. T able 7 summarizes the average of localization err ors, standar d deviation and 90% accuracy . PFML achieves the minimum mean error of 1 . 55 m with 2500 particles, which spend approximately 455 milliseconds of processing time. It is worth to notice that this conﬁguration also yields the lowest standard deviation: 0 . 73 m . Our localization appr oach achieves the best performance by using 2500 particles. However , there is not a big gap among the performance values in all tested particle numbers in this experiment. This is because PFML deﬁnes a discrete system space to repr esent the environment. Thus, most of the potential system states fr om the posterior distribution can be covered by small number of particles. More particles will signiﬁcantly increase the computation overhead, which degrades the overall system performance. 5.3.2 P erf or mance vs Number of Ranging ANs In this experiment we validate the robustness of our local- ization approach by varying the number of ranging ANs. SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 11 Number of Particles 500 1000 1500 2000 2500 3000 Mean Error 0.5 1 1.5 2 2.5 3 (a) Conﬁdence Interval Error(in meters) 0 1 2 3 4 5 CDF(%) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 PFML 2500 Particles PFML 1500 Particles PFML 600 Particles (b) Localization error CDF Figure 8: Localization performance vs number of particles Error(in meters) 0 1 2 3 4 5 6 7 8 9 10 11 CDF(%) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 PFML 2500 Particles NLS-trilateration KNN (K=3) Kalman Filter Figure 9: Localization err or CDF PFML, NLST , KNN and KF Localization Methods PFML KF NLST KNN(K=3) Mean Error 0 1 2 3 4 5 6 Figure 10: Conﬁdence interval PFML,NLST ,KNN W e select the number of particles that yield the best perfor- mance of PFML. Therefor e, we setup the PFML method to 2500 particles. Mor eover , we compare the performance of the PFML approach to NLST . Figure 11a depicts the mean localization error achieved by PFML and NLST methods. Figure 11b shows the CDF of localization errors of our pro- posed localization approach. T able 5 summarizes the mean T able 5: PFML Performance vs Number of ranging ANs Conﬁguration Mean error S.D 90% Acc. PFML 6 ANs 1.55m 0.73m 2.8m PFML 5 ANs 1.7m 1.1m 2.7m PFML 4 ANs 1.95m 1.22m 3.0m PFML 3 ANs 2.09m 1.26m 2.5m NLST 6 ANs 3.05m 2.74m 7.1m T able 6: NLST Performance vs Number of ranging ANs Conﬁguration Mean error S.D NLST 6 ANs 3.05m 2.74m NLST 5 ANs 3.22m 2.48m NLST 4 ANs 3.07m 1.3m NLST 3 ANs 3.46m 1.42m localization error , standard deviation and 90% accuracy of the PFML method. T able 6 shows the mean localization error and standard deviation of NLST . Based on these results, we can highlight the following observations. First, PFML achieves more stable and higher accuracy than the NLST method in all tested cases. This is because unlike NLST , PFML determines the target localiza- tion from a Posterior Probability Distribution (PPD). This PPD is built after considering multiple information sources such as ranges and room landmark detection. Second, by using 5 ranging ANs, PFML outperforms NLST by around 47 . 2% considering mean err or . The standard deviation of PFML is 55 . 6% smaller than for NLST . Thir d, by considering 4 ranging ANs, the standard deviation of our localization approach is 6 . 15% smaller than for NLST . PFML performs better than NLST by around 36 . 5% considering the mean error . Fourth, by considering 3 ranging ANs, PFML outper- forms NLST by around 39 . 59% and 11 . 2% regarding mean error and standard deviation, respectively . Experiment results show that the accuracy of PFML increases as mor e ranging ANs are used, whereas NLST accuracy does not signiﬁcantly change. This reﬂects that unlike NLST , our localization approach is able to exploit diverse envir onment information sour ces (e.g., SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 12 Ranging Anchor Nodes 3 ANs 4 ANs 5 ANs 6 ANs Mean error (meters) 0 0.5 1 1.5 2 2.5 3 3.5 4 PFML 2500 Ptcl. NLS-trilat. (a) Localization error vs number of ranging ANs Error(in meters) 0 1 2 3 4 5 6 CDF(%) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 PFML(6 ANs) PFML (5 ANs) PFML (4 ANs) PFML (3 ANs) (b) PFML Localization error CDF vs number of ranging ANs Figure 11: Localization performance vs number of ranging ANs T able 7: Performance vs Number of Particles Conﬁguration Mean error S.D 90% Acc. PFML (2500 Ptc.) 1.55m 0.73m 2.8m PFML (1500 Ptc.) 1.71m 1.073m 2.62m PFML (600 Ptc.) 1.91m 1.01m 3.5m NLST 3.79m 2.52m 8.0m KNN (N=1) 3.51m 2.1m 6.2m KNN (K=3) 3.32m 1.89m 6.1m Kalman Filter 3.36m 1.11m 4.1m magnetic ﬁled and wireless signal propagation) to increase localization accuracy . 5.3.3 Comparison with Other Systems Performance comparison with existing solutions can proof the superiority of the proposed system. However , for indoor localization, it is rather hard to implement all the speciﬁc details of an existing solution and repeat the identical experiment to get the same results that were collected in another physical indoor environment. Therefore, similar to many other indoor localization systems, we compare our system with classical indoor localization solutions, such as ﬁngerprinting, Kalman ﬁlter-based (KF) and trilateration approaches. Figure 9 depicts the CDF of localization error for the best performance of PFML, NLST and the KNN method (K=3), whereas Figure 10 shows the conﬁdence intervals of the best performance of PFML, KF , NLST and KNN (K=3) methods. As shown in T able 7, NLST achieves around 8 . 0 m for 90% accuracy , the mean error is 3 . 79 m and the standard deviation is 2 . 52 m . For K = 3 , the KNN localization method achieves around 6 . 1 m for 90% accuracy , the mean error is 3 . 32 m and the standard deviation is 1 . 89 m . The KF approach achieves around 4 . 1 m for 90% accuracy , the mean error is 3 . 36 m and the standard deviation is 1 . 11 m . Our localization approach outperforms NLST by around 65% , KNN by around 54 . 1% and KF by around 31 . 7% regar ding 90% accuracy . Moreover , conﬁdence intervals of PFML are around 20% smaller than for KF and 50% smaller T able 8: Ofﬂine survey time of different approaches Localization method Instances Survey time PFML(landmark detection + ranging) 3712+20 21+28=49 min KNN 5060 460 min NLST (ranging) 20 28 min LS-SVM-W A [30] 8056 403 min than for NLST and KNN. The mean error of the PFML approach is 53 . 8% , 59 . 1% and 53 . 31% better than for KF , NLST , and KNN respectively . Standard deviation is 71 . 03% smaller than for NLST and 61 . 4% smaller than for KNN. Moreover , it is worth to mention that because PFML requir es only room landmark detection, the survey time of PFML is around 88% smaller than the survey time employed to build the KNN ﬁngerprinting database. Experiment r esults show that because of the integra- tion of machine learning-based room landmark detection, ranging models and particle ﬁlter methods, our localization method outperforms KF , NLST and KNN for accuracy and stability . Moreover , PFML can signiﬁcantly reduce the off- line survey ef fort compared to common ﬁngerprint-based and landmark-based localization approaches. Furthermore, although the ranging method for both PFML and NLST is the same, PFML is able to achieve higher localization accuracy than NLST . 5.4 Adv antage of Using PFML Localization Method As mentioned before, PFML is able to reduce the off-line phase effort while still achieving high accuracy . Unlike tra- ditional ﬁngerprinting-based indoor localization methods, PFML does not deﬁne any ﬁxed survey point to build the landmark ﬁngerprint database. Thus, the ofﬂine phase becomes a simple process. The time requir ed to build the landmark ﬁngerprint database depends only on the number of collected instances and the instance collection rate given by the W i-Fi sensor of the smartphone (i.e., 3Hz). Therefore, the time required to complete the ofﬂine phase in PFML is deﬁned as follows: T of f line,P F M L = I · S rate + T rang ing , (14) SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 13 where I is the number of collected instances, S rate is the instance collection rate. The landmark ﬁngerprint database of the PFML local- ization method contains 3712 instances. Thus, building this database takes around 3712 / (3 · 60) = 21 minutes. Addi- tionally , to estimate α and β values for the ranging method, we deﬁne 20 r eference points. This process takes around 28 minutes. Thus, the required time to complete the ofﬂine phase in PFML is around 21 + 28 = 49 minutes. The KNN radio map database contains 5060 instances. W e deﬁne the indoor area as a grid of square cells of l = 1 m . Each cell intersection deﬁnes a survey point. Thus, 85 survey points were deﬁned in the whole area of interest. Around 60 instances were collected in each survey point. In or der to compar e KNN with our approach, we relate the requir ed time to complete the ofﬂine phase in ﬁngerprinting-based approaches to the number of survey points, the time needed to deﬁne each survey point, the time needed to go from one survey point to another during the site survey phase, number of collected instances and the instance collection rate. Therefor e, the total ofﬂine time in KNN is deﬁned as follows: T of f line,K N N = S p · ( t sp + t sw ) + I · S rate , (15) where S p is the number of survey points, t sp is the time needed to measur e the coor dinates of each survey point, t sw is the time needed to go from one survey point to another during the site survey phase, I is the number of collected instances, S rate is the instance collection rate. Empirically , we approximate t sp and t sw to 300 and 5 seconds respectively . Ther efore, the requir ed time to com- plete the ofﬂine phase in KNN is around 460 minutes. Assuming that the effort is proportional to the time spent in the ofﬂine phase, PFML reduces the ofﬂine effort by around 89% compared to ﬁngerprint-based appr oaches. In LS-SVM- W A [30], 80 labeled and 72 unlabeled samples are collected on 53 ﬁxed survey points, which takes (53 · 305 + 8056) / 60 = 403 minutes. T able 8 summarizes number of data instances that have been collected during the experiments and survey time of PFML, KNN, NLST , and LS-SVM-W A [30]. Clearly , PFML takes much less time and effort to deploy the system and perform the data measurement when compared to ﬁngerprinting-based, landmark-based appr oaches and other works that focus on reducing the calibration efforts. 6 C O N C L U S I O N S This work exploits an enhanced particle ﬁlter appr oach to fuse discriminative learning-based landmark detection, ranging methods and physical information of the envi- ronment to achieve high localization accuracy in complex indoor scenarios for smartphones. Since our approach re- quires only room landmark detection, we are able to signiﬁ- cantly reduce the effort and time requir ed in processing the off-line phase compared to traditional ﬁngerprinting-based and landmark-based approaches. Moreover , to incorporate a suitable discriminative learning-based landmark detec- tion method, we assess the room recognition performance with the most popular discriminative learning algorithms. W e conduct experiments to analyze the impacts of anchor nodes’ position on the landmark detection accuracy . Addi- tionally , based on ranging experiment r esults, we propose a composed NLR-LDPL ranging method. T o evaluate our localization system, we conducted extensive experiments in a complex of ﬁce-like environment. Experiment results show that our appr oach can achieve an average tracking error of 1 . 55 m and 90% accuracy is 2 . 8 m . Compared to ﬁngerprinting-based and landmark-based approaches, our solution r equires much less time in the of ﬂine process, while keeping the accuracy at a high level. Our solution is more accurate and stable than the commonly used Kalman ﬁlter , nonlinear least square trilateration and KNN-ﬁngerprinting methods. Furthermor e, our pr oposed appr oach enables r eal- time localization at the smartphone without assistance of any additional server . A C K N OW L E D G M E N T This work was partly supported by the Swiss National Sci- ence Foundation via the SwissSenseSynergy project under grant number 154458. R E F E R E N C E S [1] Z. Li and T . Braun and D. Dimitr ova, A Passive W i-Fi Source Localization System Based on ﬁne-grained power-based trilateration. Proceedings of the IEEE International Symposium on a W orld of W ireless, Mobile and Multimedia Networks (W oWMoM), 2015. [2] J. Carrera and Z. Li and Z. Zhao and T . Braun and A. Neto, A Real-time Indoor T racking System in Smartphones. Proceedings of the 19th ACM International Conference on Modeling, Analysis and Simulation of W ireless and Mobile Systems (MSW iM), 2016. [3] X. Chai and Q. Y ang, Reducing the calibration Effort for Probabilistic Indoor Location Estimation. IEEE T ran. Mobile Computing, vol 6, no 6, pp. 649-662, 2016. [4] D. Madigan and E. Einahrawy and R. Martin and W . Ju and P . krishnan and A. Krishnakumar , Bayesian Indoor Positioning Systems. IEEE. INFOCOM, vol 2, pp. 1217-1227, 2005. [5] S. Liu and H. Luo and S. Zou, A Low-Cost and Accurate Indoor Localization Algorithm Using Label Propagation based Semi Supervised Learning. Fifth International Conference Mobile Ad-Hoc and Sensor Networks, pp. 108-111, 2009. [6] R. Ouyang and A. W ong, Indoor Location Estimation with Reduced Cal- ibration Exploiting Unlabeled Data via Hybrid Generative/Discriminative Learning. IEEE T ransactions On Mobile Computing, vol. 11, no. 11, pp. 1613-1626, November 2012. [7] Y .Li and P . Zhang and X. Ni, Real-time Indoor Navigation using Smart- phone Sensors. International Conference on Indoor Positioning and Indoor Navigation (IPIN), 2015. [8] M.Moreno-Cano and M. Zamora-Izquierdo and J. Santa and A. Skarmeta An indoor localization system based on artiﬁcial neural net- works and particle ﬁlters applied to intelligent buildings. ELSEVIER Neurocomputing, 2013. [9] S. He and G. Chan Wi-Fi Fingerprint-Based Indoor Positioning:Recent Advances and Comparisons. IEEE COMMUNICA TIONS SUR VEYS & TUTORIALS, 2016. [10] H. Abdelnasser and R. Mohamed and A. Elgohary and M. Alzan- tot and H. W ang and S. Sen and R. Choudhury and M. Y oussef Se- manticSLAM: Using Environment Landmarks for Unsupervised Indoor Localization. IEEE T ransactions on Mobile Computing, 2016. [11] H. Xie and T . Gu and X. T ao and H. Y e and J. Lu A Reliability- Augmented Particle Filter for Magnetic Fingerprinting Based Indoor Lo- calization on Smartphone. IEEE T ransactions on Mobile Computing. [12] S. Seidel and T . Rappaport 914 MHz Path Loss Predictions Models for Indoor Wir eless Communications in Multi-ﬂoored Buildings. IEEE T ransactions on Antennas and Propagation, 1992. [13] J. Krumm and S. Harris and B. Meyers and B. Brumitt and M. Hale and S. Shafer Multi-camera multi-person tracking for EasyLiving. Third IEEE International W orkshop on V isual Surveillance, 2002. SUBMITTED T O IEEE TRANSACTIONS ON MOBILE COMPUTING 14 [14] S. He and S. Chan Wi-Fi Fingerprint-Based Indoor Positioning: Recent Advances and Comparisons. IEEE COMMUNICA TIONS SUR VEYS & TUTORIALS, 2016. [15] A. W ard and A. Jones A. Hopper A new location technique for the active ofﬁce. IEEE Personal Communications, 1997. [16] R. van der Merwe and A. Doucet and N. Freitas and E. W an The Unscented Particle Filter . Advances in Neural Information Processing System, 2001. [17] C. W u and Z. Y ang and Z. Zhou and K. Qian and Y . Y unhao and M. Liu PhaseU: Real-time LOS Identiﬁcation with Wi-Fi. IEEE Conference on Computer Communications (INFOCOM), 2015. [18] Z. Li and D. Burbano and Z. Zhao and Jose Carrera and T . Braun Fine-grained Indoor T racking by Fusing Inertial Sensor and Physical Layer Information in WLANs. IEEE International Conference on Communications, 2016. [19] T . Kavzoglu and I. Colkesen Entropic distance based K-star algorithm for remote sensing image classiﬁcation. Fresenius Environmental Bulletin, 2011. [20] J. Cleary and L. T rigg K*: An Instance-based Learner Using Entropic Distance Measure. International conference on Machine Learning. [21] J. Jung and S. Lee and H. Myung Indoor Mobile Robot Localization and Mapping Based on Ambient Magnetic Fields and Aiding Radio Sources. IEEE T ransactions on Instr umentation and Measur ements, 2015. [22] B. Li and C. Rizos and A. Demper and J. Barnes Hybrid Method for Localization Using WLAN. The Spatial Sciences Conference, 2005. [23] B. Li and J. Salter and A. Dempster and Chris Rizo Indoor Posi- tioning T echniques Based on Wir eless LAN. The IEEE international Conference on W ireless Broadband and Ultra W ideband Commu- nications, 2006. [24] V . Padmanabhan and V . Bahl RADAR: An In-building RF-based User Location and T racking System. IEEE 19th International Conference on Computer Communications, 2000. [25] B. Lakmali and W . W ijesinghe and K. de SIva and K. Liyanagama and S. Dias Design, Implementation & T esting of Positioning T echniques in Mobile Networks. The 3rd International Conference on Informa- tion and Automation for Sustainability , 2007. [26] Y . Shu and C. Bo and G. Shen and C. Zhao and L. Li and F . Zhao Magicol: Indoor Localization Using Pervasive Magnetic Field and Opportunistic Wi-Fi Sensing. IEEE Journal on Selected Areas in Communications, 2015. [27] Y . Shu and C. Bo and G. Shen and C. Zhao and L. Li and F . Zhao SignalSLAM: Simultaneous localization and mapping with mixed Wi-Fi, Bluetooth, L TE and magnetic signals. International Conference on Indoor Positioning and Indoor Navigation (IPIN), 2014. [28] Z. Ghahramani An Introduction to Hidden Markov Models and Bayesian Networks. International Journal of Pattern Recognition and Artiﬁcial Intelligence, 2001. [29] B. Ferris and D. Hahnel and D. Fox Gaussian Processes for Signal Strength-based Location Estimation. Procedur es of Robotics Science and Systems, 2006. [30] R. W . Ouyang and A. K. S. W ong and C. T . Lea and M. Chiang Indoor Location Estimation with Reduced Calibration Exploiting Un- labeled Data via Hybrid Generative/Discriminative Learning. IEEE T ransactions on Mobile Computing, 2012. [31] O. Bousquet and U. von Luxburg and G. Ratsch Bayesian Infer- ence: An Introduction to Principles and Practice in Machine Learning. Fresenius Environmental Bulletin 20(5), 2004. [32] J. Cleary and L. T rigg K*: An Instance-based Learner Using an Entropic Distance Measure. In: 12th International Conference on Machine Learning, 108-114, 1995. [33] D. Amancio and C. Comin and D. Casanova and G. T ravieso and O. Martinez and F . Rodrigues and L. Costa A Systematic Comparison of Supervised Classiﬁers. PLoS ONE 9(4): e94137. [34] K.-R. Mller and M. Krauledat and G. Dornhege and G. Curio and B. Blankertz Machine learning techniques for brain-computer interfaces. BIOMEDICAL ENGINEERING, 11-22, 2004. [35] D. W olpert and W . Macready and H. David and G. W illiam No Free Lunch Theorems for Search. T echnical Report SFI-TR-95-02-010. [36] Z.-A. Deng Deng and G. W ang and D. Qin and Z. Na and Y . Cui and J. Chen Continuous Indoor Positioning Fusing WiFi, Smartphone Sensors and Landmarks. Sensors, vol. 16, no. 9, p. 1427, Sep. 2016. Jos ´ e Luis. Carrera V . received the B.S.E. de- gree from the National Polytechnic School from Equator and the MSc. degree in computer sci- ences from the Swiss Joint Master of Science in Computer Science prog ram of the universities of Neuch ˆ atel, F ribourg and Bern in 2015. He is currently working tow ard the Ph.D . degree in the Institute of Computer Sciences of the University of Ber n. His research interests include Ar tiﬁcial Intelligence, Machine Learning, indoor localiza- tion and distributed systems. Zhongliang Zhao receiv ed the Ph.D . degree from the University of Ber n in 2014. He was a Senior Researcher with the University of Ber n since then. He was m ultiple W ork P ackage lead- ers in the EU FP7 project Mobile Cloud Net- working, a Co-PI of the Sino-Swiss Science and T echnology Cooperation project M3WSN. He is currently the T echnical Coordinator of the Swiss National Science Foundation project SwissSenseSynergy , and Orange-funded Con- text A wareness Engine industry project. T orsten Braun received the Ph.D . degree from the University of Kar lsruhe, Ger many , in 1993. Since 1998, he has been a Full Professor in computer science with the Univ ersity of Bern. He has been the Vice President of the SWITCH Foundation since 2011. He received the Best P aper A ward from the IEEE LCN 2001, WWIC 2007, EE-LSDS 2013, IFIP WMNC 2014, ARM- SCC 2014 Workshop , and the GI-K uVS Commu- nications Software A ward in 2009. Haiyong Luo received the B.S . degree from Huazhong Univ ersity of Science and T echnol- ogy , Wuhan, China, in 1989, M.S. degree from the Beijing University of Posts and T elecommu- nications, Beijing, China, in 2002, and Ph.D . de- gree from the University of Chinese Academy of Sciences (CAS), Beijing, China, in 2008. He is currently an Associate Professor at the Insti- tute of Computing T echnology , CAS. His main research interests include per v asive computing, mobile computing, and the Internet of Things. Fang Zhao received the B.S . degree from Huazhong Univ ersity of Science and T echnol- ogy , Wuhan, China, in 1990, M.S . and Ph.D . degrees from Beijing University of P osts and T elecommunications (BUPT), Beijing, China, in 2004 and 2009. She is currently a Professor in School of Software Engineering, BUPT . Her current research interests include mobile com- puting, location-based services, and computer network.

Discriminative Learning-based Smartphone Indoor Localization

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment