Full Optical Fiber Link Characterization with the BSS-Lasso

F ull Optical Fib er Link Characterization with the BSS-Lasso Raphael Saa v edra, P edro T o v ar, Gusta v o C. Amaral, and Bruno F anzeres Jan uary 1, 2019 Abstract In this w ork, a nov el technique for optical ﬁber monitoring that introduces the Lasso as a signal pro cessing technique within the Baseband Sub carrier Sweep (BSS) framework, called the BSS-Lasso, is proposed. The methodology is tested in sim ulated and real-w orld environmen ts, taking into ac- coun t b oth reﬂective and non-reﬂectiv e even ts. The results show that, for ﬁb er links ranging from 2 to 15 km with up to 3 faults, o ver 80% of faults are detected within a 50 m range, and indicate that the prop osed metho dology signiﬁcantly outp erforms current state-of-the-art BSS-based super- vision tec hniques. Finally , the BSS-Lasso allo ws for precise, lo w-cost, transmitter-em b edded full c haracterization of optical ﬁb er links. 1 In tro duction As so ciety b ecomes more dep endent on fast distribution of information, robust op eration of telecomm u- nication links app ears as a top priorit y for netw ork managers, since even the shortest service outage can aﬀect up to millions of users; recently , it has b een estimated that 80% of all long-distance data traﬃc in the world is carried by optical ﬁb ers [1]. The mec hanical fragility of the ﬁb ers, ho wev er, p oses a threat to the robust operation of the optical ﬁb er links since ﬁber b ending, ﬁb er breaking, imp erfect ﬁb er splices, R. Saav edra is with the Department of Electrical Engineering and the Laboratory of Applied Mathematical Program- ing and Statistics (LAMPS), Pon tiﬁcal Catholic Univ ersit y of Rio de Janeiro (PUC-Rio), Rio de Janeiro, Brazil (e-mail: rsaav edra@ele.puc-rio.br). P . T ov ar is with the Cen ter for T elecommunications Studies, PUC-Rio (e-mail: pto v ar@opto.cetuc.puc-rio.br). G. C. Amaral is with the Cen ter for T elecommunications Studies, PUC-Rio and with QuT ec h and Ka vli Institute of Nanoscience, T echnical Univ ersity of Delft, Delft, The Netherlands (e-mail: gustav o@opto.cetuc.puc-rio.br). B. F anzeres is with the Department of Industrial Engineering and LAMPS, PUC-Rio (e-mail: bruno.santos@puc-rio.br). Copyrigh t (c) 2018 IEEE. Personal use of this material is p ermitted. How ever, p ermission to use this material for any other purp oses must b e obtained from the IEEE by sending a request to pubs-p ermissions@ieee.org. 1 and corrupted connectors aﬀect the pow er budget and, thus, transmission capacit y . With the ev olution of data transmission proto cols and netw ork architectures, the link sup ervision tec hnology must also ev olv e to meet this sough t-after robustness whilst main taining a lo w impact ov er the qualit y of data transmission [2]. The most successful and well-established supervision technique of the ph ysical lay er of optical net w orks is the Optical Time-Domain Reﬂectometry (OTDR) [3]. The main adv antage of suc h technology is the p ossibilit y to obtain accurate precision with high resolution in long distance monitoring [4, 5]. How ever, in order to fully acquire information from a ﬁb er stretch using this technique, the transmission data is generally susp ended for a non-negligible time frame since the OTDR pulse carries a considerable amount of optical p ow er spread through a broad sp ectral bandwidth. As a consequence, although technically eﬃcien t, standard OTDR monitoring is usually economically burdening (see [6] and the references therein for a wider discussion). T o tac kle this issue, several other reﬂectometry-based techniques ha v e b een prop osed in tec hnical literature [7–11]. Generally , they seek for a monitoring routine such that a narrow sp ectral channel can b e allocated, widening the sim ultaneous data traﬃc capabilit y . In other w ords, an equilibrium b etw een an eﬃcien t monitoring capacity , b oth in distance and resolution, and the co existence with data transmission is pursuit. Th us, in practice, net w ork op erators need to balance the pros and cons of diﬀerent monitoring tec hniques in order to c hoose the one that b est ﬁts their net work, both from a technical and an economical p ersp ective. F ollowing this rationale, a nov el transparent and cost-eﬀective reﬂectometry-based tec hnique has been recen tly developed [12]. Due to its c haracteristics, the technique will be henceforth dubbed as Baseband Sub carrier Sw eep (BSS). F rom a technical point of view, the main goals of the BSS monitoring method are to achiev e reasonable spatial resolution and dynamic range in optical ﬁb er monitoring, with b oth negligible a priori knowledge ab out the ﬁber and in terference on data traﬃc. Within these four goals, three ha v e b een successfully ac hiev ed, with a dynamic range limited to 7 dB b eing the ma jor hindrance of the method. F urthermore, from an implementation point of view, this tec hnique ﬁts the architecture of the so-called Mobile F ronthaul [13, 14], an ubiquitous concept for next-generation mobile netw orks, and can b e seamlessly incorp orated into the optical transmitter (a so-called transmitter-embedded technique) with minimum cost ov erhead [14]. Generally speaking, the nature of the BSS-based monitoring tec hnique is to measure the ﬁb er’s transfer function by detecting the Ra yleigh backscattered p ortion of the propagating optical signal mo dulated by a sw ept tone cov ering several frequencies within an allo cated low-frequency bandwidth. Even though this had b een previously studied in [15], it was only in [12] that the manipulation of the Rayleigh bac kscattered signal b ecame the fo cus of the monitoring solution. The resulting probing signal has a p erio dic structure in the frequency domain, which can b e represented as a linear com bination of spatial-dep endent phasors that take the fault p ositions as arguments [10]. Identifying which spatial-dep endent phasors are presen t in the monitoring signal allows one to consequently identify the set of fault p ositions present in the 2 ﬁb er. Although intuitiv e, the key dra wback of this methodology is the necessity to p erform an extensive com binatorial search in order to precisely determine the fault p ositions, an utterly non-trivial task to be conducted by naiv e optimization methods [12, 16] in reasonable computational time. In this con text, tec hniques based on high-dimensional analysis emerge as an eﬀective to ol. In this w ork, a widely-used high-dimensional signal interpreter metho d known as Least Absolute Shrink age and Selection Operator (Lasso) [17] is employ ed in order to p erform a computationally eﬃcien t fault detection [18]. F undamen tally , the Lasso performs a v ariable selection from an o v er-complete dictionary , electing the comp onents that b est ﬁt the original signal. Therefore, by designing the ov er-complete dictionary with the appropriate sinusoidal-based functions asso ciated to all p ossible fault p ositions (e.g., performing a meter-by-meter discretization of the ﬁb er length), the Lasso methodology can be used to ev aluate the optical ﬁb er link in practical time. In fact, the Lasso has already b een adapted to ﬁt in BSS-based monitoring tec hniques [10]. Never- theless, the key issue regarding this previous w ork w as the absence of a complete mathematical mo del describing the impact of reﬂectiv e and non-reﬂectiv e ev ents on the acquired signal; only non-reﬂectiv e ev ents were considered within the dictionary . Characteristics of even ts allow their classiﬁcation as reﬂec- tiv e or non-reﬂectiv e, and a correspondence with the underlying physical mec hanism can b e observ ed: ﬁb er breaking and corrupted connectors usually causing reﬂective even ts; and ﬁb er b ending and imper- fect ﬁber splices causing non-reﬂectiv e ev en ts. Since the incidence of these cannot b e predetermined, the absence of either causes the model to b e incomplete. In order to comp ensate for the unaccounted con tribution of reﬂectiv e even ts, the incomplete mo del either pro duces unreal even ts or induces shifts on the real ev ent positions, thus reducing the accuracy and robustness of the monitoring method. Therefore, the main ob jectives and contributions of this w ork are threefold: 1. T o extend the metho dology proposed in [10] to account for a hybrid reﬂective and non-reﬂectiv e mo deling framew ork, which represents most practical cases of ﬁber link sup ervision. The prop osed mo del mak es use of the signal description derived in [12] to construct the ov er-complete dictionary that feeds the Lasso. 2. T o devise an ex-p ost analysis to enhance the fault p osition estimation. By making use of the prop erties of reﬂectiv e even ts, a tailored heuristic, hereinafter referred to as BSS-L asso , is prop osed. 3. T o design and v alidate a metho dology to prop erly reconstruct the time-domain proﬁle of a ﬁb er based on the BSS-Lasso metho d. The mo del that relates the frequency- to the time-domain proﬁle allo ws precise estimation of the fault and reﬂection magnitudes, a feature that enables the full c haracterization of the ﬁb er’s proﬁle. Finally , as a minor con tribution, a large library of fault y ﬁb er proﬁles in the frequency domain is made a v ailable so they can b e used to p erform extensiv e tests and comparisons with current state-of-the-art monitoring techniques. All data of this test b ench is av ailable in [19]. This step, and the p ossibilit y of 3 recreating frequency-domain proﬁles that mimic exp erimental acquired data is of great imp ortance for future prop osals and allows one to v alidate and compare the capacit y of diﬀeren t monitoring routines. Due to the in terdisciplinary c haracter of this work, whic h in volv es optical bac kreﬂection measurements and high-dimensional data processing, an introductory background is provided. Firstly , the data acqui- sition and mathematical mo del of the fault lo cation problem are brieﬂy presen ted in Section 2, where the high-dimensional feature of the problem naturally arises from the characteristics of the sup ervision tec hnique, and its constraints are p ointed out. In Section 3, the framework of the Lasso is introduced, along with a discussion on ho w the idiosyncrasies of the stated problem allow for the tailored heuris- tic, the BSS-Lasso, to b e prop osed. With the bac kground from these tw o Sections, experimental fault lo cation results using real-world ﬁb ers are presented in Section 4. Moreov er, in Section 5, the compar- ison of the BSS-Lasso with state-of-the-art monitoring tec hniques is p erformed in the constructed test b enc h in order to provide a statistically relev ant analysis of the tec hnique’s monitoring capability . The pap er is concluded in Section 6, where the adv ances achiev ed b y the BSS-Lasso and on-going researc h are summarized. 2 Exp erimen tal Setup Sw eeping the frequency of an optical sub-carrier tone to ev aluate fault lo cations in an optical ﬁb er has b een extensively studied, and is referred to as Incoherent Optical F requency-Domain Reﬂectometry (I- OFDR) [20]. In case the bandwidth of the frequency sw eep is suﬃcien tly large (in the order of GHz) to yield a granular spatial resolution (in the order of meters), a F ourier transform can b e emplo yed to extract the desired information [21], but at the high cost of consuming a signiﬁcan t portion of the a v ailable transmission band, except in sp eciﬁc cases where the transmission sp ectrum can b e tailored to accommo date the monitoring signal [14]. Con versely , if the bandwidth of the frequency sw eep is reduced (e.g., to a few kilo Hertz), the achiev able spatial resolution through a F ourier transform is not suﬃcien t for a precise fault lo cation, but allo ws for seamless adaptation in to certain data transmission formats, with special stress to Sub-Carrier Multiplexed optical net w ork arc hitectures, where the baseband of the optical carrier is left uno ccupied [12, 14]. The BSS tec hnique falls within the latter class, making itself attractiv e from the p oint of view of adaptation to transmitters and co existence with data transmission. The data acquisition is accomplished b y mo dulating an optical carrier electric ﬁeld with a sin usoidal signal, whic h has its frequency increased in a stepwise fashion that resp ects the steady-state condition of the ﬁb er, while the amplitude and phase of the backscattered signal inside the frequency swept bandwidth are determined by a complex frequency b eat detector. 4 2.1 Data Acquisition and Mathematical Mo del The exp erimental setup for data acquisition is presented in Fig. 1. A Net w ork Analyzer (NA) generates the low-frequency step-wise sw ept tone that directly mo dulates the curren t of a laser dio de biased with a Laser Bias Source (LBS). This causes the electric ﬁeld of the optical carrier to be mo dulated accordingly , th us creating a low-frequency optical sub-carrier channel with bandwidth deﬁned b y the step-wise sweep range, whic h propagates through the ﬁb er; this channel is henceforth dubbed the monitoring channel . The optical circulator, placed immediately b efore the Fib er Under T est (FUT), allo ws one to direct an y coun ter-propagating signal, suc h as the Ra yleigh backscattered p ortion of the incoming signal, to a photo dete ctor. Due to the elastic character of Rayleigh scattering, the detected backscattered signal will conserv e the original modulation, so that a frequency analysis inside the lo w-frequency optical sub- carrier channel can be performed with the correct apparatus. In this case, the NA itself allo ws for complex (amplitude and phase) frequency b eat detection, so the resulting electrical signal is ampliﬁed and directed to the NA for frequency response analysis. Figure 1: Experimental Setup of the baseband subcarrier sw eep monitoring. Data acquisition is p erformed b y measuring the steady-state amplitude and phase v alues for each frequency step. LD: Laser Diode; NA: Net work Analyzer; PD: Photo dio de; LBS: Laser Bias Source; BEA T: complex frequency b eat detector. A key c hallenge to adapt BSS-based technologies in ﬁb er monitoring is precisely the spatial resolution limitation imp osed by the F ourier transform analysis induced by a frequency sw eep with limited range. As discussed in [12], analysis of the resulting signal directly in the frequency domain p ermits one to o vercome suc h limitation and extend the achiev ed spatial resolution. In order to mo del the backscattered signal inside the monitoring channel bandwidth, the fact that the OTDR proﬁle { P ( z ) } z ∈ [0 ,L ] of a ﬁb er with length L meters can b e suitably approximated by a linear combination of step (breaks) and p eak (reﬂections) functions with a single slop e (the ﬁb er’s atten uation co eﬃcient) is used [22]. More sp eciﬁcally , ∀ z ∈ [0 , L ], P ( z ) = e − 2 αz X b ∈B φ b h u ( z ) − u ( z − X b ) i + X r ∈R θ r h δ ( z − X r ) i ! , (1) 5 where { X b } b ∈B are the non-reﬂectiv e even ts with B the set of non-reﬂectiv e even ts indexes, and φ b are the linear combination co eﬃcients. Similarly , { X r } r ∈R are the reﬂective even ts with R ⊆ B the set of reﬂectiv e even ts indexes, and θ r are the linear combination co eﬃcients. In (1), u ( z ) and δ ( z ) denote, resp ectiv ely , the Heaviside and Dirac impulse functions. Therefore, the ﬁb er proﬁle contains the level shifts that corresp ond to non-reﬂective faults and spikes that correspond to reﬂectiv e ev ents. Note that, although P ( z ) translates the amount of p ow er b eing bac kscattered and/or reﬂected from each p osition of the ﬁb er, the co eﬃcients φ b are not directly related to the faults magnitudes as usually presented in an OTDR; this relation is giv en by ξ b = v u u u t 1 − φ b Y j ∈B | j w     β ≥ 0 o , (6) where y is the observ ed/dep endent v ariable, M stands for a matrix of explanatory/indep enden t v ari- ables, β is a decision v ector of coeﬃcients, and w and λ are p ositive vector- and scalar-size parameters, resp ectiv ely . The ` 1 -norm p enalty term (second term of the ob jective function) p erforms a regularization of the β vector around zero. In other words, it p enalizes “unnecessary” deviations from zero, acting, thus, as a v ariable selection. In this regularization, w stands for the p enalty weigh t at each co ordinate of β and λ deﬁnes the p enalty magnitude for the total regularization term. It is important to note that the standard Lasso do es not restrict the v ector β to b e p ositive. Nev ertheless, since a p ermanent p ow er loss or a momen tary p ow er p eak (induced by a fault or reﬂection, respectively) can only generate complex frequency-domain signals that, under cartesian representation, exhibit positive co eﬃcients, the p ositiv e- ness constraint ensures the ph ysical soundness of the mo del. A k ey c hallenge to apply the Lasso in practical applications is an adequate deﬁnition of the p enalt y w eights w and λ . Usually , practitioners set w = 1 and apply an information criterion (e.g., the Extended Ba yesian Information Criterion (EBIC) [25, 26]) to select, from a pre-deﬁned set Λ of p enalty v alues, the magnitude of λ . Therefore, one needs to solv e the optimization problem (6) for all λ ∈ Λ and pic k the one that results in the b est (i.e., minimal) EBIC. Algorithm 1 show cases the Lasso pro cedure as conducted in this w ork 1 . Algorithm 1 Lasso for λ in Λ do ˆ β ( λ ) ← arg min β ≥ 0 k y − M β k 2 2 + λ  β > w  end for return ˆ β ← arg min ˆ β ( λ ) EBIC( ˆ β ( λ )) 1 The Lasso can be eﬃciently executed through several op en-source pack ages, among which w e highligh t glmnet [27], that utilizes coordinate descent with F ortran subroutines and is av ailable in several languages such as Julia, Matlab and R. 9 F or the sake of simplicit y , EBIC( ˆ β ( λ )) represents the ev aluation of the Extended BIC from the solution of (6) with p enalty λ . Henceforth, Algorithm 1 will b e referred to as Lasso( y , M , w ). 3.2 Mo del design and selection stage of the BSS-Lasso In order to identify the set of fault locations on a given ﬁb er link based on its frequency-domain proﬁle, equation (4) is accomo dated in to the Lasso framework discussed in Subsection 3.1. The pro cedure starts by creating a spatial discretization of the ﬁb er length L in X = { X 1 , · · · , X q } lo cations. F or a reasonable granularit y , the fault lo cations are within a negligible distance of giv en elements in X . Additionally , a byproduct of the data acquisition arc hitecture discussed in Section 2 is a set of frequencies F = { f 1 , · · · , f m } for which the probing signal has b een precisely ev aluated. Therefore, the matrix M in (6) can be deﬁned as M = 1 L · h M B    M R i , (7) where matrices M B and M R represen t the fault and reﬂection signals as follows: M B =     Re { S B ( F , X ) } Im { S B ( F , X ) }     ; M R =     Re { S R ( F , X ) } Im { S R ( F , X ) }     . (8) In (7), M contains the real and imaginary parts of the signals S B and S R deﬁned in (5), generated by ev ery possible fault and reﬂection lo cation within the giv en ﬁber discretization in X and frequency in F . F urthermore, a normalization pro cedure (division by L in (7)) is p erformed to av oid numerical issues and an intercept with asso ciated zero p enalty can be added to accoun t for p ossible measuremen t eﬀects that oﬀset the acquired data. According to the model description, the dependent v ariable y is an instance of S ( f ), which is also decomposed in to its real and imaginary parts: y =     Re { S ( F ) } Im { S ( F ) }     . (9) Within this design, the decision v ector β measures the coeﬃcients Φ and Θ in equation (4), thus b eing decomp osed in to non-reﬂective and reﬂective co eﬃcients: β = [ { β B b } q b =1 , { β R r } q r =1 ] > . Therefore, the p osition within X in which a fault is lo cated can b e iden tiﬁed b y picking { β B b } q b =1 greater than zero after executing Algorithm 2 with M and y deﬁned in (7) and (9), resp ectiv ely . Note that, as a byproduct of the prop osed metho dology , the net work op erator can also distinguish if the fault p osition has a reﬂective ev ent, i.e., if the corresp onding β R is also greater than zero. The selection stage is explicitly presented in Algorithm 2. Algorithm 2 BSS-Lasso – Selection stage ˆ β (1) ← Lasso( y , M , 1 ) return ˆ β (1) 10 Remark 1 The monitoring te chnique discusse d in [10], c al le d SincL asso, is a p articular instanc e of the sele ction stage by dr opping matrix M R fr om the mo del, ther eby not taking r eﬂe ctive events into ac c ount. 3.3 Correction stage of the BSS-Lasso In [10], it was observed that the prop osed SincLasso monitoring sc heme regularly failed to accurately iden tify fault p ositions whenever the even t had a reﬂective comp onent. More sp eciﬁcally , signiﬁcant shifts in the estimated fault position with resp ect to the real position w ere observ ed. F urthermore, a similar pattern persisted in the BSS-Lasso selection stage, even with the explicit inclusion of reﬂectiv e ev ents in the mo deling design. Nev ertheless, it w as observed that, although the fault selection remained shifted in the presence of a reﬂection, its reﬂective counterpart w as consistently accurate. Therefore, b y making use of this empirical observ ation, a correction stage was designed to accommodate an ex-p ost analysis of the selection stage in order to impro v e the supervision of ﬁb ers with reﬂectiv e even ts. The correction stage has a s imilar motiv ation to the Adaptive Lasso [28], in whic h it utilizes Lasso selections to mo dify the p enalt y v ector w . More precisely , reﬂections only exist accompanied b y a fault at the same p osition (i.e., R ⊆ B ). Therefore, since the selection stage has shown muc h greater accuracy for the reﬂective selections, an adjustment on the weigh t vector w can b e made at the respective fault p osition, reducing its p enalt y so that Algorithm 1 can b e re-executed with the adjusted w . The correction stage is presen ted next. Algorithm 3 BSS-Lasso – Correction stage Let ˆ β (1) b e the selection stage output. Let Q = { q + 1 , ..., 2 q } ,  > 0, 0 < γ < 1. Initialize p enalty v ectors w ( k ) ← 1 , k = 2 , 3. for k = 2 to 3 do i. If max j ∈Q { ˆ β ( k − 1) j } = 0 = ⇒ return ˆ β ( k − 1) ii. Adjust p enalty vector w ( k ) : w ( k ) i − q ← γ , ∀ i ∈ Q | ˆ β ( k − 1) i >  · max j ∈Q { ˆ β ( k − 1) j } iii. Run the Lasso with p enalty w ( k ) : ˆ β ( k ) ← Lasso( y , M , w ( k ) ) end for return ˆ β (3) In Algorithm 3,  is a small num ber that acts as a sensitivit y threshold. Recall from the mo del design in Subsection 3.2 that co ordinates { 1 , ..., q } in β corresp ond to non-reﬂective even t candidates ( { β B b } q b =1 ) and { q + 1 , ..., 2 q } the reﬂective even t candidates ( { β R r } q r =1 ). Therefore, γ ∈ (0 , 1) is deﬁned to reduce the p enalty weigh t on the ﬁrst q elements of β according to the reﬂective selections of the previous iteration, such that w ( k ) has all elements equal to one, except those that represen t faults with non- negligible reﬂective selections, which receiv e a reduced penalty v alue γ . More precisely , if the selection 11 stage outputs any non-zero reﬂective selections, the BSS-Lasso enters the correction stage, where up to t wo more iterations are computed in order to address the previously men tioned shifting phenomenon. F or eac h iteration k , a reduced p enalty vector w ( k ) is created based on the selections of the previous iteration, and the Lasso is executed accordingly . The rationale b ehind the correction stage is that, if a reﬂection was found at a giv en p osition, it generally indicates the presence of a fault at that lo cation. Additionally , if Algorithm 3, due to the reduced penalty , selects at the correct lo cation a fault that was previously shifted, this often causes the algorithm to reject its previous incorrect position because of the ` 1 -norm penalty , th us correcting the shifting phenomenon. F or illustrative purp oses, Fig. 3 depicts the monitoring accuracy of the SincLasso, BSS-Lasso up to the selection stage and complete BSS-Lasso for an 8 km ﬁb er link. Note that, for the reﬂectiv e fault at 4 km, b oth the SincLasso and the BSS-Lasso up to the selection stage incurred in an error of o v er 400 m, while the complete BSS-Lasso presented a 10 m error due to the c orrection stage. Figure 3: Comparison of the normalized selections resulting from SincLasso, BSS-Lasso up to the selection stage and complete BSS-Lasso for an 8 km ﬁb er link with a reﬂective fault at 4 km and a non-reﬂectiv e fault at 8 km. Remark 2 In this work, the pr op ose d algorithm has up to thr e e over al l L asso iter ations. A lthough ad- ditional iter ations might b e justiﬁable to further impr ove the quality of the metho d, ther e is a lack of a c onsistent stopping criterion in or der to avoid cycling. In fact, two over al l iter ations ar e suﬃcient for the existenc e of a c orr e ction stage. However, it was empiric al ly observe d that a thir d one, which acts as a r eﬁnement step, signiﬁc antly impr oves the pr e cision of the metho d. F urthermor e, the b eneﬁts of addi- tional iter ations (mor e than thr e e) wer e observe d to b e mar ginal, non-existent or even ne gative. F or these r e asons, the numb er of iter ations was ﬁxe d at thr e e in Algorithm 3, which r esults in a c omputational ly eﬀe ctive pr o c e dur e. 12 3.4 T reatmen t stage of the BSS-Lasso Ultimately , Algorithm 3 outputs a 2 q -dimensional vector indicating the fault lo cations. How ever, in man y cases, the result is not presented as a straightforw ard handful of singular selections, but rather as a set, or cluster, of even t positions. Although the app earance of suc h clusters ma y ha v e its roots in the limited spatial resolution of the monitoring technique, the Lasso is known to produce clusters whenev er the ov er-complete dictionary v ariables present high levels of correlation [23]. In fact, dep ending on the ﬁb er discretization, adjacent columns of the explanatory matrix M represent close enough p ositions suc h that their induced signals are almost indistinguishable, thus strongly correlated. As a consequence, the selection and correction stages often times output multiple selections around the real fault position as a cluster (see Fig. 4 for an il lustrative example con taining an output from the correction stage and the ﬁnal BSS-Lasso result) and it migh t b e useful to reﬁne the result in order to obtain more accurate selections to b etter assist the netw ork op erator. Figure 4: Time-domain proﬁle in the top panel and BSS-Lasso selections in the b ottom panel for a 5 km ﬁb er link with a reﬂective even t at 3 km and a non-reﬂectiv e even t at 5 km. The reﬂective even t results in a single selection, while the non-reﬂective one induces a cluster of selections. A handful of tec hniques app ears in technical literature to handle correlated explanatory v ariables within the Lasso framework. The commonly used approach p erforms an a priori clustering of the cor- related explanatory v ariables b efore solving the Lasso problem [29]. How ever, such a priori clustering is not suited for the particular application of this work, since all explanatory v ariables are sequentially correlated to its adjacent neighbors with respect to the emplo yed distance grid in matrix M . T o ov ercome this issue, an alternativ e is to narro w down the clusters of selections into single selections after the Lasso pro cessing. F rom a technical p oint-of-view, a cluster of selections represent a degenerate pro jection of y onto the column space of M , i.e, one p ossible w a y to express this pro jection in terms of the column 13 v ectors of M . As a consequence, it is likely that the real fault is among the cluster p ositions and closer to the selections with higher magnitude. Therefore, one can interpret each cluster of selections as a prob- abilit y density for the resp ective true fault p osition. In this con text, several approac hes can b e applied to directly narro w down the clusters of selections. Firstly , since the clusters can be in terpreted as probability densities, a computationally eﬃcient ap- proac h is to estimate the true fault p osition b y applying a w eighted av erage of the positions within the clusters, where the weigh ts are given by the selection magnitudes. Another p ossibility is to consider all p ossible combinations of p ositions with a single selection p er cluster and choose the one which outputs the least square model ﬁtting error (minimal ` 2 norm). This approach can b e implemented b y either a com binatorial set of ordinary least squares computations or via Mixed Integer Quadratic Programming (MIQP) [30, 31]. Despite showing the b est empirical results, in theory , it is aﬀected by the curse of dimensionalit y and can b e computationally burdening, i.e., instances with wide and/or several clusters (e.g., a ﬁb er with n umerous faults) migh t b e intractable in reasonable computational time. Nevertheless, it should be emphasized that, for most practical cases, this approach can b e computed in the order of seconds. Finally , a viable alternativ e to the combinatorial least squares for ﬁb ers with m ultiple even ts is the least absolute error (minimal ` 1 norm), which can b e solved using Mixed In teger Linear Programming (MILP) algorithms. Although still a com binatorial procedure due to their in teger nature, MILP problems are widely recognized to b e more computationally eﬃcient to b e solv ed than MIQP . Due to its b etter empirical performance, in this work, the treatment of the clusters of selections is p erformed using least squares. It should b e highlighted that this choice is more consistent with the prop osed BSS-Lasso metho dology , since the Lasso already uses the ` 2 norm for the mo del ﬁtting (see the ﬁrst term of the ob jective function in (6)). In fact, this choice of ex-p ost cluster treatmen t can be seen as an instance of the Lasso’s original design [17], in which the ` 1 regularization term (second term of the ob jective function in (6)) is replaced b y the semi-norm ` 0 and written as a constraint b ounded b y 1 for each cluster. More precisely , let {C i } C i =1 b e the family of C clusters resultant from the BSS-Lasso and M =  M j } j ∈C i  C i =1 , the explanatory matrix restricted to the position indexes within each cluster. Then, the cluster treatment pro cedure can b e form ulated as the following mathematical programming problem ([30, 31] are referred to for a wider discussion and eﬃcient form ulations for this problem): min β        y − M β   2 2          β i   0 ≤ 1 , ∀ i ∈ C ; β ≥ 0 ;      , (10) where β i = { β j } j ∈C i . Therefore, in this context, the original ` 0 -Lasso is solv ed, but av oiding its high com binatorial nature since the ` 0 searc h is performed in a drastically reduced space when compared to the original problem. The treatment stage of the BSS-Lasso is th us presen ted in Algorithm 4. 14 Algorithm 4 BSS-Lasso – T reatment stage Let ˆ β (3) b e the correction stage output. i ← 1, C i ← {} , C ← i . for j = 1 to q do If ˆ β (3) j > 0 = ⇒ C i ← C i ∪ { j } Else i ← i + 1; C i ← {} ; C ← i end for Construct M =  M j } j ∈C i  C i =1 Solv e ` 0 -Lasso (10) and return its optimal decision vector F or future reference, BSS-Lasso will b e identiﬁed as the sequen tial computing of Algorithms 2 – 4. 4 Exp erimen tal V alidation V alidation of the BSS-Lasso in a real environmen t consists in comparing its estimated even ts p ositions with the reference even ts p ositions determined using a standard OTDR device for diﬀeren t ﬁb ers. Practically , the limitation on the num b er of exp erimentally tested links is determined by the a v ailability of ﬁbers with diﬀeren t lengths in the laboratory and the p ossibility of connecting these ﬁb ers to form links with diﬀeren t num b er of ev en ts and diﬀeren t lengths. Six ﬁb er links were av ailable, for which the proﬁles ha v e b een individually measured; combinations of these six ﬁbers tw o-b y-t wo, to comp ose tw o-even t links, pro duced 30 more examples. Prior to measuring the ﬁb ers, how ever, a few exp erimental parameters concerning the data acquisition and the minimum monitoring signal mo dulation pow er must b e deﬁned. A picture of the exp erimental setup as assembled in the lab oratory for all these tests is presented in Fig. 5. 4.1 Exp erimen tal parameters c haracterization As depicted in the exp erimental setup of Fig. 1 and shown in Fig. 5, the opto electric conv ersion and signal ampliﬁcation is p erformed by a photodetector, which is composed mainly by a photodio de and t wo ampliﬁers. F rom an optical p ersp ective, as long as the bac kscattered optical signal reaching the photo dio de carries p ow er greater than the NEP , the output electrical signal will carry information ab out the ﬁb er. In the same wa y , but from an electrical point of view, the signal reaching the NA must b e greater than its NEP ( N na ), so that the measured frequency proﬁle translates the ﬁb er’s characteristics. Since an imp ortan t c haracteristic of the Baseband Subcarrier Sweep monitoring tec hnique is its co existence with data transmission in a transmitter-embedded conﬁguration, as discussed in [12], it is paramount that a balance betw een monitoring signal p ow er and the capacity of meeting the previously mentioned p ow er requiremen ts, in b oth the photo detector and the NA, is found. In this subsection, this balance will b e analyzed and the exp erimental parameters used throughout the BSS-Lasso v alidation will b e presen ted. 15 Figure 5: Photo of the exp erimental setup. 1) Netw ork Analyzer; 2) Distributed F eedbac k-Laser Dio de; 3) Laser Bias Source; 4) T emp erature Controller Driver; 5) Optical Circulator; 6) Photo detector; 7) Fib er under test. Analysis of the optical signal-to-noise ratio (SNR) requires knowledge of a few parameters, all of whic h are summarized in the upp er part of T able 1. The laser diode’s bias curren t is set at 70 m A, yielding an output optical p ow er of 4 dBm; this allows for a 52 mA excursion inside its linear region. An insertion loss of approximately 1 dB in the optical circulator (p ort 1 → p ort 2) sets the input optical pow er to the ﬁber under test (FUT) to ∼ 3 dBm. Thus, considering a Rayleigh bac kscattered co eﬃcient C of -72 dB/m [4], an attenuation co eﬃcient α of 0.2 dB/km, a 6372 m optical ﬁb er, and another 1 dB of insertion loss in the optical circulator (p ort 2 → p ort 3), the optical p ow er arriving at the photo dio de in the steady-state regime would b e − 33 . 20 dBm (or 477 nW). This v alue has b een exp erimentally veriﬁed to b e − 34 . 02 dBm. The emplo y ed photo detector exhibits a NEP of 200 pW in a full bandwidth condition [32], so a comfortable 33 dB SNR at the photodio de is ensured. Note, how ev er, that this SNR represen ts the total optical p ow er arriving at the photo dio de, including the DC component. As will b e discussed presen tly , a 10% p ortion of the laser’s full modulation depth is o ccupied b y the monitoring signal, so the SNR for the signal of in terest is approximately 23 dB. Electrical signal analysis b egins right after the opto-electrical conv ersion of the photo dio de: although the tw o ampliﬁcation stages of the photo detector exhibit noise ﬁgures of their own and also amplify noise coming from the photo dio de, the optical SNR is suﬃcien tly high so that one can consider the output signal of the photo detector to maintain the same SNR. Thus, taking into account the received optical 16 p o wer of 477 nW and the photo detector parameters (resp onsivity , transimp edance gain, and the 2 nd stage voltage gain) and also the p ortion of the full modulation depth occupied by the monitoring signal, all sho wn in T able 1, it is p ossible to calculate an amplitude of 298.37 mV for the input signal en tering p ort 2 of the NA at lo w frequencies. According to the mathematical mo del and the exp erimental results, the amplitude of the electrical signal has an in v erse dep endence with the frequency and, as the latter increases, the former is exp ected to dramatically decrease. The v alue for the low-frequency measuremen t, ho wev er, has b een exp erimentally measured to b e 303.57 mV, which is far abov e N na ( − 95 dBm [33]), setting the electrical SNR at 47.32 dB and v alidating b oth the optical and electrical signal analysis. T o address the SNR reduction as the frequency increases, whic h can translate into up to 30 dB reduction at the maximum swept frequency , an av eraging pro cess is employ ed. This consists of acquiring sev eral traces inside the same in terv al and taking the arithmetic av erage, a resource already a v ailable in the emplo y ed NA. At the same time, since the frequency-domain proﬁle is determined in a steady-state regime, the intermediate frequency bandwidth of the NA frequency b eat detector is set to a lo w v alue (150 Hz) translating in to a considerable long time-constan t of 6.64 s. This long time constan t translates in to an intrinsic av eraging of the measured amplitude and phase of the input signal, whic h, by itself, diminishes noise contributions to the measurement, esp ecially at higher frequencies, and alleviates the n umber of samples for the av erage pro cedure. Therefore, in order to reach a compromise betw een faster data acquisition and negligible noise contribution, a total of 10 samples has b een set as default to the a veraging pro cedure; the total data acquisition pro cess amoun ts to less than tw o min utes. T able 1: Experimental Parameters for Data Acquisition Exp erimen tal P arameters Optical Fib er input pow er ( P 0 ) 3 dBm Fib er length 6372 m Circulator loss 1 dB Fib er attenuation ( α ) 0.2 dB/km Opto/Electric PD resp onsivity @ 1550 nm ( R ) 1 A/W Electrical PD transimp edance gain 626 V/A PD 2 nd stage voltage gain 1x10 4 V/V Laser Bias Source (LBS) 70 mA NA mo dulation pow er 0 dBm Mo dulation depth ( m ) 10% As mentioned in the previous analysis, the p ortion of the laser’s full mo dulation depth occupied b y the monitoring signal was set to 10% as the balance b etw een minimal monitoring p ow er that allo ws for accurate ﬁber c haracterization. This v alue immediately impacts on the capacity of concurrent data transmission in a direct modulation scheme and exp erimentally characterizes the BSS-Lasso technique in the context of monitoring and data co existence. T o v alidate this parameter, the signal’s amplitude 17 en tering the mo dulation p ort of the laser w as v aried from 50 mVpp to 1.6Vpp, which corresp onds, appro ximately , to 2% and 62% of the laser’s full mo dulation depth, respectively . The BSS-Lasso was tested for eac h signal amplitude in the same ﬁber testb ed (with parameters describ ed in T able 1), with the frequency-domain ﬁb er proﬁle b eing determined with the BSS setup, and errors betw een the real p ositions of even ts and the ones obtained from the tec hnique b eing ev aluated based on the output of the BSS-Lasso. The exp erimental result is presen ted in Fig. 6. Figure 6: Impact of the low-frequency monitoring signal’s amplitude – written in terms of percents of the laser’s full modulation depth – on the BSS-Lasso even t p osition estimation. The 5% mark represents a clear lo w er b ound on the amplitude of the monitoring signal so that reliable results are obtained. The results v alidate the exp erimen tal parameters presen ted in T able 1 and show that, for monitoring signals with amplitude greater than 10% of the modulation depth, position estimates are within a 30 m distance error from the actual p ositions. When the monitoring signal’s amplitude falls b elow 5% of the mo dulation depth, on the other hand, the distance error rapidly increases. This indicates that the BSS-Lasso can b e implemented to contin ually monitor a ﬁb er link while o ccupying only a tenth of the laser’s full mo dulation depth, thus making a v ailable a considerable p ortion of the laser’s electro-optical transfer function for the purpose of data transmission. F urthermore, w e note that the precision plateau attained at 10% of the full mo dulation depth is asso ciated to a limitation of the spatial resolution of the tec hnique, which is related to the maxim um monitoring frequency . All the experimental results presen ted in this paper ha v e b een acquired using the parameters describ ed in T able 1. 4.2 Exp erimen tal results In Fig. 7, the even t position estimation error of the BSS-Lasso is compared with the reference results of a standard OTDR device and presented in the form of a scatter plot. Tw o error thresholds are deﬁned in the scatter; the ﬁrst one delimits a low-error interv al, where the results are deemed to b e very close to the reference, in whic h almost 70% of the results hav e fallen; the second delimits results that are fairly close to the reference p osition, and holds ov er 93% of the total results. A few outliers can b e observ ed with errors ab ov e 100 m. The error distribution, mainly concentrated around the zero error p oin t and 18 within the 0-50 m error interv al attests the prow ess of the BSS-Lasso in characterizing the assem bled exp erimen tal ﬁb er links. Figure 7: Distribution of position estimation errors for all the experimentally measured ﬁb er proﬁles. Apart from a few outliers, the p ositions are within a 100 meters error interv al from the actual real fault p ositions. In teresting observ ations can b e made from the results of Fig. 7. First of all, it is clear that the estimation error tends to b e closer to zero for single-ﬁb er links. As will also be discussed in Section 5, the estimation is hindered by the amount of even ts present in the ﬁb er link. Secondly , it is noticeable that the presence of reﬂective ev en ts induces a better estimation of the even t’s p ositions, as all but one of them fell within a 50 m error ev en with a higher num b er of even ts (tw o non-reﬂective ev ents and one reﬂectiv e ev ent). This characteristic, which will also b e studied in Section 5, indicates that the mo deling of reﬂective ev ents paired with the correction stage increases the robustness of the estimation. Up to this point, accurate determination of the faults positions has b een demonstrated using the BSS- Lasso. How ev er, determination of the fault magnitudes is of utmost imp ortance for full characterization of the optical ﬁb er link, and can b e performed following Eq. (2). Nevertheless, even though the co eﬃcients φ b , determined as a product of the BSS-Lasso, allo w for accurate estimation of the BSS frequency-domain proﬁle, they include the bias naturally induced by the Lasso [17]. Moreo ver, the relationship b etw een the φ b and the actual fault magnitudes ξ b (Eq. (2)) is extremely non-linear, so even tual deviations in the former induce sev erely imprecise results for the latter. T o ensure correct estimation of the fault magnitudes, a r e c onstruction procedure of the time-domain proﬁle of the ﬁb er, that do es not dep end on the amplitudes determined by the BSS-Lasso, is prop osed. It is as follows: (i) utilizing the ev en t p osition estimates, ﬁber link proﬁles are created in the form of Eq. (1) for whic h the resp ectiv e non-reﬂective magnitudes ξ b are sorted within a predetermined in terv al of [0, 5] dB in a sequence of tw o steps, a coarser 0.5 dB, and then a ﬁner 0.1 dB; (ii) the reﬂective 19 magnitudes θ r are sorted within a predetermined interv al of [0, 20] dB in steps of 2 dB; (iii) from the created P ( z ), frequency-domain proﬁles S ( f ) are calculated using Eq. (3) and then compared to the estimated frequency-domain proﬁle given by the BSS-Lasso; (iv) the created S ( f ) are compared with the BSS-Lasso estimated proﬁle using the ` 2 error norm; and (v) the b est reconstructed ﬁb er proﬁle according to this metric is deﬁned as the reconstructed ﬁb er proﬁle. Presented in T able 2, for four exp erimen tal ﬁb er links, are: the real fault magnitudes, calculated using the OTDR proﬁle measured by a standard OTDR device; the magnitudes calculated using Eq. (2) and the BSS-Lasso co eﬃcien ts; and the magnitudes determined through the reconstruction pro cedure. T able 2: F ault Magnitude Comparison Real Calculated Reconstructed Magnitudes [dB] Magnitudes [dB] Magnitudes [dB] 1.9, 23.0 2.77, ∞ 1.6, 22.0 2.0, 20.7 1.79, ∞ 2.0, 22.0 2.9, 17.5 3.53, ∞ 3.0, 22.0 0.9, 22.2 1.92, ∞ 1.0, 22.0 In other words, the reconstruction pro cedure is equiv alen t to the second analysis path presen ted in Section 2.2, but using artiﬁcially created P ( z ) based on the results of the BSS-Lasso. The result, as can be p erceived from T able 2, is a muc h more accurate estimate of the fault magnitudes. Finally , an example of the reconstruction pro cedure for the ﬁrst ﬁb er link of T able 2, i.e., the artiﬁcially generated P ( z ) that translates in to S ( f ) that b est approximates the estimated frequency-domain signal given by the BSS-Lasso, is depicted in Fig. 8. 5 Sim ulation Results T able 3: Selection errors for eac h set of simulated links 1 fault 2 faults 3 faults Error [m] SincLasso BSS-1 BSS- Lasso SincLasso BSS-1 BSS- Lasso SincLasso BSS-1 BSS- Lasso [0, 50] 52.20% 52.20% 89.80% 49.60% 49.45% 81.50% 50.70% 50.77% 77.50% (50, 100] 1.60% 1.60% 1.60% 1.20% 1.45% 2.95% 2.60% 2.87% 4.03% (100, 200] 11.80% 11.90% 7.20% 4.40% 5.85% 5.65% 3.33% 4.93% 4.30% (200, ∞ ) 34.40% 34.30% 1.40% 44.80% 43.25% 9.90% 43.37% 41.43% 14.17% In order to illustrate the robustness of the proposed monitoring metho dology , in this section, extensiv e computational tests are conducted on the BSS-Lasso. T o do so, a large-scale test b ench w as created: a total of three sets of 1000 ﬁb er links each containing, resp ectively , one, t w o, and three faults w ere randomly 20 Figure 8: Time-domain proﬁle reconstruction based on the results of the BSS-Lasso. In the upp er panel, the estimated frequency-domain proﬁle of the BSS-Lasso is plotted against the S ( f ) that corresp onded to the minim um ` 2 error norm during the reconstruction step. In the low er panel, the original OTDR proﬁle acquired with a standard OTDR device is depicted along with the reconstructed proﬁle. T able 4: Con tingency table ( ± 50 m) F ault Present F ault Absent SincLasso BSS-1 BSS-Lasso SincLasso BSS-1 BSS-Lasso F ault F ound T rue Positiv es T rue Positiv es T rue Positiv es F alse Positiv es F alse Positiv es F alse Positiv es 3035 3034 4853 2808 3251 1379 F ault Neglected F alse Negatives F alse Negatives F alse Negatives T rue Negatives T rue Negatives T rue Negatives 2945 2966 1147 2959557 2958128 2957685 Measures Sensitivity Sensitivity Sensitivity Speciﬁcity Sp eciﬁcity Speciﬁcity 50.58% 50.57% 80.88% 99.91% 99.89% 99.95% generated based on the following steps: 1. Fib er lengths are sampled ev enly within L ∈ [2 , 15] km. 2. Given the ﬁb er length sampled in Step 1, the fault lo cations are also sampled evenly within [2 , L ], with a mandatory fault at the end of the ﬁb er. 3. In order to sample reﬂective ev en ts, a 50% chance of ha ving an asso ciated reﬂection is attributed to each fault sampled in Step 2. 4. The magnitudes of the even ts are then randomly c hosen, with faults ranging evenly b etw een [1 , 5] dB and reﬂections up to 20 dB. 21 5. The time-domain proﬁle P ( z ) of the ﬁber link is constructed using Eq. (1) and the parameters sampled in Steps 1–4. 6. The methodology presen ted in Section 2 is utilized to obtain the frequency-domain proﬁle S ( f ) from P ( z ), using Eq. (3) with a set of frequencies ranging within [100, 100000] Hz discretized in 100 Hz. F or repro ducibility purposes, the complete test b ench of ﬁber links used in this section is av ailable in [19]. All tests were conducted in Julia language with an In tel Core i7-490K CPU at 4.00 GHz and 32 GB of RAM memory . The sim ulation parameters used throughout the testb ench were: 10 m length discretiza- tion; reduced penalty factor γ = 0 . 5; and sensitivit y threshold  = 0 . 05. F or the sake of comparison, the p erformance of the BSS-based monitoring technology presen ted in [10], known as SincL asso , w as also ev aluated for the same test b ench. F urthermore, in order to quan tify the con tribution of the correction stage describ ed in Algorithm 3, the results of the BSS-Lasso up to the selection stage (named BSS-1 for presen tation purp oses) are also studied. The test results are presented in t wo separate tables, for more rigorous analysis. In T able 3, the errors are stratiﬁed among four distance in terv als and the resp ective percentage of position estimates within eac h in terv al is ev aluated for the three techniques. T able 4 presen ts the so-called contingency table. The idea is to p erform a binary classiﬁcation of fault/no fault even ts based on the output of the tec hniques. More precisely , in the con tingency table, T rue Positiv es are accounted b y estimations within the lo w-error in terv al, i.e., when the selection error is 50 m or low er. Similarly , F alse Positiv es are estimations outside the lo w-error in terv al. It should b e highlighted that T able 4 w as constructed b y com bining the results of the three sets of ﬁb er links, totaling 6000 ev en ts. The ﬁrst conclusion based on the analysis of both tables is the clear dominance of the BSS-Lasso o ver the other tw o tec hniques; the BSS-Lasso achiev es o ver 80% of fault estimates within the low-error in terv al while the other tec hniques barely reac h 50%. A second observ ation is that the num b er of ev en ts clearly impacts the accuracy of the methodology , as it diminishes from 89.80% for the set of ﬁb er links with a single even t to 77.50% for the set with three even ts. The increase in estimation diﬃculty as the n umber of ev ents gro ws larger is exp ected, as the metho d needs to distinguish more comp onents that are comp ounded in the signal. It is also striking that the selection stage of the BSS-Lasso has p erformance extremely similar to the SincLasso, indicating that the inclusion of the reﬂections in the dictionary of the Lasso is not suﬃcient to accurately detect reﬂectiv e even ts. F urthermore, the near 50% accuracy limit of the SincLasso shares a strong correlation with the deﬁnition of Step 3 in the test b ench creation proto col, since the probability of a reﬂectiv e ev en t w as set to 50%. This result translates the lac k of precision induced by a reﬂection and indicates that the BSS-Lasso not only deals with such reﬂectiv e even ts but uses their presence to optimize the estimation through the correction stage. The BSS-Lasso also excels when analyzing the contingency table, with a n umber of F alse Negatives and F alse Positiv es less than half of those of the other tec hniques. Even though a ﬁner analysis of the 22 sp eciﬁcit y 2 is hindered by the high num b er of p ositions that do not present a fault even t, the results are coheren t with the sparse nature of the problem itself. F urthermore, BSS-Lasso’s precision 3 of 77.87% indicates that a selection do es generally represent a true fault. This is extremely imp ortant from a practical point of view, as sc heduling an in-ﬁeld unit to repair a non-existing fault can b e costly . At the same time, although the sensitivit y 4 measure is roughly 80%, analysis of the ev en ts which were not iden tiﬁed sho ws a t w o-fold b ehavior: either the fault magnitude was low and the BSS-Lasso neglects its presence; or the magnitude of an asso ciated reﬂective ev ent was lo w enough so that it w as not identiﬁed and the shift correction was not emplo yed, th us ensuing an error of ov er 50 m. In the ﬁrst case, whic h corresp onds to the ma jority of errors, the small faults do not en tirely compromise the link’s op eration capacit y . In the second case, though represen ting a more serious defect in fault lo cation, the op erator w ould hav e knowledge of the presence of the fault but with reduced accuracy . 6 Conclusions Manipulation of the detected backscattered Rayleigh signal inside the bandwidth of a frequency sw ept optical sub-carrier propagating into an optical ﬁb er p ermits an eﬃcient lo calization of faults through a F ourier op erator. When the bandwidth is restricted, analysis in the frequency domain can o v ercome the spatial resolution limitation while also inducing a high-dimensional problem. In tro ducing the Lasso as a signal pro cessing tec hnique paired with the BSS framework, a metho dology to consisten tly ev aluate ﬁb er defects, named BSS-Lasso, is designed. The BSS-Lasso presents sev eral desired prop erties in the context of optical ﬁb er link characterization, as corrob orated by the results: (i) high speciﬁcity and sensitivit y for reﬂective and non-reﬂectiv e ev ents; (ii) successful detection of multiple faults within a low-error in terv al; and (iii) low computational burden, ha ving required less than t wo minutes for all links simulated and exp erimentally acquired in this work. Both exp erimental and sim ulation results show the eﬃcacy of the ex-p ost analysis, or correction stage, whic h besides accurately iden tifying the reﬂectiv e ev ents, also makes use of their presence to increase the precision of the fault lo cations. F urthermore, the BSS-Lasso framework allows for the reconstruction of the time-domain proﬁle of ﬁbers and the estimation of the magnitude of faults. In conclusion, the BSS-Lasso allo ws for precise, low-cost, transmitter-em b edded full characterization of optical ﬁb er links in practical computational time. W e highlight sev eral ideas that can b e explored in future works to impro v e the prop osed tec hnique. F rom a data-acquisition p oint of view: (i) the analysis of the impact of the optical carrier’s bandwidth and polarization on the measured frequency-domain proﬁle; (ii) the extension of the frequency sweep range and whether this allo ws for increased estimation accuracy; and (iii) the employmen t of a lo w er NEP detector with the goal of extending the limited dynamic range of the BSS-Lasso. In the signal 2 Speciﬁcity = #T rue Negatives / (#T rue Negatives + #F alse Positiv es) 3 Precision = #T rue Positiv es / (#T rue Positiv es + #F alse Positiv es) 4 Sensitivity = #T rue Positiv es / (#T rue Positiv es + #F alse Negatives) 23 pro cessing part: (i) determining whether the dynamic range is limited by an imp ossibilit y of acquiring signals from distant portions of the ﬁb er or by the sparse c haracteristic of the Lasso, which neglects signal contributions that are to o small; and (ii) identifying a more consisten t metho dology for the cluster analysis step. Finally , the p ossibility of em b edding the softw are into a micro-con trolled unit running Julia allied with a dedicated low-cost complex frequency beat detector w ould allow the creation of an indep enden t measurement device, suc h as a standard OTDR, and is also a viable future developmen t of this work. Ac knowledgemen ts The authors would like to thank Christiano Nascimen to and Breno P erlingeiro for technical supp ort. This study was ﬁnanced in part by the Co ordena¸ c˜ ao de Ap erfei¸ coamen to de Pessoal de N ´ ıvel Sup erior - Brasil (CAPES) - Finance Co de 001. References [1] S. Kumar and M. J. Deen, Fib er optic c ommunic ations: fundamentals and applic ations . Wiley , 2014. [2] P . J. Urban, G. V all-Llosera, E. Medeiros, and S. Dahlfort, “Fib er plant manager: An OTDR-and OTM-based PON monitoring system,” IEEE Communic ations Magazine , vol. 51, no. 2, pp. S9–S15, 2013. [3] M. K. Barnoski, M. D. Rourk e, S. M. Jensen, and R. T. Melville, “Optical time domain reﬂectome- ter,” Applie d Optics , vol. 16, no. 9, pp. 2375–2379, Sept. 1977. [4] D. Derickson, Fib er Optic - Test and Me asur ement , 1st ed. Pren tice Hall, 1998. [5] F. Liu and C. J. Zaro wski, “Even ts in ﬁb er optics given noisy OTDR data—Part I. GSR/MDL metho d,” IEEE T r ansactions on Instrumentation and Me asur ement , v ol. 50, no. 1, pp. 47–58, F eb. 2001. [6] P . Eraerds, M. Legr ´ e, J. Zhang, H. Zbinden, and N. Gisin, “Photon coun ting OTDR: adv antages and limitations,” Journal of Lightwave T e chnolo gy , vol. 28, no. 6, pp. 952–964, Mar. 2010. [7] S. Hangai and Y. T aki, “Detection of faults in short ﬁb er by the phase comp ensated reﬂectometer,” IEEE T r ansactions on Instrumentation and Me asur ement , vol. 39, no. 1, pp. 238–241, F eb. 1990. [8] X. Dong, A. W ang, J. Zhang, H. Han, T. Zhao, X. Liu, and Y. W ang, “Combined attenuation and high-resolution fault measuremen ts using chaos-OTDR,” IEEE Photonics Journal , vol. 7, no. 6, Dec. 2015. 24 [9] G. C. Amaral, J. D. Garcia, L. E. Y. Herrera, G. P . T emp orao, P . J. Urban, and J. P . von der W eid, “Automatic fault detection in WDM-PON with tunable photon counting OTDR,” Journal of Lightwave T e chnolo gy , v ol. 33, no. 24, pp. 5025–5031, Dec. 2015. [10] G. C. Amaral, J. D. Garcia, B. F anzeres, P . J. Urban, and J. P . v on der W eid, “Multiple ﬁb er fault lo cation with lo w-frequency sub-carrier tone sweep,” IEEE Photonics T e chnolo gy L etters , vol. 29, no. 13, pp. 1116–1119, Jul. 2017. [11] F. Calliari, L. E. Y. Herrera, J. P . v on der W eid, and G. C. Amaral, “High-dynamic and high- resolution automatic photon counting OTDR for optical ﬁb er net w ork monitoring,” in Pr o c e e dings of the 6th International Confer enc e on Photonics, Optics and L aser T e chnolo gy - V olume 1: PHO- TOPTICS, , INSTICC. SciT ePress, 2018, pp. 82–90. [12] G. C. Amaral, A. Baldivieso, J. D. Garcia, D. C. Villafani, R. G. Leibel, L. E. Y. Herrera, P . J. Urban, and J. P . von der W eid, “A low-frequency tone sw eep metho d for in-service fault lo cation in sub-carrier multiplexed optical ﬁb er netw orks,” Journal of Lightwave T e chnolo gy , vol. 35, no. 10, pp. 2017–2025, Ma y 2017. [13] A. Pizzinat, P . Chanclou, F. Saliou, and T. Diallo, “Things you should know about fronthaul,” Journal of Lightwave T e chnolo gy , v ol. 33, no. 5, pp. 1077–1083, Mar. 2015. [14] P . J. Urban, G. C. Amaral, and J. P . von der W eid, “Fib er monitoring using a sub-carrier band in a sub-carrier multiplexed radio-ov er-ﬁber transmission system for applications in analog mobile fron thaul,” Journal of Lightwave T e chnolo gy , vol. 34, no. 13, pp. 3118–3125, Jul. 2016. [15] J. Nak ay ama, K. Iizuk a, and J. Nielsen, “Optical ﬁb er fault lo cator b y the step frequency metho d,” Applie d optics , v ol. 26, no. 3, pp. 440–443, 1987. [16] B. K. Natara jan, “Sparse approximate solutions to linear systems,” SIAM Journal on Computing , v ol. 24, no. 2, pp. 227–234, 1995. [17] R. Tibshirani, “Regression shrink age and selection via the Lasso,” Journal of the R oyal Statistic al So ciety. Series B (Metho dolo gic al) , v ol. 58, no. 1, pp. 267–288, 1996. [18] M. P arrilla, J. Ana ya, and C. F ritsc h, “Digital signal pro cessing tec hniques for high accuracy ul- trasonic range measurements,” IEEE T r ansactions on Instrumentation and Me asur ement , vol. 40, no. 4, pp. 759–763, Aug. 1991. [19] R. Saa v edra, P . T ov ar, G. C. Amaral, and B. F anzeres, “Optical ﬁber link test b ench.” [Online]. Av ailable: http://dx.doi.org/10.21227/H2N66K [20] S. Liehr, N. N¨ other, and K. Krebb er, “Incoheren t optical frequency domain reﬂectometry and dis- tributed strain detection in p olymer optical ﬁb ers,” Me asur ement Scienc e and T e chnolo gy , vol. 21, no. 1, pp. 1–4, Nov. 2010. 25 [21] S. G. Pierce, A. MacLean, and B. Culshaw, “Optical frequency-domain reﬂectometry for microb end sensor demo dulation,” Applie d optics , vol. 39, no. 25, 2000. [22] J. P . von der W eid, M. H. Souto, J. D. Garcia, and G. C. Amaral, “Adaptive ﬁlter for automatic iden tiﬁcation of m ultiple faults in a noisy OTDR proﬁle,” Journal of Lightwave T e chnolo gy , vol. 34, no. 14, pp. 3418–3424, Jul. 2016. [23] T. Hastie, R. Tibshirani, and J. F riedman, The elements of statistic al le arning: data mining, infer- enc e, and pr e diction. , 2nd ed. Springer, 2009. [24] F. P eres and F. F ogliatto, “V ariable selection methods in m ultiv ariate statistical process control: A systematic literature review,” Computers & Industrial Engine ering , vol. 115, no. 1, pp. 603–619, Jan. 2018. [25] A. F. M. Smith and D. J. Spiegelhalter, “Bay es factors and c hoice criteria for linear mo dels,” Journal of the R oyal Statistic al So ciety. Series B (Metho dolo gic al) , v ol. 2, no. 42, pp. 213–220, 1980. [26] J. Chen and Z. Chen, “Extended BIC for small-n-large-p sparse GLM,” Statistic a Sinic a , vol. 2, no. 22, pp. 555–574, 2012. [27] J. F riedman, T. Hastie, and R. Tibshirani, “glmnet: Lasso and elastic-net regularized generalized linear mo dels,” R p ackage version , vol. 1, no. 4, 2009. [28] H. Zou, “The adaptiv e lasso and its oracle prop erties,” Journal of the Americ an Statistic al Asso ci- ation , vol. 101, no. 476, pp. 1418–1429, Jan. 2006. [29] P . B ¨ uhlmann, P . R ¨ utimann, S. v an de Geer, and C.-H. Zhang, “Correlated v ariables in regression: Clustering and sparse estimation,” Journal of Statistic al Planning and Infer enc e , vol. 11, no. 143, pp. 1835–1858, No v. 2013. [30] D. Bertsimas, A. King, and R. Mazumder, “Best subset selection via a mo dern optimization lens,” The A nnals of Statistics , vol. 44, no. 2, pp. 813–852, Apr. 2016. [31] D. Bertsimas and A. King, “OR Forum – An algorithmic approach to linear regression,” Op er ations R ese ar ch , vol. 64, no. 1, pp. 2–16, Jan.–F eb. 2016. [32] 10-MHz A djustable Balanc e d Photor e c eivers , New F o cus. [Online]. Av ailable: h ttps://www.newport. com/p/2117- FC [33] E5061B Network A nalyzer , Keysight T ec hnologies. [Online]. Av ailable: h ttps://literature.cdn. k eysight.com/lit web/pdf/5990- 4392EN.p df ?id=1790097 26

Full Optical Fiber Link Characterization with the BSS-Lasso

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment