A dedicated codec for compression of Gravitational Waves Sound

A dedicated co dec for compression of Gra vitational W a v es Sound Laura Reb ollo-Nei ra Mathemat ics Department Aston Univ er sit y B4 7ET Birmingha m, UK Abstract A dedicated cod ec for compression o f gra vitational wa v es sound with high qualit y reco very is prop osed. The p erformance is tested on the a v ailable set of gravi tational sound signals th at has b een theoretic ally generated at the Massac h usetts Institute of T ec hnology (MIT). The appr oac h is based on a mo del for data reduction rend er in g high qualit y ap p ro ximation of the signals. The reduction o f dimensionalit y is ac hiev ed b y selecting ele m entary co m p onents f r om a redundant set ca lled a dictionary . Comparisons with t h e compression standard MP3 d emons trate the merit of the dedicated tec hniqu e f or compr essing this t yp e of sound. 1 In tro du ction After the celebrated ﬁrst detection of a g ra vitational w av e (GW) on Sept 2015 [1] ﬁve more detections [2 – 6] ha v e conﬁrmed Einstein’s theory of general relativit y . Predictions from the Laser Interferomete r Gra vitat io nal-W a v e Observ atory (LIGO) and Virgo Scien tiﬁc Collab oration assert that w eekly or ev en more often detections can b e expected in the near future. Scien tists en visage the prospect as the b eginning of a new era in astronomy , whereb y the ve ry early Univ erse will b e studied by the sound that w as made in its formation. Theoretically , G Ws are modelled and generated using t ec hnique s of num erical relativit y [7 – 9], from whic h audio represen tation can b e dev elop ed. In particular, the group of Prof. Hughes at MIT has pro duced and made av ailable the gravitational w av es sound (GWS) signals whic h hav e b een used in this study . The nu merically sim ulated signals are supp orted b y a n um b er of publications [11 – 15] and nice ly presen ted on a w ebsite [16 ]. This w ork uses those signals to demonstrate a dedicated sc heme for the compression of GWS with high quality reco very . The proposal falls within the usual transform co ding sch eme. It c o nsists of three main steps: 1) T ransformation of the sound signal. 2) Quan tization of the transformed data. 3) Bit- stream en tropy co ding. Ho w eve r, it diﬀers fro m the tra dit io nal compression t echniq ues from the b eginning. Instead of considering an orthogonal tra nsformation, the ﬁrst step is realized by appro ximating the signal using a redundan t dictionary from where the elemen tary comp onen ts for represen ting the signal are selected t hrough a greedy pursuit strategy . This strategy giv es r ise to what is kno wn as sparse represen tat io n of a signal. In the area o f audio pro cessing a num b er o f diﬀeren t tasks hav e b een sho wn to b eneﬁt b y the sparsit y of a represen tatio n. The relev an t literatur e is extensiv e. As a sample w e refer to [17 – 27]. Here w e focus on comp r ession of G WS signals, whic h do require a dedicated framew ork for their sparse represen tation. In part icular, the dictionary w e use for the prop osed co dec w as introduced in a recen t publicatio n as p oten tially go o d to ac hieve sparse represen tation of G WS b y partitioning the signal [28]. No w it stands as a crucial comp onen t of a co dec for hig h quality p oin t-wise reco very of the compressed signal. 1 Since t he G WS under consideration is in the range of h uman hearing, it is inte resting to realize comparisons with the p opular compression standard MP3. The results demonstrate a signiﬁcant impro v ement in compression p erformance for the equiv alen t quality of the reco v ered signal. More precisely , the signal is required to yield a Signal to Noise R atio (SNR) competitive with MP3 outcomes with respect to b oth, the whole signal and the elemen ts of the signal partition. The b eneﬁts of the prop osed dedicated fo rmat for enco ding GWS go b ey ond the compression p erformance. Certainly , the underlying represen tation of the data g enerates a reduced set whic h con tains information ab out the elemen tary comp onen ts o f the signal. Hence, in addition to r ecov ering the signal from a compressed ﬁle with high quality , one can also r ecov er its reduced represen tatio n in terms of elemen tary comp onen ts. Since the reduced represe ntation giv es rise to an accurate appro ximation of the signal, it can b e of assistance to signal pro cessing tech niques relying on a reduction of dimensionalit y a s a ﬁrst step of further pro cessing. The main goals o f the n umerical tests of Section 3 are: • T o pro duce strong evidence that high quality approximation of tw o diﬀeren tly generated types of GWS can b e achie ved as a sup erp osition o f elemen tary compo nents of diﬀeren t nature. A comp onen t of elemen ts generated b y a discrete v ersion of trigonometric functions, and a comp onen t consisting of pulses of small supp ort. • T o demonstrate that the ab o v e describ ed decomp osition can b e stored in a ﬁle whic h is signif- ican tly smaller than that obtained with the compression standard MP3. The go als are ac hieve d b y testing the metho d on t he av ailable audio r epre sentation of GWs which are group ed, according to their theoretical mo deling, as follo ws [16]: • Extreme mass ratio inspiral (EMRI). Gra vitatio na l w av es pro duced when a relativ ely lig ht compact ob ject orbits around a m uc h hea vier black hole and gra dually deca ys. • Binaries. Emitted during the merger of tw o b o dies of roughly the same mass. The ﬁrst direct detection of a G W b elongs to t his category . 2 Metho d The principal constituen t of the prop osed co dec is the mathematical mo del of the sound signal. As already men tioned, the mo del is realized by selecting elemen tar y comp onen ts of a dedicated dictionary containing atoms of diﬀeren t nature. Th us, the metho d for selecting t he suitable atoms for the appro ximation of a g iv en signal is also relev an t to the mo delling. F or this task w e use the Optimized Orthogonal Matchin g Pursuit (OOMP) metho d [30], whic h is step wise optimal in the sense o f minimizing the norm of the appro ximation error a t eac h iteration step. Throughout the description of the metho ds R and N stand for the sets of real and natural n um b ers, resp ectiv ely . Boldface letters a re used t o indicate Euclidean vec to rs a nd their corresp onding comp onen ts are represen ted using standar d mathematical fonts, e.g., f ∈ R N , N ∈ N is a ve ctor of comp onen ts f ( i ) , i = 1 , . . . , N . The inner pro duct op eration is indicated as h · , ·i . A partition of a signal f ∈ R N is realized by a set of disjoin t pieces f q ∈ R N b , q = 1 , . . . , Q , whic h for simplicity a re assumed to b e all of the same size and suc h that QN b = N , i.e., it holds t ha t f = ˆ J Q q =1 f q , where the concatenation op eration ˆ J is deﬁned as fo llows: f is a v ector in R QN b ha ving comp onen ts f ( i ) = f q ( i − ( q − 1 ) N b ) , i = ( q − 1) N b + 1 , . . . , q N b , q = 1 , . . . , Q . Hereinafter eac h elemen t of the signal partition f q will b e refereed to as a ‘blo c k’. 2 2.1 OOMP app ro ximat ion Giv en a signal f partitioned in to Q blo c ks f q ∈ R N b , q = 1 , . . . , Q , , t he k q -term approx imat io n of eac h blo c k is mo delled b y the sup erposition f k q q = k q X n =1 c q ( n ) d ℓ q n , q = 1 , . . . , Q. (1) The elemen ts d ℓ q n , n = 1 , . . . , k q in (1), called ’atoms’ are selected here from a dedicated dictionary D = { d n ∈ R N b , k d n k = 1 } M n =1 through the OO MP approach [30] whic h, for eac h blo c k q op erates a s follo ws. The algo rithm is initia lize d b y setting: r 0 q = f q , f 0 q = 0, Γ q = ∅ and k q = 0. The ﬁrst atom for the atomic decomp osition of the q -th blo c k is selected as the one corresp onding to the index ℓ q 1 suc h that ℓ q 1 = arg max n =1 ,...,M   h d n , r k q q i   2 . (2) This ﬁrst atom is used to assign w q 1 = b 1 ,q 1 = d ℓ q 1 , calculate r 1 q = f q − d ℓ q 1 h d ℓ q 1 , f q i and itera t e as prescribed b elo w. 1) Upgrade the set Γ q ← Γ q ∪ ℓ k q +1 , increase k q ← k q + 1, and select the index of a new ato m for the appro ximation as ℓ q k q +1 = arg max n =1 ,...,M n / ∈ Γ q |h d n , r k q q i| 2 1 − P k q i =1 |h d n , ˜ w q i i| 2 , with ˜ w q i = w q i k w q i k . (3) 2) Compute the correspo nding new v ector w q k q +1 as w q k q +1 = d q ℓ k q +1 − k q X n =1 w q n k w q n k 2 h w q n , d q ℓ k q +1 i . (4) including, for n umerical accuracy , t he re-orthog o nalizing step: w q k q +1 ← w q k q +1 − k q X n =1 w q n k w q n k 2 h w q n , w q k q +1 i . (5) 3) Upgrade v ectors b k q ,q n as b k q +1 ,q n = b k q ,q n − b k q +1 ,q k q +1 h d q ℓ k q +1 , b k q +1 ,q n i , n = 1 , . . . , k q , b k q +1 ,q k q +1 = w q k q +1 k w q k q +1 k 2 . (6) 4) Calculate r k q +1 q = r k q q − h w q k q +1 , f q i w q k q +1 k w q k q +1 k 2 . (7) 5) If for a give n ρ the condition k r k q +1 q k < ρ has b een met compute the co eﬃcien ts c k q ( n ) = h b k q n , f q i , n = 1 , . . . , k q . Otherwise rep eat steps 1) - 5). 3 Remark 1: F or eac h v alue of q the criterion (3) in the metho d ab o v e giv es the index minimizing the lo cal residual norm k f q − f k q q k with r esp ect to the newly selected atom. Moreo ver, t he approx- imation o f eac h blo c k q is completed at once tot a lly indep enden t of the other blo c ks. In previous publications [28, 31] we hav e show n the b eneﬁt in ranking the blo c ks for their sequen tial step wise appro ximation to minimize the total residual error k f − f K k , with K = P Q q =1 k q . Suc h a strategy is called Optimized Hierarc hical Blo c k Wise OOMP (OHBW- OOMP). How ev er, for the particular application to compression, the quan tization pro cedure plays also a role in the a ppro ximation b y mapping small co eﬃcien ts to zero and quantiz ing the others. Consequen tly , for the set of signals considered in this work the compression results rendered b y b oth metho ds are pra ctically equiv a len t. 2.2 The Dictionary The dictionary used for the approximation consists o f t w o sub-dictionaries of diﬀeren t nature. One of them is a trigonometric dictionary D T , whic h is t he union of the dictionaries D C and D S giv en b elo w. D x C = { w c ( n ) cos π (2 i − 1)( n − 1) 2 M , i = 1 , . . . , N b } M n =1 D x S = { w s ( n ) sin π (2 i − 1)( n ) 2 M , i = 1 , . . . , N b } M n =1 , where w c ( n ) and w s ( n ) , n = 1 , . . . , M are normalization f actors. In the n umerical sim ulatio ns we ha v e considered M = 2 N b and N b = 2048 . The o ther sub-dictionary is constructed by translation of the prototype atoms, p 1 , p 2 and p 3 in Fig. 1. 0 2 4 6 8 10 12 14 16 0 0.2 0.4 0.6 0.8 1 Figure 1: Protot yp e atoms p 1 , p 2 and p 3 , wh ic h generate the dictionaries D P 1 , D P 2 and D P 3 b y sequential translations of one p oin t. Denoting b y D P 1 , D P 2 and D P 3 the dictionaries ar ising b y translations of the a t oms p 1 , p 2 , a nd p 3 , resp ectiv ely , the dictionary D P is built as D P = D P 1 ∪ D P 2 ∪ D P 3 . The whole dictiona r y is then built as D = D T ∪ D P , with D T = D C ∪ D S . This dictionary w as previously in tro duced a s p oten tially suitable for a ppro ximation of GWS [28]. In this study it stands out as b eing essen tial for a c hieving high quality approx imatio n of most o f the signals whic h hav e b een compressed with t he prop osed co dec. 4 2.3 Co ding Strategy Previously to en tropy enco ding the co eﬃcien ts resulting fro m approx imating a signal b y partitioning, the real num b ers need to b e con verted in to in t egers. This op eration is kno wn as quan tizatio n. F or all the n umerical cases w e ha ve ado pt ed a simple uniform quan t izat io n tec hnique. The a bsolute v alue co eﬃcien ts | c q ( n ) | , n = 1 . . . , k q , q = 1 , . . . , Q, are con ve rted to in tegers a s follows : c ∆ q ( n ) = ⌊ | c q ( n ) | ∆ + 1 2 ⌋ , (8) where ⌊ x ⌋ indicates the largest in t eger n um b er smaller or equal to x and ∆ is the quantization parameter. The signs of the co eﬃcien ts, represen ted a s s q , q = 1 , . . . , Q , are enco ded separately using a binary alphab et. F or creating the stream of n um b ers to enco de the signal mo del we pro ceed as in a recen t w ork [2 7]. The indices of the atoms in the atomic decomp ositions of eac h blo c k f q are ﬁrst sorted in ascending order ℓ q i → ˜ ℓ q i , i = 1 , . . . , k q , whic h guaran tees that, for eac h q v alue, ˜ ℓ q i < ˜ ℓ q i +1 , i = 1 , . . . , k q − 1. This order of the indices induces an order in t he unsigned co eﬃcien ts, c ∆ q → ˜ c ∆ q and in the corr esp onding signs s q → ˜ s q . The ordered indices are stored as smaller p ositiv e num b ers by taking diﬀerences b et w een tw o consecutiv e v alues. By deﬁning δ q i = ˜ ℓ q i − ˜ ℓ q i − 1 , i = 2 , . . . , k q the follo w string stores t he indices fo r blo c k q with unique recov ery ˜ ℓ q 1 , δ q 2 , . . . , δ q k q . The n umber ‘0’ is then used t o separate the string corresp onding to diﬀerent blo c ks and entrop y co de a long string, st ind , whic h is built as st ind = [ ˜ ℓ 1 1 , . . . , δ 1 k 1 , 0 , ˜ ℓ 2 1 , . . . , δ 2 k 2 , 0 , · · · , ˜ ℓ k Q 1 , . . . , δ Q k Q ] . (9) The corresp onding quan tized mag nitude of the co eﬃcien ts are concatenated in the strings st cf as follo ws: st cf = [ ˜ c ∆ 1 (1) , . . . , ˜ c ∆ 1 ( k 1 ) , · · · , ˜ c ∆ k Q (1) , . . . , ˜ c ∆ k Q ( k Q )] . (10) Using ‘0’ to store a p ositiv e sign and ‘1’ to store negative one, the signs are placed in the string, st sg as st sg = [ ˜ s 1 (1) , . . . , ˜ s 1 ( k 1 ) , · · · , ˜ s k Q (1) , . . . , ˜ s k Q ( k Q )] . (11) The next enco ding / deco ding sc heme summarizes the ab ov e describ ed pro cedure. Enco ding • Giv en a partition f q ∈ R N b , q = 1 , . . . , Q of a signal, appro ximate eac h blo ck f q b y the a t o mic decomp ositions ( 1 ). • Quan tize, as in (8), the absolute v alue co eﬃcien ts to obtain c ∆ q ( n ) , n = 1 , . . . , k q , q = 1 , . . . , Q . • F or each q , sort the indices ℓ q 1 , . . . , ℓ k q in ascending order to hav e a new order ˜ ℓ q 1 , . . . , ˜ ℓ k q and the re-ordered sets ˜ s q (1) , . . . , ˜ s q ( k q ), and ˜ c ∆ q (1) , . . . , ˜ c ∆ q ( k q ), to create the strings: st ind , as in (9), and st cf , and st sg as in (10) and (11), resp ectiv ely . All these strings are enco ded, separately , using adaptiv e arithmetic co ding. Deco ding • Rev erse the arithmetic co ding to recov er strings st ind , st cf , st sg . • In v ert the quantization step as | ˜ c r q ( n ) | = ∆ ˜ c ∆ q ( n ) . (12) 5 • Reco v er the approximated partition thr o ugh the linear com binatio n f r ,k q q = k q X n =1 ˜ s q ( n ) | ˜ c r q ( n ) | d ˜ ℓ q n . (13) • Assem ble the reco v ered signal as f r = ˆ J Q q =1 f r ,k q q . (14) 3 Results Giv en the appro ximation f r of a signal f , the quality o f suc h a n appro ximation is assessed b y the SNR calculated as SNR = 10 log 10 k f k 2 k f − f r k 2 , (15) where k · k indicates the 2- norm. The lo cal SNR with resp ect to ev ery blo ck in the partition, whic h w e indicate as snr( q ) , q = 1 , . . . Q , is calculated as snr( q ) = 10 log 10 k f q k 2 k f q − f r ,k q q k 2 , (16) where f r ,k q q is the approxim a t ion of the blo c k f q . Both, the mean v alue ( snr) and standard deviation (std) of these lo cal quan tities a r e relev an t to comparison of p oin t-wise qualit y reco ve ry . Accordingly , w e deﬁne snr = 1 Q Q X q =1 snr( q ) , and std = v u u t 1 Q − 1 Q X q =1 (snr( q ) − snr) 2 . (17) In order t o use the SNR and snr as measures of quality fo r comparison, the MP3 signal has to b e optimized in relation to those quantities. This is carried out b y the following op erations [27]. • Shifting: Since MP3 in tro duces a shift with resp ect to the o r iginal signal, to compute the SNR that shift should b e rev ersed. Denoting b y f M the nume rical signal retriev ed from the MP3 ﬁle, the optimal time shift ˆ τ is determined to b e the time shift maximizing t he cross-correlatio n with the o riginal signal, i.e. ˆ τ = arg max τ = − N/ 2 ,...,N / 2 N X n =1 f ( n ) f M ( n + τ ) . (18) • Scaling: In order to neutralize the eﬀect of any m ultiplicativ e and/or a dditiv e constan t whic h could aﬀect the SNR v a lue, we allow for suc h tw o constan ts and adjust them to maximize the SNR as follows: Denoting by ˆ f M the MP3 signal after the shifting o peration, w e consider the linear form a ˆ f M + b and ﬁx the v alues of a and b for whic h k f − ( a ˆ f M + b ) k 2 tak es t he minim um v alue, i.e. a = h f , f M i − ( 1 N P N i =1 ˆ f M ( i ))( 1 N P N j =1 f ( j )) k ˆ f M k 2 − 1 N ( P N i =1 f M ( i )) 2 b = 1 N N X i =1 f ( i ) − a 1 N N X i =1 ˆ f M ( i ) . 6 While the additiv e constan t b is not relev ant, the scaling constan t a pro duces an imp ortant correction whic h signiﬁcan tly increases the SNR. The compression p o w er is determined b y the compression ratio (CR) deﬁned as follows CR = Size of the ﬁle with the signal Size of the compressed ﬁle . (19) In all the n umerical examples the ﬁles of the signals are giv en in W A V format. The compression is realized on a single audio channel. The mean v alue of all the signals is pra ctically zero. As discus sed b elo w, after the shifting and scaling o peration for lo w v alues of CR (e.g. CR=2 a nd CR=4) MP3 reco vers signals of high qualit y with resp ect to b oth measures, t he SNR and the snr. The prop osed co dec, henceforth to b e refereed to as ‘dictionary co dec’ (D C), is set to pro duce a n appro ximation of the orig ina l signal matc hing either the SNR or the snr v alue ac hiev ed by the MP3 signal (whatev er quantit y is the largest one). As seen in the tables b elo w, by matc hing the largest quan tity of either SNR, or snr, the other measure is guaran teed to b e larger than the v alue a ttained b y the MP3 signal. 3.1 Numerical Case I As a ﬁrst numeric a l example w e conside r the audio represen tation of a detected G W, the c hirp gw151226 [29]. This is a short signal, it consists of N = 65536 p oin ts with sampling frequency of 41 kHz. F or CR=2 the p oint-wise qualit y of the signal recov ered from the MP3 ﬁle is excellen t: SNR=74.6 dB and snr = 70 . 2 dB. These v alues should b e appreciated b y taking into account that SNR=74.6 dB corresp onds to a mean square erro r b et w een the approximation a nd the signal of order 10 − 9 . Since in t his case the SNR is greater than the snr the DC is set to pro duce the same v alue of SNR. With this restriction the resulting snr is 7 1.7dB, i.e. larg er than the v a lue yielded b y MP3. The huge diﬀerence b et w een the tw o enco ding pro cedures is the CR. Denoting by CR M the CR with resp ec t to the MP3 ﬁle a nd CR D that of the DC ﬁle, for SNR=74.6 dB one has CR M = 2 and CR D = 2 3! While for larger v alues of CR M the qualit y of the r ecov ered signal decreases, up to CR M = 10 the quality of the MP3 signal is still v ery go o d. T able 1 display s t he v alues of SNR and snr, a s w ell as the corresp onding CRs pro duced b y b oth formats. These a re diﬀeren tiated b y the notation SNR M and snr M , used to indicate the v alues pro duced b y the MP3 format, and SNR D and snr D , used to indicate the v alues pro duced by the DC format. SNR M SNR D snr M std M snr D std D CR M CR D 74.5 dB 74 .5 dB 70.2 dB 6.3 71.7 dB 5.9 2.1 23.2 73.8 dB 73 .8 dB 69.8 dB 6.4 71.1 dB 5.7 4.2 24.9 71.3 dB 71 .3 dB 66.9 dB 6.7 68.4 dB 5.8 8.4 61.2 69.4 dB 69 .4 dB 65.2 dB 6.2 66.4 dB 5.9 10.4 69.3 T able 1: Comparison of CR M and CR D , for the sound represen t a tion of the detected chirp gw151226 . The ﬁrst column giv es the v alues of SNR M pro duced b y the MP3 signal reco vered from ﬁles with CR M as listed in the 7th column. The second column are the identical v alues of SNR D pro duced b y the signal reco vere d from the DC ﬁles with CR D as listed in the last column. The 3rd and 5th columns a r e the v alues of snr M and snr D , resp ectiv ely . The 4th and 6th columns are the corresp onding standard deviations. 7 3.2 Numerical Case I I In this case the group of GWS has b een n umerically simulated at MIT [16]. The GWs b elong to the EMRI category with circular orbit. The signals are organized in tw o la rge subgroups, according to the spin of the larg er black hole: spin 99 . 8% of the maxim um v alue and spin 35 . 94% of the maxim um v alue. Eac h subgroup contains 16 signals, each of which is c har a cteriz ed by t w o angles: the orbital plane and the viewing angle. The signals corresp onding t o spin 99 . 8% are listed in the ﬁrst column T able 2 and T able 3. The signals corresp onding to spin 35 . 9 4 % are listed in the ﬁrst column of T able 4 and T able 5 . The frequency o f these signals is 8kHz and w ere compressed with MP3 a t the lo we st p ossible r ate, CR M = 2, in the ﬁrst instance. The reco ve r ed signals pro duce the v alues of snr M and SNR M giv en in the second and sixth columns of T able 2. Since in this case snr M > SNR M the DC w as set to pro duce snr D = snr M . This guaran tees that SNR D > SNR M for all signals. The actual v alues of snr M v ary with the signals, but for a ll of them the reco v ery is of go o d quality . It corresp onds to mean square errors of order of 10 − 8 . As can b e observ ed in the t a ble, for all the signals the compression p erformance of the DC format is clearly sup erior to MP3. T able 3 displa ys the equiv a len t results but corresp onding to a larger compression ratio of MP3 (CR M = 4). Signal snr M std M snr D std D SNR M SNR D CR M CR D E1 67.9 2.1 67.9 0 .2 65.8 67.8 2 6.8 E2 65.0 2.9 65.0 0 .4 58.9 64.9 2 5.3 E3 63.7 3.3 63.7 0 .5 54.4 63.6 2 5.2 E4 66.9 3.4 66.9 0 .6 55.2 66.7 2 5.6 E5 63.4 1.8 63.4 0 .2 62.4 63.4 2 6.0 E6 62.2 2.9 62.3 0 .2 58.9 62.1 2 5.1 E7 61.6 2.9 61.6 0 .2 57.9 61.6 2 4.9 E8 62.8 2.9 62.8 0 .3 59.4 62.8 2 5.2 E9 65.7 2.2 65.7 0 .3 64.5 65.7 2 6.4 E10 64.1 2.1 64.1 0 .2 62.4 64.1 2 5.6 E11 62.7 1.8 62.7 0 .2 62.2 62.7 2 5.5 E12 63.7 2.2 63.7 0 .2 63.0 63.7 2 5.8 E13 68.5 4.4 68.5 0 .4 64.3 68.5 2 6.3 E14 66.8 2.4 66.8 0 .4 65.8 66.8 2 5.6 E15 63.7 2.7 63.7 0 .6 62.7 63.6 2 5.7 E16 64.5 2.9 64.5 0 .4 62.6 63.7 2 5.9 T able 2: Comparison of CR M and CR D v alues for GWs within the EMRI category for circular orbit and spin 99 . 8% of the maxim um v alue. Eac h signal listed in the ﬁrst column corresp onds to a particular orbital plane and viewing angle. The second column shows t he snr M v alues pro duced b y the MP3 signals reco v ered from a ﬁle corresp onding to CR M = 2. The sixth column sho ws the corresp onding v alues of SNR M . All the other ﬁgures in the ta ble a rise when setting the D C to ach ieve snr D = snr M . 8 Signal snr M std M snr D std D SNR M SNR D CR M CR D E1 60.3 2.4 60.3 0 .3 58.2 60.2 4 9.7 E2 55.5 4.2 55.5 0 .4 47.9 55.4 4 7.9 E3 54.1 4.9 54.1 0 .4 45.5 54.0 4 7.6 E4 57.5 4.0 57.5 0 .6 51.5 57.4 4 8.3 E5 54.9 2.9 54.9 0 .2 52.4 54.8 4 8.8 E6 52.4 4.2 52.4 0 .2 44.9 52.3 4 7.6 E7 52.2 5.0 52.2 0 .2 39.7 52.1 4 7.1 E8 53.6 3.9 53.6 0 .3 43.0 53.6 4 7.5 E9 56.5 2.7 56.5 0 .2 55.3 56.4 4 9.6 E10 54.7 1.8 54.7 0 .2 53.8 54.6 4 8.2 E11 53.2 2.4 53.2 0 .2 51.9 53.2 4 8.1 E12 54.4 2.8 54.4 0 .2 52.0 54.4 4 8.4 E13 59.4 3.4 59.4 0 .4 57.3 59.4 4 8.8 E14 57.4 2.2 57.3 0 .3 56.7 57.4 4 7.7 E15 55.5 2.2 55.5 0 .3 51.2 55.5 4 7.7 E16 55.0 2.6 55.0 0 .5 51.2 55.0 4 8.4 T able 3: Same description as in T able 2 but fo r CR M = 4. T able 4 and T able 5 hav e equiv alent description as T able 2 and 3, resp ectiv ely , but the signals b elong t o the EMRI group with spin 3 5 . 94% of the maxim um v alue. Signal snr M std M snr D std D SNR M SNR D CR M CR D S1 71.8 3.2 71.8 0 .6 68.8 71.7 2 10.2 S2 70.2 2.4 70.2 0 .7 67.9 70.1 2 6.6 S3 69.7 2.2 69.7 0 .5 67.3 69.6 2 6.4 S4 69.5 2.4 69.5 0 .5 67.3 69.4 2 6.4 S5 70.8 2.3 70.8 0 .5 68.9 70.8 2 7.7 S6 68.8 1.9 68.8 0 .4 67.9 68.7 2 5.5 S7 65.0 3.3 65.0 0 .4 61.1 65.0 2 5.4 S8 64.8 1.9 64.8 0 .4 63.9 64.7 2 5.8 S9 69.9 2.5 69.9 0 .3 68.2 69.9 2 5.2 S10 67.1 1.7 67.1 0 .4 66.6 67.2 2 4.6 S11 63.4 2.3 63.4 0 .5 62.3 63.5 2 4.5 S12 61.7 2.0 61.7 0 .2 60.9 61.7 2 4.5 S13 66.0 3.2 66.0 0 .6 62.8 66.1 2 4.9 S14 63.0 7.2 63.0 1 .6 61.6 64.4 2 4.9 S15 59.0 8.4 59.0 1 .3 59.5 60.2 2 4.9 S16 51.8 2.0 51.8 0 .3 51.0 51.9 2 4.7 T able 4: Same description as for T able 2 but the signals b elong to the EMRI group with spin 35 . 94% of the maximum v alue. 9 Signal snr M std M snr D std D SNR M SNR D CR M CR D S1 66.3 3.2 66.3 0 .5 61.9 66.4 4 13.1 S2 62.9 2.7 62.9 0 .5 58.7 62.8 4 9.2 S3 61.4 2.9 61.4 0 .5 57.5 61.2 4 9.3 S4 61.6 2.4 61.6 0 .5 58.6 61.5 4 9.0 S5 64.1 2.9 64.1 0 .5 61.3 64.0 4 10.3 S6 61.1 2.1 61.1 0 .4 59.2 61.0 4 7.9 S7 57.3 2.5 57.3 0 .4 54.8 57.3 4 7.8 S8 56.8 2.5 56.8 0 .4 54.5 56.8 4 8.3 S9 61.3 2.6 61.3 0 .3 59.9 61.2 4 8.0 S10 58.1 2.2 58.1 0 .4 56.7 58.1 4 7.2 S11 54.2 2.6 54.2 0 .5 52.9 54.3 4 7.1 S12 53.8 2.0 53.8 0 .2 53.0 53.8 4 6.8 S13 56.8 2.9 56.8 0 .6 54.4 56.8 4 7.7 S14 55.1 7.4 55.1 1 .2 54.0 56.1 4 7.0 S15 51.0 7.5 51.0 1 .2 51.7 52.0 4 7.0 S16 44.7 2.7 44.7 0 .7 43.1 44.6 4 6.6 T able 5: Same as the description o f T able 4 but for CR M = 4. 3.3 Numerical Case I I I This group of signals has also b een sim ulated at MIT. All the signals b elong to the Binary cat ego ry , for circular syste ms, a nd are diﬀeren tiated by the mass of the larger b o dy . The ﬁrst t wo signals listed in T ables 6 and 7 corresp ond to b o dies of roughly the same mass. The ﬁrst signal (B1) is generated b y binary neutron star s each of 1.5 solar masses. The second signal (B2) is generated by binary blac k ho les , eac h of 2.5 solar masses. Signals B3 a nd B4 are pro duced b y binary blac k holes with mass ra tio 3:1. The signal B3 do es not include spin eﬀects while B4 includes rapid spinning o f b oth b o dies . The minim um p ossible v alues of CR M are giv en in the eigh th column of T able 6. In this case SNR M > snr M for all the signals. Hence, t he D C was set to ac hiev e SNR M = SNR D (second and third columns of T able 6). These v alues pro duced snr D > snr M for all the signals. The remark able p erformance of the DC emerges from the CR D ﬁgures listed in the last column of T able 6. Signal SNR M SNR D snr M std M snr D std D CR M CR D B1 70.2 70.2 66.2 8.5 67.2 8.0 2.7 51 .7 B2 70.3 70.3 65.2 10.4 66.5 9.5 2.6 34.2 B3 59.0 59.0 58.5 2.0 58.8 0.6 2.7 29 .3 B4 55.6 55.6 54.9 2.1 55.2 0.7 2.7 28 .9 T able 6: Comparison of the CR M and CR D v alues for the audible part of the signals in the Binary group. The second column give s the v alues of SNR M pro duced b y the MP3 signal recov ered f rom ﬁles with CR M as listed the 8th column. The 3rd column are the iden tical v alues of SNR D pro duced b y the signal reco vere d from the DC ﬁles with CR D as listed in the last column. The 4th and 6th columns a r e the v alues of snr M and snr D , resp ectiv ely . The 5th and 7th columns are the corresp onding standard deviations. 10 Signal SNR M SNR D snr M std M snr D std D CR M CR D B1 68.8 68.8 65.9 8.6 65.9 7.8 5.4 56 .2 B2 68.6 68.6 65.0 10.5 65.0 9.4 5.2 36.9 B3 53.4 54.1 53.9 2.9 53.9 0.4 5.5 37 .5 B4 49.3 49.6 49.7 3.4 49.7 0.4 5.5 39 .3 T able 7: Same description a s T able 6 but for double v alues of CR M . 4 Discuss ion The results o f T ables 1-7 demonstrate the remark able compression p ow er of the DC, in comparison to MP3 at equiv alen t high quality o f the reco v ered signal. F or all cases the relation CR D = γ CR M (20) holds for v alues of γ v arying from a minim um v a lue γ min = 1 . 7 for signal S12 in T able 5 to a maxim um v alue γ max = 1 9 . 2 for signal B1 in T a ble 6. T able 8 shows the mean v alue ¯ γ in eac h of the ab o ve tables. It also sho ws the corresp onding std as w ell as the v alues of γ min and γ max in eac h of the tables. T able ¯ γ std γ min γ max 1 7 .7 2.3 5.9 11.0 2 2 .8 0.3 2.4 3.4 3 2 .1 0.2 1.8 2.4 4 2 .9 0.7 2.2 5.1 5 2 .2 0.6 1.7 3.2 6 1 3.5 3.9 10.8 19.2 7 7 .7 1.4 6.8 9.7 T able 8: St a tistic of the fa ctor γ (c.f. (20)) for T able 1 -7. The second column corr esp onds to the mean v alue with resp ect to the signals in each table. The third column give s the corresp onding standard deviation. It is w o rth commenting that the comp onen t of the dictionary containing the atoms of small supp ort plays an essen tial role of ac hieving la rge v alues of CR D for high quality recov ery . This is m uc h mor e imp ortant f or the group of EMRI sound. F or example, if the compression of signal E1 in T able 2 is carried out in the same wa y but excluding the sub-dictionary with those ato ms, the v alue of CR D drops from 6.8 to 3.5. Nev ertheless, ev en if for appro ximating E1 at the quality of T able 2 t he p ercen tage of atoms of small supp ort is signiﬁcant (41%) the contribution to the signal appro ximation is minor. The norm of the signa l E1 is 207.6 and t he norm of comp onen t generated b y the ato ms of small supp ort is only 2 .1 . This comp onen t of small norm is needed to ac hiev e the high v alues of snr D and SNR D required in this study . Con trar ily , for compression of low er quality reco v ery these atoms play no role for most signals. A t low er qualit y , ho we ver, the p o we r of the DC increases substan tially . F or example, when compressing with MP3 and CR M = 8 the signal E1 the recov ered signal pro duces snr M = 4 1 . 3 dB. The DC ac hiev es the same v a lue of snr D (and SNR D > SNR M ) for CR D = 53! 11 4.1 Bey ond Compression The success of the DC in pro ducing a small ﬁle stems fro m t he ability of constructing a signal approx - imation of go o d qualit y , but inv o lving less elemen tary comp onen ts than the num b er of samples giving the signal. Let’s supp ose that to represen t at the desired qualit y the blo c k f q in the signal partition one needs k q dictionary atoms, and let’s normalize these v alues ˜ k q = k q / P Q q =1 k q for comparison purp oses. As sho wn in Fig. 2 the num b ers ˜ k q , q = 1 , . . . , Q render meaningful informat io n ab out the signal in ternal v ariations ov er time. In the low er graphs of Fig. 2 eac h of these v alues is lo cated in the horizontal axis at the cen t er of the corresp onding blo c k (of size N b = 2 0 48) and provides a condensed digital summary of the sound. The left upp er graph is the spectrogr a ms of the signal E1. The lines in the low er left graph join the v alues ˜ k q , q = 1 , . . . , 225 resulting when appro ximating this signal at t hree diﬀeren t qualities: snr D = 65, 60 and 55 dB. As observ ed in the graph, the three lines are v ery close to eac h other and accoun t for the no de in the sp ectrogram whic h o ccurs a t around 55 secs. The right upp er graph of Fig.2 is the sp ectrogram of the signal S1. This signal diﬀers fro m the signal E1 only in the spin (35 . 9 4% of the ma ximum v alue). Also in this case, the three lines connecting the v alues ˜ k q , q = 1 , . . . , 66 for approximations of snr D = 65, 60, and 55 dB are close t o eac h other and accoun t for the main feature o f the sp ectrogram, whic h is c ha r a cteriz ed b y a rise of frequency to wards the end. It is a n intere sting feature of the digital summary , giv en b y the p oints in the low er g r a phs, the small cardinalit y in relation to the length of the signal: 2 25 p oin ts for the signal E1 giv en b y N = 4608 0 0 samples a nd 66 p oin ts for the signal S1 given by N = 1 3 5168 samples. 0 10 20 30 40 50 0 0.006 0.012 0.018 0.024 Time (sec) 0 2 4 6 8 10 12 14 16 0 0.04 0.08 0.12 0.16 Time (sec) Figure 2: The up per graph s are the sp ectrograms of the signals E1 (left) and S1 (right). The lines in the low er graphs join the v alues ˜ k q = 1 , . . . , Q resulting wh en appro ximating these signals at thr ee d iﬀeren t qualities: snr D = 65, 60, and 55 dB. In the left graph eac h of the thr ee lines joins Q = 225 p oin ts. In the righ t graph Q = 66 p oin ts. 12 5 Conclus ions A dedicated co dec for compression o f gravitational sound with high quality reco v ery has b een pro - p osed. The p ow er o f the format t o store this type of sound stems from the mathematical mo del for represen t ing t he signal. The mo del is based on recursiv e selection of elemen tary comp onen ts appro x- imating the signal with accuracy . The comp onen ts, called atoms, are c hosen from a redundant set, called a dictionary , whic h is the union of t wo sub-dictionaries consisting of atoms of diﬀeren t nature. One sub-dictionary con tains trigonometric a toms. The other sub-dictionary con tains pulses of small supp ort. While the con tribution o f the atoms of small supp ort to the signal approximation is minor, they are necessary to obtain approx imatio ns of high accuracy . A t lo wer quality recov ery (less than 50 dB) this sub-dictionary could b e av oided and the compression p o w er of the prop osed co dec would b e ev en more p oten t. Nonetheless, the study rep orted here fo cuses on compression with high qualit y p oin t-wise reco very . The prop osal was tested on t he sound represen tation of the detected short c hirp gw151226 , and on a set of longer signals numerically sim ulated at MIT. Comparisons with the compression standard MP3 resulted in a signiﬁcant incremen t of compression p ow er for equiv a len t quality o f the reco v ered signal. As a b ypro duct, the co dec generates a condensed digital summary of the sound whic h could b e of assistance for iden tiﬁcation ta sk s. Note: All the results in this pap er can b e repro duced using the MA TLAB softw ar e whic h has b een made av ailable on [32]. Ac kno wle dgmen t Thanks are due to Karl Skretting for making av aila ble the Arith06 f unction [33] for arithmetic co ding, whic h has b een used at the en tropy co ding step. The author is grateful to Prof Hughes and the p eople from his group who part icipated in the sim ulation of the signals used in this study . References [1] B. P . Abb ott et al. (LIGO Scien tiﬁc Collab oration and Virgo Collab oration), “Observ ation of Gra vitat io nal W av es from a Bina r y Black Hole Merger”, Phys. R e v. L ett. 116 , 061 1 02 (201 6 ), DOI: 10.1103/Phy sRevLett.116.06 1102 [2] B. P . Abb ott et al. (LIG O Scien tiﬁc Collab oratio n and Virgo Collab oratio n), “ GW151226: Observ ation of Gravitational W av es from a 22-So lar-Mass Binar y Blac k Hole Coalescence” , Phys. R ev. L ett. , 116 , 24110 03 (2016) D OI 10.1103 /Ph ysRevLett.116.241 103. [3] B. P . Abb ott et al. (LIG O Scien tiﬁc Collab oratio n and Virgo Collab oratio n), “ GW170104: Observ ation of a 50- So lar-Mass Binary Blac k Hole Coalescence at Redshift 0 .2”, Phys. R ev. L ett. , 118 , 221101 (2017) DOI:10.1103/ Ph ysRevLett.118.2211 01. [4] B. P . Abb ott et al. (LIGO Scientiﬁc Collab oration and Virgo Colla b oratio n),“GW170814: A three-detector observ at ion of gravitational wa ve s from a binary blac k hole coalescence”, Phys. R ev. L ett. , 119 , 1411 01 (2017) DOI:10.110 3/Ph ysRevLett.119.14 1101. [5] B. P . Abb ott et al. (LIG O Scien tiﬁc Collab oratio n and Virgo Collab oratio n), “ GW170608: Observ ation of a 19- solar-mass Binary Blac k Hole Coalescenc e”, The Astr ophysic al Journal L etters , 851, 2 (2017). 13 [6] B. P . Abb ott et al. (LIGO Scien tiﬁc Collab oration and Virgo Colla bora t io n),“GW170817: Ob- serv ation of Gravitational W a ve s from a Binary Neutron Star Inspiral”, Phys. R ev . L ett. , 119, 161101 (2017). D OI:10.1103/Ph ysRevLett.119.16110 1. [7] F. Pretorius, “Ev olution of Binary Blac k-Hole Spacetimes”. Ph ys. Rev. L ett. 95 121101 (2015). doi:10.1103/ Phys R evLett.95.1 21101. [8] M. Campanelli, C. O. Lousto, P . Marronetti, Y. Zlo c ho wer, “Accurate Ev olutions of Orbiting Black - Hole Binaries without Excision”, Ph ys. R ev . Lett. 96 11110 1 ( 2006) doi:10.1103/ Phys R evLett.96.1 11101. [9] J. G . Bake r, J. Cen trella, D- Il Choi, M. Koppitz, J. v an Meter, ( 2 006). “ Gra vitational-W a ve Extraction from an Inspiraling Conﬁguration of Merging Black Holes”. Phys. R ev. L ett., 96 111102. doi:10.1103/Phys RevLett.96.11 1102. [10] L. Blanchet, Liv ing R ev. R elativ . (2014) 17: 2 . h t t ps://do i.org/10.12942/lrr- 2014-2 [11] S. A. Hughes, “The ev o lution of circular, non-equatorial orbits of Kerr bla ck ho les due to gra vitatio nal-w a v e emission”, Phys. R ev. D , 61 , 084004 ( 2000). [12] S. A. Hughes, “Ev olution of circular, non-equatorial orbits of Kerr black ho les due to gra vitatio nal-w a v e emission: I I. Inspiral tra jec to ries a nd gravitational w av eforms”, Phys. R ev. D , 64 , 0640 04 (2001 ) . [13] K. G lampedakis, S. A. Hughes, D. Kenneﬁc k, “Approximating the inspiral of test b o dies into Kerr black ho les ” , Phys. R ev. D , 66 , 064005 (200 2 ). [14] S. A. Hughes, S. Drasco, E. E. Flanagan, J. F ranklin, “Gra vitat io nal radiatio n reaction and inspiral w a veforms in the adiabatic limit, Phys. R ev. L ett. 94 , 221101 (2005). [15] S. D rasco and S. A. Hughes, “Gra vitationa l w a ve snapshots of generic extreme mass ratio inspi- rals”, Ph ys. R ev. D , 73 , ( 2 006). [16] http://gmu nu.mit.edu/sounds [17] E. Rav elli, G. Ric hard and L. D a udet, “Union of MDCT bases for a udio co ding”, IEEE T r ans- actions on Audio, Sp e e ch, and L anguage Pr o c e ssing , 1361– 1372 (2008). [18] H. Lee, P . Pham, Y. Largman, and A Y Ng, “Unsup ervise d feature learning for audio classiﬁcation using conv olutional deep b elief net w orks”, Adv a nces in Neural Information Pro cessing Systems 22, 1096–1104 (2 0 09). [19] M. D. Plum bley , T. Blumensath, L. Daudet, R. Grib onv al, and M. E. Davie s, “Sparse repre- sen tations in audio and music: F rom co ding to source separation,” Pr o c . IEEE, 98, 995–10 05 (2010). [20] J. Nam, J. Herrera, M. Slaney , and J. Smith, “Learning sparse feature represen tatio ns fo r music annotation and retriev al,” in Pro ceedings of the 13 International So ciet y for Music Information Retriev al Conference, 565–58 70 (2012). [21] H. Henaﬀ , K. Jarrett , K. K avuk cuoglu , and Y. LeCun , “ Uns up ervised learning of sparse features fo r scalable audio classiﬁcation, in Pro ceedings of the 15 t h In ternatio na l So ciet y fo r Music Information Retriev al Conference (2014), pp. 7782. 14 [22] L. F. Y u , L. Su , and Y. H. Y ang , “Sparse cepstral co des and p o w er scale for instrumen t iden tiﬁcation”, in Pro ceedings of IEEE In ternational Conference on Acoustics, Sp eec h and Signal Pro cess ing (2014) , pp. 7460–7464 . [23] Y. V aizman, B. McF ee, and G. Lanc kriet, “Co deb o ok-based audio feature represen tation for m usic informa t io n retriev al”, IEEE/ACM T r ansactions on A udio , S p e e ch an d L ang uage Pr o- c essi ng , 22 , 1483–1493 (2 014). [24] M. P a n teli, E. Benetos and S. Dixon (2016), Learning a F eature Space for W orld Music Style Similarit y . 17th International So ciet y for Music Information Retriev al Conference (ISMIR 2016), pp 53 8 -544). [25] Y. Ha n, S. Lee, J. Nam, and K. Lee, “Sparse feature learning fo r instrumen t iden tiﬁcation: Eﬀects of sampling a nd p o oling metho ds”, The Journal o f the A c oustic al So ci ety of Americ a 139 , 22 90–2298 (201 6). [26] L. Reb ollo-Neira and G . Aggar w al, “A dedicated greedy pursuit algorithm for sparse sp ectral mo delling of m usic sound”, Journal of The A c o ust ic S o ciety of Americ a , 140, 2933–2943 (2 016). [27] L. Reb ollo-Neira and I. Sanche s, , “Simple sche me for compressing sparse represen tation of melo dic mus ic”, Ele ctr onics L etters (201 7 ) DOI:10.10 4 9/el.2017.3908 [28] L. Reb ollo-Neira and A. Plastino, “Sparse represen tatio n of Gr avitational Sound”, Journal of Sound and Vibr a tion , 417 , 306–314 (2017) DOI:10.10 1 6/j.jsv.2017.12.007 [29] http://www .ligo.org/multimedia [30] L. Reb ollo-Neira and D . Lo w e, “Optimized orthogo nal matc hing pursuit approach ” , IEEE Signal Pr o c ess. L etters , 9 , 13 7–140 (2002) . [31] L. Reb ollo-Neira, “Co op erativ e greedy pursuit strategies for sparse signal represen tatio n by partitioning”, Signal Pr o c e ssing, 125 , 365–375 (2016). [32] http://www .nonlinear-approx.info/examples/node010.html [33] http://www .ux.uis.no/ ∼ karlsk/proj99 15

A dedicated codec for compression of Gravitational Waves Sound

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment