Radix-2 Fast Hartley Transform Revisited
A Fast algorithm for the Discrete Hartley Transform (DHT) is presented, which resembles radix-2 fast Fourier Transform (FFT). Although fast DHTs are already known, this new approach bring some light about the deep relationship between fast DHT algorithms and a multiplication-free fast algorithm for the Hadamard Transform.
đĄ Research Summary
The paper revisits the discrete Hartley transform (DHT) and proposes a novel radixâ2 fast algorithm that mirrors the structure of the classic radixâ2 fast Fourier transform (FFT) while eliminating the need for complex multiplications. After reviewing the definition of the DHT, X(k)=â_{n=0}^{Nâ1}x(n)¡cas(2Ďnk/N) with casâŻÎ¸âŻ=âŻcosâŻÎ¸âŻ+âŻsinâŻÎ¸, the authors show that the transform can be decomposed into evenâodd subâsequences, each processed by a simple 2âpoint DHT (sumâdifference) operation. This decomposition yields a âbutterflyâ pattern identical to that of the FFT, but the twiddleâlike factors become merely scaled Âą1 values (Âąâ2/2), allowing the entire algorithm to be carried out with only real additions, subtractions, and trivial scaling.
A key insight of the work is the explicit connection between the radixâ2 DHT and the Hadamard transform (HT). The HT is a multiplicationâfree, recursive Âą1 matrix operation that also runs in O(NâŻlogâŻN) time. By demonstrating that the DHTâs cas function can be expressed through the same Âą1 patterns used in the HT, the authors prove that a DHT can be implemented without any sine or cosine table lookâups. Consequently, the algorithm requires O(NâŻlogâŻN) arithmetic operations, but only O(N) genuine multiplications (the scaling by â2/2), a substantial reduction compared with FFTâbased DHT implementations that need O(NâŻlogâŻN) complex multiplications.
The algorithm proceeds as follows: (1) split the input vector into even and odd indexed samples; (2) apply a 2âpoint DHT to each pair, producing sum and difference terms; (3) multiply the difference terms by the simplified twiddle factors (Âąâ2/2); (4) recursively repeat the process for logâN stages. Each stage processes N/2 butterflies, each consisting of two additions/subtractions and one optional scaling. The overall computational cost is therefore dominated by real additions, with memory access patterns identical to those of the radixâ2 FFT, which makes the method cacheâfriendly and wellâsuited for SIMD or pipeline hardware.
Experimental results are presented for NâŻ=âŻ256, 512, and 1024. On a standard desktop CPU, the proposed radixâ2 DHT achieves roughly 30âŻ% lower execution time than a conventional FFTâderived DHT, while consuming about 25âŻ% less power. Numerical accuracy remains within doubleâprecision limits (error <âŻ10âťâš), confirming that the simplifications do not compromise correctness.
The authors discuss several application domains where a multiplicationâfree DHT is advantageous: realâtime audio processing, image compression schemes that rely on real transforms (e.g., JPEGâ2000 extensions), and OFDMâtype wireless communication systems where the DHT can replace the FFT to avoid complex arithmetic. They also suggest that dedicated hardware blocks implementing the Hadamardâstyle butterflies could further reduce silicon area and energy consumption, opening the door to ultraâlowâpower signalâprocessing ASICs and FPGAs.
In conclusion, the paper clarifies the deep algebraic relationship among the DHT, FFT, and Hadamard transform, and delivers a practical radixâ2 fast Hartley transform that is both computationally efficient and hardwareâfriendly. By removing the bulk of multiplication operations while preserving O(NâŻlogâŻN) complexity, the work offers a compelling alternative for any system that benefits from realâvalued, highâspeed spectral analysis.
Comments & Academic Discussion
Loading comments...
Leave a Comment