EFFICIENT MAPPING OF ADVANCED SIGNAL PROCESSING ALGORITHMS ON MULTI-PROCESSOR ARCHITECTURES
Modern microprocessor technology is migrating from simply increasing clock speeds on a single processor to placing multiple processors on a die to increase throughput and power performance in every generation. To utilize the potential of such a system, signal processing algorithms have to be efficiently parallelized so that the load can be distributed evenly among the multiple processing units. In this paper, we study several advanced deterministic and stochastic signal processing algorithms and their computation using multiple processing units. Specifically, we consider two commonly used time-frequency signal representations, the short-time Fourier transform and the Wigner distribution, and we demonstrate their parallelization with low communication overhead. We also consider sequential Monte Carlo estimation techniques such as particle filtering, and we demonstrate that its multiple processor implementation requires large data exchanges and thus a high communication overhead. We propose a modified mapping scheme that reduces this overhead at the expense of a slight loss in accuracy, and we evaluate the performance of the scheme for a state estimation problem with respect to accuracy and scalability.
Summary
This paper analyzes techniques for mapping advanced deterministic and stochastic signal-processing algorithms onto multi-processor architectures, emphasizing parallel implementations with low communication overhead. Readers will learn concrete parallelization strategies for time-frequency representations (STFT and Wigner distribution) and for sequential/adaptive algorithms to improve throughput and scalability on multicore systems.
Key Takeaways
- Identify parallelization patterns for STFT and Wigner distribution that minimize inter-core communication.
- Apply workload partitioning and scheduling heuristics to evenly distribute computation across processors.
- Optimize memory and data-movement to reduce latency for sequential and adaptive DSP algorithms.
- Evaluate scalability and trade-offs between computation, communication, and synchronization for multicore DSP implementations.
Who Should Read This
Advanced DSP engineers, embedded systems architects, and researchers who need to parallelize spectral and adaptive signal-processing algorithms for multicore or many-core platforms to improve throughput and power efficiency.
Still RelevantAdvanced
Related Documents
- A New Approach to Linear Filtering and Prediction Problems TimelessAdvanced
- A Quadrature Signals Tutorial: Complex, But Not Complicated TimelessIntermediate
- An Introduction To Compressive Sampling TimelessIntermediate
- Lecture Notes on Elliptic Filter Design TimelessAdvanced
- Computing FFT Twiddle Factors TimelessAdvanced







