FPGA based Speech Separation using IPD Features
DOI:
https://doi.org/10.14464/ess.v9i3.562Abstract
The problem of speaker separation is an established field in science and goes back to the cocktail party problem defined in 1953. For decades, methods have been improved and developed, but the computational complexity is rarely considered just as the possibility to use hardware acceleration mechanisms. For this reason, this paper addresses the research question: how speaker separation can be realized on embedded systems by exploiting parallelization and intelligent hardware/software partitioning. For this purpose, a concept is described which uses an FPGA for parallelization to separate a speech signal from an intended direction providing a constant throughput rate. The implementation results show the independence of FPGA resources except BRAM size, proving the scalability of the concept, just as the real-time capabilities.

Downloads
Published
Issue
Section
License
Copyright (c) 2022 André Böhle, Rene Schmidt, Wolfram Hardt

This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright for articles published in this journal is retained by the authors. The content is published under a Creative Commons Licence Attribution 4.0 International (CC BY 4.0). This permits use, distribution, and reproduction in any medium, provided the original work is properly cited, and is otherwise in compliance with the licence.