FPGA based Speech Separation using IPD Features

André Böhle; Rene Schmidt; Wolfram Hardt

doi:10.14464/ess.v9i3.562

FPGA based Speech Separation using IPD Features

Authors

André Böhle Chemnitz University of Technology
Rene Schmidt Chemnitz University of Technology
Wolfram Hardt Chemnitz University of Technology

DOI:

https://doi.org/10.14464/ess.v9i3.562

Abstract

The problem of speaker separation is an established field in science and goes back to the cocktail party problem defined in 1953. For decades, methods have been improved and developed, but the computational complexity is rarely considered just as the possibility to use hardware acceleration mechanisms. For this reason, this paper addresses the research question: how speaker separation can be realized on embedded systems by exploiting parallelization and intelligent hardware/software partitioning. For this purpose, a concept is described which uses an FPGA for parallelization to separate a speech signal from an intended direction providing a constant throughput rate. The implementation results show the independence of FPGA resources except BRAM size, proving the scalability of the concept, just as the real-time capabilities.

Downloads

Download PDF

Published

2022-12-05

Issue

Vol. 9 No. 3 (2022): International Symposium on Computer Science and Educational Technology

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright for articles published in this journal is retained by the authors. The content is published under a Creative Commons Licence Attribution 4.0 International (CC BY 4.0). This permits use, distribution, and reproduction in any medium, provided the original work is properly cited, and is otherwise in compliance with the licence.

FPGA based Speech Separation using IPD Features

Authors

DOI:

Abstract

Downloads

Published

Issue

Section

License

Developed By

Make a Submission