Video Demo: Real time speech recognition on VisionFive2 with next-gen Kaldi

csukuangfj · June 8, 2023, 11:06am

I would like to share the news that we just managed to run the subproject sherpa-ncnn of next-gen Kaldi on VisionFive2 for real-time speech recognition with a USB microphone.

You can find the documentation at
https://k2-fsa.github.io/sherpa/ncnn/examples/vision-five-2.html

Everything is open-source, i.e., the code, the model, the data, and the documentation, etc.

The video demo is available at

Chloe · June 9, 2023, 1:31am

Good news!

starlord · June 9, 2023, 5:50pm

Have you tried the same thing on the Raspberry pi 4? If so, how do they compare?

csukuangfj · June 11, 2023, 1:41am

I have tried it on Raspberry Pi 4 Model B, which is faster than VisionFive2 and it can also run a larger model in real time.

VisionFive2 can only run a smaller model in real time.

By real-time, I mean the RTF is less than 1.