What are common tools for speech recognition ? What are the advantages and disadvantages of each?

Spread the Knowledge

There are several ready tools for speech recognition, that one can use to train custom models given the appropriate dataset.

CMU Sphinx : Used more in an academic setting, one of the oldest libraries.

Kaldi – hard to set up, very flexible to use. Typically used by academics.

Deep Speech – Easy to set up, reasonably flexible to use

Google API : Speaker segmentation is supported.


Spread the Knowledge

Leave a Reply

Your email address will not be published. Required fields are marked *