Hi, thanks for the project!
I'm decoding on top of NeMo model with and without variable amount of hotwords and get the following runtime:
- No hotwords: 2 sec
- 100 hotwords: 23 sec
- 200 hotwords: 45 sec
Is there a plan on improve this amazing feature run-time using trie or any other solution?
Hi, thanks for the project!
I'm decoding on top of NeMo model with and without variable amount of hotwords and get the following runtime:
Is there a plan on improve this amazing feature run-time using trie or any other solution?