Stuff that needs to be fixed in PocketSphinx
- The "main page" part of the API documentation needs to be written.
- Frame reporting in hypotheses is broken
- Needs to report input frame index, not dynamic frame index
- Fixed-point support for continuous density models
This will happen by way of the generic GMM support in SphinxBase. Mini-TODO:
<gau_mix.h> needs to be implemented
- Figure out how fixed-point is going to be handled here
- Decoder initialization is weird and bogus
- Don't initialize the FSG or the LM search unless they're actually being used
- Put all decoder state in the same place and merge the kb, uttproc, dict, etc. APIs
- Begin the long and arduous task of making the decoder re-entrant.
- Fixed-size buffers are everywhere
- Use a blockarray-list type scheme for the backpointer table
- Use a growable array for acoustic features
- Make the effective maximum utterance length 32767 frames (using int16 as frame counter). Long enough for anyone?
Stuff that would be nice to fix in the future
- Make the dictionary and lextree dynamically growable