I have been using parakeet TDT v3 with just 0.6B params and its insanely fast (feels instant, even on M1 Air). The accuracy is all I could ask for - I dont see the benefit of a much larger 4B model?
Not knocking your app, but asking before your app seems very focused on one model, while others allow the user to pick according to their needs.
Not knocking your app, but asking before your app seems very focused on one model, while others allow the user to pick according to their needs.