Currently, the text isn’t displayed until after I finish speaking, so sometimes I’m not entirely sure what the system has recognized—especially when I’m speaking at length, I have to go back and review the entire transcript.
The current recognition rate is actually quite high, and while I may not be familiar with the internal design, from a user’s perspective, being able to see the text generated in real time would provide greater peace of mind and allow me to spot recognition errors more quickly.
This is just a personal observation based on my experience, for your reference.
Please authenticate to join the conversation.
In Review
Feature Request
5 months ago

Shao Duong
Get notified by email when there are changes.
In Review
Feature Request
5 months ago

Shao Duong
Get notified by email when there are changes.