-
-
Notifications
You must be signed in to change notification settings - Fork 215
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Precise Use of Actual Subtitles #323
Comments
Hi. If I understand correctly, you want to let the web ui only update "timestamps" with transcription? And if the hallucination is problem, you can consider using VAD ( Voice Detection ) and BGM Separation filters from the WebUI. They will feed the cleaner audio to the whisper and most of the hallucinations will disappear just by removing the noise from the audio. |
Yes, I have an original script for the video. Of course, the recognition result from Whisper is excellent, but the results are not 100% accurate. For example, if the original script is: the result from WebUI comes out as:
So, I have to compare line by line and correct the text. |
another samples, Webui Result: in my case, it's a music |
Transcribing music is a really good example of using the Background Music Remover filter in the WebUI.
This kind of case seems difficult one. You might consider to use higher As for the feature itself, I see this as a very specific one, I will implement it if others want it as well! |
|
First of all, I want to express my thanks because I'm using it very well.
In general, when you have a script for recorded videos, movies, or music, there is often a fully accurate script available. However, when using Whisper WebUI to convert speech to text, it often doesn't recognize certain words and sentences perfectly, so manual correction is required.
It's difficult, even for AI or humans, to completely understand dialogue just by listening. Therefore, it would be great if, when there is an original script, we could upload a script file (without timestamps) alongside the audio, and the AI could recognize and synchronize the original subtitles with the correct timing.
The text was updated successfully, but these errors were encountered: