We are addressing the problem of enhancing the precision of transcriptions, particularly when it comes to company names and product references. Our solution involves a dual strategy that utilizes both the Whisper prompt parameter and GPT-4's post-processing capabilities.
Two approaches to correct inaccuracies are:
-
We input a list of correct spellings directly into Whisper's prompt parameter to guide the initial transcription.
-
We utilized GPT-4 to fix misspellings post transcription, again using the same list of correct spellings in the prompt.
These strategies aimed at ensuring precise transcription of unfamilar proper nouns.
Setup
To get started, let's:
- Import the OpenAI Python library (if you don't have it, you'll need to install it with
pip install openai
) - Download the audio file example