Tag archives: speechrecognition

RSS feed of speechrecognition

Today we release our first self-hosted Auphonic Speech Recognition Engine using the open-source Whisper model by OpenAI!
With Whisper, you can now integrate automatic speech recognition in 99 languages into your Auphonic audio post-production workflow, without creating an external account and without extra costs!

Whisper Speech Recognition in Auphonic

So far, Auphonic users had to choose one of our integrated external service providers (Wit.ai, Google Cloud Speech, Amazon Transcribe, Speechmatics) for speech recognition, so audio files were transferred to an external server, using external computing powers, that users had to pay for ...

Speechmatics released a new API including an enhanced transcription engine (2h free per month!) that we integrated into the Auphonic Web Service now.
In this blog post, we also compare the accuracy of all our integrated speech recognition services and present our results.


Automatic speech recognition is most useful to make audio searchable: Even if automatically generated transcripts are not perfect and might be difficult to read (spoken text is very different from written text), they are very valuable if you try to find a specific topic within a one-hour audio file or if you need the exact ...

Until recently, Amazon Transcribe supported speech recognition in English and Spanish only.
Now they included French, Italian and Portuguese as well - and a few other languages (including German) are in private beta.

Update March 2019:
Now Amazon Transcribe supports German and Korean as well.

https://auphonic.com/static/screenshots/inspector-mt-closed.png The Auphonic Audio Inspector on the status page of a finished Multitrack Production including speech recognition.
Please click on the screenshot to see it in full resolution!


Amazon Transcribe is integrated as speech recognition engine within Auphonic and offers accurate transcriptions (compared to other services) at low costs, including keywords / custom ...

Back in late 2016, we introduced Speech Recognition at Auphonic. This allows our users to create transcripts of their recordings, and more usefully, this means podcasts become searchable.
Now we integrated two more speech recognition engines: Amazon Transcribe and Speechmatics. Whilst integrating these services, we also took the opportunity to develop a complete new Transcription Editor:

Screenshot of our Transcript Editor with word confidence highlighting and the edit bar.
Try out the Transcript Editor Examples yourself!


The new Auphonic Transcript Editor is included directly in our HTML transcript output file, displays word confidence values to instantly ...

After an initial private beta phase, we are happy to open the Auphonic automatic speech recognition integration to all of our users!

Our WebVTT-based audio player with search in speech recognition transcripts and exact speaker names.

We built a layer on top of multiple engines to offer affordable speech recognition in over 80 languages. This blog post also includes 3 complete examples in English and German.

Search within Audio and Video

One of the main problems of podcasts, audio and video is search.

Speech recognition is an important step to make audio searchable:
Although automatically generated ...

Podcasts are great, but they have a discovery problem – the technology to change that is available.
Today we release a private beta version of automatic speech recognition integrated in Auphonic.

UPDATE:
Please read our new and updated blog post Make Podcasts Searchable (Auphonic Speech To Text Public Beta) !

Automatic Speech Recognition in Auphonic

Since recently, most automatic speech recognition services were really expensive or the quality was very bad. Broadcasting corporations spent big money to generate automatic transcripts to search within audio.
That changed, there are a couple of affordable (even free) services available now, which can ...