{"id":2207,"date":"2023-05-25T05:15:13","date_gmt":"2023-05-25T05:15:13","guid":{"rendered":"https:\/\/ewebtoolz.com\/blog\/the-9-best-speech-to-text-apps-in-2023-tried-tested\/"},"modified":"2023-05-25T05:15:13","modified_gmt":"2023-05-25T05:15:13","slug":"the-9-best-speech-to-text-apps-in-2023-tried-tested","status":"publish","type":"post","link":"https:\/\/ewebtoolz.com\/blog\/the-9-best-speech-to-text-apps-in-2023-tried-tested\/","title":{"rendered":"The 9 Best Speech-to-Text Apps in 2023 (Tried &#038; Tested)"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div id=\"\">\n<p>Most speech recognition apps have no trouble transcribing a native speaker being recorded with a pro microphone in a quiet room. This isn\u2019t a challenge.<\/p>\n<p>So to test them more thoroughly, I created a nightmare recording of two non-native speakers with loud city background noise.<\/p>\n<p> <iframe width=\"100%\" height=\"166\" scrolling=\"no\" frameborder=\"no\" allow=\"autoplay\" src=\"https:\/\/w.soundcloud.com\/player\/?url=https%3A\/\/api.soundcloud.com\/tracks\/1519321681&amp;color=%23ff5500&amp;auto_play=false&amp;hide_related=false&amp;show_comments=true&amp;show_user=true&amp;show_reposts=false&amp;show_teaser=true\"><\/iframe><\/p>\n<p>How did they\u00a0fare?<\/p>\n<p>Let\u2019s find\u00a0out.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11.png\" alt=\"Otter.ai homepage&#10;\" class=\"wp-image-160021\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11.png\" alt=\"Otter.ai homepage&#10;\" class=\"lazyload wp-image-160021\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image9-11-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>Otter was one of the most frequently mentioned solutions when we asked for suggestions on Twitter and in the Ahrefs community. And for good reason. It is easy to set up, has an intuitive interface, and offers clear pricing.<\/p>\n<h3 class=\"wp-block-heading\">Unique features<\/h3>\n<p>What stands out from the rest is the app\u2019s ability to record online meetings and transcribe them\u2014simply by pasting the meeting URL. But you can also import a video\/audio file or record audio right in the\u00a0app.<\/p>\n<p>Besides, you can connect your calendar to never miss a meeting.<\/p>\n<h3 class=\"wp-block-heading\">Transcript quality<\/h3>\n<p>I got decent results, but there was a lot to edit\u00a0too.<\/p>\n<p>It didn\u2019t get some names right. But I can\u2019t blame any tool for not picking up \u201cAhrefs\u201d or \u201cTim Soulo\u201d 100% of the\u00a0time.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10.png\" alt=\"Otter.ai transcription results&#10;\" class=\"wp-image-160022\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10.png\" alt=\"Otter.ai transcription results&#10;\" class=\"lazyload wp-image-160022\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image5-10-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>One thing I found is that after it notified the transcriptions were ready, it might still do something in the background (adjust time stamps, tag speakers, etc.). Like a student still scribbling on a test paper while passing it to the teacher.<\/p>\n<h3 class=\"wp-block-heading\">Pricing<\/h3>\n<p>You can start for free and upgrade to a paid plan later. You can import up to three files and record 290 minutes of meetings before you need to upgrade (as of April\u00a02023).<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8.png\" alt=\"Rev.com homepage&#10;\" class=\"wp-image-160024\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8.png\" alt=\"Rev.com homepage&#10;\" class=\"lazyload wp-image-160024\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image7-8-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>Setting up an account was a no-brainer. I found the interface easy to navigate as well. One personal remark is that it felt a little too \u201ccold\u201d to use since I saw things like \u201cPlace Order,\u201d \u201cBilling,\u201d and \u201cInvoice\u201d way too\u00a0often.\u00a0<\/p>\n<p>You might get an impression that it was designed by an accounting team (as opposed to Descript that comes next in this roundup).<\/p>\n<h3 class=\"wp-block-heading\">Unique features<\/h3>\n<p>Besides auto-generated transcripts, Rev offers live captions for Zoom meetings. You also have the option to place an order for human transcriptions.<\/p>\n<h3 class=\"wp-block-heading\">Transcript quality<\/h3>\n<p>Poor audio with city noise was a bit too much for Rev. Some words were missing, while others were misrecognized. As a result, some paragraphs didn\u2019t make much sense, while others were\u00a0fine.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10.png\" alt=\"Rev.com transcription results&#10;\" class=\"wp-image-160025\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10.png\" alt=\"Rev.com transcription results&#10;\" class=\"lazyload wp-image-160025\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image8-10-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<h3 class=\"wp-block-heading\">Pricing<\/h3>\n<p>You can transcribe the first audio file (up to 45 minutes) for free. I got a bill for $1.25 with a discount that resulted in a total of $0.00. Thanks, accounting team.\u00a0\ud83d\ude09<\/p>\n<p>Rev also has a 14-day trial of its paid plan. But that was tricky to find. To locate it, you need to go to the footer of the homepage and look for it under \u201cServices.\u201d<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1936\" height=\"1299\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10.png\" alt=\"Footer of the homepage, via rev.com\" class=\"wp-image-160129\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10.png 1936w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10-633x425.png 633w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10-768x515.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10-1536x1031.png 1536w\" sizes=\"(max-width: 1936px) 100vw, 1936px\"\/><\/noscript><img decoding=\"async\" width=\"1936\" height=\"1299\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10.png\" alt=\"Footer of the homepage, via rev.com\" class=\"lazyload wp-image-160129\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10.png 1936w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10-633x425.png 633w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10-768x515.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-10-1536x1031.png 1536w\" data-sizes=\"(max-width: 1936px) 100vw, 1936px\"\/><\/figure>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1065\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9.jpg\" alt=\"Descript's homepage\" class=\"wp-image-160029\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9-680x362.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9-768x409.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9-1536x818.jpg 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1065\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9.jpg\" alt=\"Descript's homepage\" class=\"lazyload wp-image-160029\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9-680x362.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9-768x409.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image11-9-1536x818.jpg 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>Descript welcomed me by name (which was a nice coincidence). The main thing you have to know is that it is a standalone software rather than a web service. It is much more than a speech-to-text converter. It\u2019s basically a video editing tool. And there\u2019s definitely a learning curve. But thankfully, onboarding is extremely funny and engaging.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1076\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21.jpg\" alt=\"Descript's onboarding is interactive and engaging&#10;\" class=\"wp-image-160030\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21-680x366.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21-768x413.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21-1536x827.jpg 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1076\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21.jpg\" alt=\"Descript's onboarding is interactive and engaging&#10;\" class=\"lazyload wp-image-160030\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21-680x366.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21-768x413.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image21-1536x827.jpg 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<h3 class=\"wp-block-heading\">Unique features<\/h3>\n<p>As I mentioned, Descript is more of a video editing tool that is good with transcribing. I\u2019d call it \u201cCanva for video\/captions.\u201d You can add B-rolls, effects, animations, and\u00a0more.<\/p>\n<p>You can easily drag and drop and basically produce a complete video with its help. But if you just need a transcript or captions of a video or audio, you can do that\u00a0too.<\/p>\n<h3 class=\"wp-block-heading\">Transcript quality<\/h3>\n<p>My sample audio had quite muddy results. At times, it had difficulty recognizing abbreviations (e.g., SEO). I also had a problem with removing filler words like uh and\u00a0um.<\/p>\n<p>I found that if I didn\u2019t choose an option to remove them, they, <em>um<\/em>, just stayed there even though I didn\u2019t need them most of the time. But if I did choose to remove them, it occasionally ate up parts of other words, causing even more trouble.<\/p>\n<p>Also, it couldn\u2019t recognize parts that a human being would have no problem understanding just from context, e.g., \u201cJack of all trades\u201d became \u2018\u201cjackal, trades.\u201d<\/p>\n<p>On the bright side, I believe you can still understand what the text is\u00a0about.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1711\" height=\"1214\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7.png\" alt=\"Descript transcription results&#10;\" class=\"wp-image-160032\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7.png 1711w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7-599x425.png 599w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7-768x545.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7-1536x1090.png 1536w\" sizes=\"(max-width: 1711px) 100vw, 1711px\"\/><\/noscript><img decoding=\"async\" width=\"1711\" height=\"1214\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7.png\" alt=\"Descript transcription results&#10;\" class=\"lazyload wp-image-160032\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7.png 1711w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7-599x425.png 599w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7-768x545.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image19-7-1536x1090.png 1536w\" data-sizes=\"(max-width: 1711px) 100vw, 1711px\"\/><\/figure>\n<h3 class=\"wp-block-heading\">Pricing<\/h3>\n<p>You can start with basic functions for free and upgrade if needed.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2.jpg\" alt=\"MacWhisper app on gumroad.com&#10;\" class=\"wp-image-160033\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2-680x383.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2-768x432.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2-1536x864.jpg 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2.jpg\" alt=\"MacWhisper app on gumroad.com&#10;\" class=\"lazyload wp-image-160033\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2-680x383.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2-768x432.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image2-1536x864.jpg 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>MacWhisper is a transcription tool powered by Whisper. It\u2019s an automatic speech recognition (ASR) system developed by OpenAI, the same company that brought us ChatGPT.<\/p>\n<p>As OpenAI states on its website:<\/p>\n<blockquote class=\"wp-block-quote\">\n<p>Whisper is trained on 680,000 hours of multilingual and multitask supervised data collected from the\u00a0web.<\/p>\n<\/blockquote>\n<p>Whisper is not something you can simply \u201crun\u201d as is. What\u2019s more, it is pretty complicated to set up if you do want to run it yourself. Github, Python\u2014you get the\u00a0gist.<\/p>\n<p>Luckily, there are tools like MacWhisper that take this off your shoulders and let you use the power of AI in a simple user interface.<\/p>\n<h3 class=\"wp-block-heading\">Unique features<\/h3>\n<p>Just plain speech-to-text recognition with time stamps. Unfortunately, it doesn\u2019t auto-tag the speakers.<\/p>\n<h3 class=\"wp-block-heading\">Transcript quality<\/h3>\n<p>When you run the tool, you have to choose a \u201cmodel\u201d to work with. Basically, the lighter the model, the quicker it will run. But larger models will produce better results. Also, in MacWhisper, those larger (better but slower) models are only available in the paid version.<\/p>\n<p>I decided to start with the free \u201csmall\u201d model, which was stated to have \u201cnormal speed with good accuracy.\u201d<\/p>\n<p>It was OK, but no better than the competitors. I assumed it would work fine with high-quality audio, but not with the horrible examples I fed to\u00a0it.<\/p>\n<p>\u201cAI is overrated,\u201d I thought. But before closing the Mac and switching back to my dear Windows PC, I decided to give the \u201clarge\u201d model a\u00a0try.<\/p>\n<p>And you know what, AI is not overrated. I found the results to be much better than anything else.<\/p>\n<p>The transcript was really, really good. It even got things like \u201cAhrefs\u201d and \u201cSaaS\u201d right! Though still not 100% of the\u00a0time.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1299\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18.jpg\" alt=\"MacWhisper transcription results&#10;\" class=\"wp-image-160035\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18-654x425.jpg 654w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18-768x499.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18-1536x998.jpg 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1299\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18.jpg\" alt=\"MacWhisper transcription results&#10;\" class=\"lazyload wp-image-160035\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18-654x425.jpg 654w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18-768x499.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image18-1536x998.jpg 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<h3 class=\"wp-block-heading\">Pricing<\/h3>\n<p>You can run smaller models for free. For a large model, you\u2019ll need to purchase a license.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3.png\" alt=\"AI Transcriptions by Riverside homepage\" class=\"wp-image-160036\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3.png\" alt=\"AI Transcriptions by Riverside homepage\" class=\"lazyload wp-image-160036\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image24-3-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>This tool is the easiest to use. Simply drag and drop your file\u2014then it\u2019s ready. It takes some time to process, though.<\/p>\n<h3 class=\"wp-block-heading\">Unique features<\/h3>\n<p>Nothing besides downloading a transcription.<\/p>\n<h3 class=\"wp-block-heading\">Transcript quality<\/h3>\n<p>My first impression was that the results were perfect because, visually, it delivered a confident-looking text:<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1600\" height=\"900\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191.png\" alt=\"AI Transcriptions by Riverside transcription results&#10;\" class=\"wp-image-160003\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191.png 1600w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191-1536x864.png 1536w\" sizes=\"(max-width: 1600px) 100vw, 1600px\"\/><\/noscript><img decoding=\"async\" width=\"1600\" height=\"900\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191.png\" alt=\"AI Transcriptions by Riverside transcription results&#10;\" class=\"lazyload wp-image-160003\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191.png 1600w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image-191-1536x864.png 1536w\" data-sizes=\"(max-width: 1600px) 100vw, 1600px\"\/><\/figure>\n<p>But after proofreading, I realized that it simply did not include the parts it failed to recognize\u2014sometimes several words in a\u00a0row.<\/p>\n<h3 class=\"wp-block-heading\">Pricing<\/h3>\n<p>It\u2019s free to\u00a0use.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2.jpg\" alt=\"Adobe Premiere Pro homepage&#10;\" class=\"wp-image-160037\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2-680x383.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2-768x432.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2-1536x864.jpg 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2.jpg\" alt=\"Adobe Premiere Pro homepage&#10;\" class=\"lazyload wp-image-160037\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2.jpg 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2-680x383.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2-768x432.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image3-2-1536x864.jpg 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>Premiere Pro is not exactly a \u201ctranscription tool\u201d but rather a video editing software. I\u2019m including it because I assume that some companies may already have it in their arsenal (like we\u00a0do).<\/p>\n<p>To get to the transcription feature in Premiere Pro, just go to the \u201cCaptions and graphics\u201d workspace and click \u201cCreate transcription.\u201d<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1094\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11.png\" alt=\"Premiere Pro interface\u2014you can generate transcriptions in the &quot;Captions and graphics&quot; workspace&#10;\" class=\"wp-image-160039\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11-680x372.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11-768x420.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11-1536x841.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1094\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11.png\" alt=\"Premiere Pro interface\u2014you can generate transcriptions in the &quot;Captions and graphics&quot; workspace&#10;\" class=\"lazyload wp-image-160039\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11-680x372.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11-768x420.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image4-11-1536x841.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<h3 class=\"wp-block-heading\">Unique features<\/h3>\n<p>If we take only speech recognition into account here, what it does well is creating precise time stamps, auto-tagging the speakers and, if needed, automatically adding an editable captions track to a video project.<\/p>\n<h3 class=\"wp-block-heading\">Transcript quality<\/h3>\n<p>Let\u2019s be straightforward: I found the noisy audio transcript to be a failure. I couldn\u2019t comprehend what people were talking about in the first\u00a0place.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1501\" height=\"1087\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image6-8.png\" alt=\"Adobe Premiere Pro transcription results&#10;\" class=\"wp-image-160041\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image6-8.png 1501w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image6-8-587x425.png 587w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image6-8-768x556.png 768w\" sizes=\"(max-width: 1501px) 100vw, 1501px\"\/><\/noscript><img decoding=\"async\" width=\"1501\" height=\"1087\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image6-8.png\" alt=\"Adobe Premiere Pro transcription results&#10;\" class=\"lazyload wp-image-160041\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image6-8.png 1501w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image6-8-587x425.png 587w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image6-8-768x556.png 768w\" data-sizes=\"(max-width: 1501px) 100vw, 1501px\"\/><\/figure>\n<p>Still, I think this feature can be really helpful if you are creating captions from high-quality audio. I used it myself several times and had nothing to complain about when the recording quality was\u00a0good.<\/p>\n<h3 class=\"wp-block-heading\">Pricing<\/h3>\n<p>You need an <a href=\"https:\/\/www.adobe.com\/creativecloud\/plans.html\">Adobe Creative Cloud<\/a> subscription to use Premiere Pro.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9.png\" alt=\"Happyscribe.com homepage&#10;\" class=\"wp-image-160043\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9.png\" alt=\"Happyscribe.com homepage&#10;\" class=\"lazyload wp-image-160043\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image10-9-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>While signing up and uploading files is rather straightforward, you have to spend some time answering questions about you and your company before you can finally get to the tool itself. And no, you can\u2019t skip typing in your company name, your role, and your company size.<\/p>\n<p>But once you get through this, the interface is clean and intuitive.<\/p>\n<h3 class=\"wp-block-heading\">Unique features<\/h3>\n<p>You can generate a transcript or captions for video or audio. There is also an option to request a manual review of the transcript. Alternatively, you can generate subtitles in a different language, so you have transcription and translation in one\u00a0click.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"954\" height=\"966\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image14-9.png\" alt=\"Happy Scribe features include transcription, subtitles, and foreign language subtitles&#10;\" class=\"wp-image-160044\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image14-9.png 954w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image14-9-420x425.png 420w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image14-9-768x778.png 768w\" sizes=\"(max-width: 954px) 100vw, 954px\"\/><\/noscript><img decoding=\"async\" width=\"954\" height=\"966\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image14-9.png\" alt=\"Happy Scribe features include transcription, subtitles, and foreign language subtitles&#10;\" class=\"lazyload wp-image-160044\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image14-9.png 954w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image14-9-420x425.png 420w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image14-9-768x778.png 768w\" data-sizes=\"(max-width: 954px) 100vw, 954px\"\/><\/figure>\n<h3 class=\"wp-block-heading\">Transcript quality<\/h3>\n<p>Happy Scribe did a really good job transcribing the audio. It had no problem with words like \u201cSEO\u201d and \u201cSaaS\u201d (obviously the weakest point for many tools). It could also auto-tag the speakers, which might be helpful in certain situations.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7.png\" alt=\"Happy Scribe transcription results&#10;\" class=\"wp-image-160045\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7.png\" alt=\"Happy Scribe transcription results&#10;\" class=\"lazyload wp-image-160045\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image12-7-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<h3 class=\"wp-block-heading\">Pricing<\/h3>\n<p>I could test one file for free. After that, I would need to buy credits to be used for each minute of video or audio transcribed.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8.png\" alt=\"Sonix.ai homepage&#10;\" class=\"wp-image-160046\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8.png\" alt=\"Sonix.ai homepage&#10;\" class=\"lazyload wp-image-160046\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-8-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>Sonix is a tool for automatic transcriptions, translations, and integration with meeting apps.<\/p>\n<h3 class=\"wp-block-heading\">Unique features<\/h3>\n<p>Besides meetings integration, which is almost a given for most tools, AI summary generation is an interesting feature (in beta as of April 2023.) But I already got impressive results from\u00a0it.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1928\" height=\"1168\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9.png\" alt=\"AI summary from Sonix&#10;\" class=\"wp-image-160047\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9.png 1928w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9-680x412.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9-768x465.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9-1536x931.png 1536w\" sizes=\"(max-width: 1928px) 100vw, 1928px\"\/><\/noscript><img decoding=\"async\" width=\"1928\" height=\"1168\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9.png\" alt=\"AI summary from Sonix&#10;\" class=\"lazyload wp-image-160047\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9.png 1928w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9-680x412.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9-768x465.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image13-9-1536x931.png 1536w\" data-sizes=\"(max-width: 1928px) 100vw, 1928px\"\/><\/figure>\n<p>You also get some extra tools to work with video captions\u2014a timeline view and an option to split captions into several lines. You can also import an existing transcript, and Sonix will sync it with the\u00a0audio.<\/p>\n<h3 class=\"wp-block-heading\">Transcript quality<\/h3>\n<p>Sonix has a custom vocabulary feature. I found that helped a bit with names like \u201cTim Soulo\u201d and \u201cAhrefs,\u201d but it didn\u2019t work 100% of the time. It mostly did well. But at times, it mistook SEO for CEO and returned the word \u201cExcel\u201d seemingly out of nowhere.<\/p>\n<p>The transcript made sense in general but required quite a lot of edits if it needed to be perfect.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5.png\" alt=\"Sonix.ai transcription results&#10;\" class=\"wp-image-160048\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5.png\" alt=\"Sonix.ai transcription results&#10;\" class=\"lazyload wp-image-160048\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image17-5-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<h3 class=\"wp-block-heading\">Pricing<\/h3>\n<p>Sonix has a free trial for 25 minutes of transcriptions. After that, you need to purchase pay-as-you-go credits or get a subscription.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8.png\" alt=\"Notta.ai homepage&#10;\" class=\"wp-image-160049\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8.png\" alt=\"Notta.ai homepage&#10;\" class=\"lazyload wp-image-160049\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image22-8-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>Notta is yet another transcription service that works for both real-time meetings and existing recordings.<\/p>\n<h3 class=\"wp-block-heading\">Unique features<\/h3>\n<p>Besides transcription, Notta focuses on streamlining certain workflows and offers features such as calendar sync and scheduler (in beta as of April\u00a02023).<\/p>\n<h3 class=\"wp-block-heading\">Transcript quality<\/h3>\n<p>Background noise and poor audio quality were not deal breakers for Notta. The transcription results turned out mostly OK but still had some problems.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13.png\" alt=\"Notta.ai transcription results&#10;\" class=\"wp-image-160050\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13-1536x864.png 1536w\" sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/noscript><img decoding=\"async\" width=\"1999\" height=\"1125\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13.png\" alt=\"Notta.ai transcription results&#10;\" class=\"lazyload wp-image-160050\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13.png 1999w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13-680x383.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13-768x432.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image1-13-1536x864.png 1536w\" data-sizes=\"(max-width: 1999px) 100vw, 1999px\"\/><\/figure>\n<p>Sentence structure was sometimes a bit weird, certain words went missing, and my favorite \u201cJack of all trades\u201d part wasn\u2019t that neat this\u00a0time.<\/p>\n<figure class=\"wp-block-image size-full\"><noscript><img decoding=\"async\" width=\"1970\" height=\"322\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9.png\" alt=\"Inconsistency in Notta's transcription&#10;\" class=\"wp-image-160130\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9.png 1970w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9-680x111.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9-768x126.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9-1536x251.png 1536w\" sizes=\"(max-width: 1970px) 100vw, 1970px\"\/><\/noscript><img decoding=\"async\" width=\"1970\" height=\"322\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9.png\" alt=\"Inconsistency in Notta's transcription&#10;\" class=\"lazyload wp-image-160130\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9.png 1970w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9-680x111.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9-768x126.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/05\/image20-9-1536x251.png 1536w\" data-sizes=\"(max-width: 1970px) 100vw, 1970px\"\/><\/figure>\n<p>Another thing worth noting is that, for some reason, it failed to recognize two speakers, and the whole interview was tagged as \u201cSpeaker 1.\u201d<\/p>\n<h3 class=\"wp-block-heading\">Pricing<\/h3>\n<p>You can start with a free basic subscription and try a three-day trial of the paid plan, Notta\u00a0Pro.<\/p>\n<h2 class=\"wp-block-heading\">Final thoughts<\/h2>\n<p>As you can see, there are plenty of tools to choose from. Still, it seems that OpenAI stirred things up a bit by releasing a free ASR (automatic speech recognition) system, which I found to be considerably more capable than others.<\/p>\n<p>But pure speech recognition quality is just one factor. Maybe you do need to record your Zoom meetings (Otter), work with captions in a large video project (Premiere Pro), or quickly create a Canva-style video (Descript).<\/p>\n<p>Also, I need to stress that I was trying to push these tools to the edge by giving them the worst-case scenario recording. For more natural uses, the differences in the outcome might be much less noticeable.<\/p>\n<p>It\u2019s great to see that there are so many options out there, and I hope this review will help a bit in finding the one that is perfect for\u00a0you.<\/p>\n<p>Got questions? Ping me <a href=\"https:\/\/twitter.com\/DolgikhGeorge\">on Twitter<\/a>.<\/p>\n<\/p><\/div>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><br \/>\n<br \/><br \/>\n<br \/><a href=\"https:\/\/ahrefs.com\/blog\/best-speech-to-text-apps\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Most speech recognition apps have no trouble transcribing a native speaker being recorded with a pro microphone in a quiet room. This isn\u2019t a challenge. So to test them more thoroughly, I created a nightmare recording of two non-native speakers with loud city background noise. How did they\u00a0fare? Let\u2019s find\u00a0out. Otter was one of the [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":2208,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[22],"tags":[],"class_list":["post-2207","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-seo"],"_links":{"self":[{"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/posts\/2207","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/comments?post=2207"}],"version-history":[{"count":0,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/posts\/2207\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/media\/2208"}],"wp:attachment":[{"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/media?parent=2207"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/categories?post=2207"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ewebtoolz.com\/blog\/wp-json\/wp\/v2\/tags?post=2207"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}