January 21, 2026

Argmax has built the best inference pipeline for Parakeet available on the market. It gives our users efficient, high accuracy, voice to text on their machine with privacy they can trust.
-Neil Chudleigh, founder of Superwhisper

In 2024, Argmax set out to upgrade on-device inference from a cost-saving solution to a profit center for the ASR industry by migrating the best-in-market cloud workloads to the edge. When Argmax matches cloud API accuracy but delivers superior latency on device, user retention skyrockets. The Superwhisper case study is the first in a series of studies we will publish over the next few months with real-world user growth and retention data.
Superwhisper enables users to switch between several cloud and on-device inference services with a single keystoke under a single flat monthly subscription fee. Hence, Superwhisper users freely shop around and only stick with an inference service if it consistently provides the best experience. This is a perfect experimental setup that removes cost as a factor and focuses only on the user experience and retention.
From Month 1 to Month 6, Argmax's MAU in Superwhisper grew 567% as tens of thousands of users organically discovered the new inference service. With the recent Argmax upgrade as the default onboarding service, the growth accelerated in January 2026.
The onboarding of Free Trial users with a paid inference service like Argmax initially presented a cost challenge to Superwhisper. However, the 2x increase in conversion rates to Pro led to a net profit while accelerating user growth.
Most inference platforms onboard dozens, if not hundreds, of models every year. In sharp conrast, Argmax has onboarded a whopping 4 models since our inception 2 years ago. Here's why:
Our insight is that each high-potential open-source model makes for a great launch demo. However, commercial adoption at scale requires the following:
When Nvidia released Parakeet v2 in May 2025, it ranked 1st on the OpenASR leaderboard. 9 months later, Parakeet v2's accuracy is still among the top, only marginally surpassed by a handful of 5-15x larger and slower models.
At Argmax, we immediately recognized Parakeet's high potential based on our in-house evaluations on customer private test sets. Within a few weeks, we shipped our first implementation in alpha and identified the following limitations based on customer feedback:
We fixed these showstopper issues in 3 months. The results:

Find out more about Argmax SDK today.