DeepL just bought its way into real-time speech translation
Key takeaways
- DeepL bought Mixhalo, a San Francisco startup built for ultra-low-latency live audio, in its first-ever acquisition
- The goal is real-time speech translation at events, you hear a speaker in your own language through your phone with almost no delay
- DeepL is opening a Bay Area office around the Mixhalo team to push harder into voice AI
- The deal lands weeks after DeepL cut around 250 jobs, a sign of how fast it is shifting money from text to voice
DeepL is the company you reach for when Google Translate isn't good enough. Text in, better text out. On Tuesday at VivaTech in Paris, it did something it had never done before: it bought another company. The target is Mixhalo, a San Francisco startup that streams audio to thousands of phones in a stadium with almost no lag.
That combination is the whole story. DeepL knows translation. Mixhalo knows how to get sound to a huge crowd in real time. Put them together and you get a speaker on stage being translated into a dozen languages, piped straight to each listener's earbuds, fast enough that nobody notices the gap.
Why a translation firm wants stadium audio
Mixhalo was founded in 2016 by Incubus guitarist Mike Einziger and violinist Ann Marie Simpson-Einziger. It started as a way to fix bad concert sound, then turned into a system for beaming pristine audio to fans at gigs and sporting events without the delay you usually get over a venue's network.
That low-latency plumbing is the hard part of live translation. Translating a sentence accurately is one problem. Doing it while someone is still talking, then delivering it to a packed room before the moment passes, is a different one. DeepL just bought the team that already solved the delivery half.
DeepL's own pitch is blunt. It says the Mixhalo team and technology will accelerate DeepL Voice for "live, large-scale environments," the kind of conferences and venues where simultaneous interpreters currently sit in booths doing the work by hand.
The Silicon Valley move
DeepL is German, based in Cologne. It is now opening a Bay Area office built around the Mixhalo team, its first real foothold in Silicon Valley. That matters for hiring. Voice AI talent clusters in California, and a Cologne address only gets you so far when you're competing for it.
There's a sharper edge to the timing. The acquisition lands just weeks after DeepL cut around 250 jobs in a restructuring. Buying a voice startup while trimming headcount tells you where the money is being pointed. Text translation is the cash cow. Voice is the bet.
What changes if it works
Picture an international conference. Right now you either speak the room's language or you wear a clunky headset wired to a human interpreter. If DeepL pulls this off, you open an app, pick your language, and listen to the speaker in your own tongue through earbuds you already own. No booth, no delay, no extra hardware.
That's a real shift for events, trade shows, and any room where people don't share a language. It won't kill professional interpreters overnight, accuracy and nuance still matter, and live AI translation makes mistakes. But for the everyday case of "I just want to follow the talk," the bar is about to drop a lot.
DeepL hasn't shared a price or a launch date for the combined product. What it has shown is intent. The best-known name in text translation just spent its first acquisition on getting voice into a stadium, and it's planting a flag in California to do it.