They Love Games

February 19, 2026

TheyLoveGames releases offline Automatic Speech Recognition, Translation, and Audio Analysis for Unity

Model Size Legend:

Tiny — The smallest ASR model. Fastest execution, lowest resource usage.
Base — Larger than Tiny. Better accuracy.
Small — Larger than Base. Significantly better accuracy.
Medium — The largest model in this collection. Highest accuracy.

As the model sizes increase, they become more accurate but require more memory and processing power.

Platform Support: These packages are designed for Windows 64-bit (x86_64) using OnnxRuntime with DirectML hardware acceleration.

Yamnet

Adds offline audio event classification to your Unity C# projects. Classifies 521 everyday sounds from audio clips or microphone input.

Download Size: 31.0 MB

Includes C# example scenes:

Classify sounds in .WAV files
Classify sounds from Microphone in real-time

Asset Store | Documentation

Whisper Base EN

Adds offline English speech-to-text to your Unity C# projects. This Base model offers a good balance of speed and accuracy.

Download Size: 436.4 MB

Includes C# example scenes:

Convert .WAV audio files to text
Convert Microphone input to text in real-time

Asset Store | Documentation

Whisper Small EN

Adds offline English speech-to-text to your Unity C# projects. This Small model provides better accuracy than Base for clearer transcription.

Download Size: 1.2 GB

Includes C# example scenes:

Convert .WAV audio files to text
Convert Microphone input to text in real-time

Asset Store | Documentation

Whisper Medium EN

Adds offline English speech-to-text to your Unity C# projects. This Medium model offers the highest English accuracy in this collection.

Download Size: 1.7 GB

Includes C# example scenes:

Convert .WAV audio files to text
Convert Microphone input to text in real-time

Asset Store | Documentation

Whisper Base Multi

Adds offline multilingual speech-to-text to your Unity C# projects. Includes the Qwen model to translate recognized text between languages. This Base model balances performance and quality.

Download Size: 1.3 GB

Includes C# example scenes:

Convert .WAV audio files to translated text
Convert Microphone input to translated text in real-time

Asset Store | Documentation

Whisper Small Multi

Adds offline multilingual speech-to-text to your Unity C# projects. Includes the Qwen model to translate recognized text between languages. This Small model improves transcription accuracy over Base.

Download Size: 1.5 GB

Includes C# example scenes:

Convert .WAV audio files to translated text
Convert Microphone input to translated text in real-time

Asset Store | Documentation

Whisper Medium Multi

Adds offline multilingual speech-to-text to your Unity C# projects. Includes the Qwen model to translate recognized text between languages. This Medium model provides the highest quality transcription and translation.

Download Size: 2.6 GB

Includes C# example scenes:

Convert .WAV audio files to translated text
Convert Microphone input to translated text in real-time

Asset Store | Documentation

Machine Learning packages for Unity Pending Review

Active machine learning integration using ONNX Runtime + DirectML (Windows x64, Unity Editor and Standalone, fully offline).

Whisper English speech-to-text

ml.onnxruntime.directml.whisper.tiny.en.unity — (Pending Review) Lightweight tiny.en model for fast English transcription and dictation. (Download Size: 259.1 MB)

Whisper multilingual speech-to-text + translation

ml.onnxruntime.directml.whisper.tiny.multi.unity — (Pending Review) Tiny multilingual ASR with `TranslateText(...)` powered by Qwen 2.5. (Download Size: 1.2 GB)

January 2, 2018

TheyLoveGames creates speech proxy

With Chrome Speech Proxy, both speech detection and synthesis can operate on non-WebGL platforms.

WebGL Speech Detection converts speech to text in real-time. (Download Size: 8.2 MB)

WebGL Speech Synthesis converts text to speech in real-time. (Download Size: 367.3 KB)

Web Demo: Speech Synthesis

February 23, 2017

TheyLoveGames launches two new products in the Unity Asset Store

WebGL Speech Detection converts speech to text in real-time. (Download Size: 8.2 MB)

WebGL Speech Synthesis converts text to speech in real-time. (Download Size: 367.3 KB)

Web Demo: Speech Synthesis
API: Documentation

February 23, 2017

TheyLoveGames releases automation into the Unity Asset Store

Setup for Fuse CC makes Unity setup quick and easy for animation packs from Mixamo. (Download Size: 44.5 MB)