Latest packages under development
Active machine learning packages for Unity using ONNX Runtime + DirectML (Windows x64, Unity Editor and Standalone, fully offline).
Development note: these new packages are being refactored for Unity Asset Store compliance. Because model files cannot be bundled directly in the store package, a new runtime setup flow will dynamically download required ONNX model content on demand.
Audio event classification
- ml.onnxruntime.directml.yamnet.unity — YAMNet package for classifying 500+ everyday sounds from clips or microphone input, with top-K labels and scores.
Whisper English speech-to-text
- ml.onnxruntime.directml.whisper.tiny.en.unity — Lightweight tiny.en model for fast English transcription and dictation.
- ml.onnxruntime.directml.whisper.basemodel.en.unity — Base.en model balancing speed and accuracy for English ASR.
- ml.onnxruntime.directml.whisper.small.en.unity — Small.en model targeting higher English transcription quality.
- ml.onnxruntime.directml.whisper.medium.en.unity — Medium.en model for the strongest English accuracy in this lineup.
Whisper multilingual speech-to-text + translation
- ml.onnxruntime.directml.whisper.tiny.multi.unity — Tiny multilingual ASR with `TranslateText(...)` powered by Qwen 2.5.
- ml.onnxruntime.directml.whisper.basemodel.multi.unity — Base multilingual ASR plus integrated Qwen text translation.
- ml.onnxruntime.directml.whisper.small.multi.unity — Small multilingual ASR with improved quality and translation flow.
- ml.onnxruntime.directml.whisper.medium.multi.unity — Medium multilingual ASR + Qwen translation for highest quality in the multilingual set.
TheyLoveGames creates speech proxy
With Chrome Speech Proxy, both speech detection and synthesis can operate on non-WebGL platforms.
WebGL Speech Detection converts speech to text in real-time.
- Web Demo: Speech Dictation
- Web Demo: Speech Commands
- API: Documentation
WebGL Speech Synthesis converts text to speech in real-time.
- Web Demo: Speech Synthesis
TheyLoveGames launches two new products in the Unity Asset Store
WebGL Speech Detection converts speech to text in real-time.
- Web Demo: Speech Dictation
- Web Demo: Speech Commands
- API: Documentation
WebGL Speech Synthesis converts text to speech in real-time.
- Web Demo: Speech Synthesis
- API: Documentation
TheyLoveGames releases automation into the Unity Asset Store
Setup for Fuse CC makes Unity setup quick and easy for animation packs from Mixamo.