S2S Portable — "Perfect Build" Portable Edition
Created by Hardy Brielbeck
This software is free for personal, non-commercial use (home use).
No official support is provided for personal use. Please use this manual to resolve any issues.
Commercial / professional use requires a valid license. Support is provided exclusively to licensed users.
Contact: Brielbeck@hotmail.de
S2S Portable is a bidirectional, real-time Speech-to-Speech translator that runs entirely offline on your Windows PC. It was built as a love letter to the VRChat community.
S2S Portable requires a dedicated NVIDIA graphics card for real-time performance. An RTX 3060 (12GB) or better is recommended. CPU-only mode is available but significantly slower.
| GPU | VRAM | One-Way Latency | Rating |
|---|---|---|---|
| RTX 4090 / 5090 | 24GB | ~0.5–1s | Ultra |
| RTX 4080 | 16GB | ~0.7–1.3s | High |
| RTX 4070 Ti / 4070 | 12GB | ~1–1.5s | High |
| RTX 3070 | 8GB | ~1.5–2.5s | Medium |
| RTX 3060 | 12GB | ~1.5–2.5s | Medium |
| RTX 3050 | 8GB | ~2.5–4s | Low |
| CPU Only | N/A | ~10–20s | Fallback |
On a good PC (RTX 4070-class), expect about 4–5 seconds for a full bidirectional round-trip. High-end RTX cards achieve under 1 second one-way!
START.bat inside the folder. The app will initialize and open.Full 2-way live translation. You speak in your language, your friend hears the translation in theirs — and vice versa. Perfect for real conversations.
Enable PTT so the app only listens when you hold a hotkey. Great for noisy environments.
Switch between speaking into your mic or typing text to be translated and spoken aloud.
A chat-style interface showing the conversation flow with both your messages and translated responses.
The architecture is modular. Faster Whisper handles speech recognition. Translation and TTS engines can be swapped for different models as better ones become available.
NEW! Context-aware AI translation powered by Alibaba's Qwen3. Understands idioms, slang, and humor. Faster AND smarter than traditional translation on mid-to-high GPUs.
First launch walks you through language, target, and mode selection. No manual setup needed.
Switch between Desktop/Calls mode (Zoom, Teams, Discord) and VR Gaming mode (VRChat, Resonite) with one click. Settings adjust automatically.
Every language below has full speech-to-speech translation support and a completely translated GUI:
English • Japanese • Korean • Chinese (Simplified) • Chinese (Traditional) • Chinese (Hong Kong) • Spanish • French • German • Italian • Portuguese • Portuguese (Brazil) • Russian • Ukrainian • Polish • Dutch • Swedish • Danish • Norwegian • Finnish • Czech • Slovak • Romanian • Hungarian • Greek • Turkish • Arabic • Vietnamese
The app automatically detects your hardware and recommends optimal settings. You can also manually select your GPU in the System tab.
For the fastest possible setup: use Faster Whisper (Tiny) + your preferred translation model + Piper TTS + Turbo Audio Mode. On an RTX 4070 Ti or better, this achieves under 1 second one-way latency!
S2S Portable works with any voice-chat application through Virtual Audio Cable (VB-Cable) routing:
For bidirectional conversations (hearing their speech translated back to you), you may need a second VB-Cable. The app's Audio tab has a built-in Cable Test to verify routing.
For startup cable auto-detection, the post-wizard setup summary, Open Windows Sound, and a Quest / VRChat / Windows routing table, see the standalone Virtual Audio Cable setup guide.
Opens a large, always-on-top overlay window showing all translations as big, easy-to-read text. Perfect for deaf or hard-of-hearing users.
Type messages that get translated and spoken via TTS. Perfect for users who cannot speak but want to communicate in a different language.
Record 20 seconds of your voice, and OpenVoice V2 will adapt the translated speech to match your unique timbre. You'll sound like yourself, not a robot.
START.bat, not individual Python files.This software is provided "as-is" for personal use. No official support is given. Please use this manual to troubleshoot. Commercial users with a valid license receive dedicated support.
The following things are now fully automatic — no manual configuration needed:
9+ months of intensive development combining open-source tools. Built as a love letter to the VRChat community by Hardy Brielbeck.
If you enjoy barrier-free conversations, consider supporting continued development:
PayPal: paypal.me/HardyBrielbeck