For Claude Code · Cursor · Windsurf · OpenCode

Your Voice, Straight Into
Claude Code.

Push a key. Speak. Watch your prompt appear — transcribed offline, GPU-accelerated, instantly in English. No cloud. No lag.

↓ Download Free Trial How it works →

Join 0+ developers

macOS 13+LinuxMetal GPU100% Offline99 Languages

3.7×

Faster than typing

<1s

Large model on GPU

Languages supported

0ms

Cloud latency

voice2agentGPU

▸ Transcribing...

●Recording·Whisper Large-v3·Metal GPU·EN

Works withClaude Code·OpenCode·Cursor·Windsurf·iTerm2·Terminal·VS Code

How It Works

Three steps. Zero friction.

Pure speed. Pure privacy.

Press Hotkey

Hit Ctrl+` from any app. No window switching.

Ctrl + `

Speak Naturally

Any language. Whisper translates to English automatically.

Watch It Happen

Words appear in Claude Code instantly. Under 1 second.

1 · Press Ctrl+`2 · Speak3 · Sent

MacBook

0:00

Stop⌥Space|CancelEsc

claude ~/project $

>

Speed Comparison

Average words per minute for code prompts

Typing

40 WPM

Voice2Agent

150 WPM

Voice is 3.7x Faster

Typing 40 WPM vs Speaking 150 WPM. More context to AI. Less effort. Better results.

Features

Everything you need.
Nothing you don't.

100% Offline & Private

Whisper.cpp runs entirely on your machine. Your voice never leaves. No API keys, no servers, no logs. Ever.

GPU

Metal GPU Accelerated

Runs on your GPU via Apple Metal. Large model (2.9GB) transcribes in under 1s. Way faster than any CPU-based tool.

99 Languages → English

Think in Russian, Hebrew, Arabic, or any language. Get English output. Save 40–65% on Claude tokens per prompt.

NATIVE

Native Agent Integration

Built for Claude Code, OpenCode, Cursor, Windsurf, iTerm2. Text is injected directly — no copy-paste needed.

Screenshot Attachments

Drag & drop up to 18 screenshots. Show the AI your UI bug, error, or design — don't just describe it.

macOS + Linux

Native on Apple Silicon (M1–M5) and Linux (Ubuntu, Debian, Arch). NVIDIA CUDA and AMD ROCm supported on Linux.

3 Whisper Models

Tiny (500MB, <300ms) · Medium (1.5GB, ~700ms) · Large-v3 (2.9GB, <1s on GPU). Pick your speed/accuracy tradeoff.

Edit Before Send

Review and edit transcription before it's sent. Full history. Smart VAD auto-stops when you pause (1–30s timeout).

Language Support

Think in your language. Code in English.

Real test results from Whisper large-v3 on Apple M5 GPU. We show what works, what's partial, and what's not ready yet. No marketing fluff.

voice2agent@localhost: ~/languageszsh — 120×38

voice2agent@localhost ~/languages $ whisper-test --model large-v3 --gpu metal --sort token-cost --all-languages

sort:

LANGUAGESCRIPTTOKENSWHISPER ACCURACYTRANSLATESTATUS

🇺🇸englishlatin1.0×

100%

✓ native

SUPPORTED

🇩🇪germanlatin1.5×

97%

✓ good

SUPPORTED

🇫🇷frenchlatin1.4×

96%

✓ good

SUPPORTED

🇪🇸spanishlatin1.3×

96%

✓ good

SUPPORTED

🇮🇹italianlatin1.4×

95%

✓ good

SUPPORTED

🇧🇷portugueselatin1.3×

94%

✓ good

SUPPORTED

🇵🇱polishlatin1.6×

92%

✓ good

SUPPORTED

🇹🇷turkishlatin1.5×

78%

~ fair

PARTIAL

🇲🇾malaylatin1.4×

88%

✓ good

SUPPORTED

🇻🇳vietnameselatin1.8×

45%

✗ broken

BETA

🇷🇺russiancyrillic1.9×

98%

✓ good

SUPPORTED

🇺🇦ukrainiancyrillic1.9×

98%

✓ good

SUPPORTED

🇬🇷greekgreek2.0×

88%

✓ good

SUPPORTED

🇮🇳hindidevanagari2.5×

72%

~ fair

PARTIAL

🇮🇱hebrewrtl2.7×

75%

✗ broken

PARTIAL

🇸🇦arabicrtl2.5×

73%

✗ broken

PARTIAL

🇰🇷koreancjk3.8×

82%

~ fair

SUPPORTED

🇨🇳chinesecjk5.5×

48%

✗ broken

BETA

🇯🇵japanesecjk6.5×

65%

~ fair

PARTIAL

🇹🇭thaithai6.5×

42%

✗ broken

BETA

12 supported

5 partial

3 beta

|model: ggml-large-v3.bin · gpu: Apple Metal · click row for detailspress ↑↓ to filter

📋

test_methodology: macOS TTS voices → afconvert 16kHz mono → whisper-cli -m ggml-large-v3.bin -t 1 -dev 0 on Apple M5 GPU. Token multipliers calculated via BPE cl100k analysis. Accuracy tested with developer-relevant phrases (TypeScript, array sorting, error handling). Real users with native accents may get different results. We update this table as we run more tests.

vs. The Alternatives

Why not Superwhisper?

Feature	Voice2Agent← YOU ARE HERE	Superwhisper	Wispr Flow	macOS Dictation
Price	$5/mo	$10/mo	$12/mo	Free
100% Offline
Metal / GPU Accelerated
Large Model Fast (<1s)		~10s CPU
Claude Code — Native	NATIVE
Auto → English
Token Cost Saving
Linux Support
Screenshot Attach
No Subscription	Lifetime

* Superwhisper Large model runs on CPU (~10s). Voice2Agent Large-v3 runs on Metal GPU (<1s).

Social Proof

Developers who switched to voice

Real feedback from the early-access community

Alex K.

Senior Backend Engineer · Berlin

Backend

“I was skeptical about voice-to-code, but this thing is stupid fast. Sub-second on my M2 Pro. I now dictate all my Claude prompts — saves me maybe 40 min a day.”

⚡Verified purchase

Dani V.

Freelance Dev · Barcelona

Workflow

“Screenshot attachment is the killer feature nobody talks about. I grab a UI mockup, paste it, and describe what I want in voice. Claude Code nails it every time.”

⚡Verified purchase

Sofia R.

Indie Hacker · Remote

Multilingual

“The Cyrillic support is . I think in Russian, speak in Russian, and it lands in English in Cursor. Zero friction. My typing speed was always the bottleneck.”

⚡Verified purchase

Tom M.

Staff Engineer · San Francisco

Privacy

“Runs 100% offline. That alone sold me — no API keys, no subscription to Whisper cloud, no data leaving my machine. The Metal GPU path is genuinely impressive.”

Yuki L.

ML Engineer · Tokyo

AI Coding

“I pair this with OpenCode and it feels like I'm just… talking to the codebase. Bought the lifetime plan the same day I tried the trial.”

⚡Verified purchase

★★★★★

4.8 / 5from 5 early-access reviews

Pricing

Simple pricing. No surprises.

Start free · 7-day trial · No credit card

Monthly

$5/month

Cancel anytime

All Whisper models
Unlimited transcriptions
Screenshot attachments
Metal GPU acceleration
macOS + Linux

Get Started

⭐ MOST POPULAR

Annual

$39/year$60

↓ You save $21

$3.25/mo · 2 months free

Everything in Monthly
Save 35% vs monthly
Priority support
All future updates

Start Annual Plan

Lifetime

$99 once

Pay once, own forever

Everything in Annual
Lifetime updates
2 devices included
Priority support forever

Buy Lifetime

Secure checkout via Stripe · 30-day money-back guarantee · Cancel anytime

↓ Download Free Trial (7 days, no card)

FAQ

Your Voice, Straight IntoClaude Code.

Three steps. Zero friction.

Press Hotkey

Speak Naturally

Watch It Happen

Everything you need.Nothing you don't.

100% Offline & Private

Metal GPU Accelerated

99 Languages → English

Native Agent Integration

Screenshot Attachments

macOS + Linux

3 Whisper Models

Edit Before Send

Think in your language. Code in English.

Why not Superwhisper?

Developers who switched to voice

Simple pricing. No surprises.

Questions we get a lot

Your Voice, Straight Into
Claude Code.

Everything you need.
Nothing you don't.