Offline text prediction · macOS

Offline text prediction for Mac:
autocomplete with no internet.

Shadowtype finishes your sentences as inline ghost text in any app — and it does it without ever calling a server. There is no inference network request, period. Pull the Wi-Fi, board the flight, work air-gapped: it keeps predicting at full speed because the model runs 100% on your own Apple Silicon with llama.cpp and Metal.

  • Works fully offline
  • No inference call ever
  • Zero telemetry
  • One-time purchase
Why offline matters

The autocomplete that doesn’t need the cloud

Cloud autocomplete dies the moment your connection does — and it ships every keystroke to someone else’s server. Shadowtype runs the whole prediction loop on your Mac, so the network is never in the path between you and your next word.

No inference network call — ever

Predictions are computed locally with a downloaded model. There is no API request to OpenAI, to us, or to anyone, for any completion. Turn off Wi-Fi and watch the ghost text keep flowing — same words, same instant speed.

Built for flights & spotty Wi-Fi

Airplane Mode, a train tunnel, a hotel connection that drops every minute — none of it matters. Latency-free local inference means there’s no spinner, no “reconnecting,” no degraded suggestions when the bars disappear.

Safe for air-gapped & secure work

If your machine can’t talk to the internet by policy, Shadowtype still works. Nothing you type is transmitted because there’s nowhere for it to go — no telemetry backend, no analytics, no account. Activate once, then disconnect forever.

No per-call cost, no rate limits

Cloud completion bills per token and throttles you under load. Local inference is free to run as hard as your Mac allows — type all day, every day, with no usage meter on the model itself.

What actually touches the network

Three optional connections. None of them your text.

Being honest about “offline” means naming every byte that could leave the Mac. Here is the complete list — and your keystrokes, prompts, and completions are on none of it.

One-time model download

Pick a model (Gemma 4 or Qwen3.5 GGUF) and it downloads once to your Mac. After that the file is local forever — re-launch on a plane and it loads straight from disk.

One-time license activation

The one-time license verifies once, online, when you activate it. From then on it runs offline indefinitely — no recheck, no phone-home, no grace-period countdown. The free tier needs no activation at all.

Optional update check

Shadowtype can check for a new version periodically so you get fixes and new models. It’s the only recurring outbound traffic, it carries none of your content, and you can simply ignore it.

Want the why behind the architecture? See how the local on-device LLM runs entirely on your hardware, why that makes it private by construction, and the full feature list.

Same app, online or off

Everything works the same with the cable unplugged

Continuous ghost text in any app

Mail, Notes, Slack, your editor, a web form — Shadowtype predicts inline as you type. Tab accepts a word, ⌥Tab accepts the whole line. Offline changes none of it.

On-device selection rewrite

Select any text and press ⌥⌘K to rewrite it — shorter, clearer, a different tone — all computed locally. No connection required, your draft never leaves the machine.

Your choice of local model

Run Gemma 4 for speed or Qwen3.5 for richer predictions — both downloaded once and executed on-device. Swap models anytime; the inference path stays fully offline.

Questions

Offline autocomplete FAQ

Does it really work offline?
Completely. Predictions run on a local model with llama.cpp and Metal on your Apple Silicon — there is no inference network call, ever. Turn off Wi-Fi or go to Airplane Mode and the ghost text keeps coming at the same speed and quality. The connection is never in the path between your keystroke and your next word.
What network calls happen at all?
Exactly three, and none carry your text: a one-time model download when you first choose a model, a one-time license activation (online once, then offline forever), and an optional periodic update check you can ignore. Your keystrokes, prompts, and completions never leave the Mac — there’s no telemetry or analytics backend.
Is it good for flights or secure environments?
Yes — that’s where local inference shines. On a plane, a train, a flaky hotel connection, or an air-gapped secure machine, cloud autocomplete can’t run at all. Shadowtype doesn’t care: same instant suggestions, no spinner, no per-API-call cost, no latency. Activate once online, then disconnect for good.
Do I need an account?
No account, no sign-in. The free tier (100 accepted words/day) needs no login whatsoever, and the one-time license activates once online then runs offline indefinitely on up to 5 of your Macs. Requires macOS 14+ on Apple Silicon.
Ready when you are

Autocomplete that works with the Wi-Fi off.

Download Shadowtype free, accept your first word with Tab, then pull the connection and watch it keep going.

  • Works offline
  • No inference call
  • No account
  • Zero telemetry
  • 14-day refund