Dictaitor

On-device speech-to-text iOS app powered by NVIDIA Parakeet TDT 0.6B v3 running on Apple Neural Engine.

How it works

Tap the mic button and speak
Text appears in real-time as you talk (via Apple Speech framework)
Tap stop — Parakeet v3 produces the final high-accuracy transcription on-device
Result is automatically copied to clipboard (syncs to Mac via Universal Clipboard)

Features

Real-time feedback — live transcription displayed as you speak
High-accuracy final result — Parakeet v3 (0.6B parameters) runs on Apple Neural Engine
25 European languages — Polish, English, German, French, Spanish, and more
Fully offline — after initial model download (~600MB), no network needed
Auto-copy — transcript copied to clipboard on stop
Universal Clipboard — paste directly on your Mac with ⌘V

Requirements

iPhone with Apple Silicon (A14+) — iPhone 12 or newer
iOS 17.0+
~700MB free storage for model download on first launch
WiFi required for initial model download only

Building

Prerequisites

Xcode 16+ with iOS SDK
Apple Developer account (free or paid)

Build & Run

# Point to your Xcode installation
sudo xcode-select -s /path/to/Xcode.app/Contents/Developer

# Build
xcodebuild -project Dictator/Dictator.xcodeproj \
  -target Dictator -sdk iphoneos -arch arm64 \
  -allowProvisioningUpdates \
  DEVELOPMENT_TEAM=YOUR_TEAM_ID \
  build

# Install on connected device
xcrun devicectl device install app \
  --device YOUR_DEVICE_ID \
  Dictator/build/Release-iphoneos/Dictator.app

Or open Dictator/Dictator.xcodeproj in Xcode, select your device, and hit Run.

Finding your device ID

xcrun devicectl list devices

Architecture

┌─────────────────────────────────────────────────┐
│                  Dictaitor App                   │
├─────────────────────────────────────────────────┤
│                                                 │
│  ┌─────────────┐     ┌──────────────────────┐  │
│  │ AVAudioEngine│────▶│ Apple SFSpeechRecognizer│
│  │  (16kHz mono)│     │  (live partial results) │
│  └──────┬───────┘     └──────────────────────┘  │
│         │                                       │
│         ▼ on stop                               │
│  ┌──────────────────┐                           │
│  │   Parakeet v3    │                           │
│  │  (CoreML / ANE)  │──▶ Final text ──▶ Clipboard
│  │  via FluidAudio  │                           │
│  └──────────────────┘                           │
│                                                 │
└─────────────────────────────────────────────────┘

Credits

FluidAudio — Swift framework for on-device audio ML
NVIDIA NeMo Parakeet — ASR model
FluidInference CoreML conversion — CoreML model artifacts

License

MIT — see LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Dictator		Dictator
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dictaitor

How it works

Features

Requirements

Building

Prerequisites

Build & Run

Finding your device ID

Architecture

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dictaitor

How it works

Features

Requirements

Building

Prerequisites

Build & Run

Finding your device ID

Architecture

Credits

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages