A powerful Chrome extension that converts speech to text using AI, with intelligent formatting based on context. Your dictated text is transcribed, formatted appropriately for the current website, and copied to your clipboard.
- π€ Voice-to-Text: Speak naturally and get accurately transcribed text
- π€ AI-Powered Formatting: Automatically formats your text based on the website context
- π Clipboard Integration: Text is copied to clipboard for easy pasting anywhere
- π Multilingual Support: Available in English, German, Spanish, and French
- β‘ Fast & Reliable: Uses Groq's Whisper for transcription and Qwen 3 32B for formatting
- π Privacy-Focused: No data storage - all processing via direct API calls
- β¨οΈ Keyboard Shortcut: Quick access with Cmd+Shift+1 (macOS) or Ctrl+Shift+1 (Windows/Linux/ChromeOS)
Before installing, ensure you have:
- Chrome 116 or later (required for audio recording)
- Operating System: macOS, Windows, Linux, or ChromeOS
- Microphone access (extension will request permission on first use, except on ChromeOS where it typically works automatically)
- Internet connection (for AI processing)
ChromeOS Users: This extension is optimized for ChromeOS with enhanced keyboard handling and seamless microphone access.
Note for Chromebook/Enterprise users: Some managed devices may restrict extension installation or microphone access. Contact your administrator if you encounter issues.
Option A: Chrome Web Store (Recommended)
- Visit the Chrome Web Store (coming soon)
- Click "Add to Chrome"
- Confirm by clicking "Add extension"
Option B: Manual Installation
- Download the latest release ZIP file
- Extract the ZIP to a folder on your computer
- Open Chrome and navigate to
chrome://extensions/ - Toggle on "Developer mode" (top-right corner)
- Click "Load unpacked" and select the extracted folder
- The extension icon should appear in your Chrome toolbar
Chrome will automatically request microphone access when first using the extension.
To grant permission:
- When prompted, click "Allow" in the permission dialog
- The extension will save this permission for future use
If permission was blocked or needs to be changed:
- Look for the π or π€ icon in Chrome's address bar (left side)
- Click the icon to open site permissions
- Set "Microphone" to "Allow"
- Refresh the page
To manage all Chrome permissions:
- Go to
chrome://settings/content/microphone - Ensure microphone access is enabled
- Add your website domains to "Allowed" if needed
For system-level microphone issues:
- Windows: Go to Settings > Privacy > Microphone > Allow apps to access microphone
- macOS: Go to System Preferences > Security & Privacy > Privacy > Microphone > Check Chrome
- ChromeOS: Settings > Advanced > Privacy and security > Site settings > Microphone
Why do I need this? The extension uses Groq's free AI services for speech recognition and text formatting.
- Create Account: Visit console.groq.com
- Sign Up: Click "Sign Up" and create a free account
- Generate Key:
- Go to "API Keys" in the dashboard
- Click "Create API Key"
- Copy the key (starts with
gsk_...) - Important: Save this key securely - you won't see it again!
- Open Settings: Click the extension icon β "Settings"
- Enter API Key: Paste your Groq API key
- Choose Language: Select your preferred language (English, German, Spanish, or French)
- Personalize (Optional): Add your name, addresses, or custom spellings for better accuracy
- Save: Click "Save" to store your settings
Default shortcuts by platform:
- macOS:
Cmd+Shift+1 - Windows/Linux/ChromeOS:
Ctrl+Shift+1
To customize:
- Go to
chrome://extensions/shortcuts - Find "AI Voice Dictation"
- Click the pencil icon next to "Start/Stop Dictation"
- Press your preferred key combination
- Click outside to save
Note: ChromeOS users benefit from additional keyboard handling optimizations for the best experience.
Tips for choosing shortcuts:
- Avoid conflicts with browser shortcuts (e.g., Ctrl+T, Ctrl+W)
- Try combinations like Ctrl+Shift+D or Alt+Shift+V
- On macOS, use Cmd instead of Ctrl
- Test your shortcut on different websites to ensure it works
Alternative trigger: Always available via the extension icon in your toolbar
- Test microphone: Click the extension icon to verify it loads
- Quick test: Press
Ctrl+Shift+1and say "Hello, this is a test" - Verify: Check if the recording indicator appears and text gets copied to clipboard
- Start Dictation:
- Press
Ctrl+Shift+1(anywhere on any webpage) - OR click the extension icon in toolbar
- Press
- Record Your Speech:
- Speak clearly and naturally
- Red recording indicator shows recording is active
- You have up to 60 seconds (warning at 50 seconds)
- Stop Recording:
- Press
EnterorEsc - OR click the recording indicator
- Press
- Use Your Text:
- Text is automatically copied to clipboard
- Press
Ctrl+Vto paste anywhere - Green notification confirms text is ready
- Speak naturally: No need to pause between words
- Context matters: The AI adapts to the website you're on
- Use anywhere: Works on all websites, even when no text field is selected
- Multiple languages: Switch language in settings for better recognition
The AI adapts writing style based on the website you're using:
- Gmail: Professional, formal tone
- Slack/Discord: Casual, friendly style
- LinkedIn: Professional networking tone
- Twitter/X: Concise, engaging posts
- Google Docs: Well-structured documents
- GitHub: Technical, precise language
- And more: Appropriate formatting for any website
- API Key: Your Groq API key (required for all functionality)
- Language: Choose from English, German, Spanish, or French
- AI Model: Qwen QWQ 32B (recommended) or other Groq models
Make your dictation more accurate by adding:
- Full Name: Your name for better recognition
- Business Name: Company name if you use it frequently
- Home Address: Your home address
- Work Address: Your work address
- Custom Spellings: Difficult names, places, or technical terms
- Force Mode: When enabled, always copies to clipboard (useful for complex websites)
- Custom Model Path: Use newer Groq models not in the dropdown
- Debug Mode: View logs for troubleshooting (click π button)
- Keyboard Shortcut: Customize via
chrome://extensions/shortcuts
- Settings sync across all your Chrome devices
- UI language changes immediately when selected
- Dictation language affects AI transcription accuracy
- Speech Recognition: Groq Whisper Large V3 Turbo
- Text Processing: Qwen QWQ 32B (32,000 token context)
- Recording Duration: Up to 60 seconds
- Chrome Version: Requires Chrome 116 or later
- Simplified Workflow: Text always copies to clipboard - no more text field detection
- 4 Languages: Added Spanish and French support
- Better AI Model: Qwen QWQ 32B is now the default recommendation
- Cleaner Interface: Removed complexity for a smoother experience
Manual installation fails:
- Ensure you've enabled "Developer mode" in
chrome://extensions/ - Check that you're selecting the folder (not the ZIP file)
- Try restarting Chrome after installation
Extension disabled automatically:
- Some enterprise/school devices block extensions
- Contact your IT administrator for permission
- Check Chrome management policies
"No microphone access" error:
- Click the π icon in Chrome's address bar
- Set "Microphone" to "Allow"
- Refresh the page and try again
Recording indicator doesn't appear:
- Ensure no other apps are using your microphone
- Check Windows/Mac sound settings
- Try unplugging/reconnecting external microphones
Recording stops immediately:
- This often happens on managed Chromebooks
- Try enabling "Force Mode" in settings
- Check if background audio apps are interfering
"Invalid API key" error:
- Verify your key starts with
gsk_ - Make sure you copied the entire key
- Try generating a new key at console.groq.com
"Network error" messages:
- Check your internet connection
- Verify Groq services are accessible
- Try again in a few minutes
Slow processing:
- Groq's free tier may have rate limits
- Try shorter dictations (under 30 seconds)
- Consider upgrading to Groq Pro for faster processing
Ctrl+Shift+1 not working:
- Check if another app is using this shortcut
- Go to
chrome://extensions/shortcutsto verify/change - On ChromeOS, try different key combinations
- On macOS, use Cmd+Shift+1 instead
- Restart Chrome after changing shortcuts
To troubleshoot shortcut conflicts:
- Press the shortcut in different applications to test
- Check system-wide shortcuts in OS settings:
- Windows: Settings > System > About > Advanced system settings > Environment Variables
- macOS: System Preferences > Keyboard > Shortcuts
- ChromeOS: Settings > Device > Keyboard
- Try alternative combinations like Ctrl+Alt+D or Ctrl+Shift+V
Shortcut works but nothing happens:
- The extension may not be loaded on the current page
- Try refreshing the page or restarting Chrome
- Check if the site has restricted permissions
- Test on a simple page like google.com first
Global shortcut alternatives:
- Some users prefer Ctrl+Shift+D or Alt+Shift+V
- Function key combinations like F12 (if not used by DevTools)
- Consider combinations that don't conflict with common browser shortcuts
Text goes to wrong place:
- Enable "Force Mode" in settings for clipboard-only mode
- This ensures text always goes to clipboard for manual pasting
Formatted text looks wrong:
- The AI adapts to each website's context
- Try customizing your personalization settings
- Report specific issues for better training
Extension blocked by admin:
- Contact your IT administrator
- Ask them to allowlist the extension
- Some managed devices completely block extensions
Microphone blocked:
- Check device management settings
- Some schools/companies restrict microphone access
- Try using on a personal device to verify functionality
For advanced troubleshooting:
- Open extension settings
- Click the "π View Debug Logs" button
- Open Chrome DevTools (F12)
- Check the console for detailed error messages
- Share these logs when reporting issues
- Try incognito mode - This helps identify extension conflicts
- Disable other extensions - Some extensions can interfere
- Update Chrome - Ensure you have Chrome 116 or later
- Clear extension data - Uninstall and reinstall the extension
- Report the issue - Contact support with debug logs
- Complex websites: Some sites like Google Docs work better in Force Mode
- Background noise: Works best in quiet environments
- Accents: Recognition accuracy varies by accent and language
- Technical terms: Add custom spellings in settings for better accuracy
- No audio recordings are stored
- No personal data is collected
- All processing happens via secure API calls to Groq
- Only the current page URL is sent for context (not the content)
This project is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
- β Free for personal use
- β Free for non-commercial use with attribution
- β Commercial use requires explicit permission
See LICENSE.md for the full license terms.
For issues or feature requests, please visit our GitHub repository.
Made with β€οΈ for all Chrome users