A single-file, browser-based screen AI assistant. Share your screen, paste any AI provider API key, and a floating chat window lets you ask questions about what's on your screen — powered by live screenshots captured every 3 seconds.
Supports Gemini, OpenAI, Anthropic (Claude), and Groq out of the box.
- A modern browser (Chrome recommended for screen capture support)
- An API key from one of the supported providers:
- Gemini — key starts with
AIza - OpenAI — key starts with
sk- - Anthropic — key starts with
sk-ant- - Groq — key starts with
gsk_
- Gemini — key starts with
No installation needed. Just open lens.html in your browser.
- Open
lens.htmlin Chrome - Paste your API key — the provider is auto-detected
- Click Share Screen & Start and select the screen/window to share
- Click Open Chat Window — a floating popup will appear
- Move the chat window on top of your shared screen
- Click Scan & Help or type a question — the AI sees your screen automatically
└── 📄 lens.html # Entire app: UI, screen capture, and AI calls in one file
This project is licensed under the MIT License.