Feature/web gui skill scanner by xywen97 · Pull Request #62 · cisco-ai-defense/skill-scanner

xywen97 · 2026-03-14T16:42:40Z

Pull Request

Description

Summary:

This PR adds a lightweight local web UI for Skill Scanner, mounted under /ui on the existing FastAPI server.

Mounts a static frontend under skill_scanner/api and exposes it at /ui.
Adds a drag-and-drop interface to upload either a ZIP file, or a local skill folder (with SKILL.md) and run a scan using the existing analyzers.
Uses the existing HTMLReporter to embed interactive reports, and adds an option to download both HTML and Markdown reports.
Exposes an LLM configuration panel in the UI (model / base URL / API key) that is only used per request and is never stored server-side.

Implementation notes

New static frontend lives in skill_scanner/api/frontend/index.html and talks to:
- POST /scan-upload-html (HTML report)
- POST /scan-upload-markdown (Markdown report)
- POST /scan-html for scanning an on-disk skill directory.
skill_scanner/api/api.py mounts the frontend at /ui using StaticFiles.
router.py wires the new endpoints to the existing SkillScanner and reporters without changing CLI behavior.

Motivation:

This makes it much easier for users to quickly inspect scan results locally without having to construct CLI commands, and provides a more discoverable way to tweak scan options and view rich HTML reports.

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Performance improvement
Code refactoring
Test coverage improvement

Related Issues

Closes #[issue number]
Fixes #[issue number]
Related to #[issue number]

Changes Made

Change 1: Add local web UI mounted at /ui
- Mount a static frontend directory from skill_scanner/api via StaticFiles in api.py.
- New index.html provides a dark-themed UI with drag‑and‑drop upload, “Choose ZIP…” and “Choose folder…” actions, and an embedded iframe for interactive HTML reports.
Change 2: Extend API router to support UI workflows
- Extend ScanRequest and _build_analyzers to accept optional llm_model, llm_base_url, and llm_api_key, wiring them through to build_analyzers.
- Add /scan-html to scan an existing skill directory and return an HTML report using HTMLReporter.
- Add /scan-upload-html (HTML) and /scan-upload-markdown (Markdown) endpoints that reuse the existing SkillScanner pipeline and meta‑analyzer, streaming results back as reports for the UI.
Change 3: Improve report rendering & UX
- Update HTMLReporter table CSS to use fixed layout and word‑wrapping so long paths don’t require horizontal scrolling.
- Frontend UI now exposes optional LLM settings (model / base URL / API key) behind an “Enable LLM” toggle, shows step‑wise scan status (“upload / unpack”, “run analyzers”), and adds buttons to open the HTML report in a new tab or download both HTML and Markdown reports.

Testing

Test Coverage

Unit tests added/updated
Integration tests added/updated
All tests pass locally
Test coverage maintained or improved

Manual Testing

Describe manual testing performed:

# Commands run for testing
skill-scanner-api

Open http://localhost:8000/ui/ in your local browser.

Results:

Expected: Be able to run local scans through an in‑browser GUI with drag‑and‑drop, basic options, and a way to open or export the report.
Actual: Users can open /ui, drop a ZIP or choose a folder to scan, see an embedded HTML report, configure LLM options when needed, and download both HTML and Markdown reports directly from the UI.

Checklist

Code Quality

Code follows project style guidelines
Type hints added where applicable
Docstrings added/updated for public APIs
No hardcoded credentials or secrets
Error handling is comprehensive
Logging is appropriate

Documentation

README updated (if needed)
API documentation updated (if needed)
CHANGELOG updated
Code comments added for complex logic

Security

No new security vulnerabilities introduced
Input validation added where needed
Follows security best practices from workspace rules
No eval/exec on user input without sanitization

Testing

Tests pass: uv run pre-commit run --all-files
Benchmark passes: uv run python evals/runners/benchmark_runner.py
No regressions in existing functionality
Edge cases covered

Performance Impact

No significant performance regression
Performance benchmarks run (if applicable)
Resource usage is acceptable

Screenshots (if applicable)

Additional Notes

Any additional information reviewers should know.

Reviewer Checklist

For reviewers:

Code changes are clear and well-documented
Tests are comprehensive
No security issues introduced
Performance is acceptable
Documentation is updated

Made-with: Cursor

vineethsai7

Code Review — Security & Code Quality

Security Findings

1. CRITICAL: `llm_api_key` accepted as Form field / request body instead of Header

The llm_api_key is accepted via Form(...) on the upload endpoints (/scan-upload-html, /scan-upload-markdown) and as a Pydantic body field on ScanRequest. API keys in request bodies/form data are more likely to be logged by proxies, WAFs, and access logs than when sent via headers.

The existing pattern for vt_api_key and aidefense_api_key correctly uses Header(None, alias="X-...-Key"). The llm_api_key should follow the same convention.

Files: skill_scanner/api/router.py lines 199-201, 729-731, 917-919

2. HIGH: Missing Subresource Integrity (SRI) on external CDN script

<script src="https://cdn.jsdelivr.net/npm/jszip@3.10.1/dist/jszip.min.js"></script>

External scripts must use SRI hashes to prevent supply chain attacks. If the CDN is compromised, arbitrary JS would execute in the user's browser.

File: skill_scanner/api/frontend/index.html line 626

3. MEDIUM: `innerHTML` used with file names — DOM XSS sink

The setStatus function uses innerHTML to render status messages containing file names (e.g., file.name). While low risk for local files, this is a DOM XSS sink pattern. Should use textContent or escape HTML entities before interpolation.

File: skill_scanner/api/frontend/index.html lines 656-657, 711

4. MEDIUM: No Content-Security-Policy header or meta tag

The static HTML page and API responses don't set a CSP header. For a tool that renders untrusted HTML reports inside an iframe (via srcdoc), a CSP provides defense-in-depth against injected scripts.

File: skill_scanner/api/frontend/index.html

Code Quality Findings

5. HIGH: ~300 lines of duplicated upload/scan/extract logic

The three upload endpoints (/scan-upload, /scan-upload-html, /scan-upload-markdown) each independently implement the complete ZIP upload, validation, extraction, and scan pipeline (~150 lines each of nearly identical code). Should extract a shared helper.

File: skill_scanner/api/router.py

6. MEDIUM: `/scan-html` duplicates `/scan` endpoint logic

The /scan-html endpoint is a near-copy of /scan, differing only in the final rendering step. The scan + meta-analysis logic should be factored into a shared helper.

7. MEDIUM: Nested `asyncio.run()` inside `run_in_executor` is fragile

In scan_uploaded_skill_html and scan_uploaded_skill_markdown, meta-analysis creates a new event loop inside a thread inside the existing event loop. This can be simplified since the endpoint is already async.

8. LOW: Imports inside function bodies; `_frontend_dir` path computation is non-obvious

Multiple endpoints have import asyncio, import stat, import zipfile inside function bodies. Also Path(__file__).with_suffix("").parent / "frontend" could be simplified to Path(__file__).parent / "frontend".

Security: - Bundle JSZip locally to eliminate external CDN calls - Tighten CSP to default-src 'none' with explicit allowlist - Remove allow-same-origin from iframe sandbox - Add Referrer-Policy meta tag and server-side security headers (X-Content-Type-Options, X-Frame-Options, Cache-Control, Permissions-Policy) - Fix double-encoding bug in status/error display UI/UX: - Add light/dark mode with system preference detection and persistence - Add summary cards showing scan findings by severity - Add live scan timer and in-session scan history - Improve error handling with styled banner and retry button - Add ARIA roles, labels, and focus indicators for accessibility Packaging: - Move fastapi/uvicorn/python-multipart to optional [web] extras group - Guard API imports so core package works without web dependencies - Update error messages to guide users to install [web] extra Code quality: - Deduplicate shared helpers in API router

- Bundle JSZip locally (skill_scanner/api/frontend/js/jszip.min.js), remove CDN - Add Content-Security-Policy meta tag in frontend index.html - Extract _extract_uploaded_zip() for shared ZIP upload/validate/extract logic - Extract _run_scan_with_meta() for shared scan + meta-analysis; use in /scan, /scan-html, /scan-upload-html, /scan-upload-markdown - Replace nested asyncio.run() in upload handlers with direct await - Move asyncio, concurrent.futures, stat, zipfile to top-level imports in router - Use Path(__file__).parent for frontend dir in api.py

xywen97 added 2 commits March 15, 2026 00:11

Add local web UI for skill scanning

8d5100c

Made-with: Cursor

Document new API web UI helpers

6da0cec

Made-with: Cursor

vineethsai7 reviewed Mar 14, 2026

View reviewed changes

vineethsai7 and others added 2 commits March 14, 2026 11:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/web gui skill scanner#62

Feature/web gui skill scanner#62
xywen97 wants to merge 4 commits into
cisco-ai-defense:mainfrom
xywen97:feature/web-gui-skill-scanner

xywen97 commented Mar 14, 2026

Uh oh!

vineethsai7 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xywen97 commented Mar 14, 2026

Pull Request

Description

Type of Change

Related Issues

Changes Made

Testing

Test Coverage

Manual Testing

Checklist

Code Quality

Documentation

Security

Testing

Performance Impact

Screenshots (if applicable)

Additional Notes

Reviewer Checklist

Uh oh!

vineethsai7 left a comment

Choose a reason for hiding this comment

Code Review — Security & Code Quality

Security Findings

1. CRITICAL: llm_api_key accepted as Form field / request body instead of Header

2. HIGH: Missing Subresource Integrity (SRI) on external CDN script

3. MEDIUM: innerHTML used with file names — DOM XSS sink

4. MEDIUM: No Content-Security-Policy header or meta tag

Code Quality Findings

5. HIGH: ~300 lines of duplicated upload/scan/extract logic

6. MEDIUM: /scan-html duplicates /scan endpoint logic

7. MEDIUM: Nested asyncio.run() inside run_in_executor is fragile

8. LOW: Imports inside function bodies; _frontend_dir path computation is non-obvious

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

1. CRITICAL: `llm_api_key` accepted as Form field / request body instead of Header

3. MEDIUM: `innerHTML` used with file names — DOM XSS sink

6. MEDIUM: `/scan-html` duplicates `/scan` endpoint logic

7. MEDIUM: Nested `asyncio.run()` inside `run_in_executor` is fragile

8. LOW: Imports inside function bodies; `_frontend_dir` path computation is non-obvious