- index.html: star badge 100k+ → 105k+ (NousResearch/hermes-agent at 105k)
- models/index.html: bump meta description year 2025 → 2026
- models/index.html: update date to April 20, 2026
- models/index.html: Opus 4.6 → Opus 4.7 across all 4 sections (Overall, Coding, Writing, Research)
- New stats: 70% CursorBench, 98.5% XBOW visual-acuity, 3.75MP image resolution
- Picker card 'Complex coding' updated to Opus 4.7
- Anthropic setup box: add claude-opus-4-7 as first model name
- compare/claude-code.html: update Opus 4.6 reference to 4.7
- compare/perplexity-computer.html: update Opus 4.6 reference to 4.7
On mobile (≤640px), .nav-back span{display:none} hid both spans inside
the back link, leaving an empty bordered pill shape visible next to the
logo — the 'broken little thing' users reported seeing to the right of ⚡ Hermes.
Fix: hide .nav-back itself. The hamburger menu provides the back navigation
link, so no information is lost.
Applies to all 13 sub-pages: memory, models, eli5, community, and all 9 compare pages.
- Updated to Grok 4.20: / per 1M (was /5), 2M context (was 256K)
- Arena Elo updated to 1496 (#3 overall, was 1486)
- Added 2M context pill, updated score display to Arena Elo
Coding section:
- Gemini 3.1 Pro: add Terminal-Bench 78.4% (highest of any frontier model on CLI/DevOps)
Score badge updated to show Terminal-Bench rather than SWE-bench Verified
- GPT-5.4: note Terminal-Bench 75.1% in description, consolidate pill text
- #5 DeepSeek V3.2 → Qwen 3.6-Plus: leads Terminal-Bench at 61.6%, 88.2% GPQA,
1M context, available now on Alibaba Cloud + OpenRouter
Writing section (reordered based on EQ-Bench CW scores):
- #1 Claude Sonnet 4.6 (1936 EQ-Bench CW — highest, best voice consistency)
- #2 Claude Opus 4.6 (Mazur 8.53, IF Arena #1, 1M context for literary depth)
- #3 Gemini 3.1 Pro (Arena CW #1 1487, AI-tell avoidance, 2M context)
- #4 GPT-5.4 (noted as ~9th on Arena CW, better for structured/commercial writing)
- #5 Meta Muse Spark → Kimi K2.5 (/usr/bin/bash.60/.50, ~1700 EQ-Bench CW, live API)
Muse Spark removed — no commercial API available yet
Reasoning section:
- Gemini 3.1 Pro GPQA: 95.45% → 94.1% (more conservative/recent figure, consistent
with both agents' data)
- Added ARC-AGI-2 77.1% for Gemini 3.1 Pro (#1 on visual reasoning too)
- Opus 4.6: added note that Sonnet leads GDPval-AA (1633 Elo #1) for throughput
- #5 DeepSeek V3.2 → Qwen 3.6-Plus (88.2% GPQA, 1M context, same model as coding)
Quick picker:
- Creative writing: Opus → Sonnet 4.6 (EQ-Bench #1, 85% cheaper)
- Hard reasoning: 95.45% → 94.1%, add ARC-AGI-2 mention
- Budget pick: DeepSeek V3.2 → Gemini 3 Flash Thinking (/usr/bin/bash.50/1M, 89.8% GPQA)
Setup boxes:
- Self-hosted: Muse Spark → Qwen 3.6-Plus + Gemma 4 26B MoE (Apache 2.0,
82.3% GPQA with 3.8B active params, best edge/self-hosted reasoning)
Overall section: unchanged (top 5 still correct per both agents)
Search section: unchanged (no new data from either agent)
Every page now has a consistent hamburger menu on mobile (≤640px) that
lets users navigate the full site without ever getting stranded.
eli5/index.html:
- Added hamburger button (#eli5-hamburger) + mobile menu (#eli5-mobile-menu)
- Menu links: Home, Why/Features/Compare/Install, ELI5, Models, Community, Get started
- Hamburger CSS + open/close JS added
models/index.html:
- Added hamburger button (#models-hamburger) + mobile menu (#models-mobile-menu)
- Menu includes section divider between main site links and sub-pages
- Hamburger CSS + open/close JS added (placed before tab-bar scroll JS)
compare/*.html (all 9 pages):
- Added hamburger button (#compare-hamburger) + mobile menu (#compare-mobile-menu)
- Menu: Home, ← All comparisons, divider, ELI5, Models, Community, Get started
- Same hamburger CSS (scoped to each page's inline style block)
- Same open/close JS before </script>
community/index.html:
- Existing hamburger kept, mobile menu updated to include ELI5 and Models links
(was missing them — now links to ../eli5/ and ../models/ in accent color)
Behaviour: hamburger hidden at desktop (display:none), shown at ≤640px.
Clicking toggles .open class. Clicking outside closes. All menus are
position:fixed so they overlay content cleanly. Each menu uses a unique
id to avoid collisions when pages are eventually combined.
eli5/index.html:
- Logo: ☿ → ⚡ to match compare/community pages
- Nav: glass effect (rgba 13,17,23, backdrop-filter blur) matching site nav
- Light mode nav override (rgba 246,248,250)
- CTA: github URL → hermes-agent.nousresearch.com install docs
models/index.html:
- Logo: ☿ → ⚡ to match compare/community pages
- Nav: glass effect matching site nav
- Light mode nav override
- CTA: github URL → install docs
- Tab bar: position:sticky so it stays visible while scrolling
- BUGFIX: malformed CSS comment /* HOW TO PICK --> was breaking CSS
parser and causing picker-card, picker-grid, MOBILE rules to be
silently ignored — picker cards had no background, mobile breakpoints
not applying. Fixed comment to /* HOW TO PICK */
- eli5/index.html: visual ELI5 explainer with SVG diagram, before/after,
step-by-step breakdown, and concept cards for new users
- models/index.html: 2025 model guide with 5 categories (Overall, Coding,
Writing, Search, Reasoning), 25 model cards with benchmark data, tab
nav, quick picker, and setup instructions for all providers
- index.html: add ELI5 and Models links to desktop nav and mobile menu
- .gitignore: exclude __pycache__ and .pyc files