Skip to content

feat: merge curated dev updates with MoE routing, reliability fixes, and CI hardening#9

Merged
sanjeevafk merged 10 commits into
mainfrom
development
Mar 28, 2026
Merged

feat: merge curated dev updates with MoE routing, reliability fixes, and CI hardening#9
sanjeevafk merged 10 commits into
mainfrom
development

Conversation

@sanjeevafk

@sanjeevafk sanjeevafk commented Mar 28, 2026

Copy link
Copy Markdown
Owner

This pull request introduces significant improvements and refactoring across documentation, environment configuration, CI/CD automation, and backend authentication caching. The main focus is on removing the dependency on LiteLLM, migrating to native provider-based inference, and enhancing both operational and developer experience. The README and environment files have been thoroughly updated to reflect these architectural changes, and new CI/CD workflows and backend cache management improvements have been introduced.

Key changes:

Provider Migration & Documentation Refactor

  • The README (README.md) has been rewritten to remove all references to LiteLLM, replacing them with detailed documentation of the new native provider-based routing system. This includes updated model routing logic, fallback chains, and a new section describing provider fallback matrices and frontend UX improvements. [1] [2] [3] [4] [5] [6] [7] [8] [9]

  • The .env.example file has been updated to remove LiteLLM-related variables and clarify configuration for new provider keys (Groq, Gemini, OpenRouter). Timeout settings and comments have been revised for clarity.

CI/CD Automation

  • A new GitHub Actions workflow (.github/workflows/vercel-deploy.yml) has been added for automated Vercel deployments. This workflow supports both production and preview deployments, validates required secrets, builds the project, deploys prebuilt artifacts, and blocks deployment if the /api/health endpoint is not healthy. [1] [2]

  • The CI workflow (.github/workflows/ci.yml) now includes backend log hygiene guards to prevent accidental print() statements and unsafe logging of raw user_id fields, enforcing privacy and code quality.

Backend Improvements

  • The authentication module (api/auth.py) has been improved with a more robust, thread-safe cache for user "pro" state. It now uses an OrderedDict with TTL and maximum entry pruning, and logs warnings via the structured logger instead of using print(). Supabase client getters are now cached and log missing credentials more cleanly. [1] [2]

These changes collectively modernize the codebase, improve security and reliability, and provide clearer documentation for both users and developers.

@cloudflare-workers-and-pages

cloudflare-workers-and-pages Bot commented Mar 28, 2026

Copy link
Copy Markdown

Deploying knowbear with  Cloudflare Pages  Cloudflare Pages

Latest commit: 037a9c8
Status: ✅  Deploy successful!
Preview URL: https://5455dd34.knowbear.pages.dev
Branch Preview URL: https://development.knowbear.pages.dev

View logs

@sanjeevafk sanjeevafk changed the title development refractor: removed LiteLLM, fixed v2 issues, improve docs and migrate to Vercel Mar 28, 2026
@sanjeevafk sanjeevafk changed the title refractor: removed LiteLLM, fixed v2 issues, improve docs and migrate to Vercel feat: merge curated dev updates with MoE routing, reliability fixes, and CI hardening Mar 28, 2026
@sanjeevafk sanjeevafk merged commit efa9822 into main Mar 28, 2026
4 checks passed
@sanjeevafk sanjeevafk deleted the development branch April 17, 2026 03:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant