VidBoard

A local-first music video storyboard tool. Uses Ollama (Qwen3) for AI planning and ComfyUI + FLUX.2 Klein for cinematic frame generation — no cloud dependencies, no API rate limits.

Purpose

VidBoard is designed to streamline the pre-production pipeline for AI-assisted music video creation.

You give it an artist, a track, lyrics, and a mood/theme. It researches the artist, builds a Visual Bible, breaks the track into scenes, and generates Start and End frame pairs for each scene — cinematic stills that bookend a video segment.

Those frame pairs are the actual deliverable. The intended workflow is:

VidBoard generates a Start frame and End frame for each scene
You take those two frames into Google Flow (Veo) and use them as the first and last frame inputs
Veo generates the video segment between them — motion, transitions, and everything in between
Repeat for each scene to build the full music video sequence

This means VidBoard handles the creative planning and visual consistency work, while Veo handles the video generation. The quality of the output depends heavily on how well the Start and End frames communicate the intended shot — which is what the scene planning and prompt generation pipeline is optimised for.

The approach to generating Start and End frames is still evolving. The goal is better visual continuity between the two frames of a pair without making them so similar that Veo has nothing interesting to interpolate between.

Prerequisites

1. Node.js 20+ and pnpm

2. Ollama

Install: https://ollama.com/download
Pull a planning model (tested with Gemma4):
```
ollama pull gemma4:e4b
```
Create a free account at https://ollama.com and generate an API key — required for Ollama's web search feature

3. ComfyUI & FLUX.2 Klein

Install ComfyUI: https://github.com/comfy-org/ComfyUI

If you want to use the included workflows, you will need to download the following models.

Download the following model files and place them in your ComfyUI installation:

File	Destination
`flux-2-klein-4b-fp8.safetensors`	`ComfyUI/models/diffusion_models/`
`qwen_3_4b.safetensors`	`ComfyUI/models/text_encoders/`
`flux2-vae.safetensors`	`ComfyUI/models/vae/`

All three model files are available on Hugging Face. Search for the filenames above.

Start ComfyUI so it listens on the network:

python main.py --listen

If you use the Windows Portable version, edit the start up batch files to include --listen

.\python_embeded\python.exe -s ComfyUI\main.py --windows-standalone-build --listen

Comfy Desktop will run on a different port and is always set to listen. Check its startup log for the actual URL and set COMFYUI_BASE_URL accordingly (it will usually be port 8000).

Setup

git clone https://github.com/LyAhn/VidBoard
cd VidBoard
pnpm install
cp .env.example .env.local
# Edit .env.local — at minimum set OLLAMA_API_KEY
pnpm run dev

Open http://localhost:3000.

Workflows

Three ComfyUI workflow files are included in comfyui/, all targeting FLUX.2 Klein 4B:

Workflow	Description
`flux2-klein-txt2img`	Text prompt only
`flux2-klein-reference`	Text prompt + character reference image
`flux2-klein-reference-img2img`	Text prompt + character ref + init image (img2img)

By default VidBoard uses flux2-klein-reference for both the Start and End frames. For storyboard continuity (where the Start frame seeds the End frame), set in .env.local:

COMFYUI_START_WORKFLOW=flux2-klein-reference
COMFYUI_END_WORKFLOW=flux2-klein-reference-img2img

Using a different model

The included workflows are built around FLUX.2 Klein, but VidBoard is not tied to any specific model. You can use any model supported by ComfyUI by exporting your own workflow in API format and registering it in comfyui/workflows.json. A helper script makes this straightforward:

pnpm run map-workflow comfyui/my-workflow.json

See comfyui/README.md for full workflow documentation, the workflows.json schema, environment variables, and instructions for bringing your own workflow or model.

Recommended Hardware

GPU: RTX 3080 Ti / 12 GB+ VRAM (FLUX.2 Klein FP8 uses ~10–11 GB)
RAM: 16 GB+ system RAM
Storage: ~8 GB for models

Architecture

Frontend: Next.js 15 + React 19 + Tailwind CSS
Planning LLM: Ollama (qwen3:8b) with web search grounding
Image generation: ComfyUI + FLUX.2 Klein 4B FP8

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
app		app
comfyui		comfyui
components		components
hooks		hooks
lib		lib
scripts		scripts
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitattributes		.gitattributes
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
README.md		README.md
drizzle.config.ts		drizzle.config.ts
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VidBoard

Purpose

Prerequisites

1. Node.js 20+ and pnpm

2. Ollama

3. ComfyUI & FLUX.2 Klein

Setup

Workflows

Using a different model

Recommended Hardware

Architecture

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VidBoard

Purpose

Prerequisites

1. Node.js 20+ and pnpm

2. Ollama

3. ComfyUI & FLUX.2 Klein

Setup

Workflows

Using a different model

Recommended Hardware

Architecture

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages