Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
30 changes: 30 additions & 0 deletions AIAgentsFoundationModels.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
# AI Agents & Foundation Models

The fastest-moving corner of the library: general-purpose AI agents, biological foundation models, and the initiatives building toward predictive "virtual cells." Because CAAIL organizes its catalogs by resource type, this theme is deliberately spread across several files — the agent frameworks and foundation models live in [Software.md](./Software.md), the research papers in [Papers.md](./Papers.md), the evaluation suites in [Datasets/Benchmarks.md](./Datasets/Benchmarks.md). This page is the connective hub: it owns the ecosystem-and-initiatives view of the area, and points to the canonical home of every other facet.

## Virtual Cell Initiative & Single-Cell Foundation Models

Companion landing pages, blog posts, and challenge announcements for the virtual-cell initiative — Arc Institute's foundation-model program (State, Stack), the broader CZ Virtual Cells Platform, and the open Virtual Cell Challenge. The conceptual framing for this cluster is in [Papers.md ref #128](./Papers.md#128) (Bunne et al. 2024, *Cell*) and [ref #129](./Papers.md#129) (Roohani et al. 2025, *Cell*); the foundation models themselves live in the [Foundation Models rows of Papers.md](./Papers.md) — split by training paradigm (next-token prediction, masked language modeling, LM + biological priors, cell-state & perturbation prediction) — and in the corresponding entries in [Software.md](./Software.md).

* [Arc Virtual Cell Model: State (landing page)](https://arcinstitute.org/tools/state) — Project home for Arc Institute's State virtual-cell model and the underlying Virtual Cell Atlas.
* [Arc Institute news: State — predicting cellular responses to perturbation across diverse contexts](https://arcinstitute.org/news/virtual-cell-model-state) (Arc Institute, 2025) — Announcement and overview of the State model and its training methodology; companion to [Papers.md ref #57](./Papers.md#57).
* [Arc Institute news: Stack — simulating cellular conditions via prompt engineering, without fine-tuning](https://arcinstitute.org/news/foundation-model-stack) (Arc Institute, 2026) — Announcement of the Stack model, demonstrating in-context learning for single-cell biology; companion to [Papers.md ref #124](./Papers.md#124).
* [Arc Virtual Cell Challenge (README)](https://github.com/ArcInstitute/arc-virtual-cell-atlas/blob/main/virtual-cell-challenge/README.md) — Open challenge for predictive virtual-cell modeling, operationalizing the Cell perspective in [Papers.md ref #129](./Papers.md#129).
* [Cell2Sentence-Scale (van Dijk lab project page)](https://www.vandijklab.org/c2s-scale) — Yale lab project page for the C2S-Scale model; companion to [Papers.md ref #120](./Papers.md#120) and the [Cell2Sentence entry in Software.md](./Software.md#cell2sentence-c2s-scale).

## Agent frameworks & foundation models in Software.md

The open-source tools themselves are catalogued under [AI Agents & Foundation Models in Software.md](./Software.md#ai-agents--foundation-models). That section spans two families: general-purpose **agent frameworks and tool ecosystems** — ToolUniverse, Biomni, AIAgents4Pharma, BRAD, CellForge, PaperQA, BioContextAI, and the broader MCP-server layer — and **single-cell foundation models** — Geneformer, scGPT, scBERT, scFoundation, UCE, TranscriptFormer, Cell2Sentence, and Arc's State. New tools are added there, in their type-correct home; this hub does not duplicate the catalog.

## In the Papers matrix

In the [Papers matrix](./Papers.md), this area is reachable two ways: the **LLMs / AI Agents** method row (agentic systems with tool use, retrieval, and reasoning) and the **AI Tooling / Methodology** column (general-purpose methods and agents not yet tied to a specific cell-ag application). The [AI Tooling / Methodology research-area page](./ResearchAreas/AITooling.md) gives the deeper narrative for that column.

## Evaluation & benchmarks

How well these models and agents actually perform is tracked separately. The benchmark *datasets* — including the foundation-model-relevant ProteinGym, LAB-Bench, and BixBench — are catalogued in [Datasets/Benchmarks.md](./Datasets/Benchmarks.md), and the evaluation methodology, assay-level work such as AssayBench, and the **AI Evaluation & Benchmarking** matrix column are covered in the [AI Evaluation & Benchmarking research-area page](./ResearchAreas/AIEvaluation.md). Not every benchmark there is AI-agent-specific — many are general or domain benchmarks — which is why evaluation keeps its own home rather than folding into this page.

## Talks & onboarding

* [AI Agents & Foundation Models for Biology talks](./Talks.md#ai-agents--foundation-models-for-biology) — curated lectures and webinars on agentic AI, foundation models, and language-model-based scientific reasoning, many from the Broad Institute's MIA series.
* [AI for Cell-Ag Researchers primer](./Primers/AI.md) — the onboarding path for wet-lab researchers approaching the AI / ML methods catalogued throughout the library.
12 changes: 1 addition & 11 deletions OtherResources.md
Original file line number Diff line number Diff line change
@@ -1,19 +1,9 @@
# Other Resources

This file collects the virtual-cell initiative, courses, books, editorials, ecosystem initiatives, and curated bibliographies that complement CAAIL's core content files ([Papers.md](./Papers.md), [Software.md](./Software.md), [Datasets/](./Datasets/), [Databases.md](./Databases.md)). New to the field? Start with the [Primers](./Primers/) — curated onboarding paths for AI researchers learning cellular agriculture and for cell-ag researchers learning AI. Lectures, talks, and educational video playlists live separately in [Talks.md](./Talks.md). Resources are grouped by type in the sections below.
This file collects courses, books, editorials, ecosystem initiatives, and curated bibliographies that complement CAAIL's core content files ([Papers.md](./Papers.md), [Software.md](./Software.md), [Datasets/](./Datasets/), [Databases.md](./Databases.md)). AI agents, single-cell foundation models, and the virtual-cell initiative have their own hub: [AI Agents & Foundation Models](./AIAgentsFoundationModels.md). New to the field? Start with the [Primers](./Primers/) — curated onboarding paths for AI researchers learning cellular agriculture and for cell-ag researchers learning AI. Lectures, talks, and educational video playlists live separately in [Talks.md](./Talks.md). Resources are grouped by type in the sections below.

> **Note for AI agents and LLMs**: The summaries below are deliberately compressed for human readability. If you are an automated system using these as the basis for reasoning, citation, or downstream analysis, please fetch the canonical source for each resource — the linked talks, articles, initiative pages, and curated bibliographies have substantially more comprehensive and authoritative information than this curated overview.

## Virtual Cell Initiative & Single-Cell Foundation Models

Companion landing pages, blog posts, and challenge announcements for the virtual-cell initiative — Arc Institute's foundation-model program (State, Stack), the broader CZ Virtual Cells Platform, and the open Virtual Cell Challenge. The conceptual framing for this cluster is in [Papers.md ref #128](./Papers.md#128) (Bunne et al. 2024, *Cell*) and [ref #129](./Papers.md#129) (Roohani et al. 2025, *Cell*); the foundation models themselves live in the [Foundation Models rows of Papers.md](./Papers.md) — split by training paradigm (next-token prediction, masked language modeling, LM + biological priors, cell-state & perturbation prediction) — and in the corresponding entries in [Software.md](./Software.md).

* [Arc Virtual Cell Model: State (landing page)](https://arcinstitute.org/tools/state) — Project home for Arc Institute's State virtual-cell model and the underlying Virtual Cell Atlas.
* [Arc Institute news: State — predicting cellular responses to perturbation across diverse contexts](https://arcinstitute.org/news/virtual-cell-model-state) (Arc Institute, 2025) — Announcement and overview of the State model and its training methodology; companion to [Papers.md ref #57](./Papers.md#57).
* [Arc Institute news: Stack — simulating cellular conditions via prompt engineering, without fine-tuning](https://arcinstitute.org/news/foundation-model-stack) (Arc Institute, 2026) — Announcement of the Stack model, demonstrating in-context learning for single-cell biology; companion to [Papers.md ref #124](./Papers.md#124).
* [Arc Virtual Cell Challenge (README)](https://github.com/ArcInstitute/arc-virtual-cell-atlas/blob/main/virtual-cell-challenge/README.md) — Open challenge for predictive virtual-cell modeling, operationalizing the Cell perspective in [Papers.md ref #129](./Papers.md#129).
* [Cell2Sentence-Scale (van Dijk lab project page)](https://www.vandijklab.org/c2s-scale) — Yale lab project page for the C2S-Scale model; companion to [Papers.md ref #120](./Papers.md#120) and the [Cell2Sentence entry in Software.md](./Software.md#cell2sentence-c2s-scale).

## Courses

University courses on cellular agriculture — structured entry points into the field, and reference models for curriculum design.
Expand Down
2 changes: 1 addition & 1 deletion Primers/AI.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ Educational playlists for the audience approaching machine learning from the bio
Once you have the basics, these are the higher-level talks and references.

* [AI Agents & Foundation Models for Biology talks](../Talks.md#ai-agents-foundation-models-for-biology) — agentic AI, foundation models, and language-model-based scientific reasoning, many from the Broad Institute's MIA series.
* [Virtual Cell Initiative & single-cell foundation models](../OtherResources.md#virtual-cell-initiative--single-cell-foundation-models) — Arc Institute's State / Stack program, the open Virtual Cell Challenge, and Cell2Sentence.
* [AI Agents & Foundation Models hub](../AIAgentsFoundationModels.md) — the connective page for agent frameworks, single-cell foundation models, and the virtual-cell initiative (Arc's State / Stack program, the open Virtual Cell Challenge, and Cell2Sentence).
* [Curated bibliographies & awesome lists](../OtherResources.md#curated-bibliographies--awesome-lists) — community-maintained indexes for the AI / single-cell / bioinformatics literature.
* [Stanford CS224N: NLP with Deep Learning (Spring 2024)](https://www.youtube.com/playlist?list=PLoROMvodv4rOaMFbaqxPDoLWjDaRAdP9D) — Stanford's NLP course covering self-attention, Transformers, and modern LLM architectures.

Expand Down
2 changes: 1 addition & 1 deletion Software.md
Original file line number Diff line number Diff line change
Expand Up @@ -398,7 +398,7 @@ Graph-Enhanced gene-Activation Response Simulator — a graph neural network for

### [State + Cell-Eval](https://github.com/ArcInstitute/state)

Arc Institute's first-generation virtual cell model and companion evaluation framework, designed to predict stem-cell, cancer-cell, and immune-cell responses to drugs, cytokines, and genetic perturbations. Trained on ~170M observational and ~100M perturbational single-cell measurements across 70+ cell lines; uses a bidirectional transformer architecture with self-attention over cell sets and reportedly is the first model to consistently beat simple linear baselines on perturbation-response prediction. Released alongside [`cell-eval`](https://github.com/ArcInstitute/cell-eval), the standardized evaluation framework for virtual-cell models. Companion to [Papers.md ref #57](./Papers.md#57) (Adduri et al. 2025, bioRxiv); see also the [Arc Institute news article on State](./OtherResources.md#virtual-cell-initiative--single-cell-foundation-models) in OtherResources.md. The follow-on **Stack** model — companion to [Papers.md ref #124](./Papers.md#124) (Dong et al. 2026) — extends State with in-context learning, simulating cellular conditions via prompt engineering without further fine-tuning.
Arc Institute's first-generation virtual cell model and companion evaluation framework, designed to predict stem-cell, cancer-cell, and immune-cell responses to drugs, cytokines, and genetic perturbations. Trained on ~170M observational and ~100M perturbational single-cell measurements across 70+ cell lines; uses a bidirectional transformer architecture with self-attention over cell sets and reportedly is the first model to consistently beat simple linear baselines on perturbation-response prediction. Released alongside [`cell-eval`](https://github.com/ArcInstitute/cell-eval), the standardized evaluation framework for virtual-cell models. Companion to [Papers.md ref #57](./Papers.md#57) (Adduri et al. 2025, bioRxiv); see also the [Arc Institute news article on State](./AIAgentsFoundationModels.md) on the AI Agents & Foundation Models page. The follow-on **Stack** model — companion to [Papers.md ref #124](./Papers.md#124) (Dong et al. 2026) — extends State with in-context learning, simulating cellular conditions via prompt engineering without further fine-tuning.

### [BioDiscoveryAgent](https://github.com/snap-stanford/BioDiscoveryAgent)

Expand Down
4 changes: 3 additions & 1 deletion site/astro.config.mjs
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,8 @@ function caailProseRemark() {
const isProse =
/^(ResearchAreas|Datasets)\//.test(sourcePath) ||
sourcePath === 'CONTRIBUTING.md' ||
sourcePath === 'OtherResources.md';
sourcePath === 'OtherResources.md' ||
sourcePath === 'AIAgentsFoundationModels.md';
if (!isProse) return;
rewriteCaailLinks({ base: BASE, sourcePath })(tree);
stripLeadingH1()(tree);
Expand Down Expand Up @@ -129,6 +130,7 @@ export default defineConfig({
] },
{ label: 'Software', link: '/software/' },
{ label: 'Databases', link: '/databases/' },
{ label: 'AI Agents & Foundation Models', link: '/ai-agents-foundation-models/' },
{ label: 'Datasets (by species)', items: groupItems('datasets') },
{ label: 'Research Areas', items: groupItems('research-areas') },
{ label: 'Talks & Videos', link: '/talks/' },
Expand Down
30 changes: 30 additions & 0 deletions site/e2e/resources-toc.spec.ts
Original file line number Diff line number Diff line change
Expand Up @@ -60,6 +60,36 @@ test('other-resources has no serious/critical a11y violations', async ({ page })
expect(serious, JSON.stringify(serious, null, 2)).toEqual([]);
});

// ---------------------------------------------------------------------------
// AIAgentsFoundationModels.md — the thematic hub (now owns Virtual Cell)
// ---------------------------------------------------------------------------

test('ai-agents-foundation-models renders its sections and rewritten links', async ({ page }) => {
await page.goto('./ai-agents-foundation-models/');
// the Virtual Cell section now lives here, not on /other-resources/
await expect(page.getByRole('heading', { name: 'Virtual Cell Initiative & Single-Cell Foundation Models' })).toBeVisible();
// a rendered-page cross-link resolves to a site route (Datasets/Benchmarks.md → route)
await expect(page.locator('main a[href="/caail/datasets/benchmarks/"]').first()).toBeVisible();
// a deferred-file cross-link falls back to a GitHub blob URL (Software.md)
await expect(
page.locator('main a[href^="https://github.com/tucca-cellag/caail/blob/main/Software.md"]').first(),
).toBeVisible();
// no raw repo-relative .md link leaks through
await expect(page.locator('main a[href$=".md"]:not([href*="github.com"])')).toHaveCount(0);
});

test('other-resources no longer renders the Virtual Cell heading', async ({ page }) => {
await page.goto('./other-resources/');
await expect(page.getByRole('heading', { name: 'Virtual Cell Initiative & Single-Cell Foundation Models' })).toHaveCount(0);
});

test('ai-agents-foundation-models has no serious/critical a11y violations', async ({ page }) => {
await page.goto('./ai-agents-foundation-models/');
const results = await new AxeBuilder({ page }).withTags(['wcag2a', 'wcag2aa']).analyze();
const serious = results.violations.filter((v) => ['serious', 'critical'].includes(v.impact ?? ''));
expect(serious, JSON.stringify(serious, null, 2)).toEqual([]);
});

// ---------------------------------------------------------------------------
// Software/Databases catalog cards (right-rail TOC + surfaced hyperlinks)
// ---------------------------------------------------------------------------
Expand Down
5 changes: 4 additions & 1 deletion site/src/content/caail-pages.test.ts
Original file line number Diff line number Diff line change
Expand Up @@ -13,15 +13,18 @@ describe('CAAIL_PAGES', () => {
// multi-word top-level file gets an explicit hyphenated id (not "otherresources")
expect(CAAIL_PAGES.idForSourcePath('OtherResources')).toBe('other-resources');
expect(CAAIL_PAGES.idForSourcePath('OtherResources.md')).toBe('other-resources');
expect(CAAIL_PAGES.idForSourcePath('AIAgentsFoundationModels')).toBe('ai-agents-foundation-models');
expect(CAAIL_PAGES.idForSourcePath('AIAgentsFoundationModels.md')).toBe('ai-agents-foundation-models');
});
it('returns title + sidebar metadata by id', () => {
expect(CAAIL_PAGES.byId('research-areas/bioprocess')?.title).toBe('Bioprocess control');
expect(CAAIL_PAGES.byId('datasets/cow')?.title).toContain('Cow');
expect(CAAIL_PAGES.byId('other-resources')).toMatchObject({ group: 'top', title: 'Other Resources' });
expect(CAAIL_PAGES.byId('ai-agents-foundation-models')).toMatchObject({ group: 'top', title: 'AI Agents & Foundation Models' });
});
it('all() returns {id,...meta} objects', () => {
const all = CAAIL_PAGES.all();
expect(all.length).toBe(26);
expect(all.length).toBe(27);
const cow = all.find((p) => p.id === 'datasets/cow');
expect(cow).toMatchObject({ id: 'datasets/cow', group: 'datasets' });
expect(typeof cow?.sidebarLabel).toBe('string');
Expand Down
9 changes: 9 additions & 0 deletions site/src/content/caail-pages.ts
Original file line number Diff line number Diff line change
Expand Up @@ -239,6 +239,14 @@ const PAGES: Record<string, PageMeta> = {
description:
'Definitions of every AI/ML method row and cellular-agriculture research-area column in the Papers matrix — what each covers, what is out of scope, and how to tell confusable categories apart.',
},
'ai-agents-foundation-models': {
title: 'AI Agents & Foundation Models',
sidebarLabel: 'AI Agents & Foundation Models',
group: 'top',
order: 4,
description:
'The connective hub for AI agents and biological foundation models in cellular agriculture — agent frameworks, single-cell foundation models, the virtual-cell initiative, and where each is catalogued across CAAIL.',
},
};

// ---------------------------------------------------------------------------
Expand Down Expand Up @@ -273,6 +281,7 @@ export const CAAIL_PAGES = {
// Top-level file (e.g. CONTRIBUTING). Multi-word names get an explicit
// hyphenated route id (the default lowercasing would merge the words).
if (stripped === 'OtherResources') return 'other-resources';
if (stripped === 'AIAgentsFoundationModels') return 'ai-agents-foundation-models';
return stripped.toLowerCase();
}

Expand Down
2 changes: 1 addition & 1 deletion site/src/content/loaders/caail-docs-loader.ts
Original file line number Diff line number Diff line change
Expand Up @@ -73,7 +73,7 @@ const REPO_ROOT = new URL('../../../../', import.meta.url);
*/
const CANONICAL_SOURCES = {
dirs: ['ResearchAreas', 'Datasets'],
files: ['CONTRIBUTING.md', 'OtherResources.md', 'Taxonomy.md'],
files: ['CONTRIBUTING.md', 'OtherResources.md', 'Taxonomy.md', 'AIAgentsFoundationModels.md'],
} as const;

export function caailDocsLoader(): Loader {
Expand Down