Gleaph

Gleaph is a graph database designed to run on the Internet Computer.

Gleaph combines:

GQL (Graph Query Language, ISO/IEC 39075) for standard graph querying
LARA, a CSR-derived stable graph storage layout optimized for Internet Computer canisters
Prepared Queries for safe frontend-to-canister execution
Federated graph shards for scaling graph data across canisters
A path toward vector-aware graph search and GraphRAG

Gleaph is not only a place to store connected data. Its long-term direction is to make relationships, properties, embeddings, and access-control-aware query execution available through one graph-native execution model.

Why Gleaph?

Modern applications increasingly need to combine structured relationships with semantic similarity.

A knowledge graph may tell you:

which documents cite each other
which users can access which records
which entities belong to the same project
which events happened before or after another event

A vector index may tell you:

which passages are semantically close
which entities are similar
which memories or documents are relevant to a prompt

Gleaph aims to bring these together: graph traversal, filtering, authorization, and vector-aware ranking should be expressible as one query plan instead of being stitched together across unrelated systems.

GQL

Gleaph uses GQL (Graph Query Language, ISO/IEC 39075) as its query language.

The gleaph-gql and gleaph-gql-planner crates are intended to remain general-purpose GQL crates. Internet Computer-specific behavior is implemented outside those portable layers.

Storage: LARA

Gleaph’s storage model is based on LARA (Localized Adjacency Relocation Array), a graph representation derived from Compressed Sparse Row (CSR).

LARA is designed for stable memory and canister constraints: compact adjacency storage, predictable traversal, and local relocation without assuming a conventional server runtime.

Prepared Queries

Prepared Queries allow administrators to pre-register GQL programs that can be executed by less-privileged callers.

This is especially important on the Internet Computer: frontends can safely call Gleaph directly without requiring a custom backend canister for every application query.

Vector-Aware Graph Search

Gleaph is being shaped to support vector payloads and vector predicates alongside graph structure.

The near-term model is:

store vectors as graph-associated payloads
filter candidates through graph patterns, labels, properties, or access rules
apply vector predicates such as distance thresholds
return graph-shaped results rather than isolated vector hits

This is different from positioning Gleaph as a standalone vector database. The important idea is graph-first retrieval: vectors become part of the graph execution model.

Future work may include:

vector indexes for approximate nearest-neighbor search
vector-aware GQL extensions or procedures
hybrid ranking over graph distance, edge weights, properties, and embedding distance
shard-aware vector retrieval for federated deployments

GraphRAG Direction

Gleaph is a natural fit for GraphRAG-style systems because it can represent both the retrieval substrate and the authorization model.

A GraphRAG application could use Gleaph to model:

documents, chunks, entities, claims, and citations
relationships between extracted entities
provenance from generated answers back to source material
tenant, user, and role-based access control
semantic similarity between chunks or entities

Instead of retrieving text chunks first and reconstructing context later, Gleaph can make retrieval graph-aware from the beginning:

find semantically relevant chunks
traverse to related entities, documents, or citations
filter by caller identity and prepared-query permissions
return a bounded, explainable context subgraph for generation

Internet Computer Integration

Gleaph extends GQL for Internet Computer use cases with:

IC.PRINCIPAL
IC.MSG_CALLER()

These allow prepared queries to express caller-aware access patterns directly.

Access Control

The router canister enforces a role hierarchy:

Executor
Read
Write
Manager
Admin

Graph shards do not expose arbitrary user GQL. They execute planned work from the router or trusted peer shards.

Design Documentation

Architecture, GQL layers, federation, execution, storage, and security notes live in design/.

License

This project is licensed under either of Apache License, Version 2.0 or MIT License at your option.

Name		Name	Last commit message	Last commit date
Latest commit History 774 Commits
.skills		.skills
assets		assets
crates		crates
design		design
frontend		frontend
scripts		scripts
sdk		sdk
.gitignore		.gitignore
AGENTS.md		AGENTS.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
icp.yaml		icp.yaml
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
rust-toolchain.toml		rust-toolchain.toml
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gleaph

Why Gleaph?

GQL

Storage: LARA

Prepared Queries

Vector-Aware Graph Search

GraphRAG Direction

Internet Computer Integration

Access Control

Design Documentation

License

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Gleaph

Why Gleaph?

GQL

Storage: LARA

Prepared Queries

Vector-Aware Graph Search

GraphRAG Direction

Internet Computer Integration

Access Control

Design Documentation

License

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages