feat(rag): add VoyageAI voyage-context-4 contextualized embedding support by fzowl · Pull Request #6368 · crewAIInc/crewAI

fzowl · 2026-06-27T13:31:25Z

Add support for VoyageAI models: voyage-context-4

feat(rag): add VoyageAI voyage-context-4 contextualized embedding support
Fix JSON crew version pin (Fix JSON crew version pin #6342)

Summary by CodeRabbit

New Features
- Added support for VoyageAI contextualized embedding models, including automatic chunking for large inputs.
- Standard embedding behavior remains unchanged for other model types.
Tests
- Added coverage for model-based routing, chunk size handling, input normalization, and output flattening for contextualized embeddings.

…port Route voyage-context-* models through the contextualized embeddings endpoint (client.contextualized_embed) with chunk_size set to the 32000 maximum. Input is passed as a flat list of strings and the per-document chunk embeddings are flattened into the returned vectors.

coderabbitai · 2026-06-27T13:33:27Z

📝 Walkthrough

Walkthrough

Adds support for VoyageAI "contextualized" models in VoyageAIEmbeddingFunction. A new constant CONTEXTUALIZED_CHUNK_SIZE = 32000 is introduced. __call__ now routes to contextualized_embed when the model name starts with voyage-context, flattening per-chunk embeddings. A new test module covers routing, output flattening, chunk size, and input normalization.

VoyageAI contextualized embedding

Layer / File(s)	Summary
Contextualized embed routing and flattening `lib/crewai/src/crewai/rag/embeddings/providers/voyageai/embedding_callable.py`	Adds `CONTEXTUALIZED_CHUNK_SIZE = 32000` and branches `__call__` to call `contextualized_embed` with auto-chunking for `voyage-context` models, flattening `result.results[].embeddings` into the returned list; non-contextualized models use the refactored `embed` path with a local `model` variable.
Unit tests for embed routing and input handling `lib/crewai/tests/rag/embeddings/test_voyageai_embedding_callable.py`	New `TestVoyageAIEmbeddingFunction` class with five tests: standard model uses `embed`, `voyage-context-4` uses `contextualized_embed` with flattened output, `chunk_size` equals `CONTEXTUALIZED_CHUNK_SIZE` (32000), inputs passed as flat `List[str]`, and single string input normalized to one-element list.

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately captures the main change: adding contextualized embedding support for VoyageAI voyage-context-4.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

lib/crewai/tests/rag/embeddings/test_voyageai_embedding_callable.py (1)

30-49: 📐 Maintainability & Code Quality | 🔵 Trivial | ⚡ Quick win

Cover the single-document multi-chunk shape directly.

This test only proves concatenation across result.results. The new behavior that matters here is flattening result.results[*].embeddings when one input auto-chunks into multiple vectors, so a fixture with a single result containing two embeddings would make the regression target much clearer.

Suggested test tweak

     mock_client.contextualized_embed.return_value = MagicMock(
         results=[
-            MagicMock(embeddings=[[0.1, 0.2]]),
-            MagicMock(embeddings=[[0.3, 0.4]]),
+            MagicMock(embeddings=[[0.1, 0.2], [0.3, 0.4]]),
         ]
     )
@@
-    result = fn(["aa", "bb"])
+    result = fn(["aa"])
@@
     mock_client.embed.assert_not_called()
     mock_client.contextualized_embed.assert_called_once()
     assert np.allclose(result, [[0.1, 0.2], [0.3, 0.4]])

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@lib/crewai/tests/rag/embeddings/test_voyageai_embedding_callable.py` around
lines 30 - 49, Update the contextualized embedding test in
VoyageAIEmbeddingFunction so it directly covers the single-document multi-chunk
case: keep the existing VoyageAIEmbeddingFunction and contextualized_embed
setup, but make the mocked contextualized_embed response return one result whose
embeddings contains multiple vectors, then assert the callable flattens
result.results[*].embeddings into the final output. This should replace the
current multi-result concatenation check while still verifying embed is not
called and contextualized_embed is used.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In
`@lib/crewai/src/crewai/rag/embeddings/providers/voyageai/embedding_callable.py`:
- Around line 58-66: The voyage-context path in embedding_callable’s callable
currently hardcodes input_type to document, ignoring any configured value.
Update the contextualized_embed call to use the configured input_type from
self._config when appropriate, or add explicit validation in the embedding
callable to reject non-document configurations for voyage-context models. Make
the fix in the branch that handles model.startswith("voyage-context") so the
behavior matches the selected config.

---

Nitpick comments:
In `@lib/crewai/tests/rag/embeddings/test_voyageai_embedding_callable.py`:
- Around line 30-49: Update the contextualized embedding test in
VoyageAIEmbeddingFunction so it directly covers the single-document multi-chunk
case: keep the existing VoyageAIEmbeddingFunction and contextualized_embed
setup, but make the mocked contextualized_embed response return one result whose
embeddings contains multiple vectors, then assert the callable flattens
result.results[*].embeddings into the final output. This should replace the
current multi-result concatenation check while still verifying embed is not
called and contextualized_embed is used.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 5a57c8b2-22c8-4709-bd45-3bc212afb30b

📥 Commits

Reviewing files that changed from the base of the PR and between 6491f5a and 042bd71.

📒 Files selected for processing (2)

lib/crewai/src/crewai/rag/embeddings/providers/voyageai/embedding_callable.py
lib/crewai/tests/rag/embeddings/test_voyageai_embedding_callable.py

coderabbitai · 2026-06-27T13:38:37Z

+        if model.startswith("voyage-context"):
+            result = self._client.contextualized_embed(
+                inputs=input,
+                model=model,
+                input_type="document",
+                output_dtype=self._config.get("output_dtype"),
+                output_dimension=self._config.get("output_dimension"),
+                enable_auto_chunking=True,
+                chunk_size=CONTEXTUALIZED_CHUNK_SIZE,


🎯 Functional Correctness | 🟡 Minor | ⚡ Quick win

🧩 Analysis chain

🌐 Web query:

For voyageai Python client version 0.3.5, what is the signature of Client.contextualized_embed, and which input_typevalues does it support? In particular, isquery supported or is the endpoint document-only?

💡 Result:

The contextualized_embed method in the Voyage AI Python client supports both query and document types [1][2]. It is not restricted to documents only [1][2]. The method signature for voyageai.Client.contextualized_embed is as follows [1][2]: def contextualized_embed( self, inputs: Union[List[List[str]], List[str]], model: str, input_type: Optional[str] = None, output_dimension: Optional[int] = None, output_dtype: Optional[str] = "float", enable_auto_chunking: Optional[bool] = False, chunk_size: Optional[int] = 512, chunk_overlap: Optional[int] = 0, chunk_fn: Optional[Callable[[str], List[str]]] = None,) The input_type parameter accepts the values None, "query", and "document" [1][2]. When using "query", each inner list should typically contain a single query string [3][4]. When using "document", each inner list typically contains chunks from a document [3][4]. The input_type argument is optional, but setting it to "query" or "document" is recommended for retrieval tasks as it prompts the model appropriately [1][2].

Citations:

1: https://www.mongodb.com/docs/voyageai/models/contextualized-chunk-embeddings/

2: https://docs.voyageai.com/docs/contextualized-chunk-embeddings

3: https://www.mongodb.com/docs/api/doc/atlas-embedding-and-reranking-api/operation/operation-createcontextualizedembedding

4: https://docs.rs/mongodb-voyageai/latest/mongodb_voyageai/client/struct.ContextualizedEmbedBuilder.html

🏁 Script executed:

sed -n '1,160p' lib/crewai/src/crewai/rag/embeddings/providers/voyageai/embedding_callable.py printf '\n---\n' rg -n "voyage-context|contextualized_embed|input_type" lib/crewai/src/crewai/rag -S

Repository: crewAIInc/crewAI

Length of output: 3654

Respect input_type for voyage-context models contextualized_embed() accepts both query and document, so hardcoding "document" here ignores the configured value. Either forward self._config.get("input_type") or reject non-document configs explicitly.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@lib/crewai/src/crewai/rag/embeddings/providers/voyageai/embedding_callable.py` around lines 58 - 66, The voyage-context path in embedding_callable’s callable currently hardcodes input_type to document, ignoring any configured value. Update the contextualized_embed call to use the configured input_type from self._config when appropriate, or add explicit validation in the embedding callable to reject non-document configurations for voyage-context models. Make the fix in the branch that handles model.startswith("voyage-context") so the behavior matches the selected config.

coderabbitai Bot reviewed Jun 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(rag): add VoyageAI voyage-context-4 contextualized embedding support#6368

feat(rag): add VoyageAI voyage-context-4 contextualized embedding support#6368
fzowl wants to merge 1 commit into
crewAIInc:mainfrom
fzowl:feat/embedding-model-voyage-context-4

fzowl commented Jun 27, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Jun 27, 2026 •

edited

Loading

Walkthrough

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Jun 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

fzowl commented Jun 27, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Jun 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fzowl commented Jun 27, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 27, 2026 •

edited

Loading