Skip to content

test(vertexai): stabilize search & anthropic beta integration tests#1842

Open
Mason Daugherty (mdrxy) wants to merge 1 commit into
mainfrom
open-swe/fix-vertexai-integration-tests
Open

test(vertexai): stabilize search & anthropic beta integration tests#1842
Mason Daugherty (mdrxy) wants to merge 1 commit into
mainfrom
open-swe/fix-vertexai-integration-tests

Conversation

@mdrxy

Copy link
Copy Markdown
Member

Description

The Google Cloud Build langchain-google-vertexai-us (llm-integration-tests) job was failing on tests unrelated to the package-version metadata feature (#1841), which only adds _add_version(...) validators and does not touch Anthropic beta handling or Vertex search. This makes those live tests deterministic while preserving their intent.

  • test_search_builtin: replaced the context-dependent follow-up ("that last story"), which the model could answer with a clarification (no grounding), with a self-contained current-events query. Still asserts grounding metadata/annotations for both streaming and invoke.
  • Anthropic async beta tests: hard-coded beta header context-1m-2025-08-07 was rejected (BadRequestError). Now skips when the live endpoint rejects the beta header, and adds deterministic unit tests verifying betas route through the Anthropic beta client for async invoke and stream.

Authored with the help of an AI agent (Open SWE).

Release Note

none

Test Plan

  • uv run --group test pytest tests/unit_tests/test_model_garden_betas.py (passes locally)
  • Integration job re-runs green / skips gracefully when the beta header is unavailable

Made by Open SWE

…n tests

The Google Cloud Build llm-integration-tests job was failing on tests
unrelated to the package-version metadata feature (#1841):

- test_search_builtin used a context-dependent follow-up ("that last
  story") that the model could answer with a clarification, returning no
  grounding metadata. Use a self-contained current-events query so the
  invoke path reliably grounds while still asserting grounding metadata
  and annotations for both streaming and invoke.
- The anthropic async beta tests hard-coded an expired/unaccepted beta
  header value, causing anthropic.BadRequestError. Skip when the live
  endpoint rejects the beta header so beta rotation no longer fails CI,
  and add deterministic unit tests that verify betas route through the
  Anthropic beta client for async invoke and async streaming.

Co-authored-by: open-swe[bot] <open-swe@users.noreply.github.com>
@mdrxy Mason Daugherty (mdrxy) marked this pull request as ready for review June 15, 2026 04:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant