Generalize ChatQnA external LLM to external inferencing support by eero-t · Pull Request #1167 · opea-project/GenAIInfra

eero-t · 2025-07-18T20:14:20Z

Description

vLLM already support all ChatQnA inferencing sub-services: LLM, embedding, reranking, guardrails. In ChatQnA example value files, all except embedding are HW accelerated. Additionally, KubeAI already supports first three, and OPEA has Enterprise-Inferencing subproject: https://github.com/opea-project/Enterprise-Inference

Therefore it seems relevant to start discussion on how current external LLM support in the Helm charts could be changed to a more generic external inferencing support, before current support gets into too wide use.

To start that discussing, this PR includes draft of such changes for ChatQnA, and TODOs for items currently missing from ChatQnA code in GenAIExamples.

Issues

n/a.

Type of change

New feature (non-breaking change which adds new functionality)

Dependencies

This is dependency disconnection.

Tests

Not relevant yet.

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

CICD-at-OPEA · 2025-09-10T22:37:13Z

This PR is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

CICD-at-OPEA · 2025-09-18T22:36:47Z

This PR was closed because it has been stalled for 7 days with no activity.

eero-t · 2025-10-01T18:47:10Z

KubeAI just got support for LLM /rerank API: kubeai-project/kubeai#565

It's not in any release yet though.

eero-t requested review from lianhao and yongfengdu as code owners July 18, 2025 20:14

eero-t marked this pull request as draft July 18, 2025 20:14

eero-t mentioned this pull request Jul 18, 2025

External LLM endpoint usage improvements + fixes #1166

Merged

2 tasks

eero-t changed the title ~~Generalize ChatQnA external LLM to external inferencing support (WIP)~~ Generalize ChatQnA external LLM to external inferencing support Jul 18, 2025

eero-t force-pushed the external-inferencing branch from 30ae64a to 4742ac4 Compare August 11, 2025 09:42

WIP: Generalize ChatQnA external LLM to external inferencing support

09c57c6

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

eero-t force-pushed the external-inferencing branch from 2a50e5d to 09c57c6 Compare August 11, 2025 09:48

CICD-at-OPEA added the Stale label Sep 10, 2025

CICD-at-OPEA closed this Sep 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize ChatQnA external LLM to external inferencing support#1167

Generalize ChatQnA external LLM to external inferencing support#1167
eero-t wants to merge 1 commit into
opea-project:mainfrom
eero-t:external-inferencing

eero-t commented Jul 18, 2025 •

edited

Loading

Uh oh!

CICD-at-OPEA commented Sep 10, 2025

Uh oh!

CICD-at-OPEA commented Sep 18, 2025

Uh oh!

eero-t commented Oct 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eero-t commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

CICD-at-OPEA commented Sep 10, 2025

Uh oh!

CICD-at-OPEA commented Sep 18, 2025

Uh oh!

eero-t commented Oct 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

eero-t commented Jul 18, 2025 •

edited

Loading