Enable CodeGen deployment for Intel Arc Pro B-series GPU (XPU) by tintisimone · Pull Request #2467 · opea-project/GenAIExamples

tintisimone · 2026-06-03T16:23:34Z

Add Intel XPU support for CodeGen example with vLLM optimization.

Features:

Intel vLLM 0.14.1-xpu Docker image with XPU-specific configuration
XPU environment variables (VLLM_TARGET_DEVICE, ZE_FLAT_DEVICE_HIERARCHY, ONEAPI_DEVICE_SELECTOR)
GPU device mounting (/dev/dri) with privileged mode
10GB shared memory allocation for model inference
Full stack deployment: vLLM -> LLM Service -> Backend -> UI
Qwen/Qwen2.5-Coder-7B-Instruct model support

Configuration files:

compose.yaml: Docker Compose with XPU optimizations
set_env.sh: Environment setup script
README.md: Comprehensive deployment documentation
QUICK_START.md: Quick reference guide
Validation and testing scripts

Changes:

Added CodeGen/docker_compose/intel/xpu/arc/ directory structure
Updated CodeGen/README.md with Intel Arc GPU deployment option
Consistent with Intel CPU example deployment pattern

Tested and validated on Intel Arc Pro B-series GPU.

Description

The summary of the proposed changes as long as the relevant motivation and context.

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

Add Intel XPU support for CodeGen example with vLLM optimization. Features: - Intel vLLM 0.14.1-xpu Docker image with XPU-specific configuration - XPU environment variables (VLLM_TARGET_DEVICE, ZE_FLAT_DEVICE_HIERARCHY, ONEAPI_DEVICE_SELECTOR) - GPU device mounting (/dev/dri) with privileged mode - 10GB shared memory allocation for model inference - Full stack deployment: vLLM -> LLM Service -> Backend -> UI - Qwen/Qwen2.5-Coder-7B-Instruct model support Configuration files: - compose.yaml: Docker Compose with XPU optimizations - set_env.sh: Environment setup script - README.md: Comprehensive deployment documentation - QUICK_START.md: Quick reference guide - Validation and testing scripts Changes: - Added CodeGen/docker_compose/intel/xpu/arc/ directory structure - Updated CodeGen/README.md with Intel Arc GPU deployment option - Consistent with Intel CPU example deployment pattern Tested and validated on Intel Arc Pro B-series GPU. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

github-actions · 2026-06-03T16:23:54Z

Dependency Review

✅ No vulnerabilities or license issues found.

Scanned Files

None

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds an Intel Arc (XPU) Docker Compose deployment option for the CodeGen example, including environment setup scripts, validation/testing helpers, and detailed deployment documentation.

Changes:

Introduces a new intel/xpu/arc Docker Compose stack (compose + env script) for running CodeGen with Intel vLLM (XPU).
Adds helper scripts to validate configuration and check deployment readiness.
Adds multiple docs (README/quick start/test summaries) and links the new guide from CodeGen/README.md.

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 13 comments.

Show a summary per file

File	Description
CodeGen/docker_compose/intel/xpu/arc/validate_config.sh	New config validation script (env, GPU device, YAML, summary output).
CodeGen/docker_compose/intel/xpu/arc/test_deployment.sh	New deployment readiness checker for docker compose configuration.
CodeGen/docker_compose/intel/xpu/arc/set_env.sh	New environment variable setup for Arc/XPU docker compose deployment.
CodeGen/docker_compose/intel/xpu/arc/compose.yaml	New compose stack for vLLM (XPU) + llm server + backend + UI.
CodeGen/docker_compose/intel/xpu/arc/README.md	Full Arc/XPU deployment guide.
CodeGen/docker_compose/intel/xpu/arc/QUICK_START.md	Short quick-start instructions and common commands.
CodeGen/docker_compose/intel/xpu/arc/TEST_RESULTS.md	Captures validation/test results for the configuration.
CodeGen/docker_compose/intel/xpu/arc/DEPLOYMENT_TEST_SUMMARY.md	Extended narrative of configuration validation steps/results.
CodeGen/docker_compose/intel/xpu/arc/DEPLOYMENT_SUCCESS.md	Deployment success report and usage examples.
CodeGen/docker_compose/intel/xpu/arc/.gitignore	Ignores the model cache directory.
CodeGen/README.md	Adds Arc/XPU guide link + validated configuration row.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+if [ -d "/dev/dri" ]; then
+    ls -la /dev/dri/ | grep -E "card|render"
+    echo "   ✓ Intel GPU devices found"
+else
+    echo "   ✗ /dev/dri not found - Intel GPU may not be available"
+    exit 1
+fi


+if command -v python3 &> /dev/null; then
+    python3 -c "import yaml; yaml.safe_load(open('compose.yaml'))" 2>&1
+    if [ $? -eq 0 ]; then
+        echo "   ✓ compose.yaml syntax is valid"
+    else
+        echo "   ✗ compose.yaml has syntax errors"
+        exit 1
+    fi


+if [ -z "$HF_TOKEN" ]; then
+    echo "   ✗ HF_TOKEN not set"
+    exit 1
+else
+    echo "   ✓ HF_TOKEN: ${HF_TOKEN:0:10}..."
+fi


+$COMPOSE_CMD config > /dev/null 2>&1
+if [ $? -eq 0 ]; then
+    echo "   ✓ Docker Compose configuration is valid"
+else
+    echo "   ✗ Docker Compose configuration has errors"
+    exit 1
+fi


+if [ -d "/dev/dri" ]; then
+    ls -la /dev/dri/ | grep -E "card|render"
+    echo "   ✓ Intel GPU devices found"
+else
+    echo "   ✗ /dev/dri not found - Intel GPU may not be available"
+    exit 1
+fi


+    devices:
+      - /dev/dri:/dev/dri
+    privileged: true


+    healthcheck:
+      test: ["CMD-SHELL", "curl -f http://localhost:80/health || exit 1"]


+
+### Step 1: Setup Environment (1 minute)
+```bash
+cd /home/gta/GenAIExamples/CodeGen/docker_compose/intel/xpu/arc


+
+### Test 1: Health Check
+```bash
+curl http://your_host_ip:8028/health


+1. **Port Conflict Resolution**: Successfully changed LLM service port from 9000 to 9001
+2. **.env File Requirement**: Docker Compose requires .env file for proper variable expansion


Copilot AI review requested due to automatic review settings June 3, 2026 16:23

tintisimone requested review from lvliang-intel and yao531441 as code owners June 3, 2026 16:23

Copilot AI reviewed Jun 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable CodeGen deployment for Intel Arc Pro B-series GPU (XPU)#2467

Enable CodeGen deployment for Intel Arc Pro B-series GPU (XPU)#2467
tintisimone wants to merge 1 commit into
opea-project:mainfrom
tintisimone:bmg_enablement

tintisimone commented Jun 3, 2026

Uh oh!

github-actions Bot commented Jun 3, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		healthcheck:
		test: ["CMD-SHELL", "curl -f http://localhost:80/health \|\| exit 1"]

		1. Port Conflict Resolution: Successfully changed LLM service port from 9000 to 9001
		2. .env File Requirement: Docker Compose requires .env file for proper variable expansion

Conversation

tintisimone commented Jun 3, 2026

Description

Issues

Type of change

Dependencies

Tests

Uh oh!

github-actions Bot commented Jun 3, 2026

Dependency Review

Scanned Files

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants