Enable CodeGen deployment for Intel Arc Pro B-series GPU (XPU)#2467
Open
tintisimone wants to merge 1 commit into
Open
Enable CodeGen deployment for Intel Arc Pro B-series GPU (XPU)#2467tintisimone wants to merge 1 commit into
tintisimone wants to merge 1 commit into
Conversation
Add Intel XPU support for CodeGen example with vLLM optimization. Features: - Intel vLLM 0.14.1-xpu Docker image with XPU-specific configuration - XPU environment variables (VLLM_TARGET_DEVICE, ZE_FLAT_DEVICE_HIERARCHY, ONEAPI_DEVICE_SELECTOR) - GPU device mounting (/dev/dri) with privileged mode - 10GB shared memory allocation for model inference - Full stack deployment: vLLM -> LLM Service -> Backend -> UI - Qwen/Qwen2.5-Coder-7B-Instruct model support Configuration files: - compose.yaml: Docker Compose with XPU optimizations - set_env.sh: Environment setup script - README.md: Comprehensive deployment documentation - QUICK_START.md: Quick reference guide - Validation and testing scripts Changes: - Added CodeGen/docker_compose/intel/xpu/arc/ directory structure - Updated CodeGen/README.md with Intel Arc GPU deployment option - Consistent with Intel CPU example deployment pattern Tested and validated on Intel Arc Pro B-series GPU. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Dependency Review✅ No vulnerabilities or license issues found.Scanned FilesNone |
Contributor
There was a problem hiding this comment.
Pull request overview
Note
Copilot was unable to run its full agentic suite in this review.
Adds an Intel Arc (XPU) Docker Compose deployment option for the CodeGen example, including environment setup scripts, validation/testing helpers, and detailed deployment documentation.
Changes:
- Introduces a new
intel/xpu/arcDocker Compose stack (compose + env script) for running CodeGen with Intel vLLM (XPU). - Adds helper scripts to validate configuration and check deployment readiness.
- Adds multiple docs (README/quick start/test summaries) and links the new guide from
CodeGen/README.md.
Reviewed changes
Copilot reviewed 11 out of 11 changed files in this pull request and generated 13 comments.
Show a summary per file
| File | Description |
|---|---|
| CodeGen/docker_compose/intel/xpu/arc/validate_config.sh | New config validation script (env, GPU device, YAML, summary output). |
| CodeGen/docker_compose/intel/xpu/arc/test_deployment.sh | New deployment readiness checker for docker compose configuration. |
| CodeGen/docker_compose/intel/xpu/arc/set_env.sh | New environment variable setup for Arc/XPU docker compose deployment. |
| CodeGen/docker_compose/intel/xpu/arc/compose.yaml | New compose stack for vLLM (XPU) + llm server + backend + UI. |
| CodeGen/docker_compose/intel/xpu/arc/README.md | Full Arc/XPU deployment guide. |
| CodeGen/docker_compose/intel/xpu/arc/QUICK_START.md | Short quick-start instructions and common commands. |
| CodeGen/docker_compose/intel/xpu/arc/TEST_RESULTS.md | Captures validation/test results for the configuration. |
| CodeGen/docker_compose/intel/xpu/arc/DEPLOYMENT_TEST_SUMMARY.md | Extended narrative of configuration validation steps/results. |
| CodeGen/docker_compose/intel/xpu/arc/DEPLOYMENT_SUCCESS.md | Deployment success report and usage examples. |
| CodeGen/docker_compose/intel/xpu/arc/.gitignore | Ignores the model cache directory. |
| CodeGen/README.md | Adds Arc/XPU guide link + validated configuration row. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Comment on lines
+18
to
+24
| if [ -d "/dev/dri" ]; then | ||
| ls -la /dev/dri/ | grep -E "card|render" | ||
| echo " ✓ Intel GPU devices found" | ||
| else | ||
| echo " ✗ /dev/dri not found - Intel GPU may not be available" | ||
| exit 1 | ||
| fi |
Comment on lines
+51
to
+58
| if command -v python3 &> /dev/null; then | ||
| python3 -c "import yaml; yaml.safe_load(open('compose.yaml'))" 2>&1 | ||
| if [ $? -eq 0 ]; then | ||
| echo " ✓ compose.yaml syntax is valid" | ||
| else | ||
| echo " ✗ compose.yaml has syntax errors" | ||
| exit 1 | ||
| fi |
Comment on lines
+35
to
+40
| if [ -z "$HF_TOKEN" ]; then | ||
| echo " ✗ HF_TOKEN not set" | ||
| exit 1 | ||
| else | ||
| echo " ✓ HF_TOKEN: ${HF_TOKEN:0:10}..." | ||
| fi |
Comment on lines
+61
to
+67
| $COMPOSE_CMD config > /dev/null 2>&1 | ||
| if [ $? -eq 0 ]; then | ||
| echo " ✓ Docker Compose configuration is valid" | ||
| else | ||
| echo " ✗ Docker Compose configuration has errors" | ||
| exit 1 | ||
| fi |
Comment on lines
+18
to
+24
| if [ -d "/dev/dri" ]; then | ||
| ls -la /dev/dri/ | grep -E "card|render" | ||
| echo " ✓ Intel GPU devices found" | ||
| else | ||
| echo " ✗ /dev/dri not found - Intel GPU may not be available" | ||
| exit 1 | ||
| fi |
Comment on lines
+13
to
+15
| devices: | ||
| - /dev/dri:/dev/dri | ||
| privileged: true |
Comment on lines
+26
to
+27
| healthcheck: | ||
| test: ["CMD-SHELL", "curl -f http://localhost:80/health || exit 1"] |
|
|
||
| ### Step 1: Setup Environment (1 minute) | ||
| ```bash | ||
| cd /home/gta/GenAIExamples/CodeGen/docker_compose/intel/xpu/arc |
|
|
||
| ### Test 1: Health Check | ||
| ```bash | ||
| curl http://your_host_ip:8028/health |
Comment on lines
+249
to
+250
| 1. **Port Conflict Resolution**: Successfully changed LLM service port from 9000 to 9001 | ||
| 2. **.env File Requirement**: Docker Compose requires .env file for proper variable expansion |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Add Intel XPU support for CodeGen example with vLLM optimization.
Features:
Configuration files:
Changes:
Tested and validated on Intel Arc Pro B-series GPU.
Description
The summary of the proposed changes as long as the relevant motivation and context.
Issues
List the issue or RFC link this PR is working on. If there is no such link, please mark it as
n/a.Type of change
List the type of change like below. Please delete options that are not relevant.
Dependencies
List the newly introduced 3rd party dependency if exists.
Tests
Describe the tests that you ran to verify your changes.