AI-QE

AI-QE is a tool designed to enhance quality assurance processes by leveraging Large Language Models (LLMs). It automates and streamlines testing workflows through two core functions:

Test Case Generation: Utilizes a framework to automatically generate comprehensive test cases, ensuring broad coverage and efficiency in identifying potential issues.
Test Case Execution and Summarization: Leverages the tool-calling capabilities of LLMs to execute these generated (or user-provided) test cases and provide summarized results, facilitating quick insights into software performance and reliability.

Installation

This section covers how to set up AI-QE and its dependencies.

Python Dependencies

First, install the required Python packages:

$ pip install -r requirements.txt

Virtualization Tools (Optional)

If you plan to use the demo codes that test libvirt RNG features, you will need to install virtualization-related packages.

$ sudo dnf install -y libvirt qemu-kvm
$ sudo systemctl start virtqemud virtnetworkd

Setting up an LLM

AI-QE supports local Ollama models, Google's Gemini models, and Claude models via Google Vertex AI. Choose the setup that best fits your needs.

Using Ollama

This option allows you to run LLM inference locally, which is ideal if you have local GPU resources.

Hardware Requirements: Ensure you have a host, VM, or container with an NVIDIA GPU and the CUDA toolkit installed for optimal performance.
Install Ollama: Follow the instructions on the Ollama GitHub repository to install it.
Pull a Tool-Calling Model: Pull a model that supports tool calling, such as jacob-ebey/phi4-tools:
```
$ ollama pull jacob-ebey/phi4-tools
```
Configure Ollama Host (Optional, for network access): If you need to access Ollama from another machine, configure OLLAMA_HOST in its service file. For example, add Environment="OLLAMA_HOST=0.0.0.0" to /etc/systemd/system/ollama.service and restart the service:
```
# cat /etc/systemd/system/ollama.service |grep Environment
Environment="OLLAMA_HOST=0.0.0.0"
# systemctl daemon-reload
# systemctl restart ollama.service
```

Using Gemini Models

If you prefer to use Google's Gemini models, you only need an API key. This option does not require local GPU setup.

Obtain a GEMINI_API_KEY: Get your API key from the Google AI Studio or Google Cloud Console.
Set Environment Variable: Before running ai-qe, set your GEMINI_API_KEY environment variable:
```
$ export GEMINI_API_KEY=YOUR_API_KEY_HERE
```
(Replace YOUR_API_KEY_HERE with your actual key.)

Using Claude Models

Claude models are supported through Google Vertex AI, providing access to Anthropic's Claude models via Google Cloud infrastructure.

Set up Google Cloud Project: Ensure you have a Google Cloud project with Vertex AI enabled.
Configure Authentication: Set up authentication for Google Cloud (e.g., using service account credentials or gcloud auth application-default login).
Set Environment Variables: Before running ai-qe, set the required environment variables:
```
$ export ANTHROPIC_VERTEX_PROJECT_ID=your-gcp-project-id
$ export CLOUD_ML_REGION=your-preferred-region
```
(Replace your-gcp-project-id with your actual Google Cloud project ID and your-preferred-region with your preferred region, e.g., us-central1.)
Install Required Dependencies: Ensure you have the necessary packages installed:
```
$ pip install langchain_google_vertexai anthropic
```

Usage

AI-QE can be run either through a web-based Gradio interface or directly from the command line.

Running the Web Interface

This starts a local web server, allowing you to interact with AI-QE via your browser.

With Ollama: Specify your Ollama server's IP address.
```
$ ./ai-qe -w -s {ollama_server_ip}
```
Now, open your browser and access http://localhost:7860 to use the demo.
With Gemini: Ensure GEMINI_API_KEY is set and specify the model.
```
$ export GEMINI_API_KEY=XXXXXXXX
$ ./ai-qe -w -m gemini-2.5-flash
```
Then, open your browser and access http://localhost:7860.

With Claude: Ensure environment variables are set and specify the Claude model.

$ export ANTHROPIC_VERTEX_PROJECT_ID=your-gcp-project-id
$ export CLOUD_ML_REGION=your-preferred-region
$ ./ai-qe -w -m claude-sonnet-4

Then, open your browser and access http://localhost:7860.

Running via Command Line

This allows for direct execution of test case generation and execution.

Generate&Execute Test Cases (Ollama Example): This command will generate and execute 2 test cases related to the rng device hotplug with rng and memory features.
```
$ ./ai-qe -r "generate test case test rng device hotplug with rng and memory feature" -s {ollama_server_ip} -n 2
```
Execute Manual Test Cases (Ollama Example): You can pass one or more manual test case files for AI-QE to execute.
```
$ ./ai-qe -s {ollama_server_ip} -c case1.txt -c case2.txt
```

Execute Manual Test Cases (Gemini Example): Using Gemini models.

$ export GEMINI_API_KEY=XXXXXXXX
$ ./ai-qe -m gemini-2.5-flash -c case1.txt -c case2.txt

Execute Manual Test Cases (Claude Example): Using Claude models.

$ export ANTHROPIC_VERTEX_PROJECT_ID=your-gcp-project-id
$ export CLOUD_ML_REGION=your-preferred-region
$ ./ai-qe -m claude-sonnet-4 -c case1.txt -c case2.txt

Remote Test Execution

AI-QE supports executing test cases on remote hosts via SSH, enabling distributed testing and execution in isolated environments.

Configuration

Remote execution can be configured via the configuration YAML file or command-line arguments.

Via Configuration File

Add the following section to your default_config.yaml:

remote_execution:
  enabled: true
  host: "test-server.example.com"
  port: 22
  username: "testuser"
  key_file: "/home/user/.ssh/id_rsa"  # SSH private key path
  password: ""                        # Alternative to key_file
  remote_work_dir: "/tmp/ai-qe-remote"
  sync_files: true

Via Command Line

$ ./ai-qe -s {ollama_server_ip} -c test_case.txt --remote-host test-server.example.com --remote-user testuser

Authentication Methods

AI-QE supports two SSH authentication methods:

SSH Key Authentication (Recommended):
```
key_file: "/path/to/private/key"
```
Password Authentication:
```
password: "your_password"
```

Features

Automatic Directory Management: Creates remote working directories as needed
File Synchronization: Automatically transfers files between local and remote hosts
Secure Communication: All operations use SSH encryption
Error Handling: Comprehensive error handling and logging for remote operations
Seamless Integration: Same AI agent workflow works for both local and remote execution

Example Usage

# Execute test case on remote host with SSH key
$ ./ai-qe -s localhost -c test_case.txt --remote-host production-server.com --remote-user deploy

# Generate and execute test cases remotely
$ ./ai-qe -r "test network connectivity" -s localhost --remote-host test-env.lab.com --remote-user tester

Command-Line Arguments

Here's a quick reference for common ai-qe arguments:

-r "{prompt}", --request "{prompt}": Natural language request for test case generation.
-t {path}, --template-yaml {path}: Path to a template YAML file for test case generation.
-c {file}, --test-case {file}: Path(s) to raw test case files for execution (can be used multiple times).
-n {num}, --case-number {num}: Randomly pick up num test cases to run (used with -r or -t).
-d {path}, --results-dir {path}: Directory path to store result files (default: ./results).
-s {ip}, --server-ip {ip}: Ollama server IP address (required for Ollama).
-p {port}, --server-port {port}: Ollama server Port number.
-m {model_name}, --model {model_name}: LLM model name (e.g., gemini-2.5-flash, jacob-ebey/phi4-tools, claude-sonnet-4).
-f {path}, --config-yaml {path}: Path to a configuration YAML file.
-l {level}, --log-level {level}: Set the logging level (DEBUG, INFO, WARNING, ERROR, CRITICAL; default: WARNING).
-w, --web: Run the web interface (starts a Gradio server).
--use-vertex-ai: Use Vertex AI for Gemini models (default: False).
--remote-host {host}: Remote host for test execution (enables remote mode).
--remote-user {user}: Remote host username for SSH authentication.

Set Log Level

To control the verbosity of the output when using the command line, use the --log-level option. Available levels are DEBUG, INFO, WARNING, ERROR, and CRITICAL. The default level is WARNING.

Example:

$ ./ai-qe -r "generate test case" --log-level DEBUG

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
ai_qe		ai_qe
test_items		test_items
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ai-qe		ai-qe
default_config.yaml		default_config.yaml
guest.xml		guest.xml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-QE

Table of Contents

Installation

Python Dependencies

Virtualization Tools (Optional)

Setting up an LLM

Using Ollama

Using Gemini Models

Using Claude Models

Usage

Running the Web Interface

Running via Command Line

Remote Test Execution

Configuration

Via Configuration File

Via Command Line

Authentication Methods

Features

Example Usage

Command-Line Arguments

Set Log Level

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI-QE

Table of Contents

Installation

Python Dependencies

Virtualization Tools (Optional)

Setting up an LLM

Using Ollama

Using Gemini Models

Using Claude Models

Usage

Running the Web Interface

Running via Command Line

Remote Test Execution

Configuration

Via Configuration File

Via Command Line

Authentication Methods

Features

Example Usage

Command-Line Arguments

Set Log Level

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages