FlopperamUnrealMCP

Unreal Engine 5.5 C++ project featuring AI-powered conversational MetaHuman with Text-to-Speech, facial animation via NVIDIA Audio2Face, and chatbot integration.

Features

Text-to-Speech Component: Blueprint-compatible component using OpenAI's TTS API (model: tts-1, voice: alloy)
NVIDIA Audio2Face Integration: Real-time facial animation generation from audio via NVIDIA ACE
Dify Chatbot Integration: AI conversation management with streaming responses
MetaHuman Support: Full ARKit to MetaHuman blendshape mapping (55+ facial curves)
Voice Capture: Real-time microphone input with Silero VAD (Voice Activity Detection)
Runtime Audio: Uses USoundWaveProcedural for dynamic audio playback

Prerequisites

Software Requirements

Requirement	Version	Notes
Unreal Engine	5.5.4	Required for MetaHuman and ACE plugin compatibility
Visual Studio	2022	For C++ compilation
NVIDIA MetaHuman	Hadley	Required for Audio2Face integration
NVIDIA ACE Plugin	2.5+	Required for Audio2Face integration
Operating System	Windows 10/11	Linux not currently supported

Hardware Requirements

Component	Minimum	Recommended
GPU	NVIDIA RTX 2080	RTX 3080 or higher
RAM	16 GB	32 GB
VRAM	8 GB	10+ GB
Storage	50 GB free	SSD recommended

API Keys Required

You must obtain the following API keys and set them as environment variables:

Service	Environment Variable	Purpose	Get Key From
OpenAI	`OPENAI_API_KEY`	Text-to-Speech (TTS)	platform.openai.com
NVIDIA	`NVIDIA_API_KEY`	Audio2Face facial animation	build.nvidia.com
Dify	`DIFY_API_KEY`	Chatbot conversation management	Your Dify instance dashboard

Setting Environment Variables

Windows PowerShell:

$env:OPENAI_API_KEY="sk-proj-your-key-here"
$env:NVIDIA_API_KEY="nvapi-your-key-here"
$env:DIFY_API_KEY="app-your-key-here"

Windows CMD:

set OPENAI_API_KEY=sk-proj-your-key-here
set NVIDIA_API_KEY=nvapi-your-key-here
set DIFY_API_KEY=app-your-key-here

Windows System Environment (Permanent):

Open System Properties → Advanced → Environment Variables
Add new User or System variables for each key

NVIDIA ACE Plugin Installation

Download the ACE Plugin from NVIDIA:
- ACE Unreal Plugin Installation Guide
Install the plugin to your Unreal Engine installation
Enable the plugin in your project settings
Verify installation by checking for "NV ACE Reference" in Plugins menu

Installation

Clone the repository
Right-click on FlopperamUnrealMCP.uproject and select "Generate Visual Studio project files"
Open the project in Unreal Engine 5.5
Compile the C++ code

Usage

In Blueprint:

Add a TTSComponent to your actor
Set the OpenAI API key using SetAPIKey node
Call PlayTTSAudio with your text to speak
Or use PlayAudioFromFile to play local WAV files

In C++:

// Add component
UTTSComponent* TTS = CreateDefaultSubobject<UTTSComponent>(TEXT("TTSComponent"));

// Set API key
TTS->SetAPIKey("your-api-key");

// Play TTS
TTS->PlayTTSAudio("Hello world");

// Or play from file
TTS->PlayAudioFromFile("path/to/audio.wav");

NVIDIA Audio2Face Emotion Parameters

The project includes comprehensive emotion parameter controls for NVIDIA Audio2Face facial animation generation. These parameters are fully validated against NVIDIA's official ACE plugin source code and native library documentation.

Parameter Configuration

All parameters are configurable in the Details Panel when selecting an Audio2FaceClient component:

Audio2Face Configuration Category

Parameter	Range	Default	Description
PreferredEmotionStrength	0.0 - 1.0	1.0	Overall emotion intensity (maps to `OverallEmotionStrength`)
EmotionStrength	0.0 - 1.0	1.0	Legacy parameter for backward compatibility

Audio2Face Emotion Category

Parameter	Range	Default	NVIDIA Mapping	Description
DetectedEmotionContrast	0.3 - 3.0	1.0	`emotion_contrast`	Increases spread between emotion values by pushing them higher or lower
MaxDetectedEmotions	1 - 6	3	`max_emotions`	Sets firm limit on quantity of emotion sliders engaged by A2E
DetectedEmotionSmoothing	0.0 - 1.0	0.7	`live_blend_coef`	Coefficient for smoothing emotions over time (0=jittery, 1=no updates)
bEnableEmotionOverride	boolean	true	`enable_preferred_emotion`	Activate blending between app-provided emotions and A2F detection
EmotionOverrideStrength	0.0 - 1.0	0.5	`preferred_emotion_strength`	Balance between app overrides (1.0) and A2F detection (0.0)

Official Parameter Validation

These parameters are validated against NVIDIA's official sources:

Source Files Verified:

/Engine/Plugins/Marketplace/NV_ACE_Reference/Source/LegacyA2FRemote/Private/LegacyA2FRemote.cpp (lines 404-409)
/Engine/Plugins/Marketplace/NV_ACE_Reference/Source/ThirdParty/Nvacl/Include/nvacl.h

Official Parameter Mapping:

NvACEEmotionParameters NvParams;
NvParams.emotion_contrast = InEmotionParameters->DetectedEmotionContrast;
NvParams.live_blend_coef = InEmotionParameters->DetectedEmotionSmoothing;
NvParams.enable_preferred_emotion = InEmotionParameters->bEnableEmotionOverride;
NvParams.preferred_emotion_strength = InEmotionParameters->EmotionOverrideStrength;
NvParams.emotion_strength = InEmotionParameters->OverallEmotionStrength;
NvParams.max_emotions = InEmotionParameters->MaxDetectedEmotions;

Usage Example

// Configure emotion parameters in Blueprint or C++
UAudio2FaceClient* A2FClient = GetComponentByClass<UAudio2FaceClient>();

// High emotion intensity with strong contrast
A2FClient->PreferredEmotionStrength = 1.0f;      // Full emotion strength
A2FClient->DetectedEmotionContrast = 2.0f;       // Strong emotion contrast
A2FClient->MaxDetectedEmotions = 4;              // Allow up to 4 emotions
A2FClient->DetectedEmotionSmoothing = 0.5f;      // Moderate smoothing
A2FClient->bEnableEmotionOverride = true;        // Enable blending
A2FClient->EmotionOverrideStrength = 0.3f;       // Favor A2F detection

// Process audio with these parameters
A2FClient->ProcessAudioToAnimation(AudioData, AudioDuration);

Parameter Effects

DetectedEmotionContrast (emotion_contrast):

0.3 = Subtle emotion differences
1.0 = Natural emotion spread (default)
3.0 = Extreme emotion differences

DetectedEmotionSmoothing (live_blend_coef):

0.0 = No smoothing (can be jittery)
0.7 = Good balance (default)
1.0 = Maximum smoothing (emotions barely change)

EmotionOverrideStrength (preferred_emotion_strength):

0.0 = Use only A2F-detected emotions
0.5 = Blend app and detected emotions (default)
1.0 = Use only app-provided emotion overrides

Debug Logging

Enable bEnableDebugLogging to see parameter values in console:

🎭 Audio2FaceClient: Emotion Parameters:
🎭   OverallEmotionStrength: 1.00
🎭   DetectedEmotionContrast: 1.00
🎭   MaxDetectedEmotions: 3
🎭   DetectedEmotionSmoothing: 0.70
🎭   bEnableEmotionOverride: true
🎭   EmotionOverrideStrength: 0.50

Recent Fix: 2x Playback Speed Issue (2025-08-28)

Problem

Audio from OpenAI's TTS API was playing at double speed in UE5.5, making speech unintelligible.

Timeline of Fix Attempts

Date/Time	Attempt	Result	Issue
Session 1	Manual WAV parsing	2x speed	Incorrect PCM offset/size calculation
Session 2	Pitch adjustment (0.5x)	Correct speed	Voice became masculine
Session 3	Sample duplication	Correct speed	Lowered pitch, masculine voice
Session 4	Zero padding	Correct speed	Audio artifacts, messy sound
Session 5	Linear interpolation	Crash	Array bounds error
Session 6	Sample rate lie (48kHz)	2x speed	UE5 ignored the lie
Session 7	Cubic interpolation	Unknown	Over-engineered solution
FINAL	UE5 built-in parser	✅ FIXED	Perfect playback

Root Cause Analysis

The issue wasn't with UE5's ability to play 24kHz audio. The manual WAV parser was incorrectly reading the PCM data offset or size, causing UE5 to receive malformed audio data.

The Solution

Use Unreal Engine's built-in FWaveModInfo class from AudioDecompress.h:

#include "AudioDecompress.h"

bool UTTSComponent::ParseWav(const TArray<uint8>& WavBytes, 
                             int32& OutSampleRate, 
                             int32& OutNumChannels, 
                             int32& OutBitsPerSample, 
                             const uint8*& OutPCM, 
                             int32& OutPCMSize) const
{
    FWaveModInfo Info;
    if (!Info.ReadWaveInfo((uint8*)WavBytes.GetData(), WavBytes.Num()))
        return false;

    OutNumChannels   = *Info.pChannels;
    OutSampleRate    = *Info.pSamplesPerSec;
    OutBitsPerSample = *Info.pBitsPerSample;
    OutPCM          = Info.SampleDataStart;
    OutPCMSize      = static_cast<int32>(Info.SampleDataSize);
    
    return true;
}

Key Insights

Always use framework tools: UE5's FWaveModInfo handles edge cases that manual parsing misses
Don't manipulate audio data: Pass PCM directly without modification
Use correct settings: SOUNDGROUP_Voice for TTS audio
Trust the framework: UE5 handles 24kHz audio perfectly when given correct data

Lessons Learned

Simplicity wins: The fix was simpler than all the workarounds attempted
Framework knowledge: Knowing about AudioDecompress.h would have saved hours
Wrong assumptions: The "bug" was in the parsing, not in UE5's audio system
Over-engineering trap: Complex solutions (interpolation, resampling) were unnecessary

Project Structure

FlopperamUnrealMCP/
├── Source/
│   └── FlopperamUnrealMCP/
│       ├── Public/
│       │   └── TTSComponent.h      # Component header
│       └── Private/
│           └── TTSComponent.cpp     # Implementation
├── Content/                         # Unreal assets
├── Config/                         # Project configuration
└── FlopperamUnrealMCP.uproject    # Project file

Contributing

Please follow the version control protocol:

Only modify files in Source/ directory
Don't modify .uasset files
Commit after each meaningful change
Use descriptive commit messages

License

[Project License Information]

Acknowledgments

OpenAI for the TTS API
Unreal Engine community for audio system documentation
Special thanks to the debugging sessions that led to the final fix

Name		Name	Last commit message	Last commit date
Latest commit History 525 Commits
.claude		.claude
.serena		.serena
Config		Config
Content		Content
ConvaiLog		ConvaiLog
Plugins/UnrealMCP		Plugins/UnrealMCP
Source/FlopperamUnrealMCP		Source/FlopperamUnrealMCP
__pycache__		__pycache__
nvidia_ace		nvidia_ace
protos		protos
protos_nvidia		protos_nvidia
.gitattributes		.gitattributes
.gitignore		.gitignore
.vsconfig		.vsconfig
=1.60.0		=1.60.0
ACE_Blendshape_Research.md		ACE_Blendshape_Research.md
API_KEYS_SETUP.md		API_KEYS_SETUP.md
BuildProject.bat		BuildProject.bat
CLAUDE.md		CLAUDE.md
COMPONENT_BASED_MIGRATION_GUIDE.md		COMPONENT_BASED_MIGRATION_GUIDE.md
CleanAndBuild.bat		CleanAndBuild.bat
CleanBuild.bat		CleanBuild.bat
CompileProject.bat		CompileProject.bat
DESIGN_IMPROVEMENTS.md		DESIGN_IMPROVEMENTS.md
DIFY_CHAT_INTEGRATION_GUIDE.md		DIFY_CHAT_INTEGRATION_GUIDE.md
FIX_LIVECODING_ERROR.md		FIX_LIVECODING_ERROR.md
FIX_WIDGET_BINDING_ISSUE.md		FIX_WIDGET_BINDING_ISSUE.md
FixLiveCoding.bat		FixLiveCoding.bat
FlopperamUnrealMCP.code-workspace		FlopperamUnrealMCP.code-workspace
FlopperamUnrealMCP.uproject		FlopperamUnrealMCP.uproject
GenerateProjectFiles.bat		GenerateProjectFiles.bat
LIVE_CODING_FIX.md		LIVE_CODING_FIX.md
LaunchProject.bat		LaunchProject.bat
LaunchWithCompile.bat		LaunchWithCompile.bat
README.md		README.md
README_BUILD.md		README_BUILD.md
RebuildProject.bat		RebuildProject.bat
RecoverWidget.md		RecoverWidget.md
SILERO_VAD_SETUP.md		SILERO_VAD_SETUP.md
Stage3_COMPLETE_Instructions.md		Stage3_COMPLETE_Instructions.md
TEST_README.md		TEST_README.md
USAGE_INSTRUCTIONS.md		USAGE_INSTRUCTIONS.md
UpgradeLog.htm		UpgradeLog.htm
VAD_Testing_Instructions.md		VAD_Testing_Instructions.md
VOICE_BUTTON_SETUP.md		VOICE_BUTTON_SETUP.md
VOICE_CAPTURE_INTEGRATION.md		VOICE_CAPTURE_INTEGRATION.md
animation_output.csv		animation_output.csv
audio2face_network_test.py		audio2face_network_test.py
audio_ready.signal		audio_ready.signal
bp.PNG		bp.PNG
nvidia_ace.a2f.v1_pb2_grpc.py		nvidia_ace.a2f.v1_pb2_grpc.py
nvidia_ace.a2f_authoring.v1_pb2_grpc.py		nvidia_ace.a2f_authoring.v1_pb2_grpc.py
nvidia_ace.animation_data.v1_pb2_grpc.py		nvidia_ace.animation_data.v1_pb2_grpc.py
nvidia_ace.animation_id.v1_pb2_grpc.py		nvidia_ace.animation_id.v1_pb2_grpc.py
nvidia_ace.audio.v1_pb2_grpc.py		nvidia_ace.audio.v1_pb2_grpc.py
nvidia_ace.controller.v1_pb2_grpc.py		nvidia_ace.controller.v1_pb2_grpc.py
nvidia_ace.emotion_aggregate.v1_pb2_grpc.py		nvidia_ace.emotion_aggregate.v1_pb2_grpc.py
nvidia_ace.emotion_with_timecode.v1_pb2_grpc.py		nvidia_ace.emotion_with_timecode.v1_pb2_grpc.py
nvidia_ace.health_pb2_grpc.py		nvidia_ace.health_pb2_grpc.py
nvidia_ace.services.a2f.v1_pb2_grpc.py		nvidia_ace.services.a2f.v1_pb2_grpc.py
nvidia_ace.services.a2f_authoring.v1_pb2_grpc.py		nvidia_ace.services.a2f_authoring.v1_pb2_grpc.py
nvidia_ace.services.a2f_controller.v1_pb2_grpc.py		nvidia_ace.services.a2f_controller.v1_pb2_grpc.py
nvidia_ace.services.a2x_export_config.v1_pb2_grpc.py		nvidia_ace.services.a2x_export_config.v1_pb2_grpc.py
nvidia_ace.services.animation_controller.v1_pb2_grpc.py		nvidia_ace.services.animation_controller.v1_pb2_grpc.py
nvidia_ace.services.animation_data.v1_pb2_grpc.py		nvidia_ace.services.animation_data.v1_pb2_grpc.py
nvidia_ace.status.v1_pb2_grpc.py		nvidia_ace.status.v1_pb2_grpc.py
nvidia_ace_a2f_v1_pb2.py		nvidia_ace_a2f_v1_pb2.py
nvidia_ace_a2f_v1_pb2_grpc.py		nvidia_ace_a2f_v1_pb2_grpc.py
nvidia_ace_animation_data_v1_pb2.py		nvidia_ace_animation_data_v1_pb2.py
nvidia_ace_animation_data_v1_pb2_grpc.py		nvidia_ace_animation_data_v1_pb2_grpc.py
nvidia_ace_audio_v1_pb2.py		nvidia_ace_audio_v1_pb2.py
nvidia_ace_audio_v1_pb2_grpc.py		nvidia_ace_audio_v1_pb2_grpc.py
nvidia_ace_status_v1_pb2.py		nvidia_ace_status_v1_pb2.py
nvidia_ace_status_v1_pb2_grpc.py		nvidia_ace_status_v1_pb2_grpc.py
proto_loader.py		proto_loader.py
requirements_test.txt		requirements_test.txt
test_audio2face_grpc_proper.py		test_audio2face_grpc_proper.py
test_audio2face_nvidia.py		test_audio2face_nvidia.py
test_tts_output.wav		test_tts_output.wav

Folders and files

Latest commit

History

Repository files navigation

FlopperamUnrealMCP

Features

Prerequisites

Software Requirements

Hardware Requirements

API Keys Required

Setting Environment Variables

NVIDIA ACE Plugin Installation

Installation

Usage

In Blueprint:

In C++:

NVIDIA Audio2Face Emotion Parameters

Parameter Configuration

Audio2Face Configuration Category

Audio2Face Emotion Category

Official Parameter Validation

Usage Example

Parameter Effects

Debug Logging

Recent Fix: 2x Playback Speed Issue (2025-08-28)

Problem

Timeline of Fix Attempts

Root Cause Analysis

The Solution

Key Insights

Lessons Learned

Project Structure

Contributing

License

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages