Skip to content

Mirai-Gadget-Lab/Multimodal_Emotion_Recognition

Repository files navigation

Multimodal_Emotion_Recognition

MultiModal Emotion Recognition using Cross modal Interaction module and multiloss

  • Data: KEMDy19
  • Modality: Audio, Text

Installation

Requirements

  • Linux
  • Python 3.7+
  • PyTorch 1.11.0 or higher and CUDA

a. Create a conda virtual environment and activate it.

conda create -n MER python=3.7
conda activate MER

b. Install PyTorch and torchvision following the official instructions

c. Clone this repository.

git clone https://github.com/Mirai-Gadget-Lab/Multimodal_Emotion_Recognition
cd Multimodal_Emotion_Recognition

d. Install requirments.

pip install -r requirements.txt

Prepare for training

a. Prepare data

  • root_path: Original KEMD19 path Ex) /home/ubuntu/data/KEMD_19/
  • save_path: save folder, default: ./data/
python preprocess.py --root_path your_KEMD_19_path --save_path ./data/

Here is the preprocess flow chart.

Note that, wav_length cliping is conducted in train_hf.sh or inference.py

b. Set config

Change config.py for your environment.

But, i recommand default config setting.

Train

Run Training code

bash train_hf.sh

Check your GPU, and change train_hf.sh properly.

Inference

if you train the model your self using above code, execute below codes.

CUDA_VISIBLE_DEVICES=0 python inference.py --model_save_path ./models_zoo/checkpoint/

Result

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors