Multimodal_Emotion_Recognition

MultiModal Emotion Recognition using Cross modal Attention module and Contrastive loss

Data: KEMDy19
Modality: Audio, Text

Installation

Experiment setting

Linux
Python 3.8.16
PyTorch 1.13.1 and CUDA 11.7

a. Create a conda virtual environment and activate it.

conda create -n MER python=3.8
conda activate MER

b. Install PyTorch and torchvision following the official instructions

c. Clone this repository.

d. Install requirments.

pip install -r requirements.txt

e. Install DeepSpeed

First you need libaio-dev. please install by

sudo apt-get install libaio-dev

After this, install deepspeed by

DS_BUILD_CPU_ADAM=1 DS_BUILD_FUSED_ADAM=1 DS_BUILD_UTILS=1 DS_BUILD_AIO=1 pip install deepspeed==0.9.0 --global-option="build_ext" --global-option="-j11" --no-cache-dir

Please check for detail installation DeepSpeed official github

Prepare for training

a. Prepare data

root_path: original KEMD19 path Ex) /home/ubuntu/data/KEMD_19/
save_path: save folder, default: ./data/

python preprocess.py --root_path your_KEMD_19_path --save_path ./data/

Here is the preprocess flow chart.

Note that, wav_length cliping is conducted in train_hf.sh or inference.py

Train

Run Training code

bash train_hf.sh

Check your GPU, and change train_hf.sh and configs properly.

Tensorboard

You can run tensorboard

tensorboard --logdir ./output/log/tensorboard_what_you_want/version_0/

Inference

Because this repository use deepspeed stage 2, model weights sharded between gpus. So you need to make sharded checkpoints as one. You need to collate the model weights using

python make_model_weights.py

After this,

CUDA_VISIBLE_DEVICES=0 python inference.py

Result

In table, CE means cross entropy and CA means contrastive loss repectively.

Multimodal(CAT) represents using concatenate for multimodal modeling and Multimodal(CMA) represents using cross modal attention respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
configs		configs
models		models
result		result
.gitignore		.gitignore
README.md		README.md
dataset_hf.py		dataset_hf.py
inference.py		inference.py
install.sh		install.sh
make_model_weights.py		make_model_weights.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train_hf.sh		train_hf.sh
trainer_hf.py		trainer_hf.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multimodal_Emotion_Recognition

Installation

Experiment setting

Prepare for training

Train

Tensorboard

Inference

Result

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Multimodal_Emotion_Recognition

Installation

Experiment setting

Prepare for training

Train

Tensorboard

Inference

Result

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages