Skip to content

VaJavorek/T5_for_SLT

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

149 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

T5_for_SLT

Installation

cd T5_for_SLT/
conda create -n t5slt python=3.12
conda activate t5slt
pip install -r requirements.txt

Data

Our T5 dataloader expects the following data structure. The example is giver for YouTubeASL dataset. This dataset consists of multiple videos. Each video consists of multiple clips. Each clip has its own annotation.

YT-ASL
|---YouTubeASL.annotation.train.json
|---YouTubeASL.annotation.dev.json
|---keypoints
|     |--- YouTubeASL.keypoints.train.json
|     |--- YouTubeASL.keypoints.train.0.h5
|     |--- YouTubeASL.keypoints.train.1.h5
|     |--- ...
|     |--- YouTubeASL.keypoints.dev.json
|     |--- YouTubeASL.keypoints.dev.0.h5
|     |--- YouTubeASL.keypoints.dev.1.h5
|     |--- ...

Config file takes following inputs:

  • Annotation file (YouTubeASL.annotation.${split}.json)
{
    ${video_id}: {
                "clip_order": [${clip_name}, ..., ],
                ${clip_name}: {
                                "translation": ....}
                ${clip_name}: ....,
                },
    ${video_id}: ...
}
  • Metadata file (YouTubeASL.${modality}.${split}.json)
    • This file stores a dictionary with {video: shard_id} mapping for h5 shards.
    • Metadata file name must be the same as the shard names.

Each h5 shard has the following structure:

{
    ${video_id}: {
                    ${clip_name}: numpy.array (np.float16),
                    ${clip_name}: numpy.array, ...}
    ${video_id}:...
}

Usage

# Setup correct path
cd T5_for_SLT/
export PYTHONPATH=$PYTHONPATH:$(pwd)

# Run training script
python train/run_finetuning.py --config_file configs/config.yaml

# Run predict script
python eval/predict_attention_store_batch.py --config_file configs/predict_config_attention_isharah.yaml

Use --verbose for extended output.

For more details see scripts or config readme.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 93.8%
  • Python 6.2%