Evaluating the Evaluators: Metrics for Compositional Text-to-Image Generation

📖 Overview

This repository contains the code and resources for our paper:

Evaluating the Evaluators: Metrics for Compositional Text-to-Image Generation
Seyed Amir Kasaei, Ali Aghayari, Arash Marioriyad, Niki Sepasian, MohammadAmin Fazli,
Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban

Paper (arXiv) | Project Page

🚀 Introduction

Text-to-image generation has made impressive progress, but evaluating compositional alignment remains a fundamental challenge.
This project systematically analyzes widely used evaluation metrics to understand their reliability, limitations, and correspondence to human judgment.

We study 12 metrics across 8 compositional categories using T2I-CompBench++, highlighting:

No single metric is universally reliable.
Embedding-based metrics (e.g., ImageReward, HPS) and VQA-based metrics (e.g., DA Score, VQA Score) each have strengths and weaknesses.
Image-only metrics (e.g., CLIP-IQA, Aesthetic Score) contribute little to compositional evaluation.

📊 Results

Correlation analysis (Pearson, Kendall, Spearman) across compositional categories.
Regression analysis to examine combined contributions of metrics.
Distributional study highlighting biases and limitations.

Key finding: A combination of complementary metrics is essential for trustworthy evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docs		docs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluating the Evaluators: Metrics for Compositional Text-to-Image Generation

📖 Overview

🚀 Introduction

📊 Results

📂 Repository Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Evaluating the Evaluators: Metrics for Compositional Text-to-Image Generation

📖 Overview

🚀 Introduction

📊 Results

📂 Repository Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages