TIFA

Study / Research

A paper that proposes using few-shot learning to decompose a prompt into atomic properties (yes/no questions) for quantitative evaluation of text-to-image faithfulness by MLLMs.

Mentioned in 1 video