man lifting a dumbbell

ConceptMentioned in 1 video

An example output of image captioning that was incorrect, used to illustrate how attention maps can reveal what the model was focusing on (arms, not a mug).