man lifting a dumbbell
ConceptMentioned in 1 video
An example output of image captioning that was incorrect, used to illustrate how attention maps can reveal what the model was focusing on (arms, not a mug).
An example output of image captioning that was incorrect, used to illustrate how attention maps can reveal what the model was focusing on (arms, not a mug).