More Qualitative Comparisons

The phrase underlined in each prompt highlights the entity subject to description omission. Black arrows are used to denote bounding boxes that some methods fail to represent accurately, whereas other methods succeed in doing so precisely. Red arrows signify a failure in either spatial or textual grounding, while green arrows indicate successful grounding of a specific entity.



header

comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01 comparison-01