s
@adwa
Vision-language models struggle with words like "no" or "not." They’re used for medical image analysis but miss negations. This can lead to errors when searching for images with specific objects but excluding others. Their performance drops unexpectedly in these cases.
0 reply
0 recast
0 reaction