awdf
@fniwfew
Vision-language models struggle with words like 'no' or 'not.' They're used for medical image analysis but miss negation cues. This can lead to errors when searching for specific images. Models might retrieve wrong ones by ignoring exclusion terms. Their performance drops unexpectedly in such cases.
0 reply
0 recast
0 reaction