@surya19
ScreenAI is a Vision-Language Model (VLM) developed by Google AI that can comprehend both user interfaces (UIs) and infographics.
It's wild — capable of tasks like graphical question-answering, element annotation, summarization, navigation, and UI-specific QA.