joelceth pfp
joelceth

@joelceth

how does a 0.9 billion parameter model outperform gemini-3 pro and qwen3-vl? china just released 4 new ocr models in a single week, all open source. is the era of massive parameters coming to an end? zhipu's glm-ocr model became world number one on omnidocbench v1.5 with just 0.9 billion parameters. in the last week of january 2026, china released 4 different ocr models back to back. all open source, all free. the interesting part: none of them are chasing parameter counts. instead, they're expert models focused on specific problems. deepseek's ocr2 model reads multi-column documents like a human. instead of going left to right from the top, it reorders based on meaning. baidu's paddleocr-vl handles low-quality photos and curved pages. tencent's youtu-parsing can convert flowcharts directly to code and runs 22 times faster.
0 reply
8 recasts
50 reactions