@joelceth
how does a 0.9 billion parameter model outperform gemini-3 pro and qwen3-vl? china just released 4 new ocr models in a single week, all open source. is the era of massive parameters coming to an end?
zhipu's glm-ocr model became world number one on omnidocbench v1.5 with just 0.9 billion parameters.
in the last week of january 2026, china released 4 different ocr models back to back. all open source, all free. the interesting part: none of them are chasing parameter counts. instead, they're expert models focused on specific problems.
deepseek's ocr2 model reads multi-column documents like a human. instead of going left to right from the top, it reorders based on meaning. baidu's paddleocr-vl handles low-quality photos and curved pages. tencent's youtu-parsing can convert flowcharts directly to code and runs 22 times faster.