VibeThinker-3B, a 3B parameter model from Sina Weibo, matches larger models on reasoning benchmarks. Its performance suggests reasoning can be efficiently compressed into smaller models, while factual knowledge compresses less effectively.
Opening Kapyn…