Rendered at 07:09:47 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
adrian_b 2 hours ago [-]
The comparison results seem very plausible.
From the conclusion, I agree with:
> I wouldn't make either one the top-level coordinator by default.
But I do not agree with the follow-up sentence:
> The best shape is still a frontier coordinator or judge above them: GPT-5.5 or Claude Opus deciding what to delegate, checking the finished work, and rerunning narrow pieces when the answer looks wrong. These models make the worker layer much more serious, not the coordinator layer unnecessary.
For the coordinator or judge above them I would put myself, not a too expensive LLM under the control of an external entity, achieving thus simultaneously higher quality, lower cost and greater security.
throwa356262 21 minutes ago [-]
A lot of LLM discussions is driven by people who cannot code themselves.
There are multiple AI influencers on youtube who can't code 5 lines of python to save their lives. But they do own 3 DGX spark and a stack of maxed out mac minis...
scottchiefbaker 9 hours ago [-]
FWIW Opencode Go is giving 3x MiniMax M3 access right now. According to their chart you get almost 10x as much access to MM3 vs GLM 5.2.
Considering how close the models are, the extra free queries may be worth it.
oceanwaves 4 hours ago [-]
Yes, that's what I'm finding too. There seems to be a concerted promotional pricing campaign tied to M3's release across providers. Since their differences are subtle, it makes a lot of sense to fan-out to M3.
killingtime74 2 hours ago [-]
I've used both and they are great. Would be better to have a GPT or Opus benchmark
oceanwaves 14 hours ago [-]
GLM 5.2 edges as the safer pick when tasks are more challenging from-scratch builds and the result needs to arrive as a complete, runnable project. MiniMax M3 is the value pick for a lot of worker traffic.
ashenke 2 hours ago [-]
I'd love to see a comparison with both Deepseek v4 models as well
From the conclusion, I agree with:
> I wouldn't make either one the top-level coordinator by default.
But I do not agree with the follow-up sentence:
> The best shape is still a frontier coordinator or judge above them: GPT-5.5 or Claude Opus deciding what to delegate, checking the finished work, and rerunning narrow pieces when the answer looks wrong. These models make the worker layer much more serious, not the coordinator layer unnecessary.
For the coordinator or judge above them I would put myself, not a too expensive LLM under the control of an external entity, achieving thus simultaneously higher quality, lower cost and greater security.
There are multiple AI influencers on youtube who can't code 5 lines of python to save their lives. But they do own 3 DGX spark and a stack of maxed out mac minis...
Considering how close the models are, the extra free queries may be worth it.