Robo2u

Code Models

Data as of June 12, 2026
Anthropic?200K$15/$75πŸ“πŸ‘οΈ
SWE-Pro 65.0SWE 88.1Term2 69.2Arena 1580
SOC2; HIPAA eligible. System card 2026-05-29: 3.7% code-summary dishonesty, 5x fewer dishonest agentic reports vs 4.7; incremental capability gain
Anthropic?200K$15/$75πŸ“πŸ‘οΈ
SWE-Pro 64.3SWE 87.6Term2 68.5Arena 1573
SOC2; HIPAA eligible
Anthropic?200K$5/$25401.78πŸ“πŸ‘οΈ
SWE-Pro 53.4SWE 80.8Term2 65.4SWE-Multi 77.8Arena 1548
SOC2; HIPAA eligible
#4.5
OpenAI?400KπŸ“πŸ‘οΈπŸ”Š
SWE-Pro 58.6SWE 82.6Term2 73.2Arena 1509
Google?1M$2/$1210929.7πŸ“πŸ‘οΈπŸ”Š
SWE-Pro 54.2SWE 80.6Term2 68.5SWE-Multi 76.9Arena 1456
Anthropic?200K$3/$15441.48πŸ“πŸ‘οΈ
SWE 79.6Arena 1521
SOC2; HIPAA eligible
Alibaba?1M$3/$8πŸ“πŸ‘οΈ
SWE-Pro 60.6SWE 80.4Term2 69.7
Qwen3.7 flagship; agent-centric coding. SWE-Bench Verified 80.4, SWE-Bench Pro 60.6.
8.GLM 5.1●
#9.5
Z.ai754B MoE200K$0.95/$3741.64πŸ“πŸ‘οΈ
SWE 77.8Arena 1530
Open-weight; #1 SWE-Bench Pro (58.4)
Alibaba?128K962.44πŸ“πŸ‘οΈ
SWE 78.8Arena 1453
10.Kimi K2.6●
#9.5
Moonshot1T (32B active, MoE)256KπŸ“πŸ‘οΈ
SWE-Pro 58.6SWE 80.2Term2 66.7SWE-Multi 76.7
Open-weight; Kimi table reports SOTA open coding/agent scores
11.MiniMax M2.7●
#10
MiniMax230B (10B active, MoE)128K452.53πŸ“πŸ‘οΈ
SWE 80.2Arena 1425
Google?1M$0.50/$31591.19πŸ“πŸ‘οΈπŸ”Š
SWE 78.0Arena 1436
13.MIMO v2 Pro●
#11.5
Xiaomi32B128K1292.14πŸ“
SWE 78.0Arena 1433
14.GLM 4.6●
#12
Z.ai754B MoE200KπŸ“
SWE 77.8
Open-weight MoE; ~216GB VRAM (Q4_K_M); basis for Big Pickle on OpenCode Zen
#12
OpenAI?128K$2/$1474151.8πŸ“πŸ‘οΈπŸ”Š
SWE-Pro 57.7SWE 76.9Term2 65.4Arena 1457
#12.5
Z.ai744B (40B active, MoE)200K$1/$4πŸ“πŸ‘οΈπŸŽ₯
Native multimodal coding model; CogViT vision encoder; 30+ task joint RL; leads AndroidWorld, WebVoyager, ZClawBench; design-to-code specialist
#12.5
NVIDIA120B (12B active, MoE)1MπŸ“
Hybrid Mamba-Attention MoE w/ LatentMoE; trained on 25T tokens; 2.2x throughput vs GPT-OSS-120B, 7.5x vs Qwen3.5-122B; native MTP speculative decoding
#12.5
OpenRouter?1Mfree/freeπŸ“
High-performance foundation model for agentic workloads; tool use, code generation, automated workflows; compatible with Claude Code and OpenClaw. Provider may log prompts/completions.
19.Qwen3.6-27B●
#12.5
Alibaba27B262KπŸ“
Open-weight dense; ~80 tps on RTX 5090 with 218K context (vLLM 0.19)
#12.5
Alibaba35B (3B active)256KπŸ“
Open-weight MoE; 35B total, 3B active; outperforms Claude Opus 4.7 on Simon Willison's pelican test
#12.5
OpenCode Zen754B MoE200Kfree/freeπŸ“
SWE 77.8
Stealth model (GLM 4.6); free on OpenCode Zen; reasoning + tool calling; text-only
22.Llama 3.3 70B (Groq)●
#12.5
Groq70B128K$0.59/$0.793940.3πŸ“
23.DeepSeek V4●
#12.5
DeepSeek1.6T (49B active, MoE)1MπŸ“
China API; self-host OK
24.Ring-2.6-1T●
#12.5
InclusionAI1T (63B active, MoE)262KπŸ“
Trillion-parameter thinking model; PinchBench 87.6; optimized for coding agents, tool use, multi-turn agent workflows
25.Ling-2.6-1T●
#12.5
InclusionAI1T262K$0.30/$3πŸ“
Instant instruct model; fast-thinking; SOTA on AIME26 and SWE-bench Verified; hybrid MLA + Linear Attention architecture
#12.5
InclusionAI104B (7.4B active, MoE)262KπŸ“
Fast instruct model; lightweight companion to Ling-2.6-1T; coding, document processing, agent workflows
#12.5
MetaπŸ“πŸ‘οΈ
Meta's flagship coding-capable model.
28.MiniMax M3●
#12.5
MiniMax?1M$0.60/$2πŸ“πŸ‘οΈ
SWE-Pro 59.0Term2 66.0
Surpasses GPT-5.5 & Gemini 3.1 Pro on SWE-Bench Pro at ~5-10% of cost; approaches Claude Opus 4.7. SWE-fficiency 34.8.
Google?1M$2/$9πŸ“πŸ‘οΈπŸ”Š
Term2 76.2
Beats Gemini 3.1 Pro on coding/agentic suite; MCP Atlas 83.6.
#12.5
Alibaba?1M$0.40/$2πŸ“πŸ‘οΈπŸŽ₯
AA coding 46.5; ScreenSpot Pro 79.0 (GUI grounding).
#12.5
NVIDIA550B (55B active, MoE)1M146.3πŸ“
43 programming languages; AA Intelligence Index 48.
Microsoft AI35B active (SMoE)256KπŸ“
SWE-Pro 53.0
Microsoft's first reasoning model; SWE-Bench Pro 53.
Microsoft AI5B active?πŸ“
SWE-Pro 51.0
Inference-efficient agentic coding model for GitHub Copilot/VS Code; Haiku-class size, cheaper.
#13.5
Mistral128B256K$2/$8πŸ“πŸ‘οΈ
SWE 77.6
Merged instruct/reasoning/coding flagship; replaces Devstral 2 in Mistral Vibe; self-hostable on as few as 4 GPUs
Moonshot1T128K452.38πŸ“πŸ‘οΈ
SWE-Pro 50.7SWE 76.8Term2 50.8SWE-Multi 73.0Arena 1429
China API
36.Hy3 Preview●
#16
Tencent Hunyuan295B (21B active, MoE)256KπŸ“
SWE 74.4
Open-weight Hunyuan 3 preview; rebuilt RL/pretraining stack; strong code and agent benchmarks
37.Llama 4 Scout (Groq)●
#16.5
Groq17Bx16E128K$0.11/$0.345940.3πŸ“πŸ‘οΈ
SWE 74.4
Together AI397B128K$0.27/$0.8596πŸ“πŸ‘οΈ
SWE 76.4Arena 1386
#17.5
Mistral675B MoE (41B active)256K$0.50/$2481.04πŸ“πŸ‘οΈ
Arena 1222
#19
OpenAI?128K$2/$1411157.5πŸ“πŸ‘οΈπŸ”Š
SWE 76.3Arena 1339
DeepSeek685B128K$0.55/$2353.75πŸ“
SWE 73.1Arena 1368
China API; self-host OK
xAI?128K$0.20/$0.5023310.33πŸ“πŸ‘οΈ
SWE 70.8Arena 1393
Anthropic?200K$1/$51381.16πŸ“πŸ‘οΈ
SWE 73.3Arena 1315
#21.5
Groq27B128K$0.29/$0.596620.3πŸ“
SWE 72.4Arena 1344
45.DeepSeek V3.2 (Together)●
#22
Together AI685B128K$0.28/$0.42πŸ“
SWE 73.1Arena 1330
46.Devstral 2●
#25
Mistral24B128KπŸ“
SWE 61.6Arena 1197

AI App Builders (prompt β†’ working app)

Data as of April 26, 2026
1.v0
VercelCustom (built on Claude / GPT)React / Next.js / Tailwind / shadcnVercel functions / API routesOne-click VercelFree / $20 Premium / $50 TeamProprietary
First mover in the prompt-to-app space; tightly Vercel-integrated
2.Magic Patterns
Magic PatternsClaude / GPTReact / Tailwind / shadcn / othersAPI integrationsGitHub export / VercelFree / $30 / $90 Pro+Proprietary
Strong design-fidelity output; screenshot-to-code
TempoClaude / GPTReact / Tailwind / shadcnSupabase / external APIsTempo / GitHub exportFree / $30 / $80 TeamProprietary
Visual editor + AI; component-first workflow
LovableClaude / GPTReact / Tailwind / shadcnSupabaseLovable hosting / GitHub exportFree / $20 / $50 / $100 TeamProprietary
Strong Supabase integration; ex-GPT Engineer team
5.GitWit
GitWitClaude / GPTReact / VueGitHub-nativeGitHub Actions / direct PRFree / $20Proprietary
GitHub-first; PR-based collaboration
6.Webcrumbs
WebcrumbsClaude / GPTReact / Next.jsWebcrumbs / externalWebcrumbs / Vercel / GitHubFree / $20 / $50Proprietary
Visual canvas; snippet-style component library
7.Trickle
TrickleClaude / GPTReactTrickle hostingTrickle / GitHubFree / $20 / $50Proprietary
AI-native website builder; landing-page focused
ReplitClaude (managed)Any (Replit envs)Replit deploymentsReplit hosting (free tier limited)$25/mo Replit CoreProprietary
Long-tail framework support via Replit envs; agent that drives the IDE
9.Codev
CodevClaude / GPTNext.js + SupabaseSupabaseVercel / GitHubFree / $25Proprietary
Generates auth-ready full-stack apps end to end
StackBlitzClaude / GPT / Gemini selectableReact / Vue / Svelte / Astro / othersWebContainers (Node in browser)Netlify / GitHub / CloudflareFree / $20 / $50 Pro+Proprietary
Browser-native; uses WebContainers for full Node runtime
11.Same
SameClaude / GPT / GeminiReact / Vue / SvelteVariousSame hosting / Vercel / NetlifyFree / $20 / $40Proprietary
Multi-framework; clone-from-URL feature
12.Tldraw Computer●
TldrawClaude / GPTGenerated UI in tldraw canvasn/a (single-page demos)Browser onlyFreeApache 2.0
Draw a layout β†’ generates working interactive UI on the canvas
13.Open Lovable●
MendableAny (BYOK)React / TailwindSupabase / othersSelf-hostedFree (self-host)AGPLv3
Open-source clone of Lovable; bring your own model + database