BREAKING
Cognition Adds Claude Sonnet 5 to Devin
Sonnet 5 vs Opus 4.8
Sonnet 5
New
●
Frontier-level coding
●
Lower cost tier
●
Backend for Devin
Opus 4.8
High-end
●
Top-tier model
●
Extended score 51.8%
●
Beaten on FrontierCode
FrontierCode Extended Tasks
Extended
150
Main
100
Diamond
50
How Devin Works Autonomously
1
Plan
↓
2
Execute
↓
3
Test
↓
4
Debug
↓
5
Pull request
0
%
Planning gain
0
%
End-to-end eval
0
x
Speedup
Real-World Verdict Still Pending
AI NEWS BLITZ
Cognition just brought Anthropic's Claude Sonnet 5 to its AI software engineer Devin.