πŸ€– Eval Model (피평가 λͺ¨λΈ)
βš–οΈ Judge Model (μ±„μ μž)
πŸ“Š Dataset
Grade
Difficulty
1 600
1 6
1 10

FINAL Bench v3.1 · 🧬 Darwin-gpt-ernie-20b (Friendli) + SWE-bench_Verified
AGI Verification Β· Non-AGI vs Proto-AGI Β· ζœ¨η«εœŸι‡‘ζ°΄
Apache 2.0 Β· Ginigen AI β€” Choi Sunyoung