Position: Coding Benchmarks Are Misaligned with Agentic Software Engineering

(arxiv.org)

2 points | by wek 9 hours ago ago

No comments yet.