Show HN: ACE – A dynamic benchmark measuring the cost to break AI agents

(fabraix.com)

9 points | by zachdotai 4 days ago ago

4 comments