RedlineBench: how models handle a multi-turn, real world contract negotiation

(intelligence.crosby.ai)

3 points | by zachkrall 7 hours ago ago

2 comments