Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents Continue reading “Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents”