minus-squareCorngood@lemmy.mltoTechnology@lemmy.ml•AI can't even run a vending machine -- Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agentslinkfedilinkarrow-up17·20 days agoIt’s well worth reading the entire paper. It’s one of the funniest things I’ve ever read. linkfedilink
It’s well worth reading the entire paper. It’s one of the funniest things I’ve ever read.