AI Agents Struggle With Strategic Reasoning in New Benchmark | aib vote