New Benchmark Proves AI Agents Master Complex Software Engineering | aib vote