Multimodal AI Struggles to Read Text as Pixels | aib vote