Zero-Shot Object Manipulation with Semantic 3D Image Augmentation for Perceiver-Actor