06/11/2023 / Last updated : 07/11/2023 shimura Associative Transformer is a Sparse Representation Learner