Associative transformer is a sparse representation learner