Improving Length Generalization in Algorithmic Tasks with Looped Transformers: A Study on n-RASP-L Problems
Recent research highlights that Transformers, though successful in tasks like arithmetic and algorithms, need help with length generalization, where models handle inputs of unseen lengths. This is crucial for algorithmic…