Stochastic Prompt Construction for Effective In-Context Reinforcement Learning in Large Language Models
Large language models (LLMs) have demonstrated impressive capabilities in in-context learning (ICL), a form of supervised learning that doesn’t require parameter updates. However, researchers are now exploring whether this ability…