PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper โข 2309.10400 โข Published Sep 19, 2023 โข 26
CausalLM is not optimal for in-context learning Paper โข 2308.06912 โข Published Aug 14, 2023 โข 18