initial commit, for 12-layer gpt2-like transformer (checkpoint 27500) bcb66e3 Kristijan commited on Mar 30, 2023