Commit History

feat: release action
4381fec

boris commited on

feat: use default from_pretrained function
4ac66e4

boris commited on

style: reformat
dca3ada

boris commited on

feat(train): use new HF _do_init api
6b84155

boris commited on

fix: model compatible with do_init
f3a8cbb

boris commited on

fix: apply learning rate offset only when requested
c6263f3

boris commited on

Merge pull request #171 from borisdayma/reduced-requests
209ade7
unverified

Pedro Cuenca commited on

feat(train): arg to offset lr for resumed runs
89b4c45

boris commited on

Half the number of requests.
d6e9f87

Pedro Cuenca commited on

feat(demo): use vaild model
23c1ef6

boris commited on

feat(mega): switch to gelu
cdefdd0

boris commited on

feat: layernorm > rmsnorm in long runs
0f2cf98

boris commited on

fix: use correctly cache during inference + allow unscan (#170)
42968cf
unverified

boris commited on

fix: allow non-scanned models (#168)
8ae9176
unverified

boris commited on

feat(demo): do not log search
3500e67

boris commited on

feat: vmap optimizer (#166)
b993d27
unverified

boris commited on

feat(demo): use fixed commit
2f1e5d9

boris commited on

doc: reference dalle playground
648305a

boris commited on

feat: scan layers + gradient checkpointing (#161)
07a6f9a
unverified

boris commited on

Merge pull request #162 from borisdayma/demo-improvements
0199604
unverified

Pedro Cuenca commited on

Merge branch 'main' of https://github.com/borisdayma/dalle-mini into main
bcd360f

boris commited on

Style: run isort.
58b9afd

Pedro Cuenca commited on

Style: run black.
2d6dbdd

Pedro Cuenca commited on

Display info about model and run, below predictions.
13caa60

Pedro Cuenca commited on

Update year: 2021-2022
f58c732

Pedro Cuenca commited on

Log dates
687a914

Pedro Cuenca commited on

Move client api to backend.py
63679e9

Pedro Cuenca commited on

feat: better multi-node support (#158)
728a3c3
unverified

boris commited on

doc: link to new demo
49be45e

boris commited on

feat(demo): high load
d68264e

boris commited on

feat(demo): update space
20a3626

boris commited on

feat(text): support emojis (#154)
7ef7bd9
unverified

boris commited on

feat: update shampoo
9ecdd3f

boris commited on

fix: smelu
7f2f8ed

boris commited on

fix: sinkformer
2c583b3

boris commited on

fix: support smelu
a2dcee4

boris commited on

fix: einops is required
179282e

boris commited on

feat: allow relative position (#156)
769d20a
unverified

boris commited on

feat: sinkhorn in lse mode (#155)
00d4661
unverified

boris commited on

feat(demo): update model
b9a1a7d

boris commited on

fix: sinkformer gradient
eed4896

boris commited on

feat(model): allow bias (#152)
361a994
unverified

boris commited on

feat(train): google-cloud-storage is optional
02b2308

boris commited on

feat(train): rename logged config
955dc20

boris commited on

feat: add sinkformer + custom final ln + pre-ln (#151)
f139b0b
unverified

boris commited on

feat: placeholders for more config
69bcbeb

boris commited on

feat: add mini_glu config
a7e5050

boris commited on

feat: force final ln in encoder
32f4ba5

boris commited on

feat: allow more configurations
5bd4c20

boris commited on

fix: DeepNet doesn't scale weights of embedding/output layers (#150)
503d6b4
unverified

Shuming Ma Shuming Ma commited on