Commit History

Update decode method in tokenizer
d8a6cfc

zxdu20 commited on

Add support for parallel quantization on Mac
f6b88da

zxdu20 commited on

Remove assert in load_cpu_kernel
63d66b0

zxdu20 commited on

Sync with chatglm-6b
f55a108

zxdu20 commited on

Remove pytorch_model.bin.index.json
e02ba89

zxdu20 commited on

Update slack link
6498797

zxdu20 commited on

Add pytorch_model.bin.index.json
1e40d96

zxdu20 commited on

Add assertion when loading cpu and cuda kernel fails
630d0ef

songxxzp commited on

Add assertion when loading cpu and cuda kernel fails
bcc35f0

songxxzp commited on

Merge branch 'dev'
fe0674f

songxxzp commited on

Update CPU kernel loading method
c7d8998

songxxzp commited on

Fix gmask
3485994

zxdu20 commited on

Add empty_init option
9333486

zxdu20 commited on

Update README.md
6466cdc

zxdu20 commited on

Fix eos token in tokenizer
9163f7e

zxdu20 commited on

Update dependency
649466f

zxdu20 commited on

Fix attention score on mps
41fda88

zxdu20 commited on

Fix logit processor
a7272d4

zxdu20 commited on

Merge branch 'slim' of https://huggingface.co/THUDM/chatglm-6b-int4 into slim
96de7a2

zxdu20 commited on

Fix embedding quantization
5fc46d2

zxdu20 commited on

Upload pytorch_model.bin
7edbdfe

zxdu20 commited on

Slim embedding
bfb1a8f

zxdu20 commited on

Fix bugs when compiling cpu kernels
68873da

DrSong commited on

Drop icetk dependency
1f34060

zxdu20 commited on

Fix position ids expand
19685a5

zxdu20 commited on

Synchronize with chatglm 6b repo
7aaf3fe

DrSong commited on

Fix parallel cpu kernel
7458231

DrSong commited on

Fix bugs in quantization when loading kernels
dac03c3

DrSong commited on

Fix Chinese punctuation
debaf00

zxdu20 commited on

Update README.md
3ba9437

Sengxian commited on

Update README.md
0d0e806

Sengxian commited on

Update README.md
7ad727c

Sengxian commited on

init commmit
a93efa9

Sengxian commited on

initial commit
62a9758

zxdu20 commited on