Commit History

Upload nvidia_fp8_unet/params.safetensors with huggingface_hub
d9e66a0
verified

GiusFra commited on

Upload nvidia_fp8_unet/quant_params.json with huggingface_hub
730c8f5
verified

GiusFra commited on

Upload nvidia_fp8_unet/results_mlperf.json with huggingface_hub
f4037ed
verified

GiusFra commited on

Upload nvidia_fp8_unet/args.json with huggingface_hub
4e70299
verified

GiusFra commited on

Create config.json
b0f9624
verified

GiusFra commited on

Create config.json
b7db598
verified

GiusFra commited on

Create config.json
864a3a2
verified

GiusFra commited on

Create config.json
25e566b
verified

GiusFra commited on

Updated sdpa fp8 models
fa0155f

nickfraser commited on

Added models that are fully quantized with FP8.
cfd94d7

nickfraser commited on

Added SDPA math model & test
3fea540

nickfraser commited on

Fix names
740d40f

GiusFra commited on

MI250 QKV fused and all linear layers sym, FP8 attention, guidance scale 8, calib steps 8
b8d5ec9
verified

GiusFra commited on

Fix names
08a2fb9

GiusFra commited on

MI250 QKV fused and all linear layers sym, FP8 attention, guidance scale 8, calib steps 10
7c9637e
verified

GiusFra commited on

MI250 QKV fused and all layers sym, FP8 attention, guidance scale 8, calib steps 10
99f92dc
verified

GiusFra commited on

MI250 QKV fused and all layers sym, FP8 attention, guidance scale 8, calib steps 10
4d701a1
verified

GiusFra commited on

updated quant_params with QKV fusion
6751dca
verified

GiusFra commited on

update int8+fp8 safetensors with fused QKV
7d9a30f
verified

GiusFra commited on

update int8+fp8 safetensors
16771c1
verified

GiusFra commited on

updated quant_params for FNUZ
f4b2bb6
verified

GiusFra commited on

add missing smoothquant_mul
6b39796
verified

GiusFra commited on

update int8+fp8 safetensors
9886f46
verified

GiusFra commited on

update int8+fp8 safetensors
fa8dc75
verified

GiusFra commited on

update int8+fp8 quant_param
7a5baa7
verified

GiusFra commited on

Upload sdxl.safetensors with huggingface_hub
7c9bbe7
verified

bowenbaoamd commited on

Upload sdxl.json with huggingface_hub
8b25dab
verified

bowenbaoamd commited on