nvidia 's Collections

Model Optimizer

A collection of generative models quantized and optimized with TensorRT Model Optimizer. Support FP8, INT4 precisions.

This collection has no items.