Megatron-GPT2-Classification
Description
The megatron-gpt2-classification
model is a language model trained using Megatron and Accelerate frameworks. It has been fine-tuned for classification tasks and benefits from distributed training across 4 GPUs (RTX 4070).
Key Features
- Trained with Megatron and Accelerate.
- Distributed training on 4 GPUs (RTX 4070).
- Fine-tuned for classification tasks.
- Downloads last month
- 7
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.