sashakunitsyn
commited on
Commit
•
67c80a5
1
Parent(s):
8f90d90
Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ base_model: Salesforce/blip2-opt-2.7b
|
|
12 |
---
|
13 |
# VLRM
|
14 |
This repository contains the weights of BLIP-2 OPT-2.7B model fine-tuned by reinforcement learning method introduced in the paper [VLRM: Vision-Language Models act as
|
15 |
-
Reward Models for Image Captioning
|
16 |
|
17 |
The RL-tuned model is able to generate longer and more comprehensive descriptions with zero computational overhead compared to the original model.
|
18 |
|
|
|
12 |
---
|
13 |
# VLRM
|
14 |
This repository contains the weights of BLIP-2 OPT-2.7B model fine-tuned by reinforcement learning method introduced in the paper [VLRM: Vision-Language Models act as
|
15 |
+
Reward Models for Image Captioning](https://arxiv.org/abs/2404.01911).
|
16 |
|
17 |
The RL-tuned model is able to generate longer and more comprehensive descriptions with zero computational overhead compared to the original model.
|
18 |
|