update readme
Browse files
README.md
CHANGED
@@ -16,15 +16,15 @@ These models have been trained at scale on high-quality image caption datasets a
|
|
16 |
* The **instruct** fine-tuned model, `xgen-mm-phi3-mini-instruct-r-v1`, achieves state-of-the-art performance among open-source and closed-source VLMs under 5b parameters.
|
17 |
* `xgen-mm-phi3-mini-instruct-r-v1` supports flexible high-resolution image encoding with efficient visual token sampling.
|
18 |
|
19 |
-
|
20 |
|
21 |
|
22 |
# Datasets
|
23 |
|
24 |
| Dataset Type| Dataset(s) Used |
|
25 |
|--------|------------------------------------------|
|
26 |
-
| Pretrain | caption data:
|
27 |
-
| Instruction Tuning |
|
28 |
|
29 |
# Results
|
30 |
|
|
|
16 |
* The **instruct** fine-tuned model, `xgen-mm-phi3-mini-instruct-r-v1`, achieves state-of-the-art performance among open-source and closed-source VLMs under 5b parameters.
|
17 |
* `xgen-mm-phi3-mini-instruct-r-v1` supports flexible high-resolution image encoding with efficient visual token sampling.
|
18 |
|
19 |
+
The model is for research purposes, more technical details will come with a technical report soon.
|
20 |
|
21 |
|
22 |
# Datasets
|
23 |
|
24 |
| Dataset Type| Dataset(s) Used |
|
25 |
|--------|------------------------------------------|
|
26 |
+
| Pretrain | caption data: high-quality image caption datasets and interleaved datasets |
|
27 |
+
| Instruction Tuning | visual instruction following and caption datasets, a mixture of academic VQA data including OCR/Document/Chart-focused tasks, publicly available text-only instruction data |
|
28 |
|
29 |
# Results
|
30 |
|