Spaces:

clip-italian
/

clip-italian-demo

Running

vinid commited on Jul 17, 2021

Commit

80200b5

•

1 Parent(s): 7369efb

adding readme.md

Files changed (2) hide show

app.py CHANGED Viewed

@@ -1,6 +1,7 @@
 import streamlit as st
 import os
 import torch
 from transformers import AutoTokenizer
 from jax import numpy as jnp
 import json
@@ -89,3 +90,10 @@ if query:
         )
     st.image(image_paths)

 import streamlit as st
 import os
 import torch
+from pathlib import Path
 from transformers import AutoTokenizer
 from jax import numpy as jnp
 import json
         )
     st.image(image_paths)
+def read_markdown_file(markdown_file):
+    return Path(markdown_file).read_text()
+intro_markdown = read_markdown_file("readme.md")
+st.markdown(intro_markdown, unsafe_allow_html=True)

readme.md ADDED Viewed

+# Italian CLIP
+....
+# Novel Contributions
+The original CLIP model was trained on 400millions text-image pairs; this amount of data is not available for Italian and the only datasets for captioning in the literature are MSCOCO-IT (translated version of MSCOCO) and WIT. To get competitive results we follewed three directions: 1) more data 2) better augmentation and 3) better training.
+## More Data
+## Better Augmentations
+## Better Training
+different optimizer and backbone freezing
+# Scientific Validity
+To better understand how well our clip-italian model works we run an experimental evaluation. Since this is the first clip-based model in Italian, we used the multilingual CLIP model as a comparison baseline.
+We selected two different tasks:
++ image-retrieval
++ zero-shot classification
+## Image Retrieval
+## Zero-shot classification
+# Broader Outlook