Spaces:

flax-community
/

image-captioning

Runtime error

ydshieh commited on Jul 18, 2021

Commit

5c4c715

•

1 Parent(s): c0cae7b

improve desc

Files changed (1) hide show

app.py CHANGED Viewed

@@ -11,6 +11,7 @@ st.sidebar.markdown(
     """
     An image caption model [ViT-GPT2](https://huggingface.co/flax-community/vit-gpt2/tree/main) by combining the ViT model and a French GPT2 model.
     [Part of the [Huggingface JAX/Flax event](https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/).]\n
     The pretained weights of both models are loaded, with a set of randomly initialized cross-attention weigths.
     The model is trained on 65000 images from the COCO dataset for about 1500 steps, with the original english cpationis are translated to french for training purpose.
     """

     """
     An image caption model [ViT-GPT2](https://huggingface.co/flax-community/vit-gpt2/tree/main) by combining the ViT model and a French GPT2 model.
     [Part of the [Huggingface JAX/Flax event](https://discuss.huggingface.co/t/open-to-the-community-community-week-using-jax-flax-for-nlp-cv/).]\n
+    The GPT2 model source code is modified so it can accept an encoder's output.
     The pretained weights of both models are loaded, with a set of randomly initialized cross-attention weigths.
     The model is trained on 65000 images from the COCO dataset for about 1500 steps, with the original english cpationis are translated to french for training purpose.
     """