Searchium-ai
/

clip4clip-webvid150k

zero-shot-image-classification

Inference Endpoints

Model card Files Files and versions Community

Diangle commited on Jun 15, 2023

Commit

7606fc0

•

1 Parent(s): 7d954dc

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -8,6 +8,7 @@ tags:
 pipeline_tag: text-to-video
 ---
 # Model Card
 ## Details
 This model underwent training using CLIP4Clip, a video retrieval method based on the CLIP framework, as described in the paper [CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip
@@ -17,6 +18,7 @@ The training process involved 150,000 videos obtained from the [WebVid Dataset](
 To adapt the clip model obtained during training into the implementation of [clip-vit-base-patch32](https://huggingface.co/openai/clip-vit-base-patch32), we have made modifications to the weights.
 ### Use with Transformers
 ### Extracting Text Embeddings:
@@ -41,7 +43,9 @@ print("sequence_output: ", sequence_output)
 ```
 ### Extracting Video Embeddings:
-For video embedding there is an extra [notebook](https://huggingface.co/Diangle/clip4clip-webvid/blob/main/Notebooks/GSI_VideoRetrieval_EmbedVideos.ipynb) that describes how to embed videos.
 ## Model Intended Use

 pipeline_tag: text-to-video
 ---
 # Model Card
 ## Details
 This model underwent training using CLIP4Clip, a video retrieval method based on the CLIP framework, as described in the paper [CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip
 To adapt the clip model obtained during training into the implementation of [clip-vit-base-patch32](https://huggingface.co/openai/clip-vit-base-patch32), we have made modifications to the weights.
 ### Use with Transformers
 ### Extracting Text Embeddings:
 ```
 ### Extracting Video Embeddings:
+An additional [notebook](https://huggingface.co/Diangle/clip4clip-webvid/blob/main/Notebooks/GSI_VideoRetrieval_EmbedVideos.ipynb) is available that provides instructions on how to perform video embedding.
 ## Model Intended Use