Spaces:
Running
Running
Update paper "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"
#11
by
HenryHZY
- opened
- papers.csv +1 -1
papers.csv
CHANGED
@@ -258,7 +258,7 @@ Prompt-Guided Image Captioning for VQA with GPT-3,"Hu, Yushi*; Hua, Hang; Yang,
|
|
258 |
Grounded Image Text Matching with Mismatched Relation Reasoning,"Wu, Yu*; Wei, Yana; Wang, Haozhe; Liu, Yongfei; Yang, Sibei; He, Xuming",poster,2308.01236,https://arxiv.org/abs/2308.01236,,https://huggingface.co/papers/2308.01236,,,,6,0
|
259 |
GePSAn: Generative Procedure Step Anticipation in Cooking Videos,"Abdelsalam, Mohamed A*; Rangrej, Samrudhdhi B.; Hadji, Isma; DVORNIK, NIKITA; Derpanis, Konstantinos G; Fazly, Afsaneh",poster,,,,,,,,,
|
260 |
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models,"Song, Chan Hee*; Wu, Jiaman; Washington, Clayton B; Sadler, Brian M; Chao, Wei-Lun; Su, Yu",poster,,,,,,,,,
|
261 |
-
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control,"Hu, Zi-Yuan*; Li, Yanyang; Lyu, Michael R; Wang, Liwei",poster
|
262 |
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning,"Barraco, Manuele; Sarto, Sara; Cornia, Marcella*; Baraldi, Lorenzo; Cucchiara, Rita",poster,2308.12383,https://arxiv.org/abs/2308.12383,https://github.com/aimagelab/PMA-Net,https://huggingface.co/papers/2308.12383,,,,5,0
|
263 |
Improving Zero-Shot Generalization for CLIP with Synthesized Prompts,"Wang, Zhengbo*; Liang, Jian; He, Ran; Xu, Nan; Wang, Zilei; Tan, Tieniu",poster,2307.07397,https://arxiv.org/abs/2307.07397,https://github.com/mrflogs/SHIP,https://huggingface.co/papers/2307.07397,,,,6,0
|
264 |
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models,"Cho, Jaemin*; Zala, Abhaysinh S; Bansal, Mohit",poster,,,,,,,,,
|
|
|
258 |
Grounded Image Text Matching with Mismatched Relation Reasoning,"Wu, Yu*; Wei, Yana; Wang, Haozhe; Liu, Yongfei; Yang, Sibei; He, Xuming",poster,2308.01236,https://arxiv.org/abs/2308.01236,,https://huggingface.co/papers/2308.01236,,,,6,0
|
259 |
GePSAn: Generative Procedure Step Anticipation in Cooking Videos,"Abdelsalam, Mohamed A*; Rangrej, Samrudhdhi B.; Hadji, Isma; DVORNIK, NIKITA; Derpanis, Konstantinos G; Fazly, Afsaneh",poster,,,,,,,,,
|
260 |
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models,"Song, Chan Hee*; Wu, Jiaman; Washington, Clayton B; Sadler, Brian M; Chao, Wei-Lun; Su, Yu",poster,,,,,,,,,
|
261 |
+
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control,"Hu, Zi-Yuan*; Li, Yanyang; Lyu, Michael R; Wang, Liwei",poster,2308.09804,https://arxiv.org/abs/2308.09804,https://github.com/HenryHZY/VL-PET,https://huggingface.co/papers/2308.09804,,,,4,1
|
262 |
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning,"Barraco, Manuele; Sarto, Sara; Cornia, Marcella*; Baraldi, Lorenzo; Cucchiara, Rita",poster,2308.12383,https://arxiv.org/abs/2308.12383,https://github.com/aimagelab/PMA-Net,https://huggingface.co/papers/2308.12383,,,,5,0
|
263 |
Improving Zero-Shot Generalization for CLIP with Synthesized Prompts,"Wang, Zhengbo*; Liang, Jian; He, Ran; Xu, Nan; Wang, Zilei; Tan, Tieniu",poster,2307.07397,https://arxiv.org/abs/2307.07397,https://github.com/mrflogs/SHIP,https://huggingface.co/papers/2307.07397,,,,6,0
|
264 |
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models,"Cho, Jaemin*; Zala, Abhaysinh S; Bansal, Mohit",poster,,,,,,,,,
|