18 20 16

Jiaqi Wang PRO

myownskyW7

myownskyW7

AI & ML interests

None yet

Recent Activity

liked a model 16 days ago

genmo/mochi-1-preview

upvoted a paper 29 days ago

commented a paper 29 days ago

Organizations

myownskyW7's activity

commented a paper 29 days ago

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published 30 days ago • 34 •

commented a paper 30 days ago

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published about 1 month ago • 43 •

commented 3 papers about 1 month ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21 • 65 •

BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way

Paper • 2410.06241 • Published Oct 8 • 10 •

Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate

Paper • 2410.07167 • Published Oct 9 • 37 •

New activity in FiVA/FiVA 3 months ago

Delete data/full_data/{i:05d}.zip

#3 opened 3 months ago by

myownskyW7

Delete data/full_data/{i:05d}.zip

#2 opened 3 months ago by

myownskyW7

New activity in internlm/internlm-xcomposer2d5-7b-4bit 4 months ago

Update modeling_internlm_xcomposer2.py

#4 opened 4 months ago by

yuhangzang

New activity in internlm/internlm-xcomposer2d5-7b 4 months ago

Update modeling_internlm_xcomposer2.py

#14 opened 4 months ago by

yuhangzang

Update modeling_internlm_xcomposer2.py

#13 opened 4 months ago by

yuhangzang

xcomposer2d5 3rd party sft support

#12 opened 4 months ago by

tastelikefeet

commented a paper 5 months ago

Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images

Paper • 2407.06191 • Published Jul 8 • 10 •

New activity in internlm/internlm-xcomposer2d5-7b 5 months ago

TypeError: 'NoneType' object is not callable

#5 opened 5 months ago by

catworld1212

commented 5 papers 5 months ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 92 •

MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs

Paper • 2406.11833 • Published Jun 17 • 61 •

New activity in google/gemma-7b-it 9 months ago

Bug about number generation?

#30 opened 9 months ago by

myownskyW7

New activity in internlm/internlm-xcomposer2-7b 10 months ago

Add paper url

#1 opened 10 months ago by

osanseviero