Symbol-LLM

Symbol-LLM

AI & ML interests

Natural Language Processing, Large Language Models, Neuro-Symbolic

Recent Activity

Organizations

Symbol-LLM's activity

Reacted to their post with πŸš€πŸ”₯ 7 days ago
view post
Post
891
πŸ₯³ Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning !

πŸ“• Title: Vision-Language Models Can Self-Improve Reasoning via Reflection

πŸ”— Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)

πŸ˜‡Takeaways:

- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.

- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !
posted an update 7 days ago
view post
Post
891
πŸ₯³ Thrilled to introduce our recent efforts on bootstrapping VLMs for multi-modal chain-of-thought reasoning !

πŸ“• Title: Vision-Language Models Can Self-Improve Reasoning via Reflection

πŸ”— Link: Vision-Language Models Can Self-Improve Reasoning via Reflection (2411.00855)

πŸ˜‡Takeaways:

- We found that VLMs can self-improve reasoning performance through a reflection mechanism, and importantly, this approach can scale through test-time computing.

- Evaluation on comprehensive and diverse Vision-Language reasoning tasks are included !
updated a Space 10 days ago
Reacted to maxiw's post with πŸš€πŸ‘ 15 days ago
view post
Post
1718
Exciting to see open-source models thriving in the computer agent space! πŸ”₯
I just built a demo for OS-ATLAS: A Foundation Action Model For Generalist GUI Agents β€” check it out here: maxiw/OS-ATLAS

This demo predicts bounding boxes based on screenshot + instructions as input.