Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

Model details:

Chain-of-Spot encourages Large Vision-Language Models to identify the region of interest (ROI) in the image condition on the question and reasoning through an interactive manner, thereby improving the ability of visual understanding.

Where to send questions or comments about the model: https://github.com/dongyh20/Chain-of-Spot

Paper or resources for more information: https://sites.google.com/view/chain-of-spot/

Downloads last month
7
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.