Gurumurthi V Ramanan's picture

112 391

Gurumurthi V Ramanan

GVR

·

https://surasys.co

AI & ML interests

ML

Recent Activity

liked a model about 19 hours ago

alibaba-damo/mgp-str-base

upvoted a collection 2 days ago

upvoted an article 2 days ago

Organizations

GVR's activity

upvoted a collection 2 days ago

OpenScholar_V1

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 7 items • Updated 3 days ago • 13

upvoted 2 articles 2 days ago

Article

Halo: Open Source Health Tracking with Wearables

By

•

2 days ago

• 62

Article

ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models

By

•

Oct 18

• 16

upvoted a collection 8 days ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 3 days ago • 224

upvoted an article 18 days ago

Article

Decoding Strategies in Large Language Models

By

•

23 days ago

• 38

upvoted a collection 18 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated about 7 hours ago • 172

upvoted a collection 19 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 15 days ago • 95

upvoted a paper 24 days ago

Unbounded: A Generative Infinite Game of Character Life Simulation

Paper • 2410.18975 • Published 28 days ago • 34

upvoted a paper 28 days ago

OmniParser for Pure Vision Based GUI Agent

Paper • 2408.00203 • Published Aug 1 • 23

upvoted an article about 1 month ago

Article

OCR Processing and Text in Image Analysis with Florence-2-base and Qwen2-VL-2B

By

•

Oct 18

• 13

upvoted 4 collections about 1 month ago

LayerSkip

Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated about 7 hours ago • 43

DocLayout-YOLO

Dataset and model for DocLayout-YOLO • 9 items • Updated about 1 month ago • 12

Gemma-APS Release

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated Oct 15 • 19

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 15 items • Updated 19 days ago • 76

upvoted a paper about 1 month ago

Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations

Paper • 2410.02762 • Published Oct 3 • 9

upvoted an article about 1 month ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

Oct 14

• 55

upvoted 2 papers about 1 month ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17 • 72

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

upvoted 2 collections about 2 months ago

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 112

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 7 days ago • 271