Rohit Voleti's picture

Rohit Voleti

rvoleti89

·

AI & ML interests

NLP, Speech

Recent Activity

liked a model 3 days ago

togethercomputer/m2-bert-80M-32k-retrieval

liked a model about 2 months ago

allenai/Molmo-72B-0924

View all activity

Organizations

None yet

rvoleti89's activity

upvoted a collection 2 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Oct 24 • 512

upvoted a paper 4 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8 • 155

upvoted a paper 5 months ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 92

upvoted a collection 5 months ago

Long Context

44 items • Updated 1 day ago • 3

upvoted a paper 5 months ago

Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers

Paper • 2406.16747 • Published Jun 24 • 18

upvoted 4 papers 6 months ago

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Paper • 2406.10149 • Published Jun 14 • 48

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7 • 55

Towards a Personal Health Large Language Model

Paper • 2406.06474 • Published Jun 10 • 18

MedFuzz: Exploring the Robustness of Large Language Models in Medical Question Answering

Paper • 2406.06573 • Published Jun 3 • 9

upvoted a collection 8 months ago

WizardLM

0 items • Updated Jul 11 • 103

upvoted a collection 9 months ago

Cybertron 7B [Uniform Neural Alignment & MGS]

Another rockstar model, was born as a leader. Tamed with UNA, MGS, DPO, SFT. • 6 items • Updated 8 days ago • 6