Shivaen Ramshetty's picture

5 64

Shivaen Ramshetty

shivr

·

sramshetty

AI & ML interests

NLP, CV, Multimodal

Organizations

shivr's activity

commented 2 papers 8 months ago

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26 • 78 •

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26 • 78 •

commented 2 papers 9 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 62 •

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 62 •

New activity in facebook/mms-tts-eng 11 months ago

Transpose output for scipy writing

#7 opened 11 months ago by

New activity in shivr/gpt2-xl_local-narratives-reduced-overlap_lora about 1 year ago

Librarian Bot: Add base_model information to model

#1 opened about 1 year ago by

New activity in shivr/gpt2-xl_grit_and_local-narratives_lora about 1 year ago

Librarian Bot: Add base_model information to model

#1 opened about 1 year ago by