barun's picture

3

barun

adwit

AdwitBarun

AI & ML interests

machine learning

Organizations

None yet

adwit's activity

upvoted 3 papers 5 months ago

Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

Paper • 2406.11139 • Published Jun 17 • 12

Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

Paper • 2406.11801 • Published Jun 17 • 15

SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

Paper • 2406.12274 • Published Jun 18 • 14