10 15 1

Ajith V Prabhakar

ajithprabhakar

https://www.ajithp.com

ajithprabhakar

AI & ML interests

NLP, Responsible AI, Generative AI

Organizations

Posts 2

Post

511

Hi All,
In my latest blog post, I created a comprehensive guide on LLM Benchmarking.
➟ 20+ key benchmarks, from MMLU to TruthfulQA
➟ How each benchmark assesses different LLM capabilities
➟ Why benchmarking matters for real-world AI applications
➟ Future trends in AI evaluation
Read the blog here: https://wp.me/p7Qix-wO

Please let me know your thoughts, suggestions, and comments.

Post

1351

Can AI cheat or lie?

In this blog, we will explore the research conducted by experts from MIT, Australian Catholic University, and the Center for AI Safety to better understand the nature of AI deception, its various forms, and the potential risks it poses. We will examine real-world examples and the underlying mechanisms that enable AI systems to deceive.

Learn more at: https://ajithp.com/2024/05/12/ai-deception-risks-real-world-examples-and-proactive-solutions/

Ajith V Prabhakar

AI & ML interests

Organizations

Posts 2

Collections 1

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

OneLLM: One Framework to Align All Modalities with Language

Generative Multimodal Models are In-Context Learners

The LLM Surgeon

models

datasets