Sveriges mest populära poddar

Thinking Machines: AI & Philosophy

On Adversarial Training & Robustness with Bhavna Gopal

44 min • 8 maj 2024

"Understanding what's going on in a model is important to fine-tune it for specific tasks and to build trust."

Bhavna Gopal is a PhD candidate at Duke, research intern at Slingshot with experience at Apple, Amazon and Vellum.

We discuss

  • How adversarial robustness research impacts the field of AI explainability.
  • How do you evaluate a model's ability to generalize?
  • What adversarial attacks should we be concerned about with LLMs?
00:00 -00:00