-
EP 14: Past Tense Pitfalls: The Curious Case of Refusal Training in AI Language Models
- 2024/07/17
- 再生時間: 10 分
- ポッドキャスト
-
サマリー
あらすじ・解説
In this episode of "You Are A Helpful (Research) Assistant," delve into the AI-generated, human-curated exploration of refusal training vulnerabilities in language models. Uncover the past tense attack's impact on model behavior in this insightful discussion.