enough high-quality interactions can train a weaker “student” model to imitate useful behaviors, especially in narrow domains such as vulnerability discovery.
High-Quality Interactions Train Student Models
High-quality interactions can train weaker student models to imitate useful behaviors.