Enter your email and password to get started
Ask any question about your agent, such as "where is it reward hacking?" or "why did it fail?"
Docent first converts it into a precise behavior rubric by reading through your data, asking questions about ambiguities, and suggesting concrete re-writes based on your feedback.