<aside> 🪤
Add any reading you'd recommend
other evaluation framework papers
relevant survey technique stuff
step-by-step or rules-based reasoning stuff
etc </aside>
Background Reading
Evaluation Framework (Project)
Human Moral Learning
Meeting Notes
https://arxiv.org/abs/2211.09110
- HELM (sota LLM evaluation benchmark; ideally reuse their methodology).