Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Robustly improving LLM fairness in realistic settings via interpretability

arxiv.org

1 points by like_any_other 10 hours ago