Current AI alignment approaches capture what humans prefer but discard why they prefer it. Iron Sun Works develops methodology that preserves the reasoning — and uses it to build richer signal for model calibration than preference ranking alone can provide.
A structured methodology for capturing how human cognition diverges from model reasoning.
Read MoreAn AI-assisted executive function support system grounded in neuroscience.
Read MoreA self-selecting recruitment mechanism for the humans the research requires.
Read More