Utilitarian Decision-Making in Models – Evaluation and Steering 2024-12-022024-12-02 | Boris I recently did a weekend hackathon with Sinem Erisken and Pandelis Mouratoglou. We investigated the behaviour Llama, a popular LLM, under various conditions. Our paper won 3rd place! You can see our paper, the code or watch this 6-minute summary: https://boristhebrave.com/permanent/24/12/Utilitarian%20Decision-Making%20in%20Models.mp4