Research by Harvard students on catastrophic risks from advanced AI.

Join our mailing list →

Apply to our Technical Fellowship (due Sep 12)→

Apply to our Policy Fellowship (due Sep 12)→

Managing risks from advanced artificial intelligence is one of the most important problems of our time.¹ We are a community of technical and policy researchers at Harvard aimed at reducing these risks and steering the trajectory of AI development for the better.

We run a semester-long introductory technical reading group on AI safety research, covering topics like neural network interpretability,¹ learning from human feedback,² goal misgeneralization in reinforcement learning agents,³ eliciting latent knowledge,⁴ and evaluating dangerous capabilities in models.⁵

We also run an introductory AI policy reading group, where we discuss core strategic issues posed by the development of transformative AI systems.

Join our mailing list →

Our members have worked with:

OpenAI

Our members have gone on to full-time employment at:

OpenAI

Note: Use of organizational logos does not imply affiliation with or endorsement by these organizations.