I am Yaowen Ye (or Elwin, or 叶耀文), a first-year CS PhD student at UC Berkeley advised by Prof. Jacob Steinhardt and Prof. Stuart Russell. I work on understanding the limitations of human oversight of AI systems and developing scalable approaches to address them. Directions I am exploring include:
- Building scaling laws for evaluating scalable oversight proposals.
- Investigating risks in multi-agent systems where existing oversight methods may fail to scale.
- Studying the robustness of RLHF under unreliable human supervision.
Feel free to reach out if you're interested in my research! I also enjoy mentoring, so if you are an undergrad and think my advice might be helpful, I'd be happy to connect.
Before joining Berkeley, I did my undergrad at The University of Hong Kong. During my undergrad, I also worked on cognitive reasoning, intuitive physics, learning on graphs, and recommender systems. I was fortunate to be advised by Prof. Yixin Zhu at PKU Cognitive Reasoning Lab and Prof. Chao Huang at HKU Data Intelligence Lab.
Links: [X] [Email] [Give me feedback!]
