I am a Ph.D. candidate at Princeton University advised by Prof. Pramod Viswanath. I am also a part-time student researcher at ByteDance Seed mentored by Dr. Jiashuo Liu. Before Princeton, I completed my B.Eng. in Computer Science from Yao Class at Tsinghua, graduating summa cum laude and earning the prestigious Yao Award.
My research interests are primarily centered around data, evaluation, mechanism design, and blockchains, with the goal of promoting fairness and transparency in AI era and a strong focus on real-world impact. My research has contributed to high-profile startups:
Beyond research, I am a member of the Competitive Programming Hall of Fame. I served as the President of the Yao Class Students’ Congress during undergraduate; and I was once a contestant on the TV show “Super Brain” (江苏卫视“最强大脑第10季”).
I’m always open to research and industry collaborations. Feel free to contact and chat!
Ph.D. student (2023 - now)
Electrical and Computer Engineering, Princeton University
B.Eng. in Computer Science (2019 - 2023)
Yao Class, the Insititute for Interdisciplinary Information Sciences (IIIS), Tsinghua University
CAIA gets accepted and selected for oral presentation (top 10%) to AAAI26 AI4Finance!
Several papers that I contributed to are online now, and will be presented in different venues in the near future!
First-author papers:
Co-first-author papers:
Co-author papers:
Among those, PeerBench and LiveCodeBench Pro will be presented at NeurIPS 2025 Main Conference in San Diego on Dec 3; CAIA will be presented at ICAIF 2025 in Singapore (AI4F on Nov 15, AI-R2D2 on Nov 16); and OML Primitive will be presented at NeurIPS 2025 Lock-LLM on Dec 6. Stay tuned for them!
The AI benchmark paper LiveCodeBench Pro that I co-first-authored is online now!
Two AI benchmark papers that I co-authored are online now!
For most recent updates, please refer to my Google Scholar profile. Here are some selected publications.
OML: Open, Monetizable, Loyal AI (2024, NeurIPS 2025 Lock-LLM)
zkBridge (ACM CCS 2022)
LiveCodeBench Pro (NeurIPS 2025) - Comprehensive, hard, and contamination-free code generation benchmark
SPIN-Bench (COLM 2025) - Strategic planning & social reasoning for LLMs
PeerBench (NeurIPS 2025) - New paradigm for AI evaluation based on peer review
Humanity’s Last Exam (2025) - Ultimate test for AI capabilities