I am a Researcher at Microsoft Research. Prior to this, I completed internships at Mila and ByteDance AI Lab. I received my Ph.D. in Computer Science from Beijing Institute of Technology.
My research focuses on building AI that can continuously evolve.
Reward Reasoning Model
Jiaxin Guo, Zewen Chi, Li Dong*, Qingxiu Dong, Xun Wu, Shaohan Huang, Furu Wei
[model]
InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training
Zewen Chi, Li Dong, Furu Wei, Nan Yang, Saksham Singhal, Wenhui Wang, Xia Song, Xian-Ling Mao, Heyan Huang, Ming Zhou.
North American Association for Computational Linguistics (NAACL), Long paper, 2021.
[blog] [model] [code]
On the Representation Collapse of Sparse Mixture of Experts
Zewen Chi, Li Dong, Shaohan Huang, Damai Dai, Shuming Ma, Barun Patra, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Huang, Furu Wei
Neural Information Processing Systems (NeurIPS), 2022.
[code]
I'm really passionate about creating interesting things with AI. Here are some examples of what I've made.
Play the AI-generated 4096 game directly here!
More AI-generated projects are coming soon!
Feel free to reach out if you're interested in collaborating, discussing research, or just want to say hello.