I am currently an Assistant Professor of Institute for AI Industry Research (AIR). I received my PhD degree and Bachelor’s degree in Computer Science, both from Peking University. I had also worked as a Visiting PhD Student at Carnegie Mellon University. Before joining AIR, I was a Senior Researcher in the Systems Research Group at Microsoft Research Asia.
My research interest lies in the area of Edge AI Systems/Applications. I’ve published in premier venues of mobile computing, artificial intelligence, and software engineering, including a best paper nomination in UbiComp 2016, a best paper in IS-EUD 2017, and a best paper in GreenCom 2023. Some of the papers have become popular open-source tools (DroidBot, PrivacyStreams, Humanoid, etc.) in the area.
I’m recently enthusiastic about building EdgeLLM (large language models at the edge) and MobileAgent (intelligent personal agents on mobile devices) powered by EdgeLLM. Check out our position & survey paper.
Our team is recruiting PostDocs, research engineers, and interns. Please feel free to contact me if you are interested.
Recent News
- 📢 2024/06 – In collaboration with Huawei, we published the white paper on AI Phone (in Chinese).
- 📢 2024/06 – Successfully hosted the 1st workshop on edge foundation models (EdgeFM 2024) in Tokyo! See you next year!
- 📢 2024/05 – Paper “SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget” accepted to ACL 2024 (main track). Congrats to Rui Kong and others!
- 📢 2024/03 – Our group won the First Prize in AIR Winter Camp 2024. Congrats to winning team members Shanhui Zhao, Hao Wen, Wenjie Du and Cheng Liang!
- 📢 2024/03 – One paper got accepted to ACM MobiSys 2024: “Empowering In-Browser Deep Learning Inference on Edge Through Just-In-Time Kernel Optimization”. Congrats to all collaborators!
- 📢 2024/01 – Our position & survey paper on mobile LLM agents “Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security” was released. Feeling excited to work in this new area! [arXiv], [GitHub], [机器之心报道]
- More…
Summary of My Research
- Improving Edge AI Efficiency
- Enhancing Edge AI Reliability
- Building Mobile LLM Agents
- UI-grounded mobile task automation (ICSE17, ASE19, MobiCom24a)
- On-device data management & analytics (UbiComp17a, BigData18)
- Lightweight LLM at the edge (ACL24, arxiv)