Welcome to Kaiyi Ji's Homepage

Kaiyi Ji (吉凯意)
Assistant Professor
Department of Computer Science and Engineering
University at Buffalo, The State University of New York
Address: 338G Davis Hall
Phone: (614) 330-2684
Email: kaiyiji at buffalo.edu
[Google Scholar]

About Me

I am an assistant professor at the Department of Computer Science and Engineering of the University at Buffalo, The State University of New York. I received my Ph.D. degree from the Electrical and Computer Engineering Department of The Ohio State University in December, 2021, advised by Prof. Yingbin Liang. I was a postdoctoral research fellow at the Electrical Engineering and Computer Science Department of the University of Michigan, Ann Arbor, in 2022, working with Prof. Lei Ying. I was a visiting student research collaborator at the department of Electrical Engineering, Princeton University. Previously I obtained my B.S. degree from University of Science and Technology of China in 2016. I received CSE Junior Faculty Research Award (2023) and NSF CAREER Award (2025).

Research

I am particularly interested in the intersection of optimization, machine learning, and deep learning, from both theoretical and algorithmic perspectives. My current focus is on:

Large-Scale Optimization: We design provably efficient optimization algorithms for machine (deep) learning tasks with strong empirical performance. Recent efforts focus on preconditioned methods for large language model (LLM) training, bilevel optimization for hierarchical tasks, and multi-objective optimization for tasks with multiple competing objectives.

Machine (Deep) Learning: We develop algorithms to tackle key challenges in modern machine learning problems. Our current interest lies in multi-task learning (such as conflict resolution, balancing, and scalability) and life-long/continual learning (such as data selection, forgetting, computing cost). We also pursue theoretical foundations to understand the mechanisms, generalization behavior, and computational complexity of these methods.

Foundation Models: We explore efficient and theoretically grounded approaches for training LLMs under resource constraints. Current directions include continual prompt tuning, structured low-rank adaptation, and multi-objective decoding for LLMs.

Applications: We seek to ground our algorithmic designs in real-world conditions by actively collaborating with experts in robotics, ad recommendation, and natural language processing.

Recent News!

[Talk] 07/2025 I gave a talk on tuning-free bilevel optimization at ICCOPT 2025, Los Angeles, CA. Thanks for the invite from Prof. Shiqian Ma and Prof. Tong Zhang.
[Service] 07/2025 I will be serving as a reviewer for SODA 2026.
[Manuscript] 07/2025 Our new paper on Task-Agnostic Continual Prompt Tuning for LLM training is available online. We propose GRID by integrating a task-aware decoding mechanism that improves backward transfer by leveraging representative inputs, automatic task identification, and constrained decoding. It reduces forgotten tasks by up to 80% without sacrificing forward transfer performance. Check our code: GitHub.

[Publication] 06/2025 Our paper on sharpness-aware multi-task learning has been accepted to ICCV 2025! Congratulations to my students Hao and Gokul! Code is available here: GitHub. Please star us if you find it helpful!

[Publication] 06/2025 Our paper “Theoretical Study of Conflict-Avoidant Multi-Objective Reinforcement Learning” has been accepted to IEEE Transactions on Information Theory!

[Publication] 05/2025 Our paper “Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy” has been formaly accepted to The International Journal of Robotics Research (IJRR) — one of the most prestigious journals in robotics! This work marks the first successful use of bilevel optimization to develop a unified, interpretable, and self-supervised framework for robot autonomy. Congratulations to the team! For a quick overview, visit blog and Library.

[Award] 04/2025 Glad to receive NSF CAREER Award [news]. Thanks to my students!

[Talk] 03/2025 I will organize a special session entitled “Recent advances in optimization for machine learning and networking” at Asilomar 2025. Please join us if you attend Asilomar at CA.

[Manuscript] 02/2025 Our new multi-task learning (MTL) paper is out! We present LDC-MTL, a balanced MTL approach with O(1) time and memory complexity, offering both high accuracy and efficiency. It reduces loss discrepancy, minimizes gradient conflicts, and outperforms weight-swept linear scalarization through dynamic weight adjustment. Check out our paper and code. Please star us if you find it helpful!

[Manuscript] 02/2025 A new AI4Science manuscript on Space-Aware Crystal Prediction is available online. We explore reciprocal space to encode long-range interactions and leverage a mixture-of-experts (MoE) approach for multi-property prediction.

[Services] 12/2024 Couple of upcoming services include NSF panelist, TPC member for ACM Mobihoc 2025 and TPC member for IEEE Information Theory Workshops (ITW’25).

[Talk] 12/2024 I gave a talk on bilevel optimization for machine learning at the Computer Science Seminar Series at Johns Hopkins University. Thanks for the invitation!

[Talk] 10/2024 I gave a talk on federated optimization at the INFORMS Annual Meeting in Seattle.

[Software] 09/2024 Our FairGrad for multi-task/objective learning is now supported by open-source MTL Library LibMTL. Feel free to explore it and see if it can benefit your research!

Recent Featured Works

Task-Agnostic Continual Prompt Tuning with Gradient-Based Selection and Decoding <Code>
Anushka Tiwari, Sayantan Pal, Rohini K Srihari, Kaiyi Ji, 2025.

Tuning-Free Bilevel Optimization: New Algorithms and Convergence Analysis <Code>
Yifan Yang, Hao Ban, Minhui Huang, Shiqian Ma, Kaiyi Ji.
International Conference on Learning Representations (ICLR) 2025.

MGDA Converges under Generalized Smoothness, Provably
Qi Zhang, Peiyao Xiao, Shaofeng Zou, Kaiyi Ji.
International Conference on Learning Representations (ICLR) 2025.

Understanding Forgetting in Continual Learning with Linear Regression
Meng Ding, Kaiyi Ji, Di Wang, Jinhui Xu.
International Conference on Machine Learning (ICML) 2024.

Selected Publications

Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy <Open-Source Library>
Chen Wang, Kaiyi Ji, Junyi Geng, Zhongqiang Ren, Taimeng Fu, Fan Yang, Yifan Guo, Haonan He, Xiangyu Chen,
Zitong Zhan, Qiwei Du, Shaoshu Su, Bowen Li, Yuheng Qiu, Yi Du, Qihang Li, Yifan Yang, Xiao Lin, Zhipeng Zhao. The International Journal of Robotics Research (IJRR), 2025.

Fair Resource Allocation in Multi-Task Learning <Code>
Hao Ban and Kaiyi Ji.
International Conference on Machine Learning (ICML) 2024.

Direction-oriented Multi-objective Learning: Simple and Provable Stochastic Algorithms <Code>
Peiyao Xiao, Hao Ban, Kaiyi Ji.
Conference on Neural Information Processing Systems (NeurIPS) 2023.

Will Bilevel Optimizers Benefit from Loops
Kaiyi Ji, Mingrui Liu, Yingbin Liang, Lei Ying
Conference on Neural Information Processing Systems (NeurIPS) 2022. (Spotlight)

Lower Bounds and Accelerated Algorithms for Bilevel Optimization
Kaiyi Ji, Yingbin Liang
Journal of Machine Learning Research (JMLR) 2022.

Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms <Code>
Kaiyi Ji, Junjie Yang, Yingbin Liang.
International Conference on Machine Learning (ICML) 2021.