Nikki Lijing Kuang

PhD candidate in Computer Science

UC San Diego

About Me

I’m a PhD candidate in Computer Science at UC San Diego, where I am fortunate to be advised by Prof. Yian Ma, and closely collaborate with Prof. Tara Javidi, Prof. Sean Gao.

My primary research interests span reinforcement learning (RL), foundation models and Bayesian inference, with a focus on addressing fundamental challenges in sequential decision making under uncertainty. More recently, I am particularly interested in LLM alignment and reasoning, exploring how RL plays a role in these topics. The goal of my research is to design provably efficient and practical algorithms with performance guarantee, achieving both statistical and computational benefits.

During my PhD, I interned at IBM research, Amazon and Honda Research Institute, working on LLM for personlization, RL for ranking and recommendation systems, and robotics.

I have experience with fine-tuning LLMs and reward models, designing CoT prompting and reasoning frameworks, LLM decoding, and training R1-style reasoning LLMs using RL (e.g. PPO, GRPO).

Education

PhD in Computer Science, Expected 2025
University of California San Diego
MSc in Computer Science, 2020
University of California San Diego

Selected Publications

Towards Personalized Language Models via Inference-time Human Preference Optimization

(NeurIPS 2024 AFM) We introduce a novel approach to LLM alignment for personalized preference based on decode-time frameworks.

Nikki Lijing Kuang, Wei Sun, Scott McFaddin, Markus Ettl, Yi-An Ma

Towards Personalized Language Models via Inference-time Human Preference Optimization

Diff-BBO: Diffusion-Based Inverse Modeling for Black-Box Optimization

(NeurIPS 2024 BDU) We propose an inverse modeling approach for efficient online black-box optimization by resorting to classifier-free conditional diffusion models with a novel uncertainty-aware acquisition function.

Dongxia Wu*, Nikki Lijing Kuang*, Ruijia Niu, Yi-An Ma, Rose Yu

Log-concave Sampling from a Convex Body with a Barrier: a Robust and Unified Dikin Walk

(NeurIPS 2024) We design a Dikin walk for log-concave sampling over polytopes and spectrahedra with optimal mixing time and efficient per-iteration cost.

(Alphabetical) Yuzhou Gu, Nikki Lijing Kuang, Yi-An Ma, Zhao Song, Lichen Zhang

Log-concave Sampling from a Convex Body with a Barrier: a Robust and Unified Dikin Walk

Posterior sampling with delayed feedback for reinforcement learning with linear function approximation

(NeurIPS 2023) We provide the first theoretical analysis for the class of posterior sampling algorithms to handle delayed feedback in RL frameworks.

Nikki Lijing Kuang*, Ming Yin*, Mengdi Wang, Yu-Xiang Wang, Yi-An Ma

Langevin Thompson sampling with logarithmic communication: bandits and reinforcement learning

(ICML 2023) We study approximate Thompson Sampling with Markov Chain Monte Carlo in bandit and reinforcement learning frameworks, providing algorithms that achieve optimal performance with low computation and communication cost.

Nikki Lijing Kuang*, Siddharth Mitra*, Amin Karbasi, Yi-An Ma

See all publications

Invited Talks

Inference-time Alignment for Personalized LLMs
- MIT-IBM Lab, 2024
- IBM Research, 2024
Posterior Sampling in RL with delayed feedback
- SOCAMS, 2024
- TILOS-Intel Workshop, 2024
- IBM Research, 2024
Robust Human Intention Estimation in Robot Teleoperation
- Honda Research Institute PC Seminar, 2024
Efficient Langevin Thompson Sampling in Bandits and RL
- HDSI Industry Research Review, 2023
Batched Approximate Thompson Sampling
- TILOS AI Institute Trainee Workshop, 2022

Professional Service

Conference Reviewers
- NeurIPS (2023 - )
- AISTATS (2023 - )
- AAAI (2023 - )
- ICML (2024 - )
- ICLR (2024 - )
- ISIT (2024)
Journal Reviewers
- IEEE Transactions on Circuits and Systems for Video Technology (2024 - )
- IEEE Transactions on Information Theory (2025 - )

Featured Awards

NSF AIVO Travel Grant
NeurIPS 2023 Top Reviewer (Top 10%)
NeurIPS Scholar Award
HDSI Fellowship
UCSD GSA Travel Grant (2023, 2019)
National Scholarship