Nikki Lijing Kuang

Nikki Lijing Kuang

Research Scientist, PhD in CS

UC San Diego

About Me

I am a Research Scientist at Snowflake AI Research, where I work on LLM reasoning and agentic frameworks, with a particular focus on how reinforcement learning (RL) contributes to these emerging paradigms.

I received my Ph.D. in Computer Science from UC San Diego, where I was fortunate to be advised by Prof. Yian Ma, and closely collaborated with Prof. Tara Javidi, Prof. Sean Gao. During my PhD, I primarily work on RL, foundation Models (LLMs, Diffusion Models) and Bayesian inference, with the aim to address the fundamental challenges in sequential decision-making under uncertainty. The goal of my research is to design provably efficient and practical algorithms with performance guarantee, achieving both statistical and computational benefits. Besides, I have gained hands-on experience with fine-tuning LLMs and reward models, designing CoT prompting and reasoning frameworks, LLM decoding, and training R1-style reasoning LLMs using RL (e.g. PPO, GRPO).

I'm a PhD candidate in Computer Science at UC San Diego, where I am fortunate to be advised by [**Prof. Yian Ma**](https://sites.google.com/view/yianma/home), and closely collaborate with [**Prof. Tara Javidi**](https://tjavidi.eng.ucsd.edu/), [**Prof. Sean Gao**](https://scungao.github.io/).

My primary research interests span reinforcement learning (RL), foundation models and Bayesian inference, with a focus on addressing fundamental challenges in sequential decision making under uncertainty. More recently, I am particularly interested in LLM alignment and reasoning, exploring how RL plays a role in these topics. The goal of my research is to design provably efficient and practical algorithms with performance guarantee, achieving both statistical and computational benefits.

Previously, I interned at IBM research, Amazon and Honda Research Institute, working on LLM for personlization, RL for ranking and recommendation systems, and robotics.

Education
  • PhD in Computer Science, 2025

    University of California San Diego

  • MSc in Computer Science, 2020

    University of California San Diego

Selected Publications

Invited Talks

  • Inference-time Alignment for Personalized LLMs
    • MIT-IBM Lab, 2024
    • IBM Research, 2024
  • Posterior Sampling in RL with delayed feedback
    • SOCAMS, 2024
    • TILOS-Intel Workshop, 2024
    • IBM Research, 2024
  • Robust Human Intention Estimation in Robot Teleoperation
    • Honda Research Institute PC Seminar, 2024
  • Efficient Langevin Thompson Sampling in Bandits and RL
    • HDSI Industry Research Review, 2023
  • Batched Approximate Thompson Sampling
    • TILOS AI Institute Trainee Workshop, 2022

Professional Service

  • Conference Reviewers
    • NeurIPS (2023 - )
    • AISTATS (2023 - )
    • AAAI (2023 - )
    • ICML (2024 - )
    • ICLR (2024 - )
    • ISIT (2024)
  • Journal Reviewers
    • IEEE Transactions on Circuits and Systems for Video Technology (2024 - )
    • IEEE Transactions on Information Theory (2025 - )

Featured Awards

  • NSF AIVO Travel Grant
  • NeurIPS 2023 Top Reviewer (Top 10%)
  • NeurIPS Scholar Award
  • HDSI Fellowship
  • UCSD GSA Travel Grant (2023, 2019)
  • National Scholarship