Nikki Lijing Kuang
Nikki Lijing Kuang
Home
Publications
Talks
Service
Light
Dark
Automatic
Reinforcement Learning
Posterior sampling with delayed feedback for reinforcement learning with linear function approximation
(NeurIPS 2023) We provide the first theoretical analysis for the class of posterior sampling algorithms to handle delayed feedback in RL frameworks.
Nikki Lijing Kuang*
,
Ming Yin*
,
Mengdi Wang
,
Yu-Xiang Wang
,
Yi-An Ma
PDF
Cite
Project
Poster
Langevin Thompson sampling with logarithmic communication: bandits and reinforcement learning
(ICML 2023) We study approximate Thompson Sampling with Markov Chain Monte Carlo in bandit and reinforcement learning frameworks, providing algorithms that achieve optimal performance with low computation and communication cost.
Nikki Lijing Kuang*
,
Siddharth Mitra*
,
Amin Karbasi
,
Yi-An Ma
PDF
Cite
Project
Poster
Sample Efficient Constrained Reinforcement Learning
We propose sample efficient algorithms for constrained RL frameworks.
Nikki Lijing Kuang
PDF
Cite
Cite
×