Google Scholar

On Oracle-Efficient PAC RL with Rich Observations. [arXiv]
(NIPS-18, w/ spotlight talk) Christoph Dann, Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford, Robert E. Schapire.

Completing State Representations using Spectral Learning. []
(NIPS-18) Nan Jiang, Alex Kulesza, Satinder Singh.

Open Problem: The Dependence of Sample Complexity Lower Bounds on Planning Horizon. [pdf]
(COLT-18) Nan Jiang, Alekh Agarwal.

Hierarchical Imitation and Reinforcement Learning. [arXiv]
(ICML-18) Hoang M. Le, Nan Jiang, Alekh Agarwal, Miroslav Dudík, Yisong Yue, Hal Daumé III.

Markov Decision Processes with Continuous Side Information. [arXiv]
(ALT-18) Aditya Modi, Nan Jiang, Satinder Singh, Ambuj Tewari.

PAC Reinforcement Learning with an Imperfect Model. [pdf, poster]
(AAAI-18) Nan Jiang.

Repeated Inverse Reinforcement Learning. [arXiv, poster, talk video]
(NIPS-17, w/ spotlight talk) Kareem Amin*, Nan Jiang*, Satinder Singh (*Equal contribution).

Contextual Decision Processes with Low Bellman Rank are PAC-Learnable. [ICML version, arXiv, poster, talk video]
(ICML-17) Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford, Robert E. Schapire.

Doubly Robust Off-policy Value Evaluation for Reinforcement Learning. [pdf, poster]
(ICML-16) Nan Jiang, Lihong Li.

On Structural Properties of MDPs that Bound Loss due to Shallow Planning. [pdf]
(IJCAI-16) Nan Jiang, Satinder Singh, Ambuj Tewari.

Improving Predictive State Representations via Gradient Descent. [pdf]
(AAAI-16) Nan Jiang, Alex Kulesza, Satinder Singh.

Abstraction Selection in Model-based Reinforcement Learning. [pdf, talk video]
(ICML-15) Nan Jiang, Alex Kulesza, Satinder Singh.

The Dependence of Effective Planning Horizon on Model Accuracy. [pdf, errata, poster, talk video]
(AAMAS-15, best paper award) Nan Jiang, Alex Kulesza, Satinder Singh, Richard Lewis.

Low-Rank Spectral Learning with Weighted Loss Functions. [pdf]
(AISTATS-15) Alex Kulesza, Nan Jiang, Satinder Singh.

Spectral Learning of Predictive State Representations with Insufficient Statistics. [pdf]
(AAAI-15) Alex Kulesza, Nan Jiang, Satinder Singh.

Improving UCT Planning via Approximate Homomorphisms. [pdf, supplement]
(AAMAS-14) Nan Jiang, Satinder Singh, Richard Lewis.


PhD Thesis

A Theory of Model Selection in Reinforcement Learning. [pdf]
(2017) Nan Jiang.