Most of my work can be grouped into several clusters: Exploration ( data collection in RL) Value function and policy learning ( training in RL) Off-policy learning ( evaluation in RL) Applications to the Web Applications to NLP
More information can be found in Google Scholar , DBLP , LinkedIn . Somewhat up-to-date CV , and short bio . I am also on Twitter as @LihongLi20 .