Table 1 Inverse reinforcement learning algorithms
Max margin | Bayesian | Max entropy | Misc. |
|---|---|---|---|
Ng and Russell (2000) | Ramachandran and Amir (2007) | Ziebart et al. (2008) | Ho and Ermon (2016) |
Abbeel and Ng (2004) | Rothkopf and Ballard (2013) | Ziebart et al. (2009) | Hausman et al. (2017) |
Abbeel et al. (2008) | Michini and How (2012) | Ziebart et al. (2012) | Henderson et al. (2018) |
Syed and Schapire (2008) | Levine et al. (2011) | Aghasadeghi and Bretl (2011) | Hahn and Zoubir (2015) |
Syed et al. (2008) | Choi and Kim (2011) | Kalakrishnan et al. (2013) | Doerr et al. (2015) |
Valko et al. (2013) | Qiao and Beling (2011) | Kalakrishnan et al. (2010) | Chen et al. (2016) |
Zhou and Li (2018) | Qiao and Beling (2013) | Levine and Koltun (2012) | Babes et al. (2011) |
Neu and Szepesvári (2007) | Choi and Kim (2012) | Ziebart et al. (2013) | Nguyen et al. (201) |
Ratliff et al. (2006) | Choi and Kim (2013) | Bloem and Bambos (2014) | Levine et al. (2010) |
Ratliff et al. (2006) | Budhraja and Oates (2017) | Zhou et al. (2018) | Klein et al. (2011) |
Ratliff et al. (2009) | Michini and How (2012) | Shiarlis et al. (2016) | Majumdar et al. (2017) |
Zou et al. (2018) | Michini et al. (2013) | Audiffren et al. (2015) | Singh et al. (2018) |
Boularias and Chaib-Draa (2013) | Michini et al. (2015) | Bogert et al. (2016) | Ratliff et al. (2009) |
Choi and Kim (2011) | Šošic et al. (2018) | Bogert and Doshi (2017) | Dvijotham and Todorov (2010) |
Chinaei and Chaib-Draa (2012) | Šošić et al. (2018) | Byravan et al. (2015) | |
Lee et al. (2016) | Shimosaka et al. (2017) | ||
Zheng et al. (2014) | Wulfmeier et al. (2016) | ||
Rothkopf and Dimitrakakis (2011) | Wulfmeier et al. (2017) | ||
Dimitrakakis and Rothkopf (2011) | Chen et al. (2019) | ||
Surana (2014) | Finn et al. (2016) | ||
Brown and Niekum (2018) | Mendez et al. (2018) | ||
Yu et al. (2019) | |||
Ranchod et al. (2015) | |||
Boularias et al. (2012) |