Livejournal
Log in
Post
Friends
My journal
am
(no subject)
Feb 09, 2019 04:16
J. Leike, et al. DeepMind
(2018) "
Scalable agent align-
ment via reward modeling:
a research direction.
"
tl
,
imit
,
rl
,
pomdp
Leave a comment
Previous post
Next post
Up