Richard S. Sutton
E753566
Richard S. Sutton is a pioneering computer scientist best known for his foundational contributions to reinforcement learning, including co-authoring the influential textbook "Reinforcement Learning: An Introduction."
All labels observed (2)
| Label | Occurrences |
|---|---|
| Richard S. Sutton canonical | 3 |
| Richard L. Sutton | 1 |
Statements (45)
| Predicate | Object |
|---|---|
| instanceOf |
computer scientist
ⓘ
researcher ⓘ |
| affiliation |
Alberta Machine Intelligence Institute
NERFINISHED
ⓘ
DeepMind NERFINISHED ⓘ |
| almaMater | University of Massachusetts Amherst NERFINISHED ⓘ |
| citizenship |
Canada
ⓘ
United States of America ⓘ
surface form:
United States
|
| coAuthorOf | "Reinforcement Learning: An Introduction" NERFINISHED ⓘ |
| coAuthorWith | Andrew G. Barto NERFINISHED ⓘ |
| employer | University of Alberta NERFINISHED ⓘ |
| fieldOfStudy | computer science ⓘ |
| fieldOfWork |
artificial intelligence
ⓘ
machine learning ⓘ reinforcement learning ⓘ |
| hasAcademicAdvisor | Andrew G. Barto NERFINISHED ⓘ |
| hasContribution |
formalization of temporal-difference methods
ⓘ
integration of learning, planning, and acting in RL ⓘ popularization of reinforcement learning through textbooks and tutorials ⓘ theoretical foundations of reinforcement learning ⓘ |
| hasRole | pioneer of reinforcement learning ⓘ |
| influenced |
applications of reinforcement learning in games
ⓘ
applications of reinforcement learning in robotics ⓘ development of modern reinforcement learning ⓘ |
| knownFor |
actor-critic methods
ⓘ
foundational contributions to reinforcement learning ⓘ on-policy and off-policy learning methods ⓘ options framework for temporally extended actions ⓘ policy gradient methods ⓘ temporal-difference learning ⓘ the Dyna architecture NERFINISHED ⓘ the book "Reinforcement Learning: An Introduction" NERFINISHED ⓘ |
| language | English ⓘ |
| nationality |
American
ⓘ
Canadian ⓘ |
| notableWork |
Dyna reinforcement learning architecture
NERFINISHED
ⓘ
Q-learning related research ⓘ TD(λ) NERFINISHED ⓘ actor-critic architectures NERFINISHED ⓘ options framework in hierarchical reinforcement learning ⓘ policy gradient theorem NERFINISHED ⓘ temporal-difference learning algorithms ⓘ |
| position |
professor
ⓘ
research scientist ⓘ |
| workInstitution |
DeepMind Alberta
NERFINISHED
ⓘ
University of Alberta NERFINISHED ⓘ |
Referenced by (4)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
Richard L. Sutton