CS7642_Week10 Flashcards

1
Q

What is a DEC-POMDP?

A

Decentralized Partially Observable MDP. It’s a way of redefining MDPs that are more suited to coordination.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are three general ways that a human can communicate to an agent?

A
  1. Demonstrations
  2. Rewards
  3. Policies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a TTD-MDP?

A

Targeted Trajectory MDP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

TTD-MDPs can be solved in linear time? (True/False)

A

True (linear in the length of the story)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly