Implemented agent which chooses to play winning, losing or random moves by solving a simplified MDP model of the game using policy iteration.

Portions of MDP/solver code by Ciaran O'Reilly and Ravi Mohan used under MIT license.
This commit is contained in:
Woody Folsom
2012-04-30 13:35:40 -04:00
parent c06f7ab38e
commit d0ee1e647b
35 changed files with 2500 additions and 3 deletions

View File

@@ -0,0 +1,21 @@
package aima.core.probability.mdp;
/**
* An interface for MDP reward functions.
*
* @param <S>
* the state type.
* @author Ciaran O'Reilly
* @author Ravi Mohan
*/
public interface RewardFunction<S> {
/**
* Get the reward associated with being in state s.
*
* @param s
* the state whose award is sought.
* @return the reward associated with being in state s.
*/
double reward(S s);
}