Baird’s counterexample

source("./counterexample.R")

Figure 11.2: Demonstration of instability on Baird’s counterexample

plot_fig11.2()

Figure 11.5: The behavior of the TDC algorithm

plot_fig11.5()

Figure 11.6: The behavior of the one-step Emphatic-TD algorithm in expectation

plot_fig11.6()