🏢 Courant Institute of Mathematical Sciences, New York University
The surprising efficiency of temporal difference learning for rare event prediction
·1614 words·8 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Courant Institute of Mathematical Sciences, New York University
TD learning surprisingly outperforms Monte Carlo methods for rare event prediction in Markov chains, achieving relative accuracy with polynomially, instead of exponentially, many observed transitions.