Skip to main content

🏢 Courant Institute of Mathematical Sciences, New York University

The surprising efficiency of temporal difference learning for rare event prediction
·1614 words·8 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Courant Institute of Mathematical Sciences, New York University
TD learning surprisingly outperforms Monte Carlo methods for rare event prediction in Markov chains, achieving relative accuracy with polynomially, instead of exponentially, many observed transitions.