↓Skip to main content

🏢 Courant Institute of Mathematical Sciences, New York University

The surprising efficiency of temporal difference learning for rare event prediction

26 September 2024·1614 words·8 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Courant Institute of Mathematical Sciences, New York University

TD learning surprisingly outperforms Monte Carlo methods for rare event prediction in Markov chains, achieving relative accuracy with polynomially, instead of exponentially, many observed transitions.