Skip to main content

🏢 CREST, ENSAE

Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting
·1460 words·7 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 CREST, ENSAE
Incentive-aware algorithm achieves low regret in strategic multi-armed bandits under debt-free reporting, establishing truthful equilibrium among arms.
Improved Algorithms for Contextual Dynamic Pricing
·515 words·3 mins· loading · loading
AI Generated AI Theory Optimization 🏢 CREST, ENSAE
New algorithms achieve optimal regret bounds for contextual dynamic pricing under minimal assumptions, improving revenue management with better price adjustments.