↓Skip to main content

🏢 CREST, ENSAE

Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting

26 September 2024·1460 words·7 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 CREST, ENSAE

Incentive-aware algorithm achieves low regret in strategic multi-armed bandits under debt-free reporting, establishing truthful equilibrium among arms.

Improved Algorithms for Contextual Dynamic Pricing

26 September 2024·515 words·3 mins· loading · loading

AI Generated AI Theory Optimization 🏢 CREST, ENSAE

New algorithms achieve optimal regret bounds for contextual dynamic pricing under minimal assumptions, improving revenue management with better price adjustments.