Lyft’s Reinforcement Learning Platform
Lyft Engineering
MARCH 12, 2024
It then chooses the better performing actions for a state while maintaining some level of exploration to detect changes over time. More typically, we perform batch updates anywhere from every 10 minutes to 24 hours. It recommends different news article categories to a user based on the time of day and changing preferences over time.
Let's personalize your content