Français Anglais
Accueil Annuaire Plan du site
Accueil > Evenements > Séminaires
Séminaire d'équipe(s) Learning and Optimization
Continuous MCTS for hydroelectric scheduling
Adrien Couëtoux

16 July 2013, 14:30 - 16 July 2013, 15:30
Salle/Bat : 1/DIG-Moulon
Contact :

Activités de recherche :

Résumé :
Hydroelectric scheduling (HS) problems, so far, have mostly been solved by mathematical programming methods (linear programming, stochastic dual dynamic programming, etc), because it's the only known method that works well to solve sequential decision making with very large action space (~10000 variables and more).
However, these methods need to simplify the model in several ways, including: assuming the cost function is convex, and simplifying the underlying random processes (like assuming they are Markovian).
Both these assumptions are wrong in reality for hydroelectric systems.

We investigate an alternative way to solve these problems, Monte Carlo Tree Search. It became famous for improving significantly the level of computers for the game of Go. We extended its reach to continuous domains.
However, MCTS suffers from some limitations to work on HS problems: it does not do well on very large action spaces, and it struggles with long time horizon.
This is why we worked on two fronts: first, we developped a framework to mix MCTS with existing suboptimal policies (like Linear Programming), in a way that can get the best out of both worlds.
Second, we are working on modifications of the traditional bandit inspired formula that dictates how to spend computing power in the tree of MCTS. Alternative methods to direct the computing effort show promising results on simple problems.

Pour en savoir plus :
Séminaires
Measuring Similarity between Logical Arguments
Automated Reasoning
Monday 06 March 2023 - 00:00
Salle : 0 - 650
Victor David .............................................

Imputing Out-of-Vocabulary Embeddings with LOVE Ma
Data-Centric Languages and Systems
Monday 20 February 2023 - 00:00
Salle : 455 - PCRI-N
Lihu Chen .............................................

On the Interplay between Software Product Lines an
Automated Reasoning
Tuesday 18 October 2022 - 14:15
Salle : 2013 - DIG-Moulon
Vander Alves .............................................

Combining randomized and observational data: Towar
Automated Reasoning
Thursday 13 October 2022 - 10:30
Salle : 2011 - DIG-Moulon
Bénédicte Colnet .............................................

New Achievements of Artificial Intelligence in Mul
Automated Reasoning
Tuesday 11 October 2022 - 14:15
Salle : 2013 - DIG-Moulon
.............................................