Solving COPs by searching and learning Taking the best of the two worlds
5
Proposed approach
6
DP notation
7
From DP to CP
8
Proposed Framework
9
DL, RL and Search Architecture
10
Illustration on TSP
11
Link To RL environment
12
Constraint programming search
13
Adding Constraints
14
TSPTW: A DP model
15
TSPTW: Results
16
4- Moments Portfolio Optimization
17
PORT: Results
18
Conclusion and perspectives
19
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Description:
Explore a hybrid approach combining reinforcement learning and constraint programming for solving complex combinatorial optimization problems in this 28-minute lecture by Louis-Martin Rousseau from École Polytechnique de Montréal. Delve into the challenges of state-space explosion in combinatorial optimization and learn how deep reinforcement learning can be used to design effective heuristics for NP-hard problems. Discover a novel framework that bridges dynamic programming, constraint programming, and reinforcement learning to overcome limitations of current methods. Examine the application of this hybrid approach to two challenging problems: the traveling salesman problem with time windows and the 4-moments portfolio optimization problem. Gain insights into the performance advantages of this integrated solution compared to standalone reinforcement learning and constraint programming approaches, as well as its competitiveness with industrial solvers.
Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization