Constrained Shortest Path/ Multi-commodity Flow

Problem Definition

Define G = (V, E) as a bidirectional graph with vertices v ∈ V, and weighted edges e ∈ E

Shortest-Path: Generate optimal routes and transit times between pair of vertices.
Constrained Shortest-Path: Given a list of demands, as well as capacities on the edges. A demand being a commodity described by origin node, destination node, and payload. Develop a shortest-path algorithm that minimizes transit time for all demands, while accounting for edge capacities. Note that demands can be split and transported across multiple routes.
Reinforcement Learning: Provide an alternative version of the constrained-shortest path algorithm using reinforcement learning.

The problem is solved through the implementation of the following scripts -

tools.py
graph_classes.py
agents.py
environments.py
custom_data_structures.py

The main files to be run for the three sections are -

"main_sp_part1.py"
"main_csp_part2.py"
"main_rl_part3.py"

Part 1 - Shortest-Path - "main_sp_part1.py"

This was the instance of the graph provided in "exercise_baseline.json". The shortest path algorithms are developed as methods to the Graph object. Please run ""main_sp_part1.py" for the following sections.

a) Shortest path between any two vertices - graph.shortest_path_source_to_dest(src = 'co', dest = 'wr', export_output = True, show_output = True)

Output exported to - "SrcToDest_ShortestPath.json"

b) Shortest paths to a single vertex from every other vertex - graph.shortest_path_dest_from_all_nodes(dest = 'co', export_output = True, show_output = True)

Output exported to - "DestFromAllNodes_ShortestPaths.json"

c) Shortest path from a single vertex to every other vertex - graph.shortest_path_source_to_all_nodes(src = 'co', export_output = True, show_output = True)

Output exported to - "SrcToAllNodes_ShortestPaths.json"

d) Shortest paths between every pair of vertices - graph.shortest_path_all_pairs(export_output = True, show_output = False)

Output hidden. Please run ""main_sp_part1.py" to see the output of the all pairs shortest paths algorithm. Set the (show_output = True) argument to "True" to see the shortest paths for all pairs on your window.
Output exported to - "AllPairs_ShortestPaths.json"

e) Test runs using randomly generated graphs - generate_graph(vertices, edges, show_graph_visual, size = (14,8), export_output= True)

Random graphs are generated using the generate_graph(vertices, edges, show_graph_visual, size = (14,8), export_output= True) method in the "tools.py" script.

The above is an example of a randomly generated graph with 12 vertices and 40 edges. The graph is exported to "RandomlyGenerated_Graph.json"
Output exported to - "AllPairs_ShortestPaths.json"

Part 2 - Constrained Shortest Path - "main_csp_part2.py"

This was the instance of the graph provided in "exercise_bonus.json". The constrained shortest path problem is solved with linear programming. The "calculate_flow_milp()" method of the graph class creates a LP model for the same using google OR tools. The script "main_csp_part2.py" displays results for two separate methods for the same

A complete LP approach whose solution is optimal This is a screenshot of the partial output of the algorithm. The total transit time comes to - 100954.6. Each of the remaining below can be read as a demand from node A to node B of payload x is fulfilled through (Route1, Flow1), (Route2, Flow2), etc. The sum of all the flows will equal the total payload of the demand.
The output is exported to "MILP_Routes.json"
A greedy method by a sequential allocation of demand. The greedy methods iterates through each demand and allocates them immediately. Once a demand is allocated, the capacities of the arc used are updated, and a flow for the remaining demands are done in the same fashion. The total transit time comes to - 103806.2

Part 3 - Reinforcement Learning based shortest paths - "main_rl_part3.py"

The problem is modelled as a sequential demand allocation problem with the objective of finding the optimal sequence of allocation of demands to minimize total transit times. The state, action, rewards of the MDP are defined below.
State - The demands that are already allocated and the capacity of the arcs based on the current allocation.
Action - The next demand to be allocated.
Rewards - Computed using an LP. We find the optimal routes and flows of a demand given current capacity of the arcs. The objective value of the LP is used as the reward for the state-action pair.

Description of the method

An offline reinforment learning algorithm is implemented with fitted Q iterations, a batch learning method. We begin by creating a dataset using 100 iterations of the problem and random actions. Each row consists of features of the current state, the next state given the action chosen and the associated reward, a binary variable indicating whether the next state is terminal and all possible states that can be reached after another action is chosen. With the dataset ready, the learning process begins with initialising a ZeroEstimator that predicts zero values for any inputs. At each iteration, a target value y for each state in the dataset is computed. We use these values to train an MLPestimator - a feedforward artificial neural network - which seeks to approximate the optimal Q-function. Following the training in each iteration, the updated estimator is used to compute the next TD target value. Actions are chosen based on the estimator's predictions, with the best action being the one that leads to a state with the lowest value.

We observe the following on 10 runs of the reinforcement learning algorithm. We obtain an average total transit time of - 102716.32 for the offline DQN algorithm.

Total Transit Time

MILP (Optimal) - 100954.6
Greedy (Baseline) - 103806.2
Offline DQN (Average) - 102716.32

The routes and flows generated by the reinforcement learning algorithm is exported into the "RL_OfflineAgent_Routes.json" file.

Thanks, Adel Sakkir

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Constrained Shortest Path/ Multi-commodity Flow

Problem Definition

Part 1 - Shortest-Path - "main_sp_part1.py"

a) Shortest path between any two vertices - graph.shortest_path_source_to_dest(src = 'co', dest = 'wr', export_output = True, show_output = True)

b) Shortest paths to a single vertex from every other vertex - graph.shortest_path_dest_from_all_nodes(dest = 'co', export_output = True, show_output = True)

c) Shortest path from a single vertex to every other vertex - graph.shortest_path_source_to_all_nodes(src = 'co', export_output = True, show_output = True)

d) Shortest paths between every pair of vertices - graph.shortest_path_all_pairs(export_output = True, show_output = False)

e) Test runs using randomly generated graphs - generate_graph(vertices, edges, show_graph_visual, size = (14,8), export_output= True)

Part 2 - Constrained Shortest Path - "main_csp_part2.py"

Part 3 - Reinforcement Learning based shortest paths - "main_rl_part3.py"

Description of the method

Total Transit Time

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
__pycache__		__pycache__
AllPairs_ShortestPaths.json		AllPairs_ShortestPaths.json
DestFromAllNodes_ShortestPaths.json		DestFromAllNodes_ShortestPaths.json
Greedy_Routes.json		Greedy_Routes.json
MILP_Routes.json		MILP_Routes.json
README.md		README.md
RL_Greedy_Routes.json		RL_Greedy_Routes.json
RL_OfflineAgent_Routes.json		RL_OfflineAgent_Routes.json
RandomlyGenerated_Graph.json		RandomlyGenerated_Graph.json
SrcToAllNodes_ShortestPaths.json		SrcToAllNodes_ShortestPaths.json
SrcToDest_ShortestPath.json		SrcToDest_ShortestPath.json
agents.py		agents.py
custom_data_structures.py		custom_data_structures.py
environments.py		environments.py
exercise_baseline.json		exercise_baseline.json
exercise_bonus.json		exercise_bonus.json
finalized_model.sav		finalized_model.sav
graph_classes.py		graph_classes.py
main_csp_part2.py		main_csp_part2.py
main_rl_part3.py		main_rl_part3.py
main_sp_part1.py		main_sp_part1.py
results_picture.png		results_picture.png
results_rl(copy).csv		results_rl(copy).csv
results_rl.csv		results_rl.csv
tools.py		tools.py

adelsakkir/Constrained_Shortest_Path

Folders and files

Latest commit

History

Repository files navigation

Constrained Shortest Path/ Multi-commodity Flow

Problem Definition

Part 1 - Shortest-Path - "main_sp_part1.py"

a) Shortest path between any two vertices - graph.shortest_path_source_to_dest(src = 'co', dest = 'wr', export_output = True, show_output = True)

b) Shortest paths to a single vertex from every other vertex - graph.shortest_path_dest_from_all_nodes(dest = 'co', export_output = True, show_output = True)

c) Shortest path from a single vertex to every other vertex - graph.shortest_path_source_to_all_nodes(src = 'co', export_output = True, show_output = True)

d) Shortest paths between every pair of vertices - graph.shortest_path_all_pairs(export_output = True, show_output = False)

e) Test runs using randomly generated graphs - generate_graph(vertices, edges, show_graph_visual, size = (14,8), export_output= True)

Part 2 - Constrained Shortest Path - "main_csp_part2.py"

Part 3 - Reinforcement Learning based shortest paths - "main_rl_part3.py"

Description of the method

Total Transit Time

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages