Graph Traversal

st-connectivity is a decision problem determining if there is a path between two vertices $s$ and $t$ in a graph $G$ .

Breadth-First Search (BFS)

Let $G = (V, E)$ be a graph and $s \in V$ be a starting node.
- The layers $L_{1}, L_{2}, \dots$ of the node $s$ in the graph $G$ constructed by BFS are defined as follows:
  - $L_{1} = {v \in V ∣ (s, v) \in E}$ (the set of vertices adjacent to $s$ )
  - $L_{i + 1} = {v \in V ∣ (u, v) \in E, u \in L_{i}, v \in / L_{i}}$ (the set of vertices adjacent to the vertices in $L_{i}$ that are not in $L_{i - 1}$ )
- $L_{i}$ is the set of vertices at distance $i$ from $s$ .
- BFS is not only determining the nodes reachable from $s$ , but also the shortest path from $s$ to them.
- For each $j \geq 1$ , layer $L_{j}$ produced by BFS consists of all nodes at distance exactly $j$ from $s$ .
- There is a path from $s$ to $t$ if and only if $t$ appears in some layer.
- BFS produces a BFS tree rooted at $s$ which is a tree $T = (V_{T}, E_{T})$ .
  - $V_{T} = {s} \cup i \geq 1 ⋃ L_{i}$ (the set of nodes reachable from $s$ )
  - $E_{T} \subseteq E$
  - If $v \in L_{i}, u \in L_{j}$ and $(u, v) \in E$ , then $∣ i - j ∣ \leq 1$ .
  - (3.4) If $(u, v) \in E$ and $(u, v) \in / E_{T}$ (a non-tree edge), then either $u$ and $v$ are in the same layer, or they are in adjacent layers.
- BFS explores the connected component in the graph $G$ containing $s$ .
  - $R$ is the set of nodes reachable from $s$ , produced as the following algorithm:
    - Start with $R = {s}$
    - While there exists an edge $(u, v) \in E$ such that $u \in R$ and $v \in / R$ , add $v$ to $R$ .
  - (note: in this context, connected component refers to the set of nodes, not the subgraph induced by them)

Layered BFS Algorithm

BFS Algorithm:
Runs in O(n+m) time if the graph is represented as an adjacency list.
---
BFS(s):
	Discovered[s]=true 
	for all other v, Discovered[v]=false
	
	L[0] = {s}    // layer 0
	i = 0         // layer counter
	T = {} // BFS tree initially empty
	
	while L[i] != {} do
	    L[i+1] = {}
	    for each v in L[i] do
	      for each (v,u) in E do
	        if not Discovered[u] then
		        Discovered[u] = true
				Add u to L[i+1]
				Add (v,u) to T
		i = i+1

Queue-based BFS Algorithm

BFS Algorithm: (Queue-based)
---
BFS(s):
    Create a queue Q
    Enqueue s onto Q
    Discovered[s] = true
    
    while Q is not empty do
        v = Dequeue from Q
        for each neighbor u of v do
            if not Discovered[u] then
                Discovered[u] = true
                Parent[u] = v // Optional: 
                Enqueue u onto Q
                Add (v, u) to T  // Optional: if building a BFS tree

Print nodes of a tree T in BFS order:
---
Print(T, s):
    Create a queue Q
    Enqueue s onto Q
    Discovered[s] = true
    
    while Q is not empty do
        dequeue v from Q and print v
        for each child u of v do
            Enqueue u onto Q

Depth-First Search (DFS)

DFS (Undirected)

An edge $(v, u)$ is a tree edge if $u$ is visited for the first time when $(v, u)$ is explored. (i.e. visited(u) = false)
An edge $(v, u)$ is a back edge if $u$ is already visited when $(v, u)$ is explored. (i.e. visited(u) = true)
The tree that DFS constructs is called a DFS tree.

Algorithm 2.2 DFS(G)
Input: G = (V, E) connected undirected graph on n vertices
---
for all v ∈ V do
	vistied(v) ← false
for all v ∈ V do
	if not visited(v) then
		DFS-Explore(G, v)

Algorithm 2.3 DFS-Explore(G, v)
---
visited(v) ← true
for all (v, u) ∈ E do
	if not visited(u) then
		DFS-Explore(G, u)

Stack-based DFS Algorithm

DFS Algorithm: (Stack-based)
Runnning time: O(n+m) (if the graph is represented as an adjacency list)
---
DFS(s):
	Initialize S to be a stack with one element s
	While S is not empty do
	    Pop v from S
	    If Explored[u] = false then 
		    Set Explored[u] = true
			For each edge (u,v) incident to u do
			    Push v onto S

binary tree traversal and DFS

todo which one of the bianry tree traversal algorithms (inorder, preorder, postorder) is a DFS algorithm on a binary tree?

DFS (Directed)

A tree edge is an edge $(u, v)$ such that $v$ was first discovered by exploring $(u, v)$ .
Back edges are those edges $(u, v)$ connecting a vertex $u$ to an ancestor $v$ in a depth-first tree. We consider self-loops, which may occur in directed graphs, as back edges.
Forward edges are those nontree edges $(u, v)$ connecting a vertex $u$ to a proper descendant $v$ in a depth-first tree.
Cross edges are all other edges. They can go between vertices in the same depth-first tree, as long as one vertex is not an ancestor of the other, or they can go between vertices in different depth-first trees.

DFS_visits_explore(G,v):

visited[v] = true
clock++
pre[v] = clock
for each edge (v,u) in E do 
	if not visited[u] then
	    DFS_visits_explore(G,u) 
clock++
post[v] = clock

Finding the Set of All Connected Components

Algorithm:
1. Find the connected components that contain the node $s$ using BFS or DFS.
2. Find a node $v$ (if any) in the graph that is not in any connected component found in step 1.
3. Repeat step 1 with $v$ as the starting node.
This algorithm runs in $O (n + m)$ time. (even though it may run BFS or DFS multiple times, it spends a constant amount of time on each edge and node)

Testing Bipartiteness

We can implement an algorithm to test whether a graph is bipartite by simply taking the implementation of BFS and adding an extra array Color over the nodes.
Whenever we get to a step in BFS where we are adding a node $v$ to a list $L [i + 1]$ , we assign:
- Color[v] = red if $i + 1$ is an even number,
- Color[v] = blue if $i + 1$ is an odd number.
At the end of this procedure, we simply scan all the edges:
- if there is any edge for which both ends received the same color, then the graph is not bipartite.
- Otherwise, the graph is bipartite.
Thus, the total running time for the coloring algorithm is $O (m + n)$ , just as it is for BFS.

Directed Graphs

Directed Graph Representation

We can use a version of adjacency list representation for directed graphs, which is, instead of each node having a single list of neighbors, each node has two lists associated with it:
- AdjOut[v] contains all the vertices $u$ such that $(v, u) \in E$ (outgoing edges)
- AdjIn[v] contains all the vertices $u$ such that $(u, v) \in E$ (incoming edges)

Directed Graph Traversal

Given a directed graph $G = (V, E)$ :
- $G^{rev}$ is the reverse graph of $G$ , where $G^{rev} = (V, E^{rev})$ and $E^{rev} = {(u, v) ∣ (v, u) \in E}$ .
- A node $v$ has a path to $s$ in $G$ if and only if $s$ has a path to $v$ in $G^{rev}$ .
- By running BFS(s) (or DFS(s)) on $G$ , we can find the set of all nodes reachable from $s$ in $G$ .
- By running BFS(s) (or DFS(s)) on $G^{rev}$ , we can find the set of all nodes that can reach a given node $s$ in $G$ .
- There is a simple linear-time algorithm to test if a directed graph is strongly connected
  - Run BFS(s) on $G$ for some node $s$ .
  - Run BFS(s) on $G^{rev}$ for the node $s$
  - If all nodes are reachable from $s$ in both runs, then the graph is strongly connected.

lecture 3 notes

probelms:
- how to find all shortest paths from a given node $s$ to a given node $t$ in a graph $G$ ?
problem solving using reduction (reduction is the process of transforming one problem into an easier eqvivalent problem)
- given an undirected heaph G with edges with weights. we ewant an algorithme that finds the shortest path between in terms of the sum of the weights of the edges.
  - will runnig of BFS algo. solve the problem?
    - ans: no. counter example was shown in the lecture.
  - solution:
    - by using the reduction method, we can build a new graph $G^{'}$ by replacing each edge $(u, v)$ with a path of length of the minimal weight is in the original graph.
    - then, we can run BFS on the new graph $G^{'}$ to find the shortest path.
    - the reducion does not lose or add any shortest path. i.e. we can prove:
      - for every $k$ -length path from $a$ to $b$ in $G$ , there is a $k$ -length path from $a$ to $b$ in $G^{'}$ . (length=the sum of the weights of the edges)
      - for every every $k$ -length path from $a$ to $b$ in $G^{'}$ (where $a$ and $b$ are nodes in $G$ ), there is a $k$ -length path from $a$ to $b$ in $G$ .
    - the running time of the new algorithm is $O (n + m)$
exercise:
- given a neighberhood with two-way streets. but there is a traffic jam in the city.
- we want to make the roads one-way to solve the traffic jam.
  - we have to make sure that the city remains strongly connected.
    - is it always possible? no.todo
  - bridge in an undirected graph is an edge in an undirected connected graph is a bridge if removing it disconnects the graph
- we’ll use in Bridges algo. in time O(n+m) that cheeck if G has a bridge or not.
- our question is: give an algorithm that make all roads one-way such that the city remains strongly connected.
  - hint: dfs has two types of edges: (in undirected graph. in directed graph, we have more types of edges)
    - tree edges - edges that are in the dfs tree
    - back edges - edges that are not in the dfs tree BUT connects a node to an ancestor in the dfs tree.
  - algo:
    - if there is a bridge in the graph, so this road should be two-way, so we can’t make the graph directed and strongly connected.
    - otherwise, we can make the graph directed and strongly connected
  - correctness proof:
    - every node is reachable from the root in the dfs tree.
varient of the problem: what if we want make roads one way as much as possible..

Shortest Path Problem

The shortest path problem is the problem of finding the shortest path between two nodes in a graph.
- single-source: finding the shortest path from a given node $s$ to all other nodes in the graph.
- single-destination: finding the shortest path from all nodes to a given node $t$ .
- all-pairs: finding the shortest path between all pairs of nodes in the graph.
- single-pair: finding the shortest path between a given pair of nodes $(s, t)$ .
The shortest path problem can be defined for graphs whether undirected, directed, or mixed.
The shortest path problem can be defined for graphs with or without weights on the edges.

Undirected Graphs

we can use the BFS algorithm to find the shortest path in an undirected unweighted graph.
for a weighted graph, we can replace each edge with a path of length equal to the weight of the edge and then run the BFS algorithm.
- cons: 1. the new graph may have multiple edges between two nodes. 2. it’s not good for weights of real numbers.

Dijkstra’s Algorithm

Dijkstra’s algorithm is an algorithm for finding the shortest path (in terms of the sum of weights) from a given start node to every other node in a directed graph with non-negative weights.
- Although this algorithm is for a directed graph, it can be used for an undirected graph by replacing each undirected edge $(u, v)$ with two directed edges $(u, v)$ and $(v, u)$ , each with the same weight.
- Assumptions:
  - the graph is connected
  - non-negative weights
The output of Dijkstra’s algorithm is a tree called the shortest path tree rooted at the start node.
Dijkstra’s algorithm is using a priority queue which can be implemented using:
- (unsorted) Doubly linked list - $Θ (∣ V ∣^{2})$
- Binary heap $Θ (∣ E ∣ \cdot lo g ∣ V ∣)$
- Fibonacci heap $Θ (∣ E ∣ + ∣ V ∣ lo g ∣ V ∣)$

# Dijkstra's Algorithm using Priority Queue
Dijkstra(G, s, w):
input: 
	G = (V, E) directed graph
	non-negative weights w
	s = source node
output: for each u ∈ V:
	d[u] = the shortest path estimate from s to u
	π[u] = the predecessor of u in the shortest path tree
----
InitPriorityQueue(Q)
d[s] = 0
Q.Insert(s, 0)

For each u ∈ G.V: 
	if u ≠ s:
		d[u] = ∞
		π[u] = null  
		Q.Insert(u, ∞)

While Q ≠ ∅:
    u ← Q.ExtractMin()   # Remove & return the node with the smallest d[u]
    S ← S ∪ {u}                      # Mark u as processed
    For each neighbor v ∈ G.Adj[u]:
        If d[u] + w(u, v) < d[v]:    # Relax the edge (u, v)
            d[v] = d[u] + w(u, v)
            π[v] = u                 # Update the predecessor (optional)
            DECREASE-KEY(Q, v, d[v]) # Update v's priority in Q

Flight Times and Layovers (lecture 4 exercise)

You are given:

A list of cities and their unique identifiers.

Flight times between pairs of cities, represented as a weighted directed graph:

Each edge represents a flight with a time cost (in hours).

Waiting times at each city (in hours), which must be added to the travel time whenever a flight lands there.

Goal: Write a program to calculate:

The shortest total travel time from a given starting city to every other city.

The path taken for the shortest travel time to each city.

Bellman–Ford algorithm

The Bellman–Ford algorithm is an algorithm for finding the shortest path from a single source node to all other nodes in a directed weighted graph.
- It is slower than Dijkstra’s algorithm for the same problem, but more versatile, as it is capable of handling graphs in which some of the edge weights are negative numbers
If there is a negative cycle reachable from $s$ , then there is no shortest path from $s$ to any node, and $dist (v) = - \infty$

Explorer

Graph algorithms

Graph Traversal

Breadth-First Search (BFS)

Layered BFS Algorithm

Queue-based BFS Algorithm

Depth-First Search (DFS)

DFS (Undirected)

Stack-based DFS Algorithm

binary tree traversal and DFS

DFS (Directed)

Finding the Set of All Connected Components

Testing Bipartiteness

Directed Graphs

Directed Graph Representation

Directed Graph Traversal

lecture 3 notes

Shortest Path Problem

Undirected Graphs

Dijkstra’s Algorithm

Bellman–Ford algorithm

Table of Contents

Backlinks