Girish Mahajan (Editor)

Breadth first search

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Class
  
Search algorithm

Data structure
  
Breadth-first search

Worst-case performance
  
O ( | V | + | E | ) = O ( b d ) {displaystyle O(|V|+|E|)=O(b^{d})}

Worst-case space complexity
  
O ( | V | ) = O ( b d ) {displaystyle O(|V|)=O(b^{d})}

Breadth-first search (BFS) is an algorithm for traversing or searching tree or graph data structures. It starts at the tree root (or some arbitrary node of a graph, sometimes referred to as a 'search key') and explores the neighbor nodes first, before moving to the next level neighbors.

Contents

BFS was invented in the late 1950s by E. F. Moore, who used it to find the shortest path out of a maze, and discovered independently by C. Y. Lee as a wire routing algorithm (published 1961).

Pseudocode

Input: A graph Graph and a starting vertex root of Graph

Output: Goal state. The parent links trace the shortest path back to root

A non-recursive implementation of breadth-first search:

More details

This non-recursive implementation is similar to the non-recursive implementation of depth-first search, but differs from it in two ways:

  1. it uses a queue (First In First Out) instead of a stack and
  2. it checks whether a vertex has been discovered before enqueueing the vertex rather than delaying this check until the vertex is dequeued from the queue.

The Q queue contains the frontier along which the algorithm is currently searching.

The S set is used to track which vertices have been visited (required for a general graph search, but not for a tree search). At the beginning of the algorithm, the set is empty. At the end of the algorithm, it contains all vertices with a distance from root less than the goal.

The parent attribute of each vertex is useful for accessing the nodes in a shortest path, for example by backtracking from the destination node up to the starting node, once the BFS has been run, and the predecessors nodes have been set.

The NIL is just a symbol that represents the absence of something, in this case it represents the absence of a parent (or predecessor) node; sometimes instead of the word NIL, words such as null, none or nothing can also be used.

Note that the word node is usually interchangeable with the word vertex.

Breadth-first search produces a so-called breadth first tree. You can see how a breadth first tree looks in the following example.

Example

The following is an example of the breadth-first tree obtained by running a BFS starting from Frankfurt:

Time and space complexity

The time complexity can be expressed as O ( | V | + | E | ) , since every vertex and every edge will be explored in the worst case. | V | is the number of vertices and | E | is the number of edges in the graph. Note that O ( | E | ) may vary between O ( 1 ) and O ( | V | 2 ) , depending on how sparse the input graph is.

When the number of vertices in the graph is known ahead of time, and additional data structures are used to determine which vertices have already been added to the queue, the space complexity can be expressed as O ( | V | ) , where | V | is the cardinality of the set of vertices (as said before). If the graph is represented by an adjacency list it occupies Θ ( | V | + | E | ) space in memory, while an adjacency matrix representation occupies Θ ( | V | 2 ) .

When working with graphs that are too large to store explicitly (or infinite), it is more practical to describe the complexity of breadth-first search in different terms: to find the nodes that are at distance d from the start node (measured in number of edge traversals), BFS takes O(bd + 1) time and memory, where b is the "branching factor" of the graph (the average out-degree).

Completeness and optimality

In the analysis of algorithms, the input to breadth-first search is assumed to be a finite graph, represented explicitly as an adjacency list or similar representation. However, in the application of graph traversal methods in artificial intelligence the input may be an implicit representation of an infinite graph. In this context, a search method is described as being complete if it is guaranteed to find a goal state if one exists. Breadth-first search is complete, but depth-first search is not. When applied to infinite graphs represented implicitly, breadth-first search will eventually find the goal state, but depth-first search may get lost in parts of the graph that have no goal state and never return.

BFS ordering

An enumeration of the vertices of a graph is said to be a BFS ordering if it is the possible output of the application of BFS to this graph.

Let G = ( V , E ) be a graph with n vertices. Recall that N ( v ) is the set of neighbors of v . For σ = ( v 1 , , v m ) be a list of distinct elements of V , for v V { v 1 , , v m } , let ν σ ( v ) be the least i such that v i is a neighbor of v , if such a i exists, and be otherwise.

Let σ = ( v 1 , , v n ) be an enumeration of the vertices of V . The enumeration σ is said to be a BFS ordering (with source v 1 ) if, for all 1 < i n , v i is the vertex w V { v 1 , , v i 1 } such that ν ( v 1 , , v i 1 ) ( w ) is minimal. Equivalently, σ is a BFS ordering if, for all 1 i < j < k n with v i N ( v j ) N ( v k ) , there exists a neighbor v m of v j such that m < i .

Applications

Breadth-first search can be used to solve many problems in graph theory, for example:

  • Copying garbage collection, Cheney's algorithm
  • Finding the shortest path between two nodes u and v, with path length measured by number of edges (an advantage over depth-first search)
  • (Reverse) Cuthill–McKee mesh numbering
  • Ford–Fulkerson method for computing the maximum flow in a flow network
  • Serialization/Deserialization of a binary tree vs serialization in sorted order, allows the tree to be re-constructed in an efficient manner.
  • Construction of the failure function of the Aho-Corasick pattern matcher.
  • Testing bipartiteness of a graph.
  • References

    Breadth-first search Wikipedia