Girish Mahajan (Editor)

Bagplot

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Bagplot

A bagplot, or starburst plot, is a method in robust statistics for visualizing two-dimensional statistical data, analogous to the one-dimensional box plot. Introduced in 1999 by Rousseuw et al., the bagplot allows one to visualize the location, spread, skewness, and outliers of the data set.

Contents

Construction

The bagplot consists of three nested polygons, called the "bag", the "fence", and the "loop".

  • The inner polygon, called the "bag", is constructed on the basis of Tukey depth, the smallest number of observations that can be contained by a half-plane that also contains a given point. It contains at most 50% of the data points
  • The outermost of the three polygons, called the fence is not drawn as part of the bagplot, but is used to construct it. It is formed by inflating the bag by a certain factor (usually 3). Observations outside the fence are flagged as outliers.
  • The observations that are not marked as outliers are surrounded by a loop, the convex hull of the observations within the fence.
  • An asterisk symbol (*) near the center of the graph is used to mark the depth median, the point with the highest possible Tukey depth. The observations between the bag and fence are marked by line segments, on a line to the depth median, connecting them to the bag.

    Properties

    The bagplot is invariant under affine transformations of the plane, and robust against outliers.

    References

    Bagplot Wikipedia


    Similar Topics