UpSet plot
This article, UpSet plot, has recently been created via the Articles for creation process. Please check to see if the reviewer has accidentally left this template after accepting the draft and take appropriate action as necessary.
Reviewer tools: Inform author |
UpSet plots are a data visualization method for showing set data with more than three intersecting sets. UpSet shows intersections in a matrix, with the rows of the matrix corresponding to the sets, and the columns to the intersections between these sets (or vice versa). The size of the sets and of the intersections are shown as bar charts.

History
UpSet plots were first proposed in 2014 [1]. The first prototype was implemented as an interactive, web-based application[2]. UpSet plots are also related to Mosaic Plots, though Mosaic plots are designed for categorical instead of set data. UpSet plots became popular as they were re-implemented in various programming languages, such as R (based on Matplotlib)[3], Python, and others[4]. As of May 2022, UpSet R has been downloaded from C-RAN more than 1 million times[5]. UpSet plots are now frequently used instead of Venn diagrams, especially in life sciences [6].
Usage


UpSet plots visualize intersections between sets in a matrix. In a vertical UpSet plot, the columns of the matrix correspond to the sets, the rows correspond to the intersections. For each row, the cells that are part of an intersection are filled in. If there are multiple filled-in cells, they are connected with a line, to emphasize the reading direction of the plot. As sets vary in size, the size of the set is plotted as bar charts on top of the columns. The size of the intersections are shown aligned with the rows, also as bar charts. This layout facilitates the comparison between the sizes of individual intersections, as the size of the bars is easy to compare.
To answer specific questions, UpSet plots can be sorted in various ways. A common sorting approach, for example, is to sort by cardinality (the size of an intersection), which places the biggest intersections on top. Alternative sortings are by the degree of the intersection, or by sets.
UpSet plots can also be used to visualize attributes about the intersection, by placing attribute visualizations next to the bar charts[7] .
Advanced features of UpSet plots include querying, grouping and aggregating data[1]. These features tend to be available only in interactive, web-based implementations of UpSet[4].
Benefits and Limitations
UpSet plots tend to perform better than Venn diagrams for larger numbers of sets and when it is desirable to also show contextual information about the set intersections[8]. For visualizing diagrams with less than three sets, or when there are only few intersections, Venn and Euler diagram are generally preferred, because they tend to be more familiar and intuitive to read[9].
UpSet plots tend to be limited to 20-30 sets, though specific depends on the actual data [9]. An alternative for larger datasets is to show a co-occurence heat map, though these cannot show higher-order intersections[9].
References
- ^ a b Lex, Alexander; Gehlenborg, Nils; Strobelt, Hendrik; Vuillemot, Romain; Pfister, Hanspeter (31 December 2014). "UpSet: Visualization of Intersecting Sets". IEEE Transactions on Visualization and Computer Graphics. 20 (12): 1983–1992. doi:10.1109/TVCG.2014.2346248.
- ^ Lex, Alexander; Gehlenborg, Nils; Strobelt, Hendrik; Vuillemot, Romain; Pfister, Hanspeter. ""UpSet - Visualizing Intersecting Sets"".
- ^ Conway, Jake R; Lex, Alexander; Gehlenborg, Nils (15 September 2017). "UpSetR: an R package for the visualization of intersecting sets and their properties". Bioinformatics. 33 (18): 2938–2940. doi:10.1093/bioinformatics/btx364.
- ^ a b Lex, Alexander. "UpSet". upset.app. Retrieved 18 February 2022.
- ^ UpSet R Developers. "UpSetR". Retrieved 28 April 2022.
- ^ Gadhave, Kiran. ""UpSet 2: From Prototype to Tool"". Retrieved 28 April 2022.
- ^ Gu, Zuguang. "UpSet plot". ComplexHeatmap Complete Reference.
- ^ Laura, Ellis. "Set Analysis: A face off between Venn diagrams and UpSet plots". Little Miss Data. Retrieved 28 April 2022.
- ^ a b c Lex, Alexander; Gehlenborg, Nils (1 August 2014). "Sets and intersections". Nature Methods. 11 (8): 779–779. doi:10.1038/nmeth.3033.