Neha Patil (Editor)

NodeXL

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Initial release
  
July 2008 (2008-07)

Written in
  
C#, .NET

Development status
  
Released

Operating system
  
Windows

NodeXL

Developer(s)
  
Social Media Research Foundation

Stable release
  
1.0.1.238 / 8 April 2013; 3 years ago (2013-04-08)

NodeXL Basic is a free and open-source network analysis and visualization software package for Microsoft Excel 2007/2010/2013/2016. NodeXL Pro is a fee based fully featured version of NodeXL that includes access to social media network data importers, advanced network metrics, and automation. It is a popular package similar to other network visualization tools such as Pajek, UCINet, and Gephi.

Contents

Codebase

NodeXL is a set of prebuilt class libraries using a custom Windows Presentation Foundation control. Additional .NET assemblies can be developed as "plug-ins" to import data from outside data providers. Currently-implemented data providers for NodeXL include Facebook, Twitter, Wikipedia (the MediaWiki understructure), web hyperlinks, Microsoft Exchange Server.

Features

NodeXL is intended for users with little or no programming experience to allow them to collect, analyze, and visualize a variety of networks. NodeXL integrates into Microsoft Excel 2007, 2010, 2013 and 2016 and opens as a workbook with a variety of worksheets containing the elements of a graph structure such as edges and nodes. NodeXL can also import a variety of graph formats such as edgelists, adjacency matrices, GraphML, UCINet .dl, and Pajek .net.

Data Import

NodeXL imports UCINet and GraphML files, as well as Excel spreadsheets containing edge lists or adjacency matrices, into NodeXL workbooks. NodeXL also allows for quick collection of social media data via a set of import tools which can collect network data from e-mail, Twitter, YouTube, and Flickr. NodeXL requests the user's permission before collecting any personal data and focuses on the collection of publicly available data, such as Twitter statuses and follows relationships for users who have made their accounts public. These features allow NodeXL users to instantly get working on relevant social media data and integrate aspects of social media data collection and analysis into one tool.

Data Representation

NodeXL workbooks contain four worksheets: Edges, Vertices, Groups, and Overall Metrics. The relevant data about entities in the graph and relationships between them are located in the appropriate worksheet in row format. For example, the edges worksheet contains a minimum of two columns, and each row has a minimum of two elements corresponding to the two vertices that make up an edge in the graph. Graph metrics and edge and vertex visual properties appear as additional columns in the respective worksheets. This representation allows the user to leverage the Excel spreadsheet to quickly edit existing node properties and to generate new ones, for instance by applying Excel formulas to existing columns.

Graph Analysis

NodeXL contains a library of commonly used graph metrics: centrality, clustering coefficient, diameter. NodeXL differentiates between directed and undirected networks. NodeXL implements a variety of community detection algorithms to allow the user to automatically discover clusters in their social networks.

Graph Visualization

NodeXL generates an interactive canvas for visualizing graphs. The project allows users to pick from several well-known Force-directed graph drawing layout algorithms such as Fruchterman-Reingold and Harel-Koren. NodeXL allows the user to multi-select, drag and drop nodes on the canvas and to manually edit their visual properties (size, color, and opacity). In addition, NodeXL allows users to map the visual properties of nodes and edges to metrics it calculates, and in general to any column in the edges and vertices worksheet.

Research

NodeXL has been used by news outlets like Foreign Policy to visualize the structure of conversations about political topics as well as organizations like the World Bank to analyze voting data. NodeXL has been used as an analytical tool in dozens of research papers in the social, information, and computer sciences as well as the focus of research in human computer interaction, data mining, and data visualization.

Himelboim, I., McCreery, S., & Smith, M. (2013). Birds of a feather tweet together: Integrating network and content analyses to examine cross-ideology exposure on Twitter. Journal of Computer-Mediated Communication, 18(2), 40-60. DOI: 10.1111/jcc4.12001

Bonsignore, EM, Dunne, C, Rotman, D, Smith, M, Capone, T, Hansen, DL, Shneiderman, B (2009). First Steps to NetViz Nirvana: Evaluating Social Network Analysis with NodeXL. International Symposium on Social Intelligence and Networking (SIN09), Aug 29-31, Vancouver, Canada.

Smith, M, Shneiderman, B, Milic-Frayling, N, Rodrigues, E, Barash, V, Dunne, C, Capone, T, Perer, A, Gleave, E (April 2009). Analyzing (Social Media) Networks with NodeXL. Proc. Communities & Technologies Conference, Springer (June 2009).

Hansen, DL (2011) Exploring social media relationships, 43-51. In On the Horizon 19 (1).

Hansen, DL, Smith, MA, Shneiderman, B (2011) EventGraphs: Charting Collections of Conference Connections. In Forty-Fourth Annual Hawaii International Conference on System Sciences (HICSS). Also see EventGraph SlideShare Presentation

Hansen, DL, Shneiderman, B, Smith, MA (2010) Visualizing threaded conversation networks: mining message boards and email lists for actionable insights, 47-62. In Proc. Active Media Technology 2010, Lecture Notes in Computer Science 6335.

Hansen, D, Rotman, D, Bonsignore, E, Milic-Frayling, N, Rodrigues, E, Smith, M, Shneiderman, B . Do You Know the Way to SNA?: A Process Model for Analyzing and Visualizing Social Media Data. HCIL-2009-17 Tech Report.

References

NodeXL Wikipedia