SNAP: Network analysis and graph mining library for massive networks

Tag: memetracker, snap, Stanford Network Analysis Package

Stanford Network Analysis Package (SNAP) is a general purpose network analysis and graph mining library that is easily scales to massive networks, is efficient and easily extendible.

It naturally supports rich networks with complex data types associated with nodes and edges of the network. SNAP was developed in course of my PhD studies and was build on top of a general purpose STL (Standard Template Library)-like library GLib that was developed at Jozef Stefan Institute.

SNAP distinguishes between graphs and networks. Graphs describe raw topologies. That is nodes with unique integer ids and directed/undirected/multiple edges between the nodes of the graph. Networks are graphs with data on nodes and/or edges of the network. Data types that reside on nodes and edges are simply passed as template parameters which provides a very fast and convenient way to implement various kinds of networks with rich data on nodes and edges.

SNAP is based on node and edge iterators which allows for efficient implementation of algorithms that work on networks regardless of their type (directed, undirected, graphs, networks) and specific implementation.

For more information on SNAP refer to documentation

 

476 Million Twitter tweets

The 476 million twitter tweets cover a 7 month period from June 2009 to December 31 2009. The SNAP team estimated that this was about 20-30% of all public tweets published on Twitter during the particular frame.

Take a look at them here.

MemeTracker

MemeTracker is an approach for extracting short textual phrases from web documents (news articles and blog posts) and then tracking how such prases spread over the Web and how they change and evolve as they spread.

Microsoft Instant Messenger network dataset

The dataset contains summary properties of 30 billion conversations among 240 million people. From the data, SNAP constructed a communication graph with 180 million nodes and 1.3 billion undirected edges, creating the largest social network constructed and analyzed to date

Read full report here

Some papers published by SNAP:

Supervised Random Walks: Predicting and Recommending Links in Social Networks
L. Backstrom, J. Leskovec.
ACM International Conference on Web Search and Data Minig (WSDM), 2011.

Correcting for Missing Data in Information Cascades
E. Sadikov, M. Medina, J. Leskovec, H. Garcia-Molina.
ACM International Conference on Web Search and Data Minig (WSDM), 2011.

Patterns of Temporal Variation in Online Media
J. Yang, J. Leskovec.
ACM International Conference on Web Search and Data Minig (WSDM), 2011.
 

The latest news from SNAP:

  • Aug 21 2011: Tutorial on Social Media Analytics. More info.
  • Apr 17 2011: SNAP now computes k-core decomposition and analyzes information cascades.
  • Oct 21 2010: New datasets: 476 million tweets, memetracker data, Amazon product data.
  • Oct 1 2010: SNAP now also computes network motifs and overlapping network communities.
  • Jul 10 2010: SNAP networks are now part of the University of Florida Sparse Matrix collection.
  • April 25 2010: SNAP is now available through the NodeXL which is a graphical front-end that integrates network analysis and SNAP into Microsoft Office and Excel. Using NodeXL, users without programming skills can make use of key elements of the SNAP library.
  • November 25 2009: more datasets added, new version of the library.
  • June 28 2009: website launched with 36 network datasets and SNAP network analysis library.

Go to their website 
http://snap.stanford.edu/about.html


Comments

Post new comment

The content of this field is kept private and will not be shown publicly.
CAPTCHA
This is to prevent spammers. Authenticated users can skip the CAPTCHA security measure.
Image CAPTCHA
Enter the characters shown in the image.

Who's new

  • ciberprov
  • michael.nguyen
  • mornjinfeng
  • aniketdaptardar
  • hadriker
  • Alanw

Security vids #1

Team Cymru Research NFP is a specialized Internet security research firm and 501(c)3 non-profit dedicated to making the Internet more secure. Team Cymru helps organizations identify and eradicate problems in their networks, providing insight that improves lives.

Team Cymru the video series 1 to 10
Team Cymru the video series 11 to 20
Team Cymru the video series 21 to 30
Team Cymru the video series 31 to 40
Team Cymru the video series 41 to 50
Team Cymru the video series 51 to 60

CWZ Books

Who's online

There are currently 0 users and 19 guests online.

Security vids #2

The Center for Education and Research in Information Assurance and Security (CERIAS) is currently viewed as one of the world’s leading centers for research and education in areas of information security that are crucial to the protection of critical computing and communication infrastructure.

CERIAS is unique among such national centers in its multidisciplinary approach to the problems, ranging from purely technical issues (e.g., intrusion detection, network security, etc) to ethical, legal, educational, communicational, linguistic, and economic issues, and the subtle interactions and dependencies among them.

CERIAS Security: Attribute-Based Access Control
CERIAS Security: Information Flow Analysis in Security Enhanced Linux
CERIAS Security: Towards Mining Syslog Data
Weapons of Mass Disruption Gallery Launch: Reitinger Remarks
Weapons of Mass Disruption: Mike McConnell on The Nightmare Scenario



#Cyberwar