There are many use cases for graph databases and analytics

Business users are becoming more comfortable with graph analytics.

GraphLab graphThe rise of sensors and connected devices will lead to applications that draw from network/graph data management and analytics. As the number of devices surpasses the number of people — Cisco estimates 50 billion connected devices by 2020 — one can imagine applications that depend on data stored in graphs with many more nodes and edges than the ones currently maintained by social media companies.

This means that researchers and companies will need to produce real-time tools and techniques that scale to much larger graphs (measured in terms of nodes & edges). I previously listed tools for tapping into graph data, and I continue to track improvements in accessibility, scalability, and performance. For example, at the just-concluded Spark Summit, it was apparent that GraphX remains a high-priority project within the Spark1 ecosystem.

Another reason to be optimistic is that tools for graph data are getting tested in many different settings. It’s true that social media applications remain natural users of graph databases and analytics. But there are a growing number of applications outside the “social” realm. In his recent Strata Santa Clara talk and book, Neo Technology’s founder and CEO Emil Eifrem listed other uses cases for graph databases and analytics:

  • Network impact analysis (including root cause analysis in data centers)
  • Route finding (going from point A to point B)
  • Recommendations
  • Logistics
  • Authorization and access control
  • Fraud detection
  • Investment management and finance (including securities and debt)

The widening number of applications means that business users are becoming more comfortable with graph analytics. In some domains network science dashboards are beginning to appear. More recently, analytic tools like GraphLab Create make it easier to unlock and build applications with graph2 data. Various applications that build upon graph search/traversal are becoming common, and users are beginning to be comfortable with notions like “centrality” and “community structure”.

A quick way to immerse yourself in the graph analysis space is to attend the third GraphLab conference in San Francisco — a showcase of the best tools3 for graph data management, visualization, and analytics, as well as interesting use cases. For instance, MusicGraph will be on hand to give an overview of their massive graph database from the music industry, Ravel Law will demonstrate how they leverage graph tools and analytics to improve search for the legal profession, and Lumiata is assembling a database to help improve medical science using evidence-based tools powered by graph analytics.

Graphistry graphInteractive analyzer of Uber trips across San Francisco’s micro-communities

(Note: If you’re interested in attending the GraphLab conference, use the following 20% discount code.)


1. Full disclosure: I am an advisor to Databricks — a startup commercializing Apache Spark.

2. As I noted in a previous post, GraphLab has been extended to handle general machine learning problems (not just graphs).

3. Exhibitors at the GraphLab conference will include creators of several major graph databases, visualization tools, and Python tools for data scientists.


Related Content:

tags: , , , , ,

Get the O’Reilly Data Newsletter

Stay informed. Receive weekly insight from industry insiders.

  • http://linkurio.us Jean Villedieu

    For those who are interested in learning more about use cases for graph technologies, I recommend :
    – GraphGist (gist.neo4j.org) : a community driven project where people share their graph stories

    – Linkurious (linkurio.us/blog) : Linkurious is a graph visualization startup that shares use cases for graph technologies on its blog

  • http://www.noaddiction-powder.in Naina Katyal

    This was a really outstanding post. In theory I’d like to compose like this too – taking time and real effort to make a solid article.
    Step up height increaser
    Step up Height increaser

  • Tim M

    The best network analysis and visualisation toolkit I’ve seen is KeyLines http://keylines.com/
    Comes from a team who spent years building products specifically for law enforcement & fraud investigation – seems this lifts what they’ve learnt into a product that can be more easily applied to many more scenarios.

    (Disclosure: I don’t work for KeyLines but did work with some of the staff at the aforementioned company before they struck out on their own)

  • Don Kittle

    We’re using OrientDB (http://www.orientechnologies.com) as Graph Database for a couple of projects and I can definitely suggest it: commercial friendly license and the ability to work as document database make it the almost perfect database, the most complete for sure.