visualizing doc graph in gephi june 2013

18
Analyzing DocGraph in Gephi Janos G. Hajagos Stony Brook School of Medicine 1 NYC Open Data Meetup June 24, 2013

Upload: vivian-s-zhang

Post on 26-Jan-2015

106 views

Category:

Education


0 download

DESCRIPTION

NYC Data Science Academy, NYC Open Data Meetup, Big Data, Data Science, NYC, Vivian Zhang, SupStat Inc,NYC, Gephi, Stony Brook Medicine school, Janos Hajagos, NPPES

TRANSCRIPT

Page 1: Visualizing doc graph in gephi june 2013

Analyzing DocGraph in Gephi

Janos G. Hajagos Stony Brook School of Medicine

1

NYC Open Data Meetup June 24, 2013

Page 2: Visualizing doc graph in gephi june 2013

DocGraph

• Based on FOIA request to CMS by Fred Trotter • Medicare providers (more than doctors) • CY 2011 date of service • Share 10 or more patients in a 30 day forward

window • Initial access restricted to MedStartr funders

but as of June 2013 open access

2

Page 3: Visualizing doc graph in gephi june 2013

Geographic Visualization

http://isurfsoftware.com/blog/2012/12/13/visualizing-geographic-connections-between-us-doctors/

3

Page 4: Visualizing doc graph in gephi june 2013

DocGraph by the numbers

• Directed graph • Average total degree 52.8 • 940,492 providers (graph nodes/vertices) • 49,685,810 shared edges

4

Page 5: Visualizing doc graph in gephi june 2013

DocGraph Data

5

Page 6: Visualizing doc graph in gephi june 2013

6

Page 7: Visualizing doc graph in gephi june 2013

NPPES

• National Plan and Provider Enumeration System

• Source of NPI (National Provider Identifier) • Information is entered and updated by

provider • CSV file with 314 columns • MySQL load script generated by Python script

to normalize database

7

Page 8: Visualizing doc graph in gephi june 2013

Selecting a sub-graph

8

Page 9: Visualizing doc graph in gephi june 2013

Core nodes

9

Page 10: Visualizing doc graph in gephi june 2013

Leaf nodes

10

Page 11: Visualizing doc graph in gephi june 2013

Core-to-core edges

11

Page 12: Visualizing doc graph in gephi june 2013

Core-to-leaf edges

12

Page 13: Visualizing doc graph in gephi june 2013

Leaf-to-leaf edges

13

Page 14: Visualizing doc graph in gephi june 2013

Generating GraphML

• XML based file format for graphs • Readable by a large number of tools

– Gephi – Mathematica – igraph (R)

• NetworkX Python library for graphs can easily export to GraphML

14

Page 15: Visualizing doc graph in gephi june 2013

15

Page 16: Visualizing doc graph in gephi june 2013

16

Gephi

Page 17: Visualizing doc graph in gephi june 2013

Subset defined from 2 Brooklyn zip codes (11215 - Park Slope & 11212 - Brownsville)

17

Page 18: Visualizing doc graph in gephi june 2013

Links

http://strata.oreilly.com/2012/11/docgraph-open-social-doctor-data.html (information) https://github.com/jhajagos/DocGraph (code) https://github.com/ftrotter/DocGraph (data) https://groups.google.com/forum/#!forum/docgraph (mailing list) http://bit.ly/1459NXn (sample Brooklyn GraphML file) http://strataconf.com/rx2013/public/schedule/detail/29840 (StrataRX workshop with Fred Trotter)

18