Skip to content
okram edited this page May 13, 2011 · 10 revisions


Enron Corporation was an American energy, commodities, and services company based in Houston, Texas. Before its bankruptcy in late 2001, Enron employed approximately 22,000 staff and was one of the world’s leading electricity, natural gas, communications, and pulp and paper companies, with claimed revenues of nearly $101 billion in 2000.1 Fortune named Enron “America’s Most Innovative Company” for six consecutive years. At the end of 2001, it was revealed that its reported financial condition was sustained substantially by institutionalized, systematic, and creatively planned accounting fraud, known as the “Enron scandal”.

Loading the Dataset

Download at Infochimps

g = new Neo4jGraph('/data/enron')
GraphMLReader.inputGraph(g, new FileInputStream('/data/Enron_Dataset_v0.12.graphml'))

Graph Schema

Gremlin Examples

Basic Statistics

Who received the most emails?

m = [:]
g.idx(T.v)[[type:'Email Address']].sideEffect{m.put(it['address'], it.outE('SENT').count())}
m.sort{a,b -> a.value <=> b.value}

Who is Jeff Dasovich?

v = g.idx(T.v)[[address:'jeff.dasovich@enron.com']] >> 1

What people is Jeff Dasovich receiving emails from?

v.in('RECEIVED_BY').in('SENT').in('USED_EMAIL_ADDRESS').transform{it.firstName + ' ' + it.lastName}.groupCount(m)

Who is Christopher Calger?

v = g.idx(T.v)[[lastName:'Calger']] >> 1
v.out('DIRECTLY_REPORTED_TO').transform{it.firstName + ' ' +  it.lastName}

Who is Louise Kitchen?

Email Centrality

Who is the smartest guy in the room?

m = [:]; c = 0;
g.idx(T.v)[[type:'Person']].out('USED_EMAIL_ADDRESS').out('SENT').out('RECEIVED_BY').in('USED_EMAIL_ADDRESS').groupCount(m).loop(5){c++ < 1000}.filter{false}

I want names!

m = [:]; c = 0;
g.idx(T.v)[[type:'Person']].out('USED_EMAIL_ADDRESS').out('SENT').out('RECEIVED_BY').in('USED_EMAIL_ADDRESS').transform{it['firstName'] + ' ' + it['lastName']}.groupCount(m).back(2).loop(5){c++ < 1000}.filter{false}

Mark Taylor was their head attorney. Looks like Enron was in some issues with the law.