history of data
TRANSCRIPT
History of the Info: Part II
Nick DucoffCEO and Co-Founder, Infochimps
Early 2000s
Mid 2000s
Present Day
3000 BC
Recording
3000 BC 1200 BC
Recording
Aggregating
3000 BC 1200 BC 300 BC
Recording
Aggregating
Storing at Scale
300s AD – Random Access
3000 BC 1200 BC 300 BC 300 AD
Recording
Aggregating
Storing at Scale
Random Access
3000 BC 1200 BC 300 BC 300 AD 1400 AD
Recording
Aggregating
Storing at Scale
Random Access
Mass Distribution
3000 BC 1200 BC 300 BC 300 AD 1400 AD 1700 AD
Recording
Aggregating
Storing at Scale
Random Access
Mass Distribution
Infographics
3000 BC 1200 BC 300 BC 300 AD 1400 AD 1700 AD
Recording
Aggregating
Storing at Scale
Random Access
Mass Distribution
Infographics
1930s – Computation theory (Turing)1940s – Information theory (Shannon)1950s – Computer languages (1GL,2GL,3GL)1960s – Standardized metadata (Avram)1970s – Relational databases (IBM)1980s – WWW (Al Gore )
1990s – Internet archive (Kahle)
Tables on web pages
Open APIs
Commercialdata sources
Name ZIP Average Rent
Walter Cureton 78701 $400-$599
Ivy Caldwell 94103 >$1500
Regina Wootton 10027 $1000-$1499
Name Address City ZIP
Brian James 901 Red River Austin 78701
Terri Becraft 262 7th St. San Francisco 94103
Paz Brummit 603 W. 114th St. New York 10027
Name Address Normalized Address
Cecil Bartz 901 red river austin texas901 Red River, Austin, TX 78701
Genaro Luz 702 w. 32nd st austin702 W. 32nd St., Austin, TX 78705
Ruth Brown 114th + broadway, nycW. 114th St. & Broadway, New York, NY 10027
Augmentation
Completion
Normalization