webscraping with python - wordpress.com · 2015-11-04 · webscraping with python michael babington...

14
Motivation Example Summary Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University Strozier Library, 2015 Babington, Clapp, Freeland Webscraping

Upload: others

Post on 11-May-2020

28 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Webscraping With Python

Michael Babington Christopher Clapp James Freeland

Department of EconomicsFlorida State University

Strozier Library, 2015

Babington, Clapp, Freeland Webscraping

Page 2: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

SummaryWhy Webscrape?

Uses Of WebscrapingData Gathering

Source: www.phdcomics.com

Webscraping is used to extract information from websitesIt has been used to collect data on everything from airlineseat price and availability to journal article citations

Babington, Clapp, Freeland Webscraping

Page 3: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

SummaryWhy Webscrape?

Uses of Webscraping

The basic method is to find patterns in the HTML code,then use a Python program to methodically extract the datayou want

Source: www.python.org

Our example will collect data on FSU Football statistics

Babington, Clapp, Freeland Webscraping

Page 4: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Get A Feel For The DataCoding Outline

Webscrape FSU Football Statistics (seminoles.com)

Babington, Clapp, Freeland Webscraping

Page 5: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Get A Feel For The DataCoding Outline

Webscrape FSU Football Statistics (seminoles.com)

Babington, Clapp, Freeland Webscraping

Page 6: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Get A Feel For The DataCoding Outline

Webscrape FSU Football Statistics (seminoles.com)

Babington, Clapp, Freeland Webscraping

Page 7: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Get A Feel For The DataCoding Outline

Webscrape FSU Football Statistics (seminoles.com)

Babington, Clapp, Freeland Webscraping

Page 8: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Get A Feel For The DataCoding Outline

Webscrape FSU Football Statistics (seminoles.com)

Babington, Clapp, Freeland Webscraping

Page 9: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Get A Feel For The DataCoding Outline

Webscrape FSU Football Statistics (seminoles.com)

Babington, Clapp, Freeland Webscraping

Page 10: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Get A Feel For The DataCoding Outline

Webscrape FSU Football Statistics (seminoles.com)

Babington, Clapp, Freeland Webscraping

Page 11: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Get A Feel For The DataCoding Outline

Coding Outline

Program OutlineUse requests to import HTML code into PythonUse Beautiful Soup to make HTML code “readable”PythonPoint Python to the location of the data you wantLoop over the data to get it into a usable form

Full code will be available online

Babington, Clapp, Freeland Webscraping

Page 12: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

Summary

Get A Feel For The DataCoding Outline

Results

Babington, Clapp, Freeland Webscraping

Page 13: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

SummaryResources

Contact Information And Resources

Contact InformationMichael Babington: [email protected]

James Freeland: [email protected]

ResourcesYoutube tutorial using Yellow Pages

Beautiful Soup Documentation

Download Python

Babington, Clapp, Freeland Webscraping

Page 14: Webscraping With Python - WordPress.com · 2015-11-04 · Webscraping With Python Michael Babington Christopher Clapp James Freeland Department of Economics Florida State University

MotivationExample

SummaryResources

Conclusion

Webscraping is a useful research tool

It gives you access to new and exciting data

Presentation and Code will be available athttp://chrisclapp.org/teaching/

Babington, Clapp, Freeland Webscraping