libby peterek, m.s.info.st. division of instructional innovation and assessment
DESCRIPTION
Maximizing Online Information Retrieval: How Theological Librarians Can Best Access the Gnostic Areas of the Internet. Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment The University of Texas at Austin. Surface v. Deep Web. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/1.jpg)
Maximizing Online Information Retrieval: How Theological Librarians Can Best Access the Gnostic Areas of the Internet
Libby Peterek, M.S.Info.St.Division of Instructional Innovation and AssessmentThe University of Texas at Austin
![Page 2: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/2.jpg)
Surface v. Deep Web
• Surface Web - estimated between 1% and 20% of Internet
• Deep Web - content commercial search engines can’t reach (i.e., Google and Yahoo)– Unindexed
![Page 3: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/3.jpg)
Unindexed Web content
• Databases / dynamically generated content
• File types (Flash, php, etc.)• Institution sites• “Gated” content
– Require password / registration
![Page 4: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/4.jpg)
Theological Librarianship
• Underserved user group• Specialized content
– Hidden– Database driven– Newly added
• Potential to add richness to research
![Page 5: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/5.jpg)
Mining the Deep Web
• Deep Web search engines• Federated searching• RSS
![Page 6: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/6.jpg)
Deep Web Search Engines
• Look like commercial engines• Utilize different algorithms• Vary in quality and result relevance• Many free, growing number fee-based
and subscription-based– You get what you pay for…
![Page 7: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/7.jpg)
Deep Web Search Engines
• http://www.invisible-web.net• http://www.dipsie.com/ (later this year)• http://www.brightplanet.com
– The leader and most expensive– Mainly competitive intelligence
• http://www.profusion.com/
![Page 8: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/8.jpg)
Deep Web Search Engines
![Page 9: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/9.jpg)
Deep Web Search Engines
![Page 10: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/10.jpg)
Deep Web Issues
• Deep Web search engines underdeveloped
• Many of the same issues as commercial engines– Wait for search engines to improve?
• Federated Searching• RSS
![Page 11: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/11.jpg)
Federated Searching
• Programs written to connect catalogs and databases
• No need for same code• Specialized search
– Access to different information– Aggregated based on user preference– One simple interface
![Page 12: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/12.jpg)
Federated Searching
• Theological library databases, listservs, and indexes– Different form of content management– Access to all the tools available
![Page 13: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/13.jpg)
Sherlock
![Page 14: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/14.jpg)
Indeed
![Page 15: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/15.jpg)
Library Use
• New York State Library Pilot Project• http://www.nysl.nysed.gov/library/novel/pilot/
• University of Toronto & British Columbia– Endeavor ENCompass
• http://www.endinfosys.com/
• Library of Congress vendor list• http://www.loc.gov/catdir/lcpaig/portalproducts.
html
![Page 16: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/16.jpg)
Federated Searching Issues
• Need access to databases– Owned or agreed
• Can be expensive– Divide cost among interested parties or
content holders
![Page 17: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/17.jpg)
RSS
• Really Simple Syndication• Rich Site Summary• RDF Site Summary
• Comparable to personalized library “alerts”
![Page 18: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/18.jpg)
RSS
• Application of eXtensible Markup Language (XML), using W3C’s Resource Description Framework (RDF)
• What does this mean?– Metadata meets hyperlinks– Automates tasks
![Page 19: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/19.jpg)
How RSS is used
• Feeds combine metadata and links– “Syndicate (XML)” or
• Typical sites with RSS– News– Blogs
• Explosion of “bloggers” opens arena for valid material from a wide user base and links to relevant resources
![Page 20: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/20.jpg)
UT & RSS
• UT Austin strongly considering campus-wide blogging initiative– Content management– Content sharing– Archive– RSS
![Page 21: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/21.jpg)
Aggregating RSS Feeds
• Browsers– Mozilla Firefox (Mac & PC)– Safari (Mac)
• Aggregators / News Readers (full list)– NetNewsWire Lite (Mac)
• Email• Web
![Page 22: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/22.jpg)
NetNewsWire Lite
![Page 23: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/23.jpg)
How it works
• Library Jobs RSS feed from Chronicle of Higher Education– Blog– Organization site
• Elf– Library borrower RSS
![Page 24: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/24.jpg)
Feedster
• RSS search engine• Generates a unique RSS feed for each
search to copy to an aggregator• Notifications each time your subject is
updated• The better your search terms, the better
your results
![Page 25: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/25.jpg)
Feedster
![Page 26: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/26.jpg)
Feedster
![Page 27: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/27.jpg)
Why RSS at your library?
• Two-way information exchange– Information retrieval and dissemination
• For patrons and librarians– Filter information overload
• You designate the boundaries– Time sensitive
• Be notified first when something is posted in your area of interest
![Page 28: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/28.jpg)
Online Content / Search Issues
• Information creation and migration speeds• Standards - or lack thereof• Competition v. collaboration
![Page 29: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/29.jpg)
Looking forward
• Deep Web diminishing– XML– Commercial search engines
• Sophistication• File types
– Internet publishing increasing• More care about pages being indexed• Links
![Page 30: Libby Peterek, M.S.Info.St. Division of Instructional Innovation and Assessment](https://reader036.vdocument.in/reader036/viewer/2022070502/568145c7550346895db2d032/html5/thumbnails/30.jpg)
SourcesBergman, M. 2001. The Deep Web: Surfacing Hidden Value. Journal of Electronic
Publishing. http://www.press.umich.edu/jep/07-01/bergman.htmlBrightPlanet. Deep Web FAQs.
http://www.brightplanet.com/deepcontent/deep_web_faq.aspDevine, J. and Egger-Sider, F. 2004. Beyond Google:The Invisible Web in the
Academic Library. The Journal of Academic Librarianship. 30(4), 265-269.Olsen, S. 2004. Yahoo crawls deep into the Web. http://news.com.com/2100-1024-
5167931.html Smith, C. Invisible Web. http://www.libraryspot.com/features/invisibleweb.htmWired. 2005. Surfing the Deep Web.
http://www.wired.com/news/business/0,1367,67883,00.htmlUniversity at Albany. 2005. The Deep Web.
http://library.albany.edu/internet/deepweb.htmlWebster, P. 2004. Breaking Down Information Silos. Online. 30-34.Wright, A. 2004. In Search of the Deep Web. Salon.
http://www.salon.com/tech/feature/2004/03/09/deep_web/index_np.html