![Page 1: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/1.jpg)
The future of search: How to stay relevant in Sourcing
Greg Lindahl CTO, Blekko
October 13, 2011 - SourceCon
![Page 2: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/2.jpg)
A li@le about me
• Technologist, not a sourcer
• I did get sourced once, in 1995
• I’m proud to have the ugliest slides at the conference!
![Page 3: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/3.jpg)
A li@le about you
• You guys are heavy, sophisJcated users of Google & specialized engines like Topsy
• You’re eager to learn about new things
• You quickly form opinions about what’s useful
![Page 4: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/4.jpg)
Challenges in Sourcing
• The order of search results is based on incoming links & Page Rank
• You guys are heavy users of advanced features such as boolean search
• Social networks are “walled gardens”
![Page 5: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/5.jpg)
PageRank Today
![Page 6: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/6.jpg)
Advanced interfaces
• No one uses the advanced search interface of Google
• You’re lucky that it hasn’t disappeared!
• New Google algorithms trying to guess your intent are probably a net minus for Sourcers
![Page 7: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/7.jpg)
Social Networks
• Facebook is mostly a “walled garden” -‐ many users don’t want their personal info to be public
• Facebook mixes work and play • LinkedIn is a pure play, but younger people don’t use it
• Twi@er is open, but a mixture of work and play
![Page 8: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/8.jpg)
Search implicaJons of social
• General search engines won’t be good enough, now or in the future
• It will remain hard to try to match up candidates with several social accounts plus a web presence
![Page 9: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/9.jpg)
Things you should know: BiGrams
• Google counts: – java programmer OR developer: 80 million – “java programmer” OR “java devloper”: 23 million
• Why? – Everyone indexes pairs and triples of words which are thought to be related • Names, job Jtles, common word pairs
![Page 10: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/10.jpg)
Future search direcJons
• “semanJc search”
• 2 parts: – understanding the source documents be@er • bigrams of names just a start
– understanding your query intent be@er
• This will hurt advanced search and boolean queries!
![Page 11: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/11.jpg)
Future search direcJons
• “real-‐Jme” search: twi@er and non-‐twi@er
• Non-‐twi@er real-‐Jme incorporated directly into the major search engines
• Twi@er search best in specialized engines
![Page 12: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/12.jpg)
New market entrants
• blekko
• yandex
• duckduckgo
![Page 13: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/13.jpg)
slash the web "
![Page 14: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/14.jpg)
Silicon Valley handshake
• blekko was founded in 2007
• $55mm in financing, 29 employees
• Backers: USVP, CMEA Ventures, Yandex (strategic), Ron Conway, Marc Andreesen, …, Ashton Kutcher
![Page 15: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/15.jpg)
Curated Search – No Spam, High Quality
![Page 16: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/16.jpg)
ü Wikipedia model: Users idenJfy top sites for every category
ü Technology: blekko uses social data plus algorithms to make more relevant, spam-‐free search results
Algorithms + People = Better Search"
![Page 17: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/17.jpg)
Slashtag basics
• Both algorthmic and human-‐curated
• Curated slashtags developed in conjuncJon with outside partners, such as Stack Overflow
• Type ‘em directly into the search box: Greg Lindahl /date
![Page 18: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/18.jpg)
Slashtags
• Sort order: /date, /relevance • Narrow your search – Algorthmic: /forum /blog /gov /edu – Site: /foxnews.com – Curated list of websites: /health
• Human-‐edited by groups (think dmoz or wikipedia)
• Every user has their own namespace, plus there’s a namespace for /blekko/ tags
![Page 19: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/19.jpg)
Tips:
• Use /findslashtags to find tags: python /findslashags
• Use /web to get rid of any unwanted autoslashtags
• Watch the suggesJons for slashtag suggesJons as you type
• We have some API outcalls that might save you Jme: /twi@er = Twi@er search API, /video = YouTube, /imges = bing, /bing, /google
![Page 20: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/20.jpg)
Advanced Slashtags
• Can be negated: -‐/foxnews.com • Implicit -‐/spam on every searce – and you can add any result to it with 1 click – use this to get rid of all those “people finders”
• Can use mulJple tags to intersect /linux /blogs /date
• Can include tags in other tags to “OR” them
![Page 21: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/21.jpg)
Sourcer Slashtags
• /people -‐-‐ algorithmic a@empt to find resumes, bios, etc
• /blogs and /forums, -‐/blogs and -‐/forums
• Develop a /spam slashtag, or maybe even several of them that you manually add to various searches
![Page 22: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/22.jpg)
Programming slashtags
• /open-‐source -‐-‐ aliases /oss /foss /floss • /linux, /lkml, /bsd, /windows • /repo -‐-‐ open source public repositories • /apache, /fsf -‐-‐ umbrella organizaJons • /perl, /cpan, /php, /javascript, /python, /django, /ruby, /rails, /java, /erlang, /scheme -‐-‐ languages
• /hpc, /make, /hacker, /hakerspaces
![Page 23: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/23.jpg)
A few examples
• Java programers with high performance compuJng experience: java hpc /people
• Followup on a candidate “marcus wa@s” java /blogs “marcus wa@s” java -‐/blogs “marcus wa@s” java -‐/blogs /date “marcus wa@s” java -‐/blogs /date=2008-‐2009
• Ok, more “marcus wa@s” /twi@er “marcus wa@s” /youtube -‐-‐ oops, basketball guy
![Page 24: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/24.jpg)
The bad stuff
• blekko’s crawl is fairly small, 2 billion pages today – increasing this Fall
• Some of the programming slashtags aren’t as good as others – this will improve over Jme
![Page 25: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/25.jpg)
For more info
• help.blekko.com – Add blekko to the list on the upper right search box
• blekko toolbar
![Page 26: The Future of Search: How to Stay Relevant in Sourcing](https://reader034.vdocument.in/reader034/viewer/2022051608/5455dceab1af9f39378b4b3f/html5/thumbnails/26.jpg)
To Sum Up
• “Let me explain. No, there is too much. Let me sum up.”
• Search is evolving in good and bad ways • New tools pop up every year
• I would love to hear feedback: [email protected]