making sense out of things on the web

71

Upload: pradeep-varadaraja-banavara

Post on 20-Jan-2015

958 views

Category:

Technology


4 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Making sense out of things on the web
Page 2: Making sense out of things on the web

MAKING SENSE OUT OF THINGS ON THE WEB@pradeepbv

Page 3: Making sense out of things on the web

3

We have been accumulating a lot of information

Page 4: Making sense out of things on the web

4http://en.wikipedia.org/wiki/File:Jingangjing.jpg

Page 5: Making sense out of things on the web

5

http://en.wikipedia.org/wiki/File:Printer_in_1568-ce.png

http://en.wikipedia.org/wiki/File:BuxheimStChristopher.jpg

Page 6: Making sense out of things on the web

6http://en.wikipedia.org/wiki/Odhecaton

Page 7: Making sense out of things on the web

7

http://upload.wikimedia.org/wikipedia/commons/f/f1/The_First_Telegraph.jpg

What hath God wrought

Page 8: Making sense out of things on the web

8

http://en.wikipedia.org/wiki/File:1891_Telegraph_Lines.jpg

1891 Telegraph Lines

Page 9: Making sense out of things on the web

9

Mr Watson—Come hereI want to see you

http://www.boerner.net/jboerner/?p=9396

Page 10: Making sense out of things on the web

10

radioRadio

Page 11: Making sense out of things on the web

11http://www.elon.edu/e-web/predictions/150/1930.xhtml

Page 12: Making sense out of things on the web

12

Page 13: Making sense out of things on the web

13

Page 14: Making sense out of things on the web

14

Page 15: Making sense out of things on the web

15

www

Page 16: Making sense out of things on the web

16http://en.wikipedia.org/wiki/File:NCSA_Mosaic.PNG

Page 17: Making sense out of things on the web

17

the Internet had an estimated 16 million users by 1995

Page 18: Making sense out of things on the web

18http://en.wikipedia.org/wiki/Venture_capital

Page 19: Making sense out of things on the web

19

People from all over the world started sharing their interests,

hopes and dreams online

Page 20: Making sense out of things on the web

20

Page 21: Making sense out of things on the web

21http://electrokami.com/wp-content/uploads/2010/09/the-internet-in-real-life.jpg

Page 22: Making sense out of things on the web

22

The number of devices connected to IP networks will be nearly three times as high as the global population in 2016

Page 23: Making sense out of things on the web

23

The Zettabyte Era

http://www.cisco.com/en/US/solutions/collateral/ns341/ns525/ns537/ns705/ns827/VNI_Hyperconnectivity_WP.html

kilomegateragigapitaexazettayotta9,444,732,965,739,290,427,392 bits (1024 exbibytes)

Page 24: Making sense out of things on the web

24

“Reports that say that something hasn't happened are always interesting to me, because as we know, there are known knowns; there are things we know we know. We also know there are known unknowns; that is to say we know there are some things we do not know. But there are also unknown unknowns – the ones we don't know we don't know.”

Donald Rumsfeld, US Defense Secretaryat a press conference at NATO Headquarters, Brussels, Belgium, June 6, 2002Image: planetization.org

Page 25: Making sense out of things on the web

25

Nicholas Carr worries that the flood of digital information is changing not only our habits, but even our mental capacities: Forced to scan and skim to keep up, we are losing our abilities to pay sustained attention, reflect deeply, or remember what we’ve learned.

Page 26: Making sense out of things on the web

26http://blogs.tusc.k12.al.us/bhslibrary/files/2012/01/Information_overload.jpg

Information overload?

Page 27: Making sense out of things on the web

27http://www.teachersdiary.com/.a/6a0115703931fc970c0128765537ba970c-800wi

DO YOU KNOW WHAT ARE YOU LOOKING FOR?

Page 28: Making sense out of things on the web

28http://www.flickr.com/photos/special/1597251/

DO YOU KNOW WHERE TO FIND WHAT YOU WANT?

Page 29: Making sense out of things on the web

29http://www.flickr.com/photos/sumrow/1267682594/sizes/l/

REGULAR SEARCH #FAIL?

Page 30: Making sense out of things on the web

30http://www.flickr.com/photos/sumrow/1267682594/sizes/l/

IS THERE A SUPERHEROWHO CAN HELP?

Page 31: Making sense out of things on the web

BUILD YOUR OWN SEARCH SERVICE

Yes, you are the superhero

Page 32: Making sense out of things on the web

BOSS IS BUILD YOUR OWN SEARCH SERVICE

http://developer.yahoo.com/search/boss/

Page 33: Making sense out of things on the web

BOSS PROVIDES APIS

TO OUR SEARCHDATA STORES

Page 34: Making sense out of things on the web

TO BUILD YOUR OWNPOWERFUL

SEARCH APPLICATIONS

Page 35: Making sense out of things on the web

BOSS allows you to search over

Web, images, news & Blogs

Page 36: Making sense out of things on the web

You can even monetize yourapplications using Search Ads from BOSS and get support.

Page 37: Making sense out of things on the web

What can be done on top of BOSS?• Blend and re-rank search results

• Your own look and feel

• Mix it with other APIs

Page 38: Making sense out of things on the web

BOSS Pricing

Page 39: Making sense out of things on the web

Free for building your hacks!!

Page 40: Making sense out of things on the web

Where do I start?

Page 41: Making sense out of things on the web

Restful XML and JSON API

Web

Image

Spelling

News

Search Ads

What’s in it?

http//www.flickr.com/photos/joeshlabotnik/419914250/sizes/o/in/photostream/.jpg

Page 42: Making sense out of things on the web

Oauth based Autentication

http//www.flickr.com/photos/friarsbalsam/5736126308/sizes/o/in/photostream/.jpg

Page 43: Making sense out of things on the web

What else do I get?

Web and Limited Web results

Image attributes

like height, width, etc

Time span filtering

for News Search

Document type filtering

Extended abstracts

http//www.flickr.com/photos/acidpix/6021203584/sizes/o/in/photostream/.jpg

Page 44: Making sense out of things on the web

BOSS + YQL

• Table Name: boss.search

• e.g. select * from boss.search where ck=… and secret=… and q=‘openhackindia’

Parameters Example

Consumer Key ck -

Consumer Secret secret -

Query Term q ‘iitd’

Page 45: Making sense out of things on the web

Searching “The Dark Knight”

Page 46: Making sense out of things on the web
Page 47: Making sense out of things on the web

Finding images of “The Dark Knight Rises”

select * from boss.search where q="The Dark Knight Rises" and service="images" and

ck="..." and secret="..."

Page 48: Making sense out of things on the web

Finding “The Dark Knight Rises” in IMDB, movies.yahoo.com

select * from boss.search where q="The Dark Knight Rises" and

sites="imdb.com,movies.yahoo.com" and ck="..." and secret="..."

Page 49: Making sense out of things on the web

Spell Check and Correction

select * from boss.search where q="The Dark Knight Rises" and service="spelling" and

ck="..." and secret="..."

Page 50: Making sense out of things on the web

Finding news on “The Dark Knight Rises”

select * from boss.search where q="The Dark Knight Rises" and service="news" and ck="..."

and secret="..."

Page 51: Making sense out of things on the web

And through the BOSS API

Getting multiple data sets /ysearch/web,images,news?q=anna

/ysearch/web,images,news?web.q=anna&images.q=anna&news.q=lokpal

Searching through sites A Simple Movie Search

/ysearch/web?q=“Dark Knight”&

sites=movies.yahoo.com,netflix.com,imdb.com

AND/OR operators /ysearch/web?q="steve jobs"AND((ipad)OR(iphone))&sites=bestbuy.com,newegg.com

Important: Use Braces or quotes

Page 52: Making sense out of things on the web

Unary Operators Search for Batman but not “Dark Knight”

q=(batman -“Dark Knight")

Find pages with “Heath Ledger” but not “Dark Knight”

q=+”heath ledger”–”Dark

Knight”&sites=movies.yahoo.com

Force auto-spelling off

q=+”drk knight”

AND OR

Page 53: Making sense out of things on the web

Searching in body and in title

Searching for Dark Knight in the Title on Yahoo moviesq=reviews intitle:"dark knight"&sites=movies.yahoo.com

Searching for Dark Knight in the Title in Yahoo movies containing Christian Baleq=reviews intitle:"dark knight" inbody:"christian

bale"&sites=movies.yahoo.com

Page 54: Making sense out of things on the web

Market and document specific Filters

Search for “Dark Knight” in India specific sites q=“Dark Knight”&market=en-in

Search for “PDF’s containing “Dark Knight” q=“Dark Knight”&type=pdf

Search for MS Office type (except PPT’s) containing “Dark Knight” q=“Dark Knight”&type=msoffice,-ppt

Page 55: Making sense out of things on the web

Output

Page 56: Making sense out of things on the web

Image search parameters

Search for images that are not offensive

/ysearch/images?q=“san francisco”&filter=yes

Search for images that are wallpaper size

/ysearch/images?q=“san francisco”&dimensions=wallpaper

Search for a image at a certain refer URL

/ysearch/images?q=yahoo&refererurl=http://www.flickr.com

• Interesting Output Fields

format, file size, height, width, title, total result count

Page 57: Making sense out of things on the web

News search parameters

Search news that is less than 7 days old/ysearch/news?q=lokpal&age=7d

Search news that is between 20hrs and 2 days old

/ysearch/news?q=lokpal&age=20h2d

Re-rank news results by date

/ysearch/news?q=lokpal&ranking=true

Interesting Output Fields

Source, Date, Source URL

Page 58: Making sense out of things on the web

EXAMPLE HACKS

Page 59: Making sense out of things on the web

Duckduckgo.com

Page 60: Making sense out of things on the web

Interceder

Page 61: Making sense out of things on the web

Ask-boss (v1)

Hack: http://ask-boss.appspot.com Code: https://github.com/saurabhsahni/Hacks/tree/master/askBOSS

Page 62: Making sense out of things on the web

webmeme.in

Page 63: Making sense out of things on the web

http://hackyourworld.org/~iitb_pacman/search/

Page 64: Making sense out of things on the web

I did BOSS and got data, now how to extract information of out it?

Page 65: Making sense out of things on the web

make sense out of it?

Page 66: Making sense out of things on the web
Page 67: Making sense out of things on the web

Content Analysis

select * from contentanalysis.analyze where text="Yahoo! kicks off hackday”

Page 68: Making sense out of things on the web

Content Analysis from a URL

select * from contentanalysis.analyze where url="http://www.cnn.com/"

Page 69: Making sense out of things on the web

Term Exraction

select * from search.termextract where context in (select description from rss where url=‘’)

Page 70: Making sense out of things on the web

More resources Yahoo! BOSS: http://developer.yahoo.com/boss

BOSS Technical Documentation:

http://developer.yahoo.com/search/boss/boss_api_guide/

YQL: http://developer.yahoo.com/yql

Amazon Web Services: http://aws.amazon.com

oAuth: http://oauth.net/

Open Data: http://theinfo.org

Alt Search Engines: http://www.altsearchengines.com/

Page 71: Making sense out of things on the web

Happy hacking!