how search engine works ( mr. mirza)

21
How Search Engine Works

Upload: mirza-jr

Post on 25-Jan-2017

200 views

Category:

Internet


2 download

TRANSCRIPT

Page 1: How search engine works ( Mr. Mirza)

How Search Engine Works

Page 2: How search engine works ( Mr. Mirza)

The Main Parts of a Search Engine

• Spider (or “web crawler”)

• Indexer

• Search software (an algorithm)

Page 3: How search engine works ( Mr. Mirza)

Lets us Start with SPIDERS

Page 4: How search engine works ( Mr. Mirza)

• A web crawler (also known as a web spider or web robot) is a program or automated script which browses (crawls) the World Wide Web in a methodical, automated manner.This process is called Web crawling or spidering.

• In short, All websites are codes and these codes are read by spiders.

• So basically a Spider or Spiders are software programs that crawl throughout the internet and select relative data to be INDEXED.

Page 5: How search engine works ( Mr. Mirza)

• When we Search in a search engine is actually searching the index of that search engine and not the whole internet.

• Spiders start by fetching few web pages then they follow the links on those pages and fetch the pages they point to, and follow all the links on those pages and fetch the pages they link to and so on.

• Until they've indexed a pretty big chunk of the web, many billions of pages stored across thousands of machines.

Page 6: How search engine works ( Mr. Mirza)
Page 7: How search engine works ( Mr. Mirza)

Let US SEE what is

INDEXING ?

Page 8: How search engine works ( Mr. Mirza)

Is It Easy to Search here?

Page 9: How search engine works ( Mr. Mirza)

Or Here?

Page 10: How search engine works ( Mr. Mirza)

• This is the equivalent form of the information that our spiders acquire after they finish with crawling.

• Random organization and NO STRUCTURE.

• The information available throughout the world wide web is in all kinds of shapes sizes and FORMATS.

Page 11: How search engine works ( Mr. Mirza)

• AND This is what indexing does.

• Makes data accessible in a Structured format, easily accessible through search.

Page 12: How search engine works ( Mr. Mirza)

Indexer

• Search engine indexing is the process of a search engine collecting, parses and stores data for use by the search engine.

• The actual search engine index is the place where all the data the search engine has collected is stored.

• It is the search engine index that provides the results for search queries, and pages that are stored within the search engine index that appear on the search engine results page.

Page 13: How search engine works ( Mr. Mirza)

What is a Search Algorithms ?

Page 14: How search engine works ( Mr. Mirza)

• Basically, a search engine algorithm is a set of rules, or a unique formula, that the search engine uses to determine the significance or rankings of a web page, and each search engine has its own set of rules.

• The algorithms, as they are different for each search engine, are also closely guarded secrets.

• Search algorithm sorts on the basis of many things like location of keyword, synonyms, adjacent words, etc

• But there are certain things that all search engine algorithms have in common.

Page 15: How search engine works ( Mr. Mirza)
Page 16: How search engine works ( Mr. Mirza)

• Relevancy

• Individual Factors

• Off-Page Factors

Page 17: How search engine works ( Mr. Mirza)

Relevancy• This is the First thing every search engine

checks.• The algorithm will determine whether this web

page has any relevancy at all for the particular keyword.

• Location of keywords in that page is also important for the relevancy of that website.

• Web pages that have the keywords in the title, as well as within the headline or the first few lines of the text will rank better for that keyword than websites that do not have these features

Page 18: How search engine works ( Mr. Mirza)

Individual Factors •  A second part of search engine algorithms are

the individual factors that make that particular search engine different from every other search engine out there.

• Each search engine has unique algorithms, and the individual factors of these algorithms are why a search query turns up different results on Google than MSN or Yahoo!.

• One of the most common individual factors is the number of pages a search engine indexes.

• They may just have more pages indexed, or index them more frequently, but this can give different results for each search engine.

• Some search engines also penalize for spamming, while others do not.

Page 19: How search engine works ( Mr. Mirza)

Off-Page Factors • Another part of algorithms that is still individual

to each search engine are off-page factors. • Off-page factors are such things as click-through

measurement and linking. • The frequency of click-through rates and linking

can be an indicator of how relevant a web page is to actual users and visitors, and this can cause an algorithm to rank the web page higher.

• Off-page factors are harder for web masters to craft, but can have an enormous effect on page rank depending on thesearch engine algorithm.

Page 20: How search engine works ( Mr. Mirza)
Page 21: How search engine works ( Mr. Mirza)

• Search engine algorithms are the mystery behind search engines, sometimes even amusingly called the search engine’s

“Secret Sauce”. • Beyond the basic functions of a search

engine, the relevancy of a web page, the off-page factors, and the unique factors of each search engine help make the algorithms of each engine an important part of the search engine optimization design.