how search engine index a website and provides relevant output

Presentation by: ADMEC MULTIMEDIA INSTITUTE

www.admecindia.co.in

Indexing and Working Processof Search Engines

The first basic truths, thats you need to understand in SEO that searchengines are not a human.

While this might be obvious for everybody, the differences betweenhow humans and search engines view web pages aren't. Search enginesare text-driven, voice driven and image driven.

Although now a days technology advances rapidly grow, search enginesare far from intelligent creatures that can feel the beauty of a cool designor enjoy the sounds and movement in movies.

Instead, search engines crawl the web pages, looking at particular sitecontent (mainly text) to get an idea about a site.

Firstly, search engines crawl the website to see what is on the website. Thistask is performed by software, called a crawler or a spider.

Spiders go to website and follow links from one page to another and indexall things, whatever they find on their way. More than 20 billion pages onthe web available, so it is impossible for a spider to visit all site daily just tosee if a new pages is added or any existing page is modified on the web. Soit may be possible that crawlers may not end up visiting your site for amonth or two.

Crawling-Crawling is a process by which searchengines discover publicly availableweb pages. Google uses softwarename web crawlers for crawling.The crawl process begins with a list ofweb address from past crawls andsitemaps provided by website owners.

What you can do is to check what a crawler sees from your site. As abovementioned, crawlers are not humans and they do not see images, Flashmovies, JavaScript, frames, password-protected pages and directories, so ifyou have added these on your site, you'd better run the SpiderSimulator below to see if these goodies are viewable by the spider. If theyare not viewable, they will not be spidered, not indexed, not processed,etc. - in a word they will be non-existent for search engines.

Spider-Spider is a program (set of instructions) thatautomatically fetches Web pages. Spidersare used to feed pages to search engines.It's crawls over the Web, so its calledspider. Another term for these programs isknown as WebCrawler.Example:Name of Google Spider is Googlebot.Name of Bing Spider is Bingbot.Name of Alta Vista Spider is Scooter.

a) When page is crawled by crawler the next step is to index its all thecontent.

b) The index page stored in a giant database, from where it can beaccess or retrieved later as per requirement.

c) Essentially, the process of indexing is identifying the words that bestdescribe the page and provides the page to particular keywordswhich search on the web.

d) So typical work is very difficult for a human to process suchamounts of information but generally search engines manage justfine with this task within a few time.

e) Sometimes search engine not get the meaning of a page right but ifwe help them by optimizing it, it will be easier for to search engineto classify your pages correctly and for you to get higher rankingsand better results.

When anybody Query anything in search engine, the searchengine processes it i.e. it compares the search keywords or string in thesearch request with the indexed pages in the stored database.

Since it is likely that more than one page (practically it is millions of pages)contains the search string or keyword, the search engine starts calculatingthe relevancy of each of the pages in its index as the keywords or stringsearched and provides best result after calculating the relevancy.

1. The Web server sends the query to the index servers. The contentinside the index server is similar to the index in the back of a book-ittells which pages contain the words that match the query.

2. The query travels to the doc servers , which actually retrieve the storeddocuments. Snippets are generated to describe each search result.

3. The search result are returned to the user in a fraction of a second.

Contact Us:ADMEC MULTIMEDIA INSTITUTEC-7/114, IInd Floor, Sector- 7, Rohini, Delhi- 85Landmark: Near Rohini East Metro StationHelpline 1: +91 9811 818 122Helpline 2: +91 9911 782 350

ADMEC MULTIMEDIA INSTITUTE

For More information you can visit :http://www.admecindia.co.in

Or email : [email protected]

Slide Number 1Slide Number 2Slide Number 3Slide Number 4Slide Number 5Slide Number 6Slide Number 7Slide Number 8Slide Number 9

how search engine index a website and provides relevant output

Documents