a search engine is a web site that collects and organizes content from all over the internet search...
TRANSCRIPT
![Page 1: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/1.jpg)
![Page 2: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/2.jpg)
GROUP MEMBERS
NIDA IRFAN AHSAN AKRAM USMAN BASHIR AMBER SHAHEEN GHULAM HASSAN
![Page 3: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/3.jpg)
SEARCH ENGINE
![Page 4: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/4.jpg)
![Page 5: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/5.jpg)
INTRODUCTION A search engine is a web site that collects and
organizes content from all over the internet
Search engines look through their own databases of information in order to find what it is that you are looking for.
“Search engine” is the popular term for an Information Retrieval (IR) system.
![Page 6: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/6.jpg)
INTRODUCTION
Search engines match queries against an index that they create. The index consists of the words in each document, plus pointers to their locations within the documents. This is called an inverted file.
![Page 7: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/7.jpg)
INTRODUCTIONA search engine or IR system
comprises four essential modules:
A document processorA query processorA search and matching functionA ranking capability
![Page 8: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/8.jpg)
HIST
ORY
![Page 9: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/9.jpg)
Archie -> First search tool for the Internet
Gopher -> Indexed plain text documents
Jughead -> Searched the files stored in Gopher index
Wandex -> First Web search engine systems
HISTORY
![Page 10: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/10.jpg)
Alan Emtage created the first ever search engine called Archie in 1990.
Archie collects information by crawling the internet.
It matches the files it found with search queries and returned results from its database.
ArchieHISTORY
![Page 11: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/11.jpg)
HISTORY It was introduced in 1990 It is the protocol system It allows text files to be hierarchically
organized It is used to enable the simple sharing of
documents with which could be located in any part of the world
Gopher
![Page 12: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/12.jpg)
HISTORY Jughead
It was developed by Rhett Jones in 1993 Jug head is the search engine system for
the gopher protocol It searches the single server at a time It was officially released in 2006
![Page 13: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/13.jpg)
HISTORY Wandex
It was the first search engine introduced in 1993 by Matthew Gray
It was the first search engine that is usable for world wide web
Wandex itself was based on the world wide web wanderer which merely listed websites
![Page 14: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/14.jpg)
Mod
ules
![Page 15: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/15.jpg)
Search Engine Modules A document processor
A query processor
A search and matching function
A ranking capability
![Page 16: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/16.jpg)
The document processor prepares, processes, and inputs the documents, pages, or sites that users search against. The document processor performs some or all of the following steps:
Search Engine Modules Document Processor
![Page 17: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/17.jpg)
Search Engine Modules Normalizes the document stream to a
predefined format. Breaks the document stream into desired
retrievable units. Isolates and metatags subdocument
pieces. Identifies potential index able elements in
documents.
![Page 18: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/18.jpg)
Search Engine Modules Deletes stop words. Stems terms. Extracts index entries. Computes weights. Creates and updates the main
inverted file against which the search engine searches in order to match queries to documents.
![Page 19: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/19.jpg)
Query Processor Query processing has seven possible
steps. System can cut these steps short and
proceed to match the query to the inverted file.
Document processing shares many steps with query processing.
Search Engine Modules
![Page 20: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/20.jpg)
More steps and more documents make the process more expensive for processing
Search system designers must choose what is most important to their users, time or quality.
Publicly available search engines usually choose time over very high quality, having too many documents to search against.
Search Engine Modules
![Page 21: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/21.jpg)
Search Engine Modules
Step 1: TokenizingStep 2: ParsingStep 3: STOPLISTStep 4: StemmingStep 5: Creating the queryStep 6: Query expansionStep 7: Query term weighting
![Page 22: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/22.jpg)
Search Engine Modules Search and matching function
Searching the inverted file for documents meeting the query requirements
It is the simple binary search It follows the simple document
representation, query representation and algorithm
After computing the similarity of each document in the subset of documents, the system presents an ordered list to the user
![Page 23: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/23.jpg)
Type
s
![Page 24: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/24.jpg)
Types of Search Engine Crawler-Based Search
Engines
Human-Powered Directories
Hybrid Search Engines" Or Mixed Results
![Page 25: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/25.jpg)
Wor
king
![Page 26: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/26.jpg)
Working of Search Engine
SpidersRobots
![Page 27: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/27.jpg)
![Page 28: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/28.jpg)
![Page 29: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/29.jpg)
![Page 30: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/30.jpg)
![Page 31: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/31.jpg)
Adva
nta
ges
![Page 32: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/32.jpg)
Advantages of Search Engine
It is easy and quick
It is a quick access to information
Save of time
It help to organize internet and individual websites
![Page 33: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/33.jpg)
Good
Ch
oice
![Page 34: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/34.jpg)
Choice of Search Engine The search system development or selection should be based primarily on the local needs Consider using freeware search engines, if your requirements are met by these. Consider if the web server you are using supports indexing and search, and if this is adequate for you
![Page 35: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/35.jpg)
Choice of Search Engine
The IT Professionals should make an effort to keep themselves abreast of the current web technologies The features available within a tool
should be made use of properly to get maximum benefits
![Page 36: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/36.jpg)
Conc
lusi
on
![Page 37: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/37.jpg)
Search engine plays important role in accessing the content over the internet, it fetches the pages requested by the user.
It made the internet and accessing the information just a click away.
Conclusion
![Page 38: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/38.jpg)
ConclusionThe need for better search engines only
increases
The search engine sites are among the most popular websites.
![Page 39: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/39.jpg)
![Page 40: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of](https://reader035.vdocument.in/reader035/viewer/2022070605/5a4d1af27f8b9ab05997f107/html5/thumbnails/40.jpg)