text mininig

Upload: shashi-kant-rawat

Post on 04-Apr-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/29/2019 TEXT MININIG

    1/16

    TO DESIGN AND DEVELOP

    A MECHANISM FOR TEXTMINING.

    Group Members:

    SANDEEP KUMAR RAI (0909710091)

    SHABINAZ ZAIDI (0909710097)

    SHASHI KANT RAWAT (0909710100)

    Mentor: Prof. Bhawna Mallick

    GALGOTIAS COLLEGE OF ENGINEERING AND TECHNOLOGYCOMPUTER SCIENCE AND ENGINEERING DEPARTMENT

  • 7/29/2019 TEXT MININIG

    2/16

    Abstract

    Searching and dealing with search engines like Google,yahoo, Bing etc. as a common user provides us with a tremendous amountof additive data and links which are challenging and industrious work due tothe relatively rare occurrence of desired links within a gigantic pool ofsearch results especially when we want more user friendly environment suchas mobile phone.

    While active search tools and optimizing tools arecommonly used to improving the visibility of a website or a web page ina search engine's search results, there has been little analysis on the analysisof tools used to filter the quantity of search. The objective of our study is tounderstand how to reduce the burden of finding the appropriate resultsamong the millions of the search results quickly and with the help of a well

    versed interface.It has been analyzed under research that even in case of simpler

    search we have to deal with large amount of search results about millions ,which sometimes leads to users discontentment and thus there is a need tohave some work on it.

  • 7/29/2019 TEXT MININIG

    3/16

    IntroductionBasic purpose of this project is to facilitate the users with an

    effective way of using search engine to get rid of the burden of finding theuseful results among data pool. Particular importance here is to remove the bulkof results, for this we need to filter the incoming results by using effective

    algorithm .

    We need to develop an interactive tool called MbSeMa(Mobile SearchMaster) This tool basically comprises of an interactive user interface in whichthe preferences of the search can be gathered and accordingly the search can befiltered. The tool uses the commands from users and accordingly perform the

    search.

    Here we have used concepts like concept of Search Engine AdvancedOperators. Some of the Operators have been directly employed as radio buttonin the interactive tool to filter the search directly.

  • 7/29/2019 TEXT MININIG

    4/16

    Proposed SystemThe purpose of our study was to understand how to reduce the burden

    of searching complexity for search engine users.

    The goal of our study is to understand how to develop a mechanism

    so that we can interact with users to know what search exactly.

    Specifically, our method involves an effective and hybrid use of

    advanced search engine operators to filter the search.

  • 7/29/2019 TEXT MININIG

    5/16

    Work Flow Diagram

  • 7/29/2019 TEXT MININIG

    6/16

    USE-CASE DIAGRAM

  • 7/29/2019 TEXT MININIG

    7/16

    Basic Operators Some important things to keep in mind

    Google queries are not case sensitive.

    The * wildcard represents anyword

    Example: * firewall

    Google stems words automatically

    Example: firewall DMZ configuration

  • 7/29/2019 TEXT MININIG

    8/16

    Basic Operators The + symbol forces inclusion of a certain word.

    firewall+DMZ+Configuration

    The - symbol forces exclusion of a certain word. (Site:google.com site:www.google.co.in)

    The | symbol provides boolean OR logic. firewall + inurl:(firewall| cisco)

  • 7/29/2019 TEXT MININIG

    9/16

    Advance Operators link:URL= lists other pages that link to the URL. related:URL = lists other pages that are related to the URL. site:domain.com search term = restricts search results to the

    given domain. allinurl:WORDS = shows only pages with all search terms in the

    url. inurl:WORD = like allinurl: but filters the URL based on the first

    term only. allintitle:WORD = shows only results with terms in title. intitle:WORD = similar to allintitle, but only for the next word. cache:URL =will show the Google cached version of the URL.

  • 7/29/2019 TEXT MININIG

    10/16

  • 7/29/2019 TEXT MININIG

    11/16

    ILLUSTRATION

  • 7/29/2019 TEXT MININIG

    12/16

    ILLUSTRATION

  • 7/29/2019 TEXT MININIG

    13/16

    USER INTERATIVE TOOL

  • 7/29/2019 TEXT MININIG

    14/16

    Modules of the Project

    Designing tool Interface

    Designing Filtering Algorithm

    Connection Establishment

    Employing Operators on search Engines

  • 7/29/2019 TEXT MININIG

    15/16

    System Requirement

    SpecificationHardwares:

    PROCESSOR : PENTIUM IV 2.6 GHz

    RAM : 512 MB

    MONITOR : 15HARD DISK : 20 GB

    CDDRIVE : 52X

    KEYBOARD : STANDARD 102 KEYS

    MOUSE : 3 BUTTONS

    Softwares:

    FRONT END :

    OPERATING SYSTEM : WINDOWS 7

  • 7/29/2019 TEXT MININIG

    16/16

    THANKING YOU!!