comparable entity mining from comparative questions (1)

Upload: lavanya-basker

Post on 04-Feb-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/21/2019 Comparable Entity Mining From Comparative Questions (1)

    1/5

    Comparable Entity Mining from Comparative Questions

    Abstract:

    Comparing one thing with another is a typical part of human decision making

    process. However, it is not always easy to know what to compare and what are the

    alternatives. To address this difficulty, we present a novel way to automatically

    mine comparable entities from comparative questions that users posted online.To

    ensure high precision and high recall, we develop a weakly-supervised

    bootstrapping method for comparative question identification and comparableentity extraction by leveraging a large online question archive. The experimental

    results show our method achieves !-measure of "#.$% in comparative question

    identification and "&.&% in comparable entity extraction. 'oth significantly

    outperform an existing state-of-the-art method.

    Architecture Diagram:

  • 7/21/2019 Comparable Entity Mining From Comparative Questions (1)

    2/5

    Objectives

    (e develop a weakly-supervised bootstrapping method for

    comparative question identification and comparable entity extraction by leveraging

    a large online question archive. The experimental results show our methodachieves !-measure of "#.$% in comparative question identification and "&.&% in

    comparable entity extraction. 'oth significantly outperform an existing state-of-

    the-art method.

    Existing system:

    comparator mining is related to the research on entity and relation extraction in

    information extraction )pecifically, the most relevant work is mining comparative

    sentences and relations. Their methods applied class sequential rules *C)+ and

    label sequential rules *)+ learned from annotated corpora to identify

    comparative sentences and extract comparative relations respectively in the news

    and review domains. The same techniques can be applied to comparative question

    identification and comparator mining from questions.

    Disadvantages:

    This methods typically can achieve high precision but suffer from low recall.

    Proposed system:

    we present a novel weakly supervised method to identify comparative questions

    and extract comparator pairs simultaneously. (e rely on the key insight that a good

    comparative question identification pattern should extract good comparators, and a

    good comparator pair should occur in good comparative questions to bootstrap the

  • 7/21/2019 Comparable Entity Mining From Comparative Questions (1)

    3/5

    extraction and identification process. 'y leveraging large amount of unlabeled data

    and the bootstrapping process with slight supervision to determine four parameters.

    Advantages:

    To ensure high precision and high recall, we develop a weakly-supervised

    bootstrapping method for comparative question identification and comparable

    entity extraction by leveraging a large online question archive

    cope of t!e Project

    (e rely on the key insight that a good comparative questionidentification pattern should extract good comparators, and a good comparator pair

    should occur in good comparative questions to bootstrap the extraction and

    identification process.

    Main Modules:

    Pattern "eneration#comparable Entity$:

    %& 'exical patterns

    #. "enerali(ed patterns&. peciali(ed patterns

    Pattern Evaluation#comparable )uestions$:

    'exical patterns:

    exical patterns indicate sequential patterns consisting of only words and symbols

    *C, /start, and /end. They are generated by suffix tree algorithm with two

    constraints0 1 pattern should contain more than one C, and its frequency in

    collection should be more than an empirically determined number.

    "enerali(ed patterns:

  • 7/21/2019 Comparable Entity Mining From Comparative Questions (1)

    4/5

    1 lexical pattern can be too specific. Thus, we generali2e lexical patterns by

    replacing one or more words with their 34) tags. 12 generali2ed patterns can

    be produced from a lexical pattern containingN words excluding $Cs.

    peciali(ed patterns:

    5n some cases, a pattern can be too general. or example, although a question

    ipod or zune? is comparative, the pattern

  • 7/21/2019 Comparable Entity Mining From Comparative Questions (1)

    5/5

    HARDWARE REQUIREMENTS:

    PROCESSOR : PENTIUM IV 2.6 GHz, Intel Core 2 Duo.

    RM : 2 G! DD RM

    MONITOR : 1"# CO$OR

    HRD DIS% : &' G!

    CDDRIVE : $G "2(

    %E)!ORD : STNDRD 1'2 %E)S

    MOUSE : * !UTTONS

    SOFTWARE REQUIREMENTS:

    O+ert-n /0/te : 3-n4o5/ '7 (P Pro8e//-onl

    9ront En4 : V-/ul Stu4-o 2'12, C.Net.

    Dt;/e : S