16860 lab manual

Upload: ankur-singh

Post on 04-Apr-2018

213 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/30/2019 16860 Lab Manual

    1/46

    Lab Manual Prepared by: Prof (Dr). V. SARAVANAN,School of Computer Applications, LPU

    S.No TITLE

    1 IMPORTING DATA INTO RAPIDMINER

    2 STORING AND RETRIEVING DATA

    3 GRAPHICAL REPRESENTATION OF DATA

    4 APPLYING MODEL FOR PREDICTION

    5IMPLEMENTATION OF BAYESIAN MODEL ON

    IMPORTED DATA

    6 CROSS VALIDATION

    7CREATION OF GENERIC OPTIMIZATION

    PREPROCESSOR

    8 REPRESENTING DATA USING DECISION TREE

    9EVOLUTIONARY WEIGHTING OF THE

    ATTRIBUTES

    10 TEXTMINING USING RAPIDMINER

    1

  • 7/30/2019 16860 Lab Manual

    2/46

    IMPORTING DATA INTO RAPIDMINER

    AIM:To import files into RapidMiner tool using three methods.

    ALGORITHM:

    Step 1: Open the RapidMiner tool.Step 2: In the welcome perspective that opens, select the new icon.

    Method 1:Step 3: Select the repository location and click ok.

    Step 4: In the design perspective, click operators view import data Read Excel.

    Step 5: The Read Excel operator will be displayed in the process view.

    Step 6: Connect the out of the operator to the res of the process.

    Step 7: Click on the operator to view the parameters and browse the file to be imported.

    Step 8: Run the process. The result can be viewed in metadata, data and plot views.

    Method 2:Step 9: In the design perspective, click on the repositories view.

    Step 10: Click on the Import drop down, select import Excel sheet.Step 11: Select the file to be imported and` click Next.

    Step 12: Specify a repository location, click Finish and view the result.

    Method 3:Step 13: Drag the file to be imported and drop it in the NewLocalRepository of the repositories

    View and click Next.

    Step 14: Specify a repository location, click Finish and view the result.

    SCREENSHOTS:Method 1:

    2

  • 7/30/2019 16860 Lab Manual

    3/46

    3

  • 7/30/2019 16860 Lab Manual

    4/46

    4

  • 7/30/2019 16860 Lab Manual

    5/46

    5

  • 7/30/2019 16860 Lab Manual

    6/46

    6

  • 7/30/2019 16860 Lab Manual

    7/46

    Method 2:

    7

  • 7/30/2019 16860 Lab Manual

    8/46

    8

  • 7/30/2019 16860 Lab Manual

    9/46

    9

  • 7/30/2019 16860 Lab Manual

    10/46

    Method 3:

    10

  • 7/30/2019 16860 Lab Manual

    11/46

    11

  • 7/30/2019 16860 Lab Manual

    12/46

    RESULT:The files are imported into the RapidMiner tool using the three methods.

    12

  • 7/30/2019 16860 Lab Manual

    13/46

    STORING AND RETRIEVING DATA

    AIM:To store and retrieve data into RapidMiner tool.

    ALGORITHM:Step 1: Open the RapidMiner tool.

    Step 2: In the welcome perspective that opens, select the new icon.

    Step 3: Select the repository location and click ok.

    Step 4: In the design perspective, click operators view import data Read Excel.

    Step 5: The Read Excel operator will be displayed in the process view.

    Step 6: In the design perspective, click operators view Repository AccessStore.

    Step 7: The Store operator will be displayed in the process view.

    Step 8: Connect the out of the operator to the inp of the Store and thr of store to the res of

    process.

    Step 9: Click on the operator to view the parameters and browse the file to be imported.

    Step 10: Run the process. The result can be viewed in metadata, data and plot views.

    SCREENSHOTS:Storing data:

    13

  • 7/30/2019 16860 Lab Manual

    14/46

    14

  • 7/30/2019 16860 Lab Manual

    15/46

    15

  • 7/30/2019 16860 Lab Manual

    16/46

    Retrieving data:

    16

  • 7/30/2019 16860 Lab Manual

    17/46

    RESULT:The data are stored and retrieved into the RapidMiner tool.

    17

  • 7/30/2019 16860 Lab Manual

    18/46

    GRAPHICAL REPRESENTATION OF DATA

    AIM:To graphically represent the data

    ALGORITHM:

    STEP 1: Open the rapid miner tool.STEP2: Select file->new, Repository browser dialog box appears , In that selectthe

    repository and give a name and click ok.STEP 3:In the operator window select import->data->read excel.STEP 4:In the parameter window .select the excel file and click open.STEP 5:Run the process.STEP 6: Select the plot viewSTEP 7:Select the type of plotter and set the parameters.STEP 8: The corresponding graph appear will appear in the right side window.

    18

  • 7/30/2019 16860 Lab Manual

    19/46

    SCREEN SHOTS:

    19

  • 7/30/2019 16860 Lab Manual

    20/46

    RESULT:Thus the data will be displayed graphically successfully.

    20

  • 7/30/2019 16860 Lab Manual

    21/46

    APPLYING MODEL FOR PREDICTIONAIM:

    To predict the unknown data using neural net and apply models in RapidMiner tool.

    ALGORITHM:Step 1: Open the RapidMiner tool.

    Step 2: In the welcome perspective that opens, select the new icon.

    Step 3: Select the repository location and click ok.

    Step 4: Import two csv files containing the mark and result details.

    Step 5: The result field of first file is given as label and of the second as prediction.

    Step 6: Retrieve both the files into the design view of the process.

    Step 7: In the design perspective, click operators view Modeling Classification and

    RegressionNeural Net trainingNeural Net.

    Step 8: In the design perspective, click operators view ModelingModel

    ApplicationApply Model.

    Step 9: Connect the out of the retrieve to the tra of the neural net. The mod of neural net is

    connected to the mod of apply model.

    Step 10: Connect the out of the retrieve to the unl of the apply model. The mod and lab of apply

    model is connected to the res of the process.

    Step 11: Run the process to view the predicted data and improved neural net.

    SCREENSHOTS:

    21

  • 7/30/2019 16860 Lab Manual

    22/46

    22

  • 7/30/2019 16860 Lab Manual

    23/46

    23

  • 7/30/2019 16860 Lab Manual

    24/46

    24

  • 7/30/2019 16860 Lab Manual

    25/46

    RESULT:The unknown data are predicted using neural net and apply model of RapidMiner tool.

    25

  • 7/30/2019 16860 Lab Manual

    26/46

    IMPLEMENTATION OF NAVE BAYESIAN MODEL ON IMPORTED DATAAIM:

    To implement the nave baysian on imported data.

    ALGORITHM:STEP 1: Open the rapid miner tool.

    STEP 2 : In the process window place the retrieve operator and select the dataset you want to

    import.

    STEP 3: Place the nave bayes operator and connect it with the retrieve operator.

    STEP 4:Place the Apply model operator and connect it with the nave bayes .

    STEP 5: Run the process

    26

  • 7/30/2019 16860 Lab Manual

    27/46

    SCREEN SHOTS:

    27

  • 7/30/2019 16860 Lab Manual

    28/46

    RESULT:Thus the naive baysian has been implemented on the imported data successfully.

    28

  • 7/30/2019 16860 Lab Manual

    29/46

    CROSS VALIDATIONAIM:

    To perform cross validation in RapidMiner tool using validation operator.

    ALGORITHM:Step 1: Open the RapidMiner tool.

    Step 2: In the welcome perspective that opens, select the new icon.Step 3: Select the repository location and click ok.

    Step 4: Import an excel file and retrieve it.

    Step 5: In the design perspective, click operators view Evaluation Validation X-

    Validation.

    Step 6: Click on the validation process operator which will lead to the training and testing

    process.

    Step 7: Click operators viewModelingClassification and RegressionBayesian

    ModelingNave Bayes and place it in training process.

    Step 8: Click ModelingModel ApplicationApply Model and EvaluationPerformance,

    MeasurementPerformance and place them in testing process and make connections.

    Step 9: Connect the ave of the validation operator to the res of the process.

    Step 10: Run the process. The result can be viewed in performance vector view.

    SCREENSHOTS:

    29

  • 7/30/2019 16860 Lab Manual

    30/46

    30

  • 7/30/2019 16860 Lab Manual

    31/46

    31

  • 7/30/2019 16860 Lab Manual

    32/46

    RESULT:The cross validation is performed using validation operator of RapidMiner tool.

    32

  • 7/30/2019 16860 Lab Manual

    33/46

    CREATION OF GENERIC OPTIMIZATION PREPROCESSORAIM:

    To create a generic optimization preprocessor in RapidMiner tool.

    ALGORITHM:Step 1: Open the RapidMiner tool.

    Step 2: Select a location and name for the local repository.Step 3: Select Operators view Utility Data Generation Generate Data

    Step 4: Select Operators view Data TransformationAttribute set Reduction and

    Transformation SelectionOptimizationOptimize selection

    Step 5: Connect the operators and click on the process which then leads to the validation process.

    Step 6: Select operators view EvaluationValidationSplit validation and make the

    connections.

    Step 7: Click on the process which proceeds to the training and testing process.

    Step 8: Select operatorsModelingClassification and Regression Support Vector

    Modeling Support Vector Machine in the training phase.

    Step 9: Select operators

    Modeling

    Model Application

    Apply Model and Evaluation

    Performance and Measurement Performance in the testing phase.

    Step 10: Make the appropriate connections.

    Step 11: Run the process in forward selection and backward elimination mode.

    SCREENSHOTS:

    33

  • 7/30/2019 16860 Lab Manual

    34/46

    34

  • 7/30/2019 16860 Lab Manual

    35/46

    35

  • 7/30/2019 16860 Lab Manual

    36/46

    Forward Selection:

    36

  • 7/30/2019 16860 Lab Manual

    37/46

    Backward Elimination:

    RESULT:

    The forward and backward optimization is created using optimize selection operator of the

    RapidMiner tool.

    37

  • 7/30/2019 16860 Lab Manual

    38/46

    REPRESENTING DATA USING DECISION TREE

    AIM:

    To represent data using decision tree in the RapidMiner tool.

    ALGORITHM:

    Step 1: Open the RapidMiner tool.

    Step 2: Select a location and name for the local repository.

    Step 3: Select Operators view Utility Data Generation Generate Direct Mailing Data

    Step 4: Select operators view EvaluationValidationSplit validation and make the

    connections.

    Step 5: Open the process which proceeds to the training and testing process.

    Step 6: In the training phase, select operatorsModelingClassification and Regression

    Tree Induction Decision tree

    Step 7: Place the apply model and performance operators in the testing phase.

    Step 8: Make the appropriate connections.

    Step 9: Run the process to view the performance and decision tree.

    SCREENSHOTS:

    38

  • 7/30/2019 16860 Lab Manual

    39/46

    39

  • 7/30/2019 16860 Lab Manual

    40/46

    RESULT:Thus the data is represented using decision tree in the RapidMiner tool.

    40

  • 7/30/2019 16860 Lab Manual

    41/46

    EVOLUTIONARY WEIGHTING OF THE ATTRIBUTES

    AIM:

    To perform the evolutionary weighting of the attributes using RapidMiner.

    ALGORITHM:

    Step 1: Open the RapidMiner tool.

    Step 2: Select OperatorsModelingAttribute WeightingOptimize Weights (evolutionary)

    and place it in the main process.

    Step 3: Import and retrieve a dataset whose character field is label.

    Step 4: On double clicking, Optimize weights it leads to Evaluation Process. Place Split

    Validation operator in it.

    Step 5: Again on double clicking the process, leads to Training and Testing phase.

    Step 6: In Training phase, place Neuralnet.

    Step 7: In testing phase, place ApplyModel and Performance.

    Step 8: In the main process, click on the optimize weights operator. The right side of the window

    have several options to be modified.

    Step 9: Give the population size as 10, check early stopping and show population plotter, set the

    selection scheme as roulette wheel, p crossover as 0.2 and crossover type as shuffle.

    Step 10: Make the necessary connections and run the process.

    RESULT:

    Thus the evolutionary weighting of the attributes are obtained using RapidMiner.

    41

  • 7/30/2019 16860 Lab Manual

    42/46

    TEXT MINING USING RAPID MINER:

    AIM:

    To mine the text data using the rapid miner tool.

    ALGORITHM:

    STEP 1:Open the rapid miner tool.

    STEP 2:Place the process document operator .

    STEP 3:Select that operator and click Edit List .

    STEP 4 :In the dialog box select the text file and click ok

    STEP 5:Double Click the operator .The vector process appears.

    STEP 6:Place the tokenize operator and set the parameter .Right click and

    select set breakpoint.

    STEP 7:Place the Filter stopword (English ) and connect it with the tokenize.

    STEP 8:Run the process.

    42

  • 7/30/2019 16860 Lab Manual

    43/46

    SCREEN SHOTS:

    43

  • 7/30/2019 16860 Lab Manual

    44/46

    44

  • 7/30/2019 16860 Lab Manual

    45/46

    45

  • 7/30/2019 16860 Lab Manual

    46/46

    RESULT:

    Thus the text document has been mined by using the rapid miner

    successfully.