data classification using svm & quickcog
TRANSCRIPT
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Institute of Integrated Sensor Systems
Dept. of Electrical Engineering and Information Technology
Data Classification using SVM & Quickcog
Mahesh PoolakkaparambilWinter Semester 2007/08
Under the guidance ofProf. Dr.-Ing. Andreas König
&Kuncup Iswandy
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Contents
IntroductionLS-SVM for Repository dataOther classifiers for Repository dataClassification stepsClassification resultsClassification with Robot Vision dataLS-SVM & other Classifiers for Robot vision dataRobot Vision data resultsClassification using SVM on Robot vision dataConclusion
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Introduction
Basic idea of this study project is to carry out the classification taskand compare classification techniques K-NN, PNN & RNN with LS-SVM (Least Square Support Vector Machine) &SVMData sets used are the Repository data from the website ,http://www.ics.uci.edu/~mlearn/MLRepository.html and Robot visiondata from Prof. Dr. Ing Andreas KönigThe repository data used for this project are Breast cancer data, Segment data, Wine data and the Robot vision data (benchmark data)Entire experiment has been carried out on a Intel Celeron processor(1.73Ghz)
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Introduction
We used Quick-cog for the K-NN , PNN and RNN classifiersFor SVM we used the LS-SVM MATLAB tool & SVM tool boxData division through out using Holdout (Source Mr.Iswandy)Seed used for division is ‘1‘ for all data70% Training and 30% Testing (Division is proportional)We used normalized data through out our task( normalized btw -1 to +1 ).Experiments are carried out with different classifier parameters in atrial and error fashion to obtain better classification results (manualtuning)
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Classification using LS-SVM
In our experiment the Regularization parameter (Gamma) andKernel parameter (Sigma) are varied btw 0.1 to 200 in different steps ofvarious step size and tested each time for better results.Varied in steps of 1 or 5 till we get a good CR and then tuned carefullyaround the obtained values again with minute step sizeValues of these parameters with best classification rate are optedEncoding scheme used is code_MOCImplementation used is CMEXKernel used is RBFDecoding schemes used is Hamming distance
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
LSSVM Classification
45133178133 Wine
57417362310197Segment
113456569302Cancer
# ofTesting sets
# ofTrainingsets
Total # ofSamples
# of Features
# ofClasses
Data
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
LSSVM Classification & Results
93.33100.45Wine
93.030.40.70Segment
97.5014.4423.14Cancer
Result(%)
GammaSigmaData
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Other Classifiers (K-NN, PNN, RNN)
Data sets are converted into .NIF format using MATLABThe classification methods are carried out without feature selection(Same as in LS-SVM)
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Classifier parameters (K-NN, PNN, RNN)
K-NNThrough out the Repository data we discussed the result for K=3and K=5 for Robot vision dataDistance and Radius values used were default (10000 , 1)
PNNThreshold and Sigma parameters are clearly mentioned with the ResultsWe used Euclidean distance in PNN
RNNWe used Default values of Distance and radius as in case of K-NNWe used Euclidean distance
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
K-NN(Cancer data)
Training Testing(K=3)Euclidean distance
Confusion Matrix,1Class1 (89): 0 (R) 95.500 4.490Class2 (53): 0 (R) 3.770 96.220
Classification rate: 95.775 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
K-NN(Segment data)
Training Testing(K=3)Euclidean distance
Confusion Matrix,1 Class1 (82): 0 (R) 98.780 0 0 0 1.210 0
0Class2 (82): 0 (R) 0 100 0 0 0 0
0Class3 (82): 0 (R) 0 0 89.020 0 10.970 0
0Class4 (82): 0 (R) 0 0 6.090 92.680 1.210 0
0Class5 (82): 0 (R) 0 0 2.430 4.870 92.680 0
0Class6 (82): 0 (R) 0 0 0 0 0 100
0Class7 (82): 0 (R) 0 0 0 1.210 0 0
98.780
Classification rate: 95.993 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
K-NN(Wine data)
Training Testing(K=3)Euclidean distance
Confusions Matrix,1Class1 (15): 0 (R) 100 0 0Class2 (18): 0 (R) 5.550 94.440 0Class3 (12): 0 (R) 0 0 100
Classification rate: 97.778 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
PNN(Cancer data)
Training Testing(Sigma=0.1, Threshold=1)
Confusions Matrix,1Class1 (89): 0 (R) 96.620 3.370Class2 (53): 0 (R) 3.770 96.220
Classification rate: 96.479 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
PNN(Segment data)
Training Testing(Sigma=0.1, threshold=1)
Confusion Matrix,1Class1 (82): 0 (R) 98.780 0 0 0 1.210 0 0Class2 (82): 0 (R) 0 100 0 0 0 0 0Class3 (82): 0 (R) 0 0 91.460 0 8.530 0 0Class4 (82): 0 (R) 2.430 0 2.430 89.020 6.090 0 0Class5 (82): 0 (R) 1.210 0 8.530 2.430 87.800 0 0Class6 (82): 0 (R) 0 0 0 0 0 100 0Class7 (82): 0 (R) 0 0 0 1.210 0 0 98.78
Classification rate: 95.122 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
PNN(Wine data)
Training Testing(Sigma=0.5, Threshold=1)
Confusion Matrix,1
Class1 (15): 0 (R) 100 0 0Class2 (18): 0 (R) 5.550 94.44 0Class3 (12): 0 (R) 0 0 100
Classification srate: 97.778
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
RNN(Cancer data)
Training Testing
Confusion Matrix,1Class1 (89): 0 (R) 93.250 6.740Class2 (53): 0 (R) 3.770 96.220
Classification rate: 94.366 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
RNN(Segment data)
Training Testing
Confusion Matrix,1Class1 (82): 0 (R) 98.780 0 0 0 1.210 0 0Class2 (82): 0 (R) 0 100 0 0 0 0 0Class3 (82): 0 (R) 0 0 82.920 2.430 14.630 0 0Class4 (82): 0 (R) 0 0 4.870 91.460 1.210 2.430 0Class5 (82): 0 (R) 1.210 0 2.430 6.090 90.240 0 0Class6 (82): 0 (R) 0 0 0 1.210 0 98.780 0Class7 (82): 0 (R) 0 0 0 1.210 0 0 98.7
Classification rate: 94.425 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
RNN(Wine data)
Training Testing
Confusion Matrix,1Clase1 (15): 0 (R) 100 0 0Clase2 (18): 0 (R) 0 100 0Class3 (12): 0 (R) 0 8.330 91.660
Classification rate: 97.778 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Classification Results(Comparison with other classifiers)
93.3397.7897.7897.78Wine
93.03394.42595.12295.993Segment
97.8894.36696.47995.75Cancer
LS-SVM(%)
RNN(%)
PNN(%)
K-NN(%)
Data
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Classification of Robot vision
LS-SVM data preparations:Data is converted from .ewd format into .mat format using MATLABClasses are re-orderedNormalization is same as before
Other classifier :Same as before & SVM
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Classification data (Division and Features)
6013634196OLA
102238313340VA
78182311260Dorthrecht
282663337945SPTH
264616335880Tiger(top)
285661340946Tiger
# of testing
# of training
# of features
# of classes# of datasamples
Data
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
K-NN (Tiger cap set)
Training Testing
Classification rate: 67.368 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
KNN (Tiger cap set top only)
Training Testing
Classification rate: 72.348 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
K-NN (Special Tube holder)
Training Testing
Classification rate: 92.199 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
K-NN (Dortrecht)
Training Testing
Classification rate: 100.000 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
K-NN (Validierung & Analyse)
Training Testing
Classification rate: 100.000 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
K-NN (OLA Goldach)
Training Testing
Confusion Matrix,1Class1 (15): 0 (R) 100 0 0 0Class2 (15): 0 (R) 0 100 0 0Class3 (15): 0 (R) 0 0 100 0Class4 (15) 0 (R) 0 0 0 100
Classification rate: 100.000 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
PNN (Tiger cap set )
Training
Classification rate: 65.965 %
Testing
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
PNN (Tiger cap set top only)
Training Testing
Classification rate: 62.500 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
PNN (Special Tube holder)
Training Testing
Classification rate: 87.589 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
PNN (Dortrecht)
Training Testing
Classification rate: 100.000 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
PNN (Validierung & Analyse)
Training Testing
Classification rate: 100.000 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
PNN (OLA Goldach)
Training Testing
Confusion Matrix,1Class1 (15): 0 (R) 100 0 0 0Class2 (15): 0 (R) 0 100 0 0Class3 (15): 0 (R) 0 0 100 0Class4 (15) 0 (R) 0 0 0 100
Classification rate: 100.000 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
RNN (Tiger cap set )
Training Testing
Classification rate: 68.070 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
RNN (Tiger cap set top only)
Training Testing
Classification rate: 68.561 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
RNN (Special Tube holder)
Training Testing
Classification rate: 90.426 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
RNN (Dortrecht)
Training Testing
Classification rate: 100.000 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
RNN (Validierung & Analyse)
Training Testing
Classification rate: 100.000 %
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
RNN (OLA Goldach)
Confusion Matrix,1Class1 (15): 0 (R) 100 0 0 0Class2 (15): 0 (R) 0 100 0 0Class3 (15): 0 (R) 0 0 100 0Class4 (15) 0 (R) 0 0 0 100
Classification rate: 100.000 %
Training Testing
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Classsification on Robot Vision data
100%T=1,Sig=0.1
100%100%100%24OLA
Goldach
100%T=1,Sig=0.1
100%100%98.039%0.00625232.0554VA
100%T=1,Sig=0.1
100%100%100%0.012Dortrecht
90.426%T=1, Sig=0.1
87.58%92.19%78.723%0.020.8SPTH
68.56%T=1,Sig=0.1
62.5%72,348%55.303%0.020.7Tigercap-
top only
68.07%T=1 ,Sig=0.1
65.965%67.368%56.14%0.0052Tiger-
capset
RNNPNNK-NN(K=5)
LS-SVMSigmaGammaData
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Classification using SVM
Unsatisfied result using LSSVM lead us to SVMSVM uses Quadratic Programming We used the same data for classificationSVM has different parameters We used Manual tuning Tuning steps as same as LSSVM
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Classification results using SVM
1000.05 10OLA Goldach(One Vs All)
1000.0320Validierund &Analyse
1000.0510Dorthrecht
92.1990.0515Special tube holder
74.24 0.0515Tiger cap set toponly
70.1750.0510Tiger cap set
CR(%)SigmaCData
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
LIBSVM Vs Other Classifiers
100%100%T=1,Sig=0.1
100%100%100%OLA Goldach
100%100%T=1,Sig=0.1
100%100%98.039%VA
100%100%T=1,Sig=0.1
100%100%100%Dortrecht
92.199%90.426%T=1, Sig=0.1
87.58%92.19%78.723%SPTH
74.24% 68.56%T=1,Sig=0.1
62.5%72,348%55.303%Tigercap-top
only
70.175%68.07%T=1 ,Sig=0.1
65.965%67.368%56.14%Tiger-
capset
SVMRNNPNNK-NN(K=5)
LS-SVMData
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Conclusion
Even though LS-SVM is a powerful tool for classification and regression, in our experiment, only with Repository data LS-SVM seems to be competitive with other classifiers For the Robot vision data, for Dortrecht data set, VA and OLA, LS-SVM is comparable with other classifiersFor rest of the data sets LS-SVM is seems to be not well generalizingAs the complexity of the data set increases, generalizing capability ofLS-SVM has to be checked SVM with QP proved to be the best for our multi class data
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
References
Lecture documents of Sensorsignalverarbeitung from Prof.Dr.-Ing Andreas KönigQuickcog user manualHold out algorithm from Mr. Kuncup IswandyLS-SVM MATLAB toolbox and user manualSVM toolbox (Source Mr.Iswandy)Support Vector Machines Andrew W. Moore Professor (School of Computer Science Carnegie Mellon University)
Prof: Dr. Ing Andreas König Sensorsignalverarbeitung
Data Classification using LS-SVM and Quickcog classifiers& SVM
THANK YOU FOR YOR ATTENTION
????????????????