csc 4510 – machine learning
DESCRIPTION
CSC 4510 – Machine Learning. 7: Neural Networks – Exercises with AISpace. Dr. Mary-Angela Papalaskari Department of Computing Sciences Villanova University Course website: www .csc.villanova.edu /~map/4510/. The slides in this presentation are adapted from: - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/1.jpg)
CSC 4510 – Machine LearningDr. Mary-Angela PapalaskariDepartment of Computing SciencesVillanova University
Course website: www.csc.villanova.edu/~map/4510/
7: Neural Networks – Exercises with AISpace
1
The slides in this presentation are adapted from:Prof Paula Matuszek’s Knowledge Based Systems class at Villanovahttp://www.csc.villanova.edu/~matuszek/spring2011/index.html CSC 4510 - M.A. Papalaskari - Villanova University
![Page 2: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/2.jpg)
2
Neural Nets tools in AI Space• Sample Data and Graph: Mail– Examples– Set properties– Initialize the parameters– Solve– How do we use it? Calculate output
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 3: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/3.jpg)
3
A small example: “Which class to take”• Inputs?• Outputs?• Sample data
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 4: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/4.jpg)
4
Some Examples• Example 1: – 3 inputs, 1 output, all binary
• Example 2:– same inputs, output inverted
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 5: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/5.jpg)
5
Getting the right inputs• Example 3– Same inputs as 1 and 2– Same output as 1– Outcomes reversed for half the cases
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 6: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/6.jpg)
6
Getting the right inputs• Example 3
– Same inputs as 1 and 2– Same output as 1– Outcomes reversed for half the cases
• Network is not converging• The output here cannot be predicted from these inputs. • Whatever is determining whether to take the class, we
haven’t captured it
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 7: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/7.jpg)
7
Representing non-numeric values• Example 4– Required is represented as “yes” and “no”
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 8: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/8.jpg)
Representing non-numeric values• Example 4
– Required is represented as “yes” and “no”
• Actual model still uses 1 and 0; transformation is done by applet.
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 9: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/9.jpg)
9
More non-numeric values• Example 5• Workload is low, med, high: text values but
they can be ordered.
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 10: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/10.jpg)
10
More non-numeric values• Example 5• Workload is low, med, high: text values but they can be
ordered.
• Applet asks us to assign values. 1, 0.5, 0 is typical.
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 11: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/11.jpg)
Unordered values• Example 6– Input variables here include professor– Non-numeric, can’t be ordered.
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 12: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/12.jpg)
Unordered values• Example 6
– Input variables here include professor– Non-numeric, can’t be ordered.
• Still need numeric values• Solution is to treat n possible values as n separate binary values• Again, applet does this for us
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 13: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/13.jpg)
13
Variables with more values• Example 7
– GPA and number of classes taken are integer values
• Takes considerably longer to solve• Looks for a while like it’s not converging
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 14: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/14.jpg)
14
And Reals• Example 8
– GPA is a real.
• Takes about 20,000 steps to converge
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 15: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/15.jpg)
And multiple outputs• Small Car database from AIspace• For any given input case, you will get a value
for each possible outcome.• Typical for, for instance, character recognition.
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 16: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/16.jpg)
16
Training and Test Cases• The basic training approach will fit the training data as closely as
possible.• But we really want something that will generalize to other cases• This is why we have test cases.
– The training cases are used to compute the weights– The test cases tell us how well they generalize
• Both training and test cases should represent the overall population as well as possible.
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 17: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/17.jpg)
Representative Training Cases• Example 9– Training cases and test cases are similar
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 18: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/18.jpg)
18
Representative Training Cases• Example 9
– Training cases and test cases are similar– (actually identical...)
• Training error and test error are comparable
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 19: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/19.jpg)
19
Non-Representative Training Cases• Example 10
– Training cases and test cases represent different circumstances– We’ve missed including any cases involving Lee in the training
• Training error goes down, but test error goes up.• In reality these are probably bad training AND test cases;
neither seems representative.
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 20: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/20.jpg)
20
So:• Getting a good ANN still involves understanding your domain and
capturing knowledge about it– choosing the right inputs and outputs– choosing representative training and test sets– Beware “convenient” training sets
• You can represent any kind of variable: numeric or not, ordered or not.
• Not every set of variables and training cases will produce something that can be trained.
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 21: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/21.jpg)
21
Once it’s trained...• When your ANN is trained, you can feed it a specific set of inputs and get
one or more outputs.• These outputs are typically interpreted as some decision:
– take the class– this is probably a “5”
• The network itself is black box.• If the situation changes the ANN should be retrained
– new variables– new values for some variables– new patterns of cases
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 22: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/22.jpg)
22
One last note• These have all been simple cases, as examples• Most of these examples could in fact be predicted
much more easily and cleanly with a decision tree (or even a couple of IF statements!)
• A more typical use for any neural net system has many more inputs and many more training cases
CSC 4510 - M.A. Papalaskari - Villanova University
![Page 23: CSC 4510 – Machine Learning](https://reader036.vdocument.in/reader036/viewer/2022081505/56816664550346895dd9f85f/html5/thumbnails/23.jpg)
And one last example:• Our class data example (apples or oranges?)• Homework: Use what you learned today with
this example, write up your comments and include some screenshots showing your work.
• Due Tuesday 3/20/12
CSC 4510 - M.A. Papalaskari - Villanova University 23