distributed*datamining*lab* · 2014. 10. 9. · ddm lab ws14/15 conclusion* we*are*a...

12
DDM Lab WS14/15 Distributed Data Mining Lab Winter Term 2014/15 Chair for Bioinforma>cs L. Richter

Upload: others

Post on 12-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Distributed  Data  Mining  Lab  

Winter  Term  2014/15  Chair  for  Bioinforma>cs  

L.  Richter  

Page 2: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

About  the  Course  

●  Wedneday,  10-­‐12am  ●  15  mee>ngs  from  Oct.  8th  to  Jan  28th  

●  Highly  innova>ve  

●  Explora>on  of  distributed  data  mining  technologies  

●  First  >me  of  this  class  

●  No  pre-­‐arranged  script  available  

Page 3: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Contents  

Topics,  a.o.:  ●  Hadoop  File  System  

●  Hadoop  

●  Spark  ●  Mahout  

●  MLlib  

Page 4: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Grading  

What  is  Graded?:  ●  The  grade  is  based  on:  the  whole  semester  performance  i.e.  the  complete  wiki  entries,  all  presenta>ons  of  a  team  

●  Each  group  reports  weekly  in  the  wiki  

●  If  there  are  no  objec>ons  the  group  is  awarded  the  same  grade,  minor  differen>a>on  is  possible  

●  Individual  grading/report  only  on  request  

Page 5: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Grading  

Criteria:  ●  Presenta>on  criteria:  focus  to  things  that  ma\er,  communicate  you  message  in  an  easy  and  understandable  way,  it  is  important,  that  the  audience  can  follow  

●  Wiki:  volume,  conciseness,  clearness.  

 

Page 6: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Grading  

Criteria  in  Detail:  ●  The  volume  should  reflect  the  amount  of  >me  and  work  you  spend  on  the  topic  

●  The  conciseness  refers  to  the  level  of  details  and  the  precision  (e.g.  do  things  really  work  using  the  given  instruc>ons)  

●  The  clearness:  Is  the  red  line  of  the  work  visible,  is  it  easy  to  understand.  

Page 7: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Realiza>on  

Conduc>on  Hints:  ●  Therefore  the  wiki  entries  should  have  meaningful  structure  and  cover  different  levels  of  abstrac>on.  Referenced  white  papers,  manuals,  tutorials  and  further  reading.  

●  Since  this  course  is  new,  problems  and  challenges  should  adequately  be  reflected.  

●  We  plan  to  record  weekly  notes  which  leads  then  to  the  final  grade.    

Page 8: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Organiza>on  

●  Register  par>cipants  ●  Form  teams  

●  Discuss  the  mee>ng  >me  

●  Iden>fy  interest  in  /  find  slots  for  coding  sessions  ●  Answer  ques>ons  about  prerequisites  

Page 9: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Infrastructure  

●  Bunch  of  virtual  machines  running  Debian  ●  you  have  root  access  to  these  machines  

●  Provide  a  wiki  for  reports  and  documenta>on  

●  Provide  a  forum  for  off-­‐line  discussion  

Page 10: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Conclusion  

●  New  course    ●  No  previous  experience  with  ma\er  yet  -­‐>  you  are  free  to  explore  the  opportuni>es  given  by  the  technology  -­‐>  you  are  free  to  suggest  addi>onal  topics  as  the  course  goes  on  

●  you  have  to  do  reading  and  technology  research  on  your  own  

Page 11: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Conclusion  

●  We  are  a  kind  of  project  manager  where  you  are  the  technical  specialist  

●  The  supervisors  work  in  collabora>on  with  the  students  and  refine  strategies  and  direc>ons  on  the  fly  as  the  course  is  going  on.  This  is  based  on  the  feedback  from  the  students  and  done  in  collabora>on  with  the  students  -­‐>  clear  communica>on  is  essen>al  on  both  sides  

Page 12: Distributed*DataMining*Lab* · 2014. 10. 9. · DDM Lab WS14/15 Conclusion* We*are*a kind*of*project*manager*where*you*are* the*technical*specialist* The*supervisors*workin collaboraon

DDM Lab WS14/15

Have  fun!