10/6/2015b.ramamurthy110/6/2015b.ramamurthy1 midterm review cse4/587 b.ramamurthy

8
07/20/22 B.Ramamurthy 1 07/20/22 B.Ramamurthy 1 Midterm Review CSE4/587 B.Ramamurthy

Upload: allyson-francis

Post on 30-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 10/6/2015B.Ramamurthy110/6/2015B.Ramamurthy1 Midterm Review CSE4/587 B.Ramamurthy

04/19/23 B.Ramamurthy 104/19/23 B.Ramamurthy 1

Midterm Review

CSE4/587B.Ramamurthy

Page 2: 10/6/2015B.Ramamurthy110/6/2015B.Ramamurthy1 Midterm Review CSE4/587 B.Ramamurthy

04/19/23 B.Ramamurthy 204/19/23 B.Ramamurthy 2

Exam Date

October 22, 2010Please bring Pencils, pens and erasers.

This is a closed book exam.You can bring 4 sheets of any information you need.

Page 3: 10/6/2015B.Ramamurthy110/6/2015B.Ramamurthy1 Midterm Review CSE4/587 B.Ramamurthy

04/19/23 B.Ramamurthy 3

TopicsDefining data intensive computingRelationship between data-intensive computing and cloud computingLuceneEnabling Technologies (ET): ET1: Web services ET2: Virtualization ET3: Special data structures and algorithms

(MapReduce Engine, HDFS)Google App EngineProject 1

04/19/23 B.Ramamurthy 3

Page 4: 10/6/2015B.Ramamurthy110/6/2015B.Ramamurthy1 Midterm Review CSE4/587 B.Ramamurthy

04/19/23 B.Ramamurthy 404/19/23 B.Ramamurthy 4

Data-intensive computing competencies

1. Describe data-intensive computing concepts2. Compute with data-intensive computing concepts3. Recognize a data-intensive problem.4. Identify the scale of data.5. Analyze the data requirements of a problem.6. Describe the data layout and define the data repository format 7. Decide the algorithms (Ex: MapReduce)8. Define application-specific algorithms and analytics 9. Design the data-intensive program solution and system configuration.10. Implement the data-intensive solution and test the solution for functional correctness and non-functional requirements.11. Research algorithmic improvements and experiment with them.12. Write a report summarizing the solution and results.13. Incorporate services from cloud computing platforms.14. Integrate semantic information in organizing data for context-awareness.15. Apply collective intelligence methods for diverse data sources.16. Formulate data-intensive visualization solutions for presenting the results.

Page 5: 10/6/2015B.Ramamurthy110/6/2015B.Ramamurthy1 Midterm Review CSE4/587 B.Ramamurthy

04/19/23 B.Ramamurthy 5

Take home questions: Q1

Read the list of competencies and check whether you have learned this competency in your project 1 and/or in the course lessons.Item#

from slide 4

Covered in Prj1

Covered in lessons

Where? How?

What extent (%)

Page 6: 10/6/2015B.Ramamurthy110/6/2015B.Ramamurthy1 Midterm Review CSE4/587 B.Ramamurthy

04/19/23 B.Ramamurthy 6

Take home questions: Q2

In order handle the massive load we had in out recent deployment of the Pop!World application Google App Engine (GAE), we deployed 20 instances of the application and gave the users the 20 links and let them choose their link that was available when they it ran out of quota.Design a front end (method) using GAE resources or others that will (i) hide the 20 instances or n instances and (ii) direct the user to the one of the unloaded app. (The policy could even be a simple round robin).

Page 7: 10/6/2015B.Ramamurthy110/6/2015B.Ramamurthy1 Midterm Review CSE4/587 B.Ramamurthy

04/19/23 B.Ramamurthy 704/19/23 B.Ramamurthy 7

Format of in-class exam

4 questionsEnabling technologies: Webservices, virtualization, Mapreduce, HDFS, infrastructure managementAmazon cloud environment (read the blue book)

Page 8: 10/6/2015B.Ramamurthy110/6/2015B.Ramamurthy1 Midterm Review CSE4/587 B.Ramamurthy

04/19/23 B.Ramamurthy 8

How to study?

04/19/23 B.Ramamurthy 8