amy apon, ph.d. director, arkansas high performance computing center

28
Investment in High Performance Computing A Predictor of Research Competitiveness in U.S. Academic Institutions Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center Professor, CSCE, University of Arkansas Stan Ahalt, Ph.D. Director RENCI Professor, Computer Science, UNC-CH Work supported by the NSF through Grant #0946726 University of Arkansas and RENCI/UNC-CH Reall y

Upload: aine

Post on 23-Feb-2016

40 views

Category:

Documents


0 download

DESCRIPTION

Investment in High Performance Computing A Predictor of Research Competitiveness in U.S. Academic Institutions. Really. Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center Professor, CSCE, University of Arkansas Stan Ahalt , Ph.D. Director RENCI - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Investment in High Performance Computing

A Predictor of Research Competitiveness

in U.S. Academic InstitutionsAmy Apon, Ph.D.

Director, Arkansas High Performance Computing CenterProfessor, CSCE, University of Arkansas

Stan Ahalt, Ph.D.Director RENCI

Professor, Computer Science, UNC-CHWork supported by the NSF through Grant #0946726

University of Arkansas and RENCI/UNC-CH

Reall

y

Page 2: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Collaborators

Amy AponUniversity of

Arkansas 

Stanley AhaltRENCI,

University of North

Carolina

Vijay DantuluriRENCI,

University of North

Carolina

Constantin Gurdgiev

IBM

Moez LimayemUniversity of

Arkansas 

Linh NgoUniversity of

Arkansas 

Michael StealeyRENCI, University of

North Carolina

Page 3: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Research Study

• Background and motivation• Research hypothesis• Data acquisition• Analysis and Results• Discussion

Page 4: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Research and Computing

Cyberinfrastructure Ecosystem Foundation

Computational and Data Driven Science

Nan

otec

hnol

ogy

Hig

h E

nerg

y Ph

ysic

s

Hea

lth

Scie

nces

Glo

bal

Clim

ate

Mod

elin

g

Page 5: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Credit: NSF OCI

$ $ $$

$ $$$$$$$

Page 6: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Conversation with a Chancellor

• HPC guys, “This is a great investment! We think we can run the HPC center with only $1M/year in hardware and $1M/year in staffing.”

Chancellor, “Which 20 faculty do you want me to fire?”

Page 7: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

HPC: High rePeating Cost

• Computer equipment is usually treated as a capital expense, with costs for substantial clusters in the range of $1M+

• Warranties on these generally last 3 years, or 5 years at most, after which repairs become prohibitive

• Even without that, the pace of technology advances require refreshing every 3-5 years

• Staffing is a long term repeating cost!

Page 8: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

6/1/19

93

11/1/

1993

6/1/19

94

11/1/

1994

6/1/19

95

11/1/

1995

6/1/19

96

11/1/

1996

6/1/19

97

11/1/

1997

6/1/19

98

11/1/

1998

6/1/19

99

11/1/

1999

6/1/20

00

11/1/

2000

6/1/20

01

11/1/

2001

6/1/20

02

11/1/

2002

6/1/20

03

11/1/

2003

6/1/20

04

11/1/

2004

6/1/20

05

11/1/

2005

6/1/20

06

11/1/

2006

6/1/20

07

11/1/

2007

6/1/20

08

11/1/

2008

6/1/20

09

11/1/

2009

0

50

100

150

200

250

300

350

400

450

500

Ranks of Top 500 Computers and Appearances in Succeeding Lists

HPC: High rePeating Cost

Page 9: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Some Observations

0 20 40 60 80 100 120 140 160 180 2000

20,000,000

40,000,000

60,000,000

80,000,000

100,000,000

120,000,000

140,000,000

Tflops versus Core Hours UsedAcademic HPC Centers

Tflops 3GHz

Tflops

Core

Hou

rs U

sed

Page 10: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

What is the ROI?

• Can I convince my VPR that the funds invested in HPC add value to the institution and create opportunity?

What if this is not true?

Page 11: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Hypothesis• Investment in high performance

computing, as measured by entries on the Top 500 list, is a predictive factor in the research competitiveness of U.S. academic institutions.

We study Carnegie Foundation institutions with “Very High” and “High” research activity – about 200 institutions

Page 12: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Data AcquisitionIndependent variables• Top 500 List count and rank of entries

o Mapped from “supercomputer site” to “institution”o We note that entries are voluntary – the absence of an

entry does not mean that an institution does not have HPC

Dependent variables• NSF and other federal funding summary

and award information• Publication counts• U.S. News and World Report rankings

Page 13: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Data from the Top 500 List

An historical record without comparison of supercomputers

Page 14: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Data from the Top 500 List

June

1993

June

1994

June

1995

June

1996

June

1997

June

1998

June

1999

June

2000

June

2001

June

2002

June

2003

June

2004

June

2005

June

2006

June

2007

June

2008

June

2009

0

20

40

60

80

100

120

institutions as they appear cumulativelyno. of academic institutionsno. of machine entries

About 100 U.S. institutions have appeared on a Top 500 List

Page 15: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Analysis• Examples• Correlation analysis• Regression analysis

Page 16: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Simple Example of ROI• Evidence based on 2006 NSF

funding

$0

$20

$40

$60

$80

$100

$120

Fund

ing

in M

illio

ns o

f Dol

lars

$0

$20

$40

$60

$80

$100

$120

Fund

ing

in M

illio

ns o

f Dol

lars

Average NSF funding: $30,354,000

Average NSF funding: $7,781,000

95 of Top NSF-funded Universities with HPC 98 of Top NSF-funded Universities w/out HPC

With HPC Without HPC

Page 17: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

• More evidence, 1993-2009 NSF funding

Longer Example of ROI

Page 18: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Correlation Analysis  Count

sNSF Pubs All

FedDOE DOD NIH USNew

sdRankSum

0.8198 0.6545

0.2643

0.2566

0.2339

0.1418

0.1194

-0.243

Counts   0.6746

0.4088

0.3601

0.3486

0.1931

0.2022

-0.339

NSF     0.7123

0.6542

0.5439

0.2685

0.4830

-0.540

Pubs       0.8665

0.4846

0.3960

0.8218

-0.588

All Fed         0.4695

0.6836

0.9149

-0.543

DOE           0.1959

0.3763

-0.384

DOD             0.4691

-0.252

NIH               -0.500

Page 19: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Regression Analysis• Two Stage Least Squares (2SLS) regression is used

to analyze the research-related returns to investment in HPC

• We model two relationships • Model 1: NSF Funding as a function of

contemporaneous and lagged Appearance (APP) on the Top 500 List Count and Publication Count (PuC), and

• Model 2: Publication Count (PuC) as a function of contemporaneous and lagged Appearance on the Top 500 List Count (APP) and NSF Funding

Page 20: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Endogeneity• Funding allows an institution to

acquire resources• Resources are used to perform

research, which leads to more funding

• Resources are also cited in the argument for research funding

• NSF funding begats HPC resources which begats NSF funding …

Page 21: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Regression Analysis• Original tests revealed significant problems with

endogeneity of Publication Counts (PuC) and NSF Funding.

• To correct for this, we deployed a 2SLS estimation method, with number of undergraduate Student Enrollments (SN) acting as an instrumental variable in the first stage regression for PuC (Model 1) and NSF (Model 2).

• In both cases, SN was found to be a suitable instrument for endogenous regressors.

Page 22: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

First Result• A single HPC investment yields

statistically significant immediate returns in terms of new NSF funding • An entry on a list results in an

increase of yearly NSF funding of $2.4MoConfidence level 95%oConfidence interval $769K-$4M

Page 23: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Second Result• A single HPC investment yields

statistically significant immediate returns in terms of increased academic publications• An entry results in an increase in

yearly publications of 60o Confidence level 95%o Confidence interval 19-100

Page 24: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Third Result• Analysis on the rank of the system

shows that rank has a positive impact to competiveness, but with reduced confidence.

• We have not studied returns to other institutions of investments by resource providers, or returns to overall U.S. competitiveness.

Page 25: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Fourth Result• HPC investments suffer from fast

depreciation over a 2 year horizon• Consistent investments in HPC,

even at modest levels, are strongly correlated to research competitiveness.

• Inconsistent investments have a significantly less positive ROI

Page 26: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Discussion• More study is needed to precisely

determine the rate of depreciation of HPC investments

• The publication counts include all publications, not just those related to HPC

• More study is needed regarding how use of national systems, such as Teragrid, may impact research competitiveness

Page 27: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Data from Teragrid Usage

Page 28: Amy Apon, Ph.D. Director, Arkansas High Performance Computing Center

Questions?