harnessing the datarevolution - nsf · • clear and compelling science-and engineering-driven...
TRANSCRIPT
![Page 1: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/1.jpg)
HARNESSING THE DATA REVOLUTION
“Engage NSF’s research community in the pursuit of fundamental research in data science and engineering, the development of a cohesive, federated, national-scale approach to research data infrastructure, and the
development of a 21st-century data-capable workforce.”
MPSAC, August 14-15 2018
![Page 2: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/2.jpg)
MATERIALS LENGTH SCALES• “ARRANGEMENT OF PHASES AND
DEFECTS” – MAY BE COMPLEX
• RICH IN DATA FROM IMAGINGPROBES
• DATA ANALYTICS WITHEXPERIMENT AND COMPUTATIONTO SPAN THE MICROSTRUCTUREPROPERTY CHASM
MATERIALS PROPERTIES ARE CONTROLLED BY STRUCTURE AT DIFFERENT SCALES
CHALLENGE: • DISCOVER MICROSTRUCTURE PROPERTY RELATIONSHIPS
• CONTROL MICROSTRUCTURE DESIRED PROPERTY
![Page 3: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/3.jpg)
DYE SENSITIZED SOLAR CELLSCAN FOSSIL FUEL ELECTRICITY GENERATION PRICE/PERFORMANCE BE ACHIEVED?
APPLY CONCEPTS– ALGEBRAIC TOPOLOGY AND GEOMETRY– APPLIED STATISTICS– ALGORITHMS– GRAPH THEORY
UNDERSTAND TOPOLOGICAL INTERCONNECTIONS, SHAPES, AND DYNAMICS
INVESTIGATE TOPOLOGICAL CONCEPTS– FINITE DATA, – APPROXIMATIONS– NOISE– CONSTRAINTS
THESE ARE ALWAYS ENCOUNTERED IN REAL MATERIALSCHARACTERIZATION!
DATA SCIENCEMATERIALS SCIENCESYNERGY ENABLES MATERIALS TO DEVICE DESIGN
HIGH SURFACEAREA FOR DYE
NANOPORES
HOW CAN MICROSTRUCTUREBE OPTIMIZED FOR
MAXIMUM EFFICIENCY?
![Page 4: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/4.jpg)
THE MATERIALS GENOME INITIATIVEDISCOVERY-TO-MARKET IN LESS THAN HALF THE TIME AT HALF THE COST
A NEW PARADIGM FOR DISCOVERY:THE SYNERGISTIC INTERACTION AMONG
COMPUTATION, DATA, EXPERIMENT, AND THEORY
THEORY
EXPERIMENT
COMPUTATION
SYNERGY
Data
![Page 5: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/5.jpg)
HDR ROADMAP HAS 5 MAJOR COMPONENTS
• THEORETICAL FOUNDATIONS
• SYSTEMS FOUNDATIONS
• DATA-INTENSIVE RESEARCH ACROSS ALL S&E• DATA CYBERINFRASTRUCTURE
• EDUCATION & WORKFORCE DEVELOPMENT
![Page 6: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/6.jpg)
HARNESSING THE DATA REVOLUTION
DATA-INTENSIVESCIENCE &
ENGINEERING
SYSTEMS&
ALGORITHMS
FOUNDATIONS
ADVANCEDCYBER
INFRASTRUCTURE
EDUCATION,WORKFORCE
SCIENCE-DRIVENHDR INSTITUTES
GENOTYPETO
PHENOTYPE
NOVELCATALYSISDESIGN
REAL-TIMEENGINEERING
SYSTEMS
ECOSYSTEMFORECASTING
MULTIMESSENGER
ASTROPHYSICS
MATERIALSGENOME
?????????
![Page 7: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/7.jpg)
HARNESSING THE DATA REVOLUTION
DATA-INTENSIVESCIENCE &
ENGINEERING
SYSTEMS&
ALGORITHMS
FOUNDATIONS
ADVANCEDCYBER
INFRASTRUCTURE
EDUCATION,WORKFORCE
SCIENCE-DRIVENHDR INSTITUTES
RULES OF LIFE
FUTURE OFWORK
QUANTUMLEAP
WINDOWSON THE
UNIVERSE
?????????
![Page 8: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/8.jpg)
Foundations
Systems,Algorithms
HDR SYSTEMS AND ALGORITHMS• OPEN KNOWLEDGE NETWORK (OKN)
– AN OPEN WEB-SCALE KNOWLEDGE NETWORK OF SEMANTICALLY-LINKED CONCEPTS AND DATA
– TO FOSTER RESEARCH ON A NEW GENERATION OF APPLICATIONS LEVERAGING DATA, CONTEXT, AND INFERENCES FROM DATA
• MODELCOMMONS– SHARING AND REUSE OF MACHINE LEARNING AND OTHER DATA-INTENSIVE MODELS
– SUPPORT FOR REPRODUCIBILITY AND REUSE (TRANSFER LEARNING…)
HDR FOUNDATIONS• TRIPODS: Transdisciplinary Research in Principles of Data Science
– COLLABORATION AMONG COMPUTER AND COMPUTATIONAL SCIENTISTS, STATISTICIANSAND MATHEMATICIANS TO DEVELOP THE PRINCIPLES OF DATA SCIENCE
• TRIPODS+X– COLLABORATION AMONG DOMAIN RESEARCH AND TRIPODS PROJECTS, SO THAT
FOUNDATIONAL APPROACHES ARE INFORMED BY REAL SCIENCE & ENGINEERING PROBLEMS
![Page 9: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/9.jpg)
SCIENCE AND ENGINEERINGCHALLENGES:• NEAR-TERM ECOLOGICAL FORECASTING
• REAL-TIME SENSING, LEARNING, ANDDECISION MAKING
• CLIMATE, WEATHER, HYDROLOGICAL, AND HAZARD FORECASTING
• NOVEL MATERIALS AND CHEMICALDESIGNS
• MULTI-MESSENGER ASTROPHYSICS
DATA CHALLENGES:• MACHINE LEARNING
• DATA PROVENANCE
• DATA HETEROGENEITY
• DATA SECURITY
• DATA ETHICS
• DATA STORAGE & ACCESS
DATA-INTENSIVESCIENCE &
ENGINEERING
HDR INSTITUTES: COUPLING SCIENCE AND ENGINEERINGCHALLENGES WITH DATA CHALLENGES
CO-DESIGNED HDR INFRASTRUCTURE
![Page 10: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/10.jpg)
LEVERAGE CI INVESTMENTS:• CSSI – CYBERINFRASTRUCTURE FOR
SUSTAINED SCIENTIFIC INNOVATION• SI2 – SOFTWARE INFRASTRUCTURE
FOR SUSTAINED INNOVATION• DIBBS – DATA INFRASTRUCTURE
BUILDING BLOCKS• EARTHCUBE• BIGDATA • BIG DATA HUBS AND SPOKES• ….
HDR THEMES:• THEORETICAL AND
SYSTEMS FOUNDATIONS• DATA INTENSIVE
RESEARCH• CYBERINFRASTRUCTURE• LEARNING AND
WORKFORCEDEVELOPMENT
DATA-INTENSIVESCIENCE &
ENGINEERING
HDR – BUILDING ON EXISTING PROGRAMS
CO-DESIGNED HDR INFRASTRUCTURE
HDR FOUNDATIONS:• TRIPODS• TRIPODS+X
![Page 11: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/11.jpg)
EDUCATION & WORKFORCE DEVELOPMENT AND EVALUATION
• HDR ACADEMY– CATALOG, COLLECT, CREATE EDUCATION/TRAINING MATERIALS
– HDR POSTDOCS, HDR BOOTCAMPS
• DATA SCIENCE CORP CONNECTING DATA SCIENTISTS/SCIENCE STUDENTS TODATA SCIENCE PROJECTS
– SPECIAL FOCUS ON DATA SCIENCE PROGRAMS AT COMMUNITY COLLEGES, 4-YEARCOLLEGES, MSIS, ETC.
– DATA SCIENCE CORPS WORKSHOP, DEC 7-8, 2017, GEORGETOWN UNIVERSITY
• PROGRAM EVALUATION– METRICS FOR SUCCESS, EVALUATING CONVERGENCE
• SOCIOTECHNICAL STUDY– SIMILAR TO WORK BEING CONDUCTED FOR THE NSF BIG DATA HUBS
Education,Workforce
![Page 12: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/12.jpg)
CURRENT STATUS
TRIPODS 12 PHASE I PROJECTS FUNDED
TRIPODS + X PROPOSALS IN REVIEW
HDR ADVANCED CYBERINFRASTRUCTURE OPEN STORAGE NETWORK AWARD (JUNE 2018)
STEERING COMMITTEE AND WORKING GROUP IN DISCUSSIONS OPEN KNOWLEDGE NETWORK
DATA SCIENCE CORPS
MODELCOMMONS
HDR INSTITUTES
![Page 13: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/13.jpg)
• CO-CHAIRS: CHAITAN BARU, CISE, JUAN MEZA, MPS• STEERING GROUP : JIM DESHLER, BIO; ROBIN WRIGHT, EHR; FIL BARTOLI,
ENG; ANJULI BAMZAI, GEO; MANISH PARASHAR, OAC; DANIEL SUI, SBE
• WORKING GROUP: PETER MCCARTNEY, BIO; JOHN CHERNIAVSKY, EHR; TONY KUH, AKBAR SAYEED, ENG; EVA ZANZERKIA, GEO; DARYL HESS, LIN HE, NANDINI KANNAN, SLAVALUKIN, ANGELA WILSON, MPS; AMY WALTON, OAC; PAUL MORRIS, OIA; CHARLES ESTABROOK, OISE; CHERYLEAVEY, CASSIDY SUGIMOTO, SBE
• EXEC SECRETARY: VANDANA JANEJA
HDR STRUCTURE
![Page 14: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/14.jpg)
THANK YOU
![Page 15: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/15.jpg)
EXTRA SLIDES
![Page 16: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/16.jpg)
DATA-INTENSIVESCIENCE &
ENGINEERING
SCIENCE-DRIVEN HDR INSTITUTES
• CLEAR AND COMPELLING SCIENCE- AND ENGINEERING-DRIVEN GOALS– ENABLE SIGNIFICANT PROGRESS WITHIN A 3-5 YEAR TIME PERIOD
• CONVERGENT– TEAMS OF DOMAIN SCIENTISTS AND COMPUTER SCIENTISTS, MATHEMATICIANS AND STATISTICIANS
• CO-DESIGN OF HDR “INFRASTRUCTURE”– COORDINATION WITH OTHER HDR COMPONENTS: FOUNDATIONS, OPEN KNOWLEDGE NETWORK,
MODELCOMMONS, ETC.
• LEVERAGE OTHER NSF INVESTMENTS:– CYBERINFRASTRUCTURE: CSSI, SI2, DIBBS, EARTHCUBE, ETC.– BIGDATA, BIG DATA HUBS
• ENHANCE EDUCATION, DIVERSITY, AND PUBLIC OUTREACH
![Page 17: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/17.jpg)
HYPOTHESESINFORMATION
DATA IN CONTEXT
DISCOVERY
THEORY
DATA
ENABLING AND ACCELERATING DISCOVERY: CONVERGENCE & CO-DESIGNAccess
Visualization Data Quality
Collaboration ToolsExploratory Analysis
AnalyticsHigh Performance ComputingComputational-Mathematical -Statistical Methods/Models
InterpretationModel ValidationRedesign
ExperimentsData CollectionBenchmark Data Sets
Domain
Algorithms/ Systems
Cyberinfrastructure
Foundations WORKFORCE
![Page 18: HARNESSING THE DATAREVOLUTION - NSF · • clear and compelling science-and engineering-driven goals – enable significant progress within a 3-5 year time period • convergent –](https://reader035.vdocument.in/reader035/viewer/2022062507/5fc2ff940b9aab2c544f35cb/html5/thumbnails/18.jpg)
HYPOTHESIS:Bigger root systems =>
better water use and grain yield
DISCOVERY:Some root features affect yield under
drought.
THEORY:Root variables influence yield,
but … How…? What if…?
DATA: Genome Sequences Trait MeasurementsEnvironmental Data
FROM GENOTYPES TO PHENOTYPES
AnalyticsHigh Performance ComputingModels/Methods
InterpretationModel ValidationRedesign
ExperimentsData CollectionBenchmark Data Sets
Domain
Algorithms/ Systems
Cyberinfrastructure
Foundations WORKFORCE
Access Visualization Data Quality
Collaboration ToolsExploratory Analysis
Digital Imaging of Root Traits