grid component model and platform: an overview workshop on evolutions of grids towards sokus ogf20,...
TRANSCRIPT
Managed by
Grid Component Model and Platform: An Overview
Workshop on Evolutions of GRIDs Towards SOKUsOGF20, Manchester, 8 May 2007
Vladimir GetovCoreGRID STE Institute Leader
University of Westminster, London, U.K.http://[email protected]
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 2
WaveWave 2 – start 20062 – start 2006
Degree
DataminingGrid
data, knowledge, data, knowledge, semanticssemantics
OntoGrid
InteliGridK-WF Grid
Chemomen tum
A-Ware Sorma
platforms, user platforms, user environmentsenvironments
CoreGRIDvirtual laboratories
UniGrids HPC4U
g-Eclipse
Gredia
GridComp
QosCosGrid
Grid4all
Provenance
AssessGridGridTrust
trust, securitytrust, security
Grid services, Grid services, business modelsbusiness models
ArguGrid Edutain@ Grid
GridEconGridCoord
Nessi-GridChallengers
NextGRIDservice
architecture
Akogrimomobile
services
BREINagents &
semantics
BeinGridbusiness
experiments
supporting the Grid communitysupporting the Grid community
SIMDATindustrial
simulations
XtreemOS
Linux based Grid
operating system
BeinGridbusiness
experiments
KnowArc
EC-GinBridge
Grid@Asia EchoGrid
international cooperationinternational cooperation
Specific support action
Integrated project
Network of excellence
Specific targeted research project
Wave 1 – start 2004Wave 1 – start 2004
EU Funding: 130 M€
Grid Research Projects under FP6
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 3
Preventing research fragmentation Developing world-class scientific and
technological excellence Achieving sustainable integration Contributing to the realisation of the
European Research Area for Grid Research
Knowledge and data management
Programming models
System architecture
Grid Information, Resource and Workflow Monitoring
Resource Management & Scheduling
Grid Systems, Tools and Environments
Six highly-focused Research Institutesacross 41 Research Labs
Gathering 145 researchers & 169 PhD students into a single EU Laboratory on Grid Technologies
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 4
CoreGRID Definition of Future Grids
A fully distributed, dynamically reconfigurable, scalable and autonomous infrastructure to provide location
independent, pervasive, reliable, secure and efficient access to a coordinated set of services encapsulating and virtualizing resources (computing power, storage,
instruments, data, etc.) in order to generate knowledge.
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 5
From Grids to SOKU
Next
Generation
Grids
SoftwareTechnologies
KnowledgeTechnologies
Service-Service-OrientedOrientedKnowledge Knowledge UtilityUtility
Evolution of HPCN
CurrentGrids
SOA Methodologies
AutonomicComputing
Evolution of the Web
…
…
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 6
Background: How to build Grid Middleware
• Proprietary middleware (Globus 1.0, Legion, Unicore, …)– Resources exposed through an API– Non interoperable !
• Object-based middleware– Resources exposed through distributed objects (Java,
CORBA, etc.)– Some interoperability issues with the communication
protocols (CORBA IIOP)– Not anymore at the top of the hype !
• Service-based middleware– Resources exposed through services– Strong support from the Industry– At the top of the hype !– Need some extensions (stateful Web services)
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 7
One of the Main Research Challenges for Future Grids
To develop the software design and development methodology of a generic component-based Grid platform for both applications and tools/systems/PSEs to have a single, seamless, “invisible” Grid software services infrastructure.
Possible Solution: Grid Component Model (GCM):• Proposal for a Grid Component Model - DPM02 • Basic Features of the Grid Component Model (assessed) -
DPM04
GoreGRID Institute on Programming Models
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 8
GCM: Main Features
– Component hierarchy – Extensibility of the model– Support for adaptivity– Language neutrality– Interoperability– Reflexivity
Lightweight portable and compact implementations Well-defined semantics (allow future formalization)
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 9
GCM Technical Structure
Component Specification as an XML schema
Run-Time API defined in several languagesC, Java, etc.
Packaging described as an XML schema
Information for Deployment(Virtual Nodes, … Variables, File Transfer, …)
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 10
Collective Interfaces
Simplify the design and configuration of component systems
Expose the collective nature of interfaces
– Multicast, Gathercast, gather-multicast
The framework handles collective behaviour at the level of the interface
Based on Fractal API : – Dedicated controller – Interface typing Verifications
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 11
Content and Main Activities: GCM Reference Implementation
1 - Primitive Component Programming
2 - Legacy Code Wrapping, Interoperability
3 - Composition and Composites, Deployment
4 – Autonomic features
5 – IDE for GCM (Composition GUI, etc.)
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 12
Research Example: Componentising an Application for the GridINRIA and University of Westminster
Jem3D– numerical solver for the 3D
Maxwell’s equations modelling the time domain propagation of electromagnetic waves
– follows typical “geometric decomposition” parallelisation
– distributed object application using ProActive
ProActive library
–Java distributed object middleware for parallel and concurrent programming
–Main features: Active objects, Asynchronous method invocation, Group communications, Descriptor-based deployment
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 13
Research Example: Componentisation Process
Recover Original Architecture
Design Component Architecture
Restructure Original System
Implement Component-based System
Original Architecture
Component Architecture
RestructuredSystem
Component -based System
Source code,Documentation
Artefact Flow
General, architecture-based process
Object-based system
Component-based system
Early GCM using ProActive – extends Fractal with: •distributed components•multicast interfaces•configurable deployment on the Grid
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 14
Research Example: Performance Evaluation
Comparison: Object-based vs. Component-based
Component based vs Object based
0
2
4
6
8
10
12
14
16
18
1 2 3 4 5 6 7
Experiment
Tim
e (
min
)
"component based - initialization time"
component based - computation time
"object based - initialization time
object based - computation time
– experiments on Grid’5000 using up to 308 processors, allocated on up to 3 clusters
– 7 experiments using different problem size and number of processors
– execution times of two versions are similar
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 15
Domain-Specific Metadata for Model Validation and Performance Optimisation – Legacy Applications
GENIE is an interactive, legacy code for Earth system modelling. Our hypothesis is that componentising the application and using domain-specific metadata will
help transforming it into a scalable yet efficient Grid system.
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 16
Domain-Specific Metadata for Model Validation and Performance Optimisation
Motivation: Enable legacy applications to evolve as a part of the scalable problem solving environments within modern Grid systems.
Framework: Componentising existing applications along with domain-specific metadata so that issues arising thereof can be addressed using this metadata.
Result: Used GENIE (Earth Simulation System) as a motivating example. Derived different domain- and component-specific metadata and optimisation strategies.
Further Work: The principles we outlined are application-specific. A generic, but domain-restricted approach is required and potential performance benefits need to be demonstrated.
Partners:- University of Westminster (UK) - Imperial College - London (UK)- Ongoing work: CoreGRID TR-0068 and a chapter in a CoreGRID Springer
volume
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 17
Some Future Research Goals
• Adoption of GCM for Grid applications development• Generic, lightweight component-based Grid platform
design methodology• Use of GCM for Grid system software design • Integration of application and system components
into a single adaptable Grid platform• Interoperability between peer-to-peer and client-
server paradigms in hybrid Grid systems
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 18
New Spin-off Project - GridCOMP
- FP6 Call5
- GridCOMP nvolves 6/12 CoreGRID partners:INRIA, ERCIM, UNIPI, UOW, ISTI/CNR, UCHILE
- Main goal: develop a GCM prototype platform
- Strong industrial involvement: IBM, Atos Origin, Grid Systems
- Worldwide partners: Tsinghua University (China), Melbourne University (Australia), and University of Chile (Chile)
European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies 19
Conclusions
• Research work ongoing in both CoreGRID and GridCOMP projects
• Some other CoreGRID Institutes adopting GCM• More research results to be expected soon• Strong interest from the US community – joint
focused yearly workshop• High industrial interest via the GridCOMP project• ETSI interest to initiate further work on the GCM
specification• An OGF activity - ?