public-resource computing for cepc simulation wenxiao kan computing center/institute of high physics...
TRANSCRIPT
Public-resource computingfor
CEPC Simulation
Wenxiao Kan Computing Center/Institute of High Physics Energy
Chinese Academic of ScienceCEPC2014 Scientific program, September 12, 2014, Shanghai
2
Outline
• What is public-resource computing• The BOINC midware• A list public-resource computing projects using the BOINC platform• Our work with BOINC• CEPC Simulation implementation
3
Public-resource computing
• AKA global computing or P2P computing• Combines the resources of personal computers and game consoles belonging to the general
public to perform scientific computations Started with• Great Internet Mersenne Prime Search (GIMPS) (1996)• Distributed.net (1997)Contrast with grid computing:• Grid computing involves "organizationally-owned resources"
• Centrally managed by IT professionals• Powered on most of the time• Connected by high-speed links• Malicious behavior handled by organization
Of course None of that is true for public-resource computing.
4
BOINC
• Berkeley Open Infrastructure for Network Computing• Developed at UCB Space Science Laboratory by the SETI@home group
• Reduce the barriers of entry to public-resource computing:• A project can be run from a single computer running standard open-source software
• Share resources among autonomous projects:• Each PC owner can join multiple projects• Results in better resource utilization
• Support diverse applications:• Offer various data distribution mechanisms• Support various programming languages• …
• Reward participants:• Mostly by giving them credits
• System must be cheating-resistant• Also by offering nice graphics
• Great screensavers!
5
Projects using BOINC
• SETIi@home:search for intelligent extra-terrestrial life(hosts:3679294, users:1503938 total cpu hours: ~20140050535)• Einstein@home:
Search for gravitational signals emitted by pulsars(hosts:4133384, users:376185 total cpu hours:~10141441049)• Climateprediction.net:
long term-climate prediction(hosts:12658387 users:3206474 total cpu hours:~1836733417)• World Community Grid: Runs multiple sub-projects that focus on humanitarian research including HIV/AIDS, cancer, dengue fever, malaria, etc(hosts:2297367,users:466627,total cpu hours:~12009696112)
6
Projects using BOINC• LHC@home: a.LHC@home Classic – SixTrack, 2003 hosts: 322580 & users: 122303 total cpu hours:~223381278 b. vLHC@home- Test4Theory virtualization technology c. Beauty@LHC - B-physics for LHCb experiment
d. ATLAS@home – atlas simulation
7
Our work with BOINC(I)• CAS@home: First official Volunteer Project in mainland China
• Applications: TreeThreader, Lammps, etc address: http://casathome.ihep.ac.cn
8
Our work with BOINC(II)
• ALTAS@home: ATLAS simulation
• A lot of CPUs needed for simulation• Key point: virtualization• Total jobs statistic in Aug.2014: ~24000 jobs completed.
10
Our work with BOINC(III)
• Desktop Grid: integrate Dirac and boinc, provide public resource for grid user.
• Test work completed and promotion in progress
• Application: BESIII simulation
11
Discusses CEPC Simulation implementation• Feasibility: a. Simulation jobs : low I/O jobs b. virtualization technology:
Heterogeneous Resources
c. Huge potential resources: 1) Enthusiastic&strong volunteers from CAS@home 2) Technologic collaboration with BOINC develop group &other project groups d. development experiences with Large physical experiments: ALTAS@home & BESDIRAC project
12
Web protral/api
Discusses CEPC Simulation implementation• Work flow on boinc side
boinc sever
Image repository
PC
gPCsPCs PC
PhysicistUser
PhysicistUser
Generater
Running
workunit workunit workunit workunit workunit
BOINC CLient Core
Shareddirectory VMimageimageimage
output
output output outputoutput output
13
Discusses CEPC Simulation implementation• Basic architecture: ongoing integration Dirac and Boinc for CEPC simulation
14
Summary & Outlook• Skilled technology for CEPC simulation experiment by using volunteer
computing• Completed some test simulation jobs on boinc(C/S) side and got good
results, more test jobs are needed, and some more details should be discussed.• Reduce the size of the image, current software all loaded in the image• Database access not available for public, load database in the image. • On going integration of DIRAC and boinc for CEPC simulation. • Powerful resources are provided by volunteers from CAS@home
(host:40000+ cpu cores:160000+) &other boinc projects.