instrumenting folding@work

Instrumenting Folding@Work

Badi Abdul-Wahid, RJ NowlingCSE 60641 Operating Systems

Professor Striegel

Overview

• Problem Description– Experimental Structure– Folding@Work Workflow

• Benchmarks• Results– Weak Scaling (ns / day)– Server Capacity– Available Workers Over Time– Variability of Computation Time

• Conclusions

Experimental Structure

Folding@Work Workflow

Benchmarks

• Tasks: 1 ns generations (approx 2 hr on test machine)

• 10 consecutive generations / simulations• Weak Scaling– 10 simulations / 10 workers– 100 simulations / 100 workers– 1,000 simulations / 1,000 workers

• Condor, later added SGE jobs• 1 Trial of each; Took ~ 2 days to run

Weak Scaling of F@W

Server Capacity (Wait Time)

Available Workers over Time

Transfer Times

Variability of Computation Time

Example Execution Timeline

Performance Model

Nwu =⟨texe⟩+ ⟨tW ,wait⟩

⟨tnew⟩+ ⟨ttrans⟩+ ⟨tM ,wait⟩

Weak Scaling (updated)

Wait Times

Tasks Waiting

Identified Areas of Improvement• Availibility of Resources

– Benchmarks limited by number of sustained workers available through Condor

– New feature: WorkQueue Worker Pool can be used to start new workers• WorkQueue Limits Number of Workers

– Increasing number of file descriptors allowed up to 2,500 workers to connect– Bad behavior occuring in calls to select()– Working with WorkQueue developers to switch to poll()

• Long-Running Work Units Delay Completion of Trajectories– Some work units not returned / taking very long time– Prevents trajectories from finishing– Use fast abort feature to re-assign work units that take longer than a

specified time

Conclusion

• Accomplished– Identified key metrics (ns / day, wait time)– Developed scaling model– Tested model

• Conclusions– Real scientific applications scale well– Forcing short workunits adds load to Master– Performance model validated– “Self-correcting” behavior

instrumenting folding@work

work instrumenting folding

task queue task

reassign work units

work5weak scaling of

w simulated time

ns generations approx

workers100 simulations

condornew feature

Documents

instrumenting an abstract object

key notes in instrumenting change

folding fabric partitions - barbour product search...folding...

delphi tools update: instrumenting threaded programsdesign...

in-situ sensors instrumenting the environment gregory bonito...

instrumenting parsecs raytrace

instrumenting point-of-sale malware - def con ·...

instrumenting php applications slides

instrumenting flexible substrates for clinical diagnosis

understanding, choosing & instrumenting nosql

folding work table | table de travail pliante

building and instrumenting the next- generation security...

gauging adf application performance instrumenting your...

instrumenting your instruments

a framework for dynamically instrumenting gpu compute...

instrumenting go (gopherconindia lightning talk by bhasker...

instrumenting, analyzing, & tuning the performance of...

defcon 22-wesley-mc grew-instrumenting-point-of-sale-malware

instrumenting the planet for intelligence from blue sky to...

coarsening bias: how instrumenting for instrumental