Date & Time:
March 26, 2024 2:30 pm – 4:00 pm
Location:
JCL 298
03/26/2024 02:30 PM 03/26/2024 04:00 PM America/Chicago Anne Benoit & Yves Robert – Joint Lecture JCL 298

Variable Capacity Scheduling

Abstract: We formalize the problem of scheduling jobs on a set of machines, each having a fixed number of resources, and a probability to be alive, computed according to some probability distribution, e.g., random walk. The goal is to maximize the utilization. Several heuristics are designed to efficiently schedule jobs, cleverly deciding which machines to use (trade-off between the load and the probability that the machine will not survive until job completion). We will present simulation results, based on real traces, to compare the performance of various heuristics.

Checkpointing`a la Young/Daly

Abstract: The Young/Daly formula provides an approximation of the optimal checkpoint period for a parallel application executing on a large-scale platform. It was originally designed to handle fail-stop errors for preemptible tightly-coupled applications. It has recently been extended to other application and resilience frameworks, such as workflows, silent errors, and imprecise knowledge of key parameters (MTBF and checkpoint duration). We provide some background and survey various scenarios to assess the usefulness and limitations of the formula.

Speakers

Anne Benoit

Associate Professor, Computer Science Laboratory LIP

Anne Benoit is currently an Associate Professor in the Computer Science Laboratory LIP at ENS Lyon, France, and a Senior Member of Institut Universitaire de France. She is Editor-in-Chief of the journal Parallel Computing, Chair of the IEEE CS Technical Committee on Parallel Processing (TCPP), and a senior member of the IEEE. She has chaired the program committee of several major conferences in her field, in particular SC, IPDPS, ICPP and HiPC. Her research interests include algorithm design and scheduling techniques for parallel and distributed platforms, with a focus on energy awareness and resilience. See bit.ly/abenoit for further information.

Yves Robert

Professor, ENS Lyon

Yves Robert is a Full Professor at ENS Lyon, a Fellow of the IEEE and a former Senior Member of Institut Universitaire de France. He received the 2014 IEEE TCSC Award for Excellence in Scalable Computing, the 2016 IEEE TCPP Outstanding Service Award, and the 2020 IEEE CS Charles Babbage Award. He holds a Visiting Scientist position at the Innovative Computing Laboratory, University of Tennessee Knoxville, since 2011. His main research interests are scheduling techniques, parallel algorithms and resilient approaches for large-scale platforms. See~\url{http://graal.ens-lyon.fr/~yrobert/} for further information.

Related News & Events

UChicago CS News

UChicago Partners On New National Science Foundation Large-Scale Research Infrastructure For Education

Dec 10, 2024
UChicago CS News

Saturdays with CSIL — How Undergraduates are Transforming CS Education for Local High School Students

Dec 05, 2024
UChicago CS News

UChicago Researchers Receive Google Privacy Faculty Award for Research on AI Privacy Risks

Nov 22, 2024
UChicago CS News

The Climate App Designed to Tackle Chatham’s Flooding Crisis

Nov 21, 2024
In the News

Globus Receives Multiple Honors in 2024 HPCwire Readers’ and Editors’ Choice Awards

Nov 20, 2024
In the News

Argonne Team Breaks New Ground in AI-Driven Protein Design

Nov 15, 2024
UChicago CS News

DOE Awards Fred Chong and his National Research Team $7.5M to Develop a SMART Software Stack to Control Quantum Computer Noise

Nov 12, 2024
UChicago CS News

CS/LSSG Showcases Sustainability Research and Education

Nov 11, 2024
UChicago CS News

Ph.D. Student Jibang Wu Receives the Stigler Center Ph.D. Dissertation Award for His Work Modeling the Incentive Structures of Reward and Recommendation–Based Systems

Oct 24, 2024
UChicago CS News

Rebecca Willett Receives the SIAM Activity Group on Data Science Career Prize

Oct 23, 2024
UChicago CS News

UChicago CS Researchers Shine at UIST 2024 with Papers, Posters, Workshops and Demonstrations

Oct 10, 2024
UChicago CS News

UChicago Scientists Receive Grant to Expand Global Data Management Platform, Globus

Oct 03, 2024
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube