BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/New_York
X-LIC-LOCATION:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20210402T160551Z
LOCATION:Track 7
DTSTART;TZID=America/New_York:20201110T100000
DTEND;TZID=America/New_York:20201110T140000
UID:submissions.supercomputing.org_SC20_sess270_pec106@linklings.com
SUMMARY:Fault-Tolerance for High-Performance and Big Data Applications: Th
 eory and Practice: Part 2
DESCRIPTION:Tutorial\n\nFault-Tolerance for High-Performance and Big Data 
 Applications: Theory and Practice: Part 2\n\nBosilca, Bouteiller, Herault,
  Robert\n\nResilience is a critical issue for large-scale platforms. This 
 tutorial provides a comprehensive survey of fault-tolerant techniques for 
 high-performance and big-data applications, with a fair balance between th
 eory and practice.\n\nThe tutorial will include an overview of failure typ
 es and typical probability distributions, general-purpose techniques: chec
 kpoint and rollback recovery protocols, replication, prediction and silent
  error detection, application-specific techniques: user-level in-memory ch
 eckpointing, data replication (map-reduce) or fixed-point convergence for 
 iterative applications (back-propagation): practical deployment of fault t
 olerance techniques with User Level Fault Mitigation (a proposed MPI stand
 ard extension). Examples: Monte-Carlo methods, SPMD stencil, map-reduce an
 d back-propagation in neural networks. A step-by-step hands-on approach sh
 ows how to protect these routines.\n\nThe tutorial is open to all SC20 att
 endees who are interested in the current status and expected promise of fa
 ult-tolerant approaches for scientific and big data applications. No audie
 nce prerequisites: background will be provided for all protocols and proba
 bilistic models.\n\nTag: Correctness, Fault Tolerance, MPI, Reliability an
 d Resiliency, Reproducibility and Transparency\n\nRegistration Category: T
 utorial Reg Pass
END:VEVENT
END:VCALENDAR

