BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/New_York
X-LIC-LOCATION:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20210402T160551Z
LOCATION:Track 5
DTSTART;TZID=America/New_York:20201118T150000
DTEND;TZID=America/New_York:20201118T163000
UID:submissions.supercomputing.org_SC20_sess174@linklings.com
SUMMARY:Resilience and Power Management
DESCRIPTION:Paper\n\nRuntime-Guided ECC Protection using Online Estimation
  of Memory Vulnerability\n\nJaulmes, Moretó, Valero, Erez, Casas\n\nDimini
 shing reliability of semiconductor technologies and decreasing power budge
 ts per component hinder designing next-generation high-performance computi
 ng (HPC) systems. Both constraints strongly impact memory subsystems, as D
 RAM main memory accounts for up to 30 to 50 percent of a node’s overall ..
 .\n\n---------------------\nANT-Man: Towards Agile Power Management in the
  Microservice Era\n\nHou, Li, Liu, Zhang, Hu...\n\nThe emerging trend of d
 ecomposing cloud applications into microservices has raised new questions 
 about managing the performance/power trade-off of a datacenter at microsec
 ond-scale. We introduce ANT-Man, an Agile, Native and Transparent power Ma
 nagement framework that can exploit fine-grained micros...\n\n------------
 ---------\nCRAC: Checkpoint-Restart Architecture for CUDA with Streams and
  UVM\n\nJain, Cooperman\n\nThe share of the top 500 supercomputers with Nv
 idia GPUs is now over 25% and continues to grow.  While fault tolerance is
  a critical issue for supercomputing, there does not currently exist an ef
 ficient, scalable solution for CUDA applications on Nvidia GPUs.  CRAC is 
 a new checkpoint-restart soluti...\n\n\nTag: Accelerators, FPGA, and GPUs,
  Fault Tolerance, Power, Reliability and Resiliency\n\nRegistration Catego
 ry: Tech Program Reg Pass
END:VEVENT
END:VCALENDAR

