BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/New_York
X-LIC-LOCATION:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20210402T160554Z
LOCATION:Track 2
DTSTART;TZID=America/New_York:20201112T153000
DTEND;TZID=America/New_York:20201112T160000
UID:submissions.supercomputing.org_SC20_sess208_ws_pmbsf115@linklings.com
SUMMARY:Accelerating High-Order Stencils on GPUs
DESCRIPTION:Workshop\n\nAccelerating High-Order Stencils on GPUs\n\nSai, M
 ellor-Crummey, Meng, Araya-Polo, Meng\n\nWhile implementation strategies f
 or low-order stencils on GPUs have been well-studied in the literature, no
 t all of the techniques studied work well for high-order stencils, such as
  those used for seismic imaging. In this paper, we study practical seismic
  imaging computations on GPUs using high-order stencils on large domains w
 ith meaningful boundary conditions. We manually crafted a collection of im
 plementations of a 25-point seismic modeling stencil in CUDA along with co
 de to apply the boundary conditions. We evaluate our stencil code shapes, 
 memory hierarchy usage, data-fetching patterns, and other performance attr
 ibutes. We conducted an empirical evaluation of these stencils using sever
 al mature and emerging tools and discuss our quantitative findings. Some o
 f our implementations achieved twice the performance of a proprietary code
  developed in C and mapped to GPUs using OpenACC. Additionally, several of
  our implementations have excellent performance portability.\n\nRegistrati
 on Category: Workshop Reg Pass
END:VEVENT
END:VCALENDAR

