Zero-Overhead Parallel Scans for Multi-Core CPUs (ARRAY 2024)

Mon 24 - Fri 28 June 2024 Copenhagen, Denmark

Who

Ivo Gabe de Wolff, David van Balen, Gabriele Keller, Trevor L. McDonell

Track

ARRAY 2024

Time Zone

The program is currently displayed in (GMT+02:00) Windhoek.

Use conference time zone: (GMT+02:00) WindhoekSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 25 Jun 2024 14:55 - 15:20 at Stockholm - Performance

Abstract

Scans are a fundamental primitive for array languages, and enable efficient parallel implementations of many problems. Parallel scans for multi-core CPUs come at a cost, however: their single-threaded performance is around 50% to 75% of the speed of a sequential scan. We present our \textit{assisted reduce-then-scan}, an adaptation of reduce-then-scan with no single-threaded overhead and slightly better multi-threaded performance than reduce-then-scan. Furthermore we show that chained scans, the state-of-the-art scan algorithm for GPUs, are also suitable for CPUs, outperforming (assisted) reduce-then-scan. Our \textit{adaptive chained scan} has zero single-threaded overhead, and equal multi-threaded performance to the standard chained scan. Our algorithms allow more threads to join the workload of a scan during its execution, which may happen unpredictably if the program exploits both data and task parallelism. This robustness is especially important for the implementation of parallel array languages, which should work well on a wide range of programs and hardware.

File attachments

Extended Abstract (array24-paper15.pdf)	357KiB

Ivo Gabe de Wolff

Utrecht University

David van Balen

Gabriele Keller

Utrecht University

Netherlands

Trevor L. McDonell

Utrecht University

Netherlands

Time Zone

The program is currently displayed in (GMT+02:00) Windhoek.

Use conference time zone: (GMT+02:00) WindhoekSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 25 Jun
Displayed time zone: Windhoek change

13:40 - 15:20	PerformanceARRAY at Stockholm

13:40 25m Talk		Apple Array Allocation ARRAY Vanessa McHale Northern Trust File Attached
14:05 25m Talk		Shray: an Owner-Compute Distributed Shared-Memory System ARRAY Stefan Schrijvers Radboud University, Thomas Koopman Radboud University, Sven-Bodo Scholz Radboud University DOI
14:30 25m Talk		Work Assisting: Linking Task-Parallel Work Stealing with Data-Parallel Self Scheduling ARRAY Ivo Gabe de Wolff Utrecht University, Gabriele Keller Utrecht University DOI
14:55 25m Talk		Zero-Overhead Parallel Scans for Multi-Core CPUs ARRAY Ivo Gabe de Wolff Utrecht University, David van Balen , Gabriele Keller Utrecht University, Trevor L. McDonell Utrecht University File Attached