Upcoming Talks

40th IEEE International Parallel & Distributed Processing Symposium (IPDPS '26) - New Orleans, USA
(May 25 - 29, 2026)

Time Location Event Speaker(s)

Monday, May 25

8:30AM - 12:00PM Room K

High-Performance and Smart Networking Technologies for HPC and AI

[Tutorial]

DK Panda
B. Michalowicz

Tuesday, May 26

8:30AM - 12:00PM Room K

Principles and Practice of High Performance Deep Learning Training and Inference

[Tutorial]

DK Panda
N. Alnaasan
1:30PM - 2:30PM Salon D

Impact of AI on PDC Education Panel at the EduPar-26 Workshop

[Panel]

DK Panda

Thursday, May 28

1:30PM - 3:00PM Room F-G-H

From Skew to Symmetry: Node-Interconnect Multi-Path Balancing with Execution-time Planning for Modern GPU Clusters

[Talk]

J. Yao
K. Suresh
B. Ramesh
H. Subramoni
DK Panda
3:30PM - 5:00PM Room D

Design and Implementation of Casting Compression for GPU-Aware MPI Collectives

[Talk]

C. Chen
N. Contini
L. Xu
J. Queiser
H. Subramoni
DK Panda
5:00PM - 6:00PM Room D

One Memory-Many Paths: Early Experiences with Allocation and Data Copy Strategies on MI300A

[Talk] [Best Paper Award Finalist]

G. Kuncham
S. Zhang
DK Panda

Past Talks

NVIDIA GTC AI Conference 2026 - San Jose, California
(Mar 16 - 19, 2026)

Time Location Event Speaker(s)

Sunday, March 15

5:00PM - 7:00PM None

HyperSack: Resource-Aware Distributed Hyperparameter Optimization for Lightweight Vision and Language Models on NVIDIA GPU Systems

[Poster Presentation]

N. Alnaasan
DK Panda
5:00PM - 7:00PM None

Characterizing Communication Patterns in Distributed Large Language Model Inference

[Poster Presentation]

L. Xu
K. Suresh
Q. Anthony
N. Alnaasan
DK Panda
5:00PM - 7:00PM None

Design and Optimization of GPU-Aware MPI Allreduce Using Direct Sendrecv Communication

[Poster Presentation]

C. Chen
J. Yao
H. Subramoni
DK Panda

31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming & The IEEE International Symposium on High-Performance Computer Architecture - Sydney, Australia
(Jan 31 - Feb 04, 2026)

Time Location Event Speaker(s)

Saturday, January 31

1:45PM - 5:45PM Collaroy

High-Performance and Smart Networking Technologies for HPC and AI

[Tutorial]

DK Panda
B. Michalowicz

Sunday, February 01

8:45AM - 12:45PM Curl Curl

Principles and Practice of Scalable and Distributed Deep Neural Networks Training and Inference

[Tutorial]

DK Panda
N. Alnaasan

The International Conference on High Performance Computing in Asia-Pacific Region - Osaka, Japan
(Jan 26 - 29, 2026)

Time Location Event Speaker(s)

Monday, January 26

All times are listed in JST
9:30AM - 12:30PM Room 1001

High-Performance and Smart Networking Technologies for HPC and AI

[Tutorial]

DK Panda
B. Michalowicz
1:30PM - 4:30PM Room 1001

Principles and Practice of Scalable and Distributed Deep Neural Networks Training and Inference

[Tutorial]

DK Panda
N. Alnaasan

Tuesday, January 27

All times are listed in JST
3:45PM - 4:15PM Room 801

Ethernet for HPC/AI Clusters: High-Performance and Scalable Middleware for HPC and AI

[Talk]

DK Panda
5:00PM - 6:00PM Room 1002

Agriculture Empowered by Supercomputing

[BoF]

DK Panda

Wednesday, January 28

All times are listed in JST
1:30PM - 2:00PM Room 1003

Designing High-Performance, Scalable, and Converged Middleware for the Modern HPC and AI Systems, Talk at the 8th RIKEN Center for Computational Science (R-CCS) International Symposium

[Talk]

DK Panda
2:30PM - 2:50PM Room 702

Designing High-Performance and Scalable Middleware for the Modern HPC and AI Era, Talk at the 2026 APAC HPC-AI Competition – RDMA and the Networking Architecture of the AI Factory

[Talk]

DK Panda

Thursday, January 29

All times are listed in JST
1:30PM - 5:00PM 12F Conference Hall

The First International Workshop on Foundational Large Language Models Advances for HPC in Asia (LLM4HPCAsia’26)

[Workshop]

Pedro Valero Lara
William F. Godoy
DK Panda

The International Conference for High Performance Computing, Networking, Storage, and Analysis 2025 - St. Louis, Missouri
(Nov 16 - 21, 2025)

Time Location Event Speaker(s)

Thursday, November 20

10:30AM - 11:00AM Booth #414

Powering Performance: Inside TACC Vista and Its Communication Libraries

[Talk]

John Cazes and Amit Ruhela, TACC
10:30AM - 10:45AM 230

Designing GPU-Aware Collective Communication for Heterogeneous Clusters with Diverse GPUs and Interconnects | Presentations

[Doctoral Showcase]

C. Chen
11:30AM - 12:00PM Booth #414

MPI Communication Performance on AMD MI300A: Microbenchmarks and Applications

[Talk]

G. Kuncham
12:30PM - 1:00PM Booth #414

HPC and AI Workloads on C-DAC’s Trinetra Network Using MVAPICH4

[Talk]

Parikshit Godbole, CDAC, India
1:30PM - 2:00PM Booth #414

Enhanced MPI Intra-node Communication Framework with Cooperative DMA-based Data Transfer

[Talk]

S. Xu
2:15PM - 2:45PM 275

A Streaming Collectives Interface Targeting Dataflow Acceleration and HPC Workloads

[Talk]

N. Contini
J. Queiser
B. Ramesh
H. Subramoni
DK Panda

Friday, November 21

10:30AM - 11:00AM 261

MPI Communication Performance on AMD MI300A: Microbenchmarks and Applications, IPDRM workshop

[Workshop]

G. Kuncham
S. Zhang
S. Mohammad
C. Chen
DK Panda