OS4C: An Open-Source SR-IOV System for SmartNIC-based Cloud Platforms | Scott Smith, Yuan Ma, Marissa Lanz Kate, Bill Dai (University of Illinois Urbana-Champaign); Martin Ohmacht, Bharat Sukhwani, Hubertus Franke (IBM Research); Volodymyr Kindratenko (University of Illinois at Urbana-Champaign); Deming Chen (University of Illinois, Urbana-Champaign) |
Variable Bit-width Random Number Generation for Implementing Direct Simulation Monte Carlo on Field-Programmable Gate Arrays | Saleen Bhattarai, Andrew Lambert, David Petty (The University of New South Wales Canberra); Sean O’Byrne (The Australian National University) |
High-Performance Reconfigurable Accelerator for Knowledge Graph Reasoning | Hanning Chen, Ali Zakeri, Yang Ni (University of California, Irvine); Fei Wen (Texas A&M University); Behnam Khaleghi (University of California San Diego); Hugo Latapie (Cisco Systems); Mohsen Imani (University of California Irvine) |
BitSys: Bitwise Systolic Array Architecture for Multi-precision Quantized Hardware Accelerators | Yuhao Liu, Salim Ullah, Akash Kumar (Chair of Processor Design, TU Dresden) |
RingTK: A Ring, Parallel and High Performance Top-K Sorter on FPGA | Huawen Liang, Qizhe Wu, Wei Yuan, Teng Tian, Xi Jin (University of Science and Technology of China) |
Stay Flexible: FPGA Acceleration of Graph Neural Networks on a High-Performance NPU Overlay | Taikun Zhang, Andrew Boutros (University of Toronto); Sergey Gribok, Kwadwo Boateng (Intel Corp.); Vaughn Betz (University of Toronto) |
AXI SmartDisconnect: Guiding Memory Performance in Multi-Tenant FPGAs | Kristiyan Manev (EnduroSat) |
The BRAM is the Limit: Shattering Myths, Shaping Standards, and Building Scalable PIM Accelerators | MD Arafat Kabir, Tendayi Kamucheka, Nathaniel Fredricks (University of Arkansas); Joel Mandebi (Advanced Micro Devices, Inc. (AMD)); Jason Bakos (University of South Carolina); Miaoqing Huang, David Andrews (University of Arkansas) |
Best of Both Worlds: Integrating Scalable Analytical Placement into the Flexible VTR Framework | Rachel Selina Rajarathnam (University of Texas at Austin, Austin, TX, USA); Kate Thurmer, Vaughn Betz (University of Toronto, Toronto, ON, Canada); Mahesh A. Iyer (Intel Corporation, San Jose, CA, USA); David Z. Pan (University of Texas at Austin, Austin, TX, USA) |
Etna: MLIR-Based System-Level Design and Optimization for Transparent Application Execution on CPU-FPGA Nodes | Stephanie Soldavini (Politecnico di Milano); Felix Suchert (TU Dresden); Serena Curzel, Michele Fiorito (Politecnico di Milano); Karl Friedrich Alexander Friebel (TU Dresden); Fabrizio Ferrandi (Politecnico di Milano); Radim Cmar (Sygic); Jeronimo Castrillon (TU Dresden); Christian Pilato (Politecnico di Milano) |
Efficient profiling of HLS code | Kimberley Stonehouse, Jose Lopes, Benoit Pradelle (AMD) |
Fitop-Trans: Maximizing Transformer Pipeline Efficiency through Fixed-Length Token Pruning on FPGA | Kejia Shi, Manting Zhang, Keqing Zhao, Xiaoxing Wu, Jun Yu, Kun Wang (Fudan University) |