oneAPI (compute acceleration)
oneAPI is a cross-industry initiative for an open, standards-based unified programming model that creates a common developer experience across compute accelerator architectures. Its objective is to deliver an efficient, performant programming model that eliminates the need for developers to maintain separate code bases, multiple programming languages, and different tools and workflows for each architecture.
The oneAPI Specification
The oneAPI specification[1] extends existing developer programming models to enable a diverse set of hardware through a data-parallel language, a set of library APIs, and a low-level hardware interface to support cross-architecture programming. It builds upon industry standards and provides an open, cross-platform developer stack.
The Language – Data Parallel C++
DPC++[2] is an open, cross-architecture language built upon the ISO C++ and Khronos SYCL standards. DPC++ extends these standards with explicit parallel constructs like sub-groups and unified shared memory offload interfaces to support a broad range of computing architectures and processors, including CPUs and accelerators. Extensions are contributed back to standards bodies.
The oneAPI Libraries
The set of APIs spans several domains that benefit from acceleration, including an interface for deep learning; general libraries for linear algebra math, video, and media processing; and others.
Library Name | Short
Name |
Description |
---|---|---|
oneAPI DPC++ Library | oneDPL | Algorithms and functions to speed DPC++ kernel programming |
oneAPI Math Kernel Library | oneMKL | Math routines including matrix algebra, FFT, and vector math |
oneAPI Data Analytics Library | oneDAL | Machine learning and data analytics functions |
oneAPI Deep Neural Network Library | oneDNN | Neural networks functions for deep learning training and inference |
oneAPI Collective Communications Library | oneCCL | Communication patterns for distributed deep learning |
oneAPI Threading Building Blocks | oneTBB | Threading and memory management template library |
oneAPI Video Processing Library | oneVPL | Real-time video encode, decode, transcode, and processing |
The Hardware Abstraction Layer
oneAPI Level Zero, the low-level hardware interface, defines a set of capabilities and services that a hardware accelerator needs to interface with the broad set of languages in support of consumer to Deep/Machine Learning and HPC class solutions.
Intel Reference Implementation
Intel has released a oneAPI Beta Product[3] that serves as a reference implementation of the specification and adds migration, analysis, and debug tools.
External links
- Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems Using C++ and SYCL
- oneAPI / DPC++ / SYCL Will Run Atop NVIDIA GPUs
- Intel Debuts New GPU – Ponte Vecchio – and Outlines Aspirations for oneAPI
- oneAPI Industry Specification
- Intel oneAPI Product
- ^ "The oneAPI Specification". oneAPI.
{{cite web}}
: CS1 maint: url-status (link) - ^ "Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems Using C++ and SYCL". Apress.
{{cite web}}
: CS1 maint: url-status (link) - ^ "Intel oneAPI Product". Intel oneAPI Toolkits.
{{cite web}}
: CS1 maint: url-status (link)