@hackage accelerate0.9.0.0

An embedded language for accelerated array processing

This library defines an embedded language for regular, multi-dimensional array computations with multiple backends to facilitate high-performance implementations. Currently, there are two backends: (1) an interpreter that serves as a reference implementation of the intended semantics of the language and (2) a CUDA backend generating code for CUDA-capable NVIDIA GPUs.

To use the CUDA backend, you need to have CUDA version 3.x installed. The CUDA backend currently doesn't support Char and Bool arrays.

Known bugs: https://github.com/mchakravarty/accelerate/issues

  • New in 0.9.0.0: Streaming, precompilation, Repa-style indices, stencils, more scans, rank-polymorphic fold, generate, block I/O & many bug fixes

  • New in 0.8.1.0: bug fixes and some performance tweaks

  • New in 0.8.0.0: replicate, slice and foldSeg supported in the CUDA backend; frontend and interpreter support for stencil; bug fixes

  • New in 0.7.1.0: the CUDA backend and a number of scalar functions

For documentation, see the homepage and https://github.com/mchakravarty/accelerate/wiki.