@hackage streamly-lmdb0.5.0

Stream data to or from LMDB databases using the streamly library.

Categories
- Streaming
- Databases
License
BSD-3-Clause
Maintainer
sd-haskell@quant.is
Links
Versions
- 0.7.0 Fri, 5 May 2023
- 0.6.0 Thu, 20 Apr 2023
- 0.5.0 Tue, 12 Jul 2022
- 0.4.0 Thu, 27 Jan 2022
- 0.3.0 Sat, 24 Jul 2021
- 0.2.1 Sun, 2 May 2021

streamly-lmdb

Stream data to or from LMDB databases using the Haskell streamly library.

Requirements

Install LMDB on your system:

Debian Linux: sudo apt-get install liblmdb-dev.
macOS: brew install lmdb.

Quick start

{-# LANGUAGE OverloadedStrings #-}

module Main where

import Streamly.External.LMDB
  ( Limits (mapSize),
    WriteOptions (writeTransactionSize),
    defaultLimits,
    defaultReadOptions,
    defaultWriteOptions,
    getDatabase,
    openEnvironment,
    readLMDB,
    tebibyte,
    writeLMDB,
  )
import qualified Streamly.Prelude as S

main :: IO ()
main = do
  -- Open an environment. There should already exist a file or
  -- directory at the given path. (Empty for a new environment.)
  env <-
    openEnvironment "/path/to/lmdb-database" $
      defaultLimits {mapSize = tebibyte}

  -- Get the main database.
  -- Note: It is common practice with LMDB to create the database
  -- once and reuse it for the remainder of the program’s execution.
  db <- getDatabase env Nothing

  -- Stream key-value pairs into the database.
  let fold' = writeLMDB db defaultWriteOptions {writeTransactionSize = 1}
  let writeStream = S.fromList [("baz", "a"), ("foo", "b"), ("bar", "c")]
  _ <- S.fold fold' writeStream

  -- Stream key-value pairs out of the
  -- database, printing them along the way.
  -- Output:
  --     ("bar","c")
  --     ("baz","a")
  --     ("foo","b")
  let unfold' = readLMDB db Nothing defaultReadOptions
  let readStream = S.unfold unfold' undefined
  S.mapM_ print readStream

Benchmarks

See bench/README.md. Summary (with rough figures from our machine^†):

Reading. For reading a fully cached LMDB database, this library (when unsafeReadLMDB is used instead of readLMDB) has roughly a 15 ns/pair overhead compared to plain Haskell IO code, which has roughly another 10 ns/pair overhead compared to C. (The first two being similar fulfills the promise of streamly and stream fusion.) We deduce that if your total workload per pair takes longer than around 25 ns, your bottleneck will not be your usage of this library as opposed to C.
Writing. Writing with plain Haskell IO code and with this library is, respectively, around 30% and 50% slower than writing with C. We have not dug further into these differences because this write performance is currently good enough for our purposes.

^† Linode; Debian 10, Dedicated 32GB: 16 CPU, 640GB Storage, 32GB RAM.

Installation
In your cabal file:
Dependencies (4)
- base >=4.7 && <5
- bytestring >=0.10.10.0 && <0.11
- async >=2.2.2 && <2.3
- streamly >=0.8 && <0.9
Dependents (0)

@hackage streamly-lmdb0.5.0

Categories

License

Maintainer

Links

Versions

streamly-lmdb

Requirements

Quick start

Benchmarks

Installation

Dependencies (4)

Dependents (0)