@hackage pandoc-pyplot2.2.0.0

A Pandoc filter to include figures generated from Python code blocks

pandoc-pyplot - A Pandoc filter to generate Matplotlib/Plotly figures directly in documents

Hackage version Stackage version (LTS) Windows Build status macOS and Linux Build Status GitHub

pandoc-pyplot turns Python code present in your documents into embedded figures via Matplotlib or Plotly.

Usage

Markdown

The filter recognizes code blocks with the .pyplot or .plotly classes present in Markdown documents. It will run the script in the associated code block in a Python interpreter and capture the generated Matplotlib/Plotly figure.

Here is a basic example using the scripting matplotlib.pyplot API:

```{.pyplot}
import matplotlib.pyplot as plt

plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
```

Putting the above in input.md, we can then generate the plot and embed it:

pandoc --filter pandoc-pyplot input.md --output output.html

or

pandoc --filter pandoc-pyplot input.md --output output.pdf

or any other output format you want.

LaTeX

The filter works slightly differently in LaTeX documents. In LaTeX, the minted environment must be used, with the pyplot class.

\begin{minted}{pyplot}
import matplotlib.pyplot as plt

plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
\end{minted}

Note that you do not need to have minted installed.

Examples

There are more examples in the source repository, in the \examples directory.

Features

Captions

You can also specify a caption for your image. This is done using the optional caption parameter.

Markdown:

```{.pyplot caption="This is a simple figure"}
import matplotlib.pyplot as plt

plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
```

LaTex:

\begin{minted}[caption=This is a simple figure]{pyplot}
import matplotlib.pyplot as plt

plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
\end{minted}

Caption formatting is either plain text or Markdown. LaTeX-style math is also support in captions (using dollar signs \(...\)).

In case of an output format that supports links (e.g. HTML), the embedded image generated by pandoc-pyplot will be a link to the source code which was used to generate the file. Therefore, other people can see what Python code was used to create your figures. A high resolution image will be made available in a caption link.

(New in version 2.1.3.0) For cleaner output (e.g. PDF), you can turn this off via the links=false key:

Markdown:

```{.pyplot links=false}
...
```

LaTex:

\begin{minted}[links=false]{pyplot}
...
\end{minted}

or via a configuration file.

Including scripts

If you find yourself always repeating some steps, inclusion of scripts is possible using the include parameter. For example, if you want all plots to have the ggplot style, you can write a very short preamble style.py like so:

import matplotlib.pyplot as plt
plt.style.use('ggplot')

and include it in your document as follows:

```{.pyplot include=style.py}
plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
```

Which is equivalent to writing the following markdown:

```{.pyplot}
import matplotlib.pyplot as plt
plt.style.use('ggplot')

plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
```

The equivalent LaTeX usage is as follows:

\begin{minted}[include=style.py]{pyplot}

\end{minted}

This include parameter is perfect for longer documents with many plots. Simply define the style you want in a separate script! You can also import packages this way, or define functions you often use.

Customization of figures beyond what is available in pandoc-pyplot can also be done through the include script. For example, if you wanted to figures with a black background, you can do so via matplotlib.pyplot.rcParams:

import matplotlib.pyplot as plt

plt.rcParams['savefig.facecolor'] = 'k'
...

You can take a look at all available matplotlib parameters here.

Multiple backends

(new in version 2.2.0.0) Both Matplotlib and Plotly are supported!

To render Plotly figures in Markdown:

```{.plotly caption="This is a Plotly figure"}
import plotly.graph_objects as go
figure = go.Figure(
    data=[go.Bar(y=[2, 1, 3])],
)

Here is the LaTeX equivalent:

\begin{minted}[caption=This is a Plotly figure]{plotly}
import plotly.graph_objects as go
figure = go.Figure(
    data=[go.Bar(y=[2, 1, 3])],
)
\end{minted}

pandoc-pyplot will render and capture your figure automagically.

No wasted work

pandoc-pyplot minimizes work, only generating figures if it absolutely must. Therefore, you can confidently run the filter on very large documents containing dozens of figures --- like a book or a thesis --- and only the figures which have changed will be re-generated.

Compatibility with pandoc-crossref

pandoc-crossref is a pandoc filter that makes it effortless to cross-reference objects in Markdown documents.

You can use pandoc-crossref in conjunction with pandoc-pyplot for the ultimate figure-making pipeline. You can combine both in a figure like so:

```{#fig:myexample .pyplot caption="This is a caption"}
# Insert figure script here
```

As you can see in @fig:myexample, ...

If the above source is located in file myfile.md, you can render the figure and references by applying pandoc-pyplot first, and then pandoc-crossref. For example:

pandoc --filter pandoc-pyplot --filter pandoc-crossref -i myfile.md -o myfile.html

Configurable

(New in version 2.1.0.0) To avoid repetition, pandoc-pyplot can be configured using simple YAML files. pandoc-pyplot will look for a .pandoc-pyplot.yml file in the current working directory:

# You can specify any or all of the following parameters
interpreter: python36
directory: mydirectory/
include: mystyle.py
format: jpeg
links: false
dpi: 150            # Matplotlib only
tight_bbox: true    # Matplotlib only
transparent: false  # Matplotlib only
flags: [-O, -Wignore]

These values override the default values, which are equivalent to:

# Defaults if no configuration is provided.
# Note that the default interpreter name on MacOS and Unix is 'python3'
# and 'python' on Windows.
interpreter: python
flags: []
directory: generated/
format: png
links: true
dpi: 80
tight_bbox: false
transparent: false
flags: []

Using pandoc-pyplot --write-example-config will write the default configuration to a file .pandoc-pyplot.yml, which you can then customize.

Configuration-only parameters

There are a few parameters that are only available via the configuration file .pandoc-pyplot.yml:

  • interpreter is the name of the interpreter to use. For example, interpreter: python36;
  • flags is a list of strings, which are flags that are passed to the python interpreter. For example, flags: [-O, -Wignore];
  • (New in version 2.1.5.0) tight_bbox is a boolean that determines whether to use bbox_inches="tight" or not when saving Matplotlib figures. For example, tight_bbox: true. See here for details. This is ignored for Plotly figures.
  • (New in version 2.1.5.0) transparent is a boolean that determines whether to make Matplotlib figure background transparent or not. This is useful, for example, for displaying a plot on top of a colored background on a web page. High-resolution figures are not affected. For example, transparent: true. This is ignored for Plotly figures.

Installation

Binaries

Windows binaries are available on GitHub. Place the executable in a location that is in your PATH to be able to call it.

If you can show me how to generate binaries for other platform using e.g. Azure Pipelines, let me know!

Installers (Windows)

Windows installers are made available thanks to Inno Setup. You can download them from the release page.

From Hackage/Stackage

pandoc-pyplot is available on Hackage. Using the cabal-install tool:

cabal update
cabal install pandoc-pyplot

Similarly, pandoc-pyplot is available on Stackage:

stack update
stack install pandoc-pyplot

From source

Building from source can be done using stack or cabal:

git clone https://github.com/LaurentRDC/pandoc-pyplot
cd pandoc-pylot
stack install # Alternatively, `cabal install`

Running the filter

Requirements

This filter requires a Python interpreter and at least Matplotlib or Plotly installed. The name of the Python interpreter to use can be specified in a .pandoc-pyplot.yml file; by default, pandoc-pyplot will use the "python" name on Windows, and "python3" otherwise.

Use the filter with Pandoc as follows:

pandoc --filter pandoc-pyplot input.md --output output.html

in which case, the output is HTML. Another example with PDF output:

pandoc --filter pandoc-pyplot input.md --output output.pdf

Python exceptions will be printed to screen in case of a problem.

pandoc-pyplot has a limited command-line interface. Take a look at the help available using the -h or --help argument:

pandoc-pyplot --help

Usage as a Haskell library

To include the functionality of pandoc-pyplot in a Haskell package, you can use the makePlot :: Block -> IO Block function (for single blocks) or plotTransform :: Pandoc -> IO Pandoc function (for entire documents). Variations of these functions exist for more advanced configurations. Take a look at the documentation on Hackage.

Usage with Hakyll

This filter was originally designed to be used with Hakyll. In case you want to use the filter with your own Hakyll setup, you can use a transform function that works on entire documents:

import Text.Pandoc.Filter.Pyplot (plotTransform)

import Hakyll

-- Unsafe compiler is required because of the interaction
-- in IO (i.e. running an external Python script).
makePlotPandocCompiler :: Compiler (Item String)
makePlotPandocCompiler =
  pandocCompilerWithTransformM
    defaultHakyllReaderOptions
    defaultHakyllWriterOptions
    (unsafeCompiler . plotTransform)

The plotTransformWithConfig is also available for a more configurable set-up.

Warning

Do not run this filter on unknown documents. There is nothing in pandoc-pyplot that can stop a Python script from performing evil actions.