@hackage dom-selector0.2.0.0

DOM traversal by CSS selectors for xml-conduit package

CSS selector support for xml-conduit/html-conduit. This package supports compile-time checking of CSS selectors using quasiquotes. All DOM traversals are purely functional.

  • Quick start

-- The following pragmas should be put first (Haddock does not accept a pragma notation.)
-- LANGUAGE OverloadedStrings, QuasiQuotes

module Main (main) where

import Text.XML.Cursor (fromDocument)
import Text.HTML.DOM (parseLBS)
import qualified Data.Text.Lazy.IO as TI (putStrLn)

import Control.Monad (mapM_)

import Text.XML.Scraping (innerHtml)
import Text.XML.Selector.TH

import Network.HTTP.Conduit
import Data.Conduit.Binary

main :: IO ()
main = do
   root <- fmap (fromDocument . parseLBS) $ simpleHttp "https://news.google.com/"
   let cs = queryT [jq| h2 span.titletext |] root
   mapM_ (TI.putStrLn . innerHtml) cs

You can use some elementary CSS selectors for traversing a DOM tree.

  • Other examples

https://github.com/nebuta/dom-selector/tree/master/examples

Changes:

Ver 0.2: All scraping functions return lazy text now. It is implemented with a type class.