License | BSD-3-Clause |
---|---|
Maintainer | Jamie Willis, Gigaparsec Maintainers |
Stability | stable |
Safe Haskell | Trustworthy |
Language | Haskell2010 |
Text.Gigaparsec
Description
This module contains the core combinators and parser type: all parsers will require something from within!
Since: 0.1.0.0
Synopsis
- data Parsec a
- data Result e a
- result :: (e -> b) -> (a -> b) -> Result e a -> b
- parse :: ErrorBuilder err => Parsec a -> String -> Result err a
- parseFromFile :: ErrorBuilder err => Parsec a -> FilePath -> IO (Result err a)
- parseRepl :: Show a => Parsec a -> String -> IO ()
- atomic :: Parsec a -> Parsec a
- lookAhead :: Parsec a -> Parsec a
- notFollowedBy :: Parsec a -> Parsec ()
- eof :: Parsec ()
- unit :: Parsec ()
- pure :: Applicative f => a -> f a
- empty :: Alternative f => f a
- ($>) :: Parsec a -> b -> Parsec b
- (<$>) :: Functor f => (a -> b) -> f a -> f b
- (<$) :: Functor f => a -> f b -> f a
- void :: Functor f => f a -> f ()
- (<~>) :: Parsec a -> Parsec b -> Parsec (a, b)
- (<:>) :: Parsec a -> Parsec [a] -> Parsec [a]
- (<*>) :: Applicative f => f (a -> b) -> f a -> f b
- liftA2 :: Applicative f => (a -> b -> c) -> f a -> f b -> f c
- (*>) :: Applicative f => f a -> f b -> f b
- (<*) :: Applicative f => f a -> f b -> f a
- (<**>) :: Applicative f => f a -> f (a -> b) -> f b
- (<+>) :: Parsec a -> Parsec b -> Parsec (Either a b)
- (<|>) :: Alternative f => f a -> f a -> f a
- select :: Selective f => f (Either a b) -> f (a -> b) -> f b
- branch :: Selective f => f (Either a b) -> f (a -> c) -> f (b -> c) -> f c
- filterS :: (a -> Bool) -> Parsec a -> Parsec a
- mapMaybeS :: (a -> Maybe b) -> Parsec a -> Parsec b
- many :: Alternative f => f a -> f [a]
- manyl :: (b -> a -> b) -> b -> Parsec a -> Parsec b
- manyr :: (a -> b -> b) -> b -> Parsec a -> Parsec b
- manyMap :: Monoid m => (a -> m) -> Parsec a -> Parsec m
- some :: Alternative f => f a -> f [a]
- somel :: (b -> a -> b) -> b -> Parsec a -> Parsec b
- somer :: (a -> b -> b) -> b -> Parsec a -> Parsec b
- someMap :: Semigroup s => (a -> s) -> Parsec a -> Parsec s
The Parsec
type
This type represents parsers as a first-class value.
Values of this type are constructed using the library's combinators, to build
up a final Parsec
value that can be passed to parse
or one
of the similar functions. This is implemented internally similar to other
libraries like parsec
and gigaparsec
.
Since: 0.1.0.0
Instances
Alternative Parsec Source # | Since: 0.1.0.0 |
Applicative Parsec Source # | Since: 0.1.0.0 |
Functor Parsec Source # | Since: 0.1.0.0 |
Monad Parsec Source # | Since: 0.1.0.0 |
Selective Parsec Source # | Since: 0.1.0.0 |
Monoid m => Monoid (Parsec m) Source # | Since: 0.1.0.0 |
Semigroup m => Semigroup (Parsec m) Source # | Since: 0.1.0.0 |
Similar to Either
, this type represents whether a parser has failed or not.
This is chosen instead of Either
to be more specific about the naming.
Since: 0.1.0.0
Constructors
Success a | The parser succeeded with a result of type |
Failure e | The parser failed with an error of type |
Instances
Arguments
:: (e -> b) |
|
-> (a -> b) |
|
-> Result e a |
|
-> b | applies either |
Arguments
:: ErrorBuilder err | |
=> Parsec a | the parser to execute |
-> String | the input to parse |
-> Result err a | result of the parse, either an error or result |
Runs a parser against some input.
Given a parser, some input, and a way of producing parse errors of a desired
type (via ErrorBuilder
), this function runs a parser to produce either a
successful result or an error. Note that the err
type parameter is first,
which allows for parse @String
to make use of the defaultly formated String
error messages. This may not be required if it is clear from context. To make
this process nicer within GHCi, consider using parseRepl
.
Since: 0.2.0.0
Arguments
:: ErrorBuilder err | |
=> Parsec a | the parser to execute |
-> FilePath | the file to source the input from |
-> IO (Result err a) | the result of the parse, error or otherwise |
Runs a parser against some input obtained from a given file.
Given a parser, a filename, and a way of producing parse errors of a desired
type (via ErrorBuilder
), this function runs a parser to produce either a
successful result or an error. First, input is collected by reading the file,
and then the result is returned within IO
; the filename is forwarded on to
the ErrorBuilder
, which may mean it forms part of the generated error messages.
Note that the err
type parameter is first,
which allows for parseFromFile @String
to make use of the defaultly formated String
error messages. This may not be required if it is clear from context.
Since: 0.2.1.0
Arguments
:: Show a | |
=> Parsec a |
|
-> String |
|
-> IO () | An |
Runs a parser against some input, pretty-printing the result to the terminal.
Compared, with parse
, this function will always generate error messages as
strings and will print them nicely to the terminal (on multiple lines). If the
parser succeeds, the result will also be printed to the terminal. This is useful
for playing around with parsers in GHCi, seeing the results more clearly.
Since: 0.2.0.0
Primitive Combinators
These combinators are specific to parser combinators. In one way or another, they influence how a parser consumes input, or under what conditions a parser does or does not fail. These are really important for most practical parsing considerations, although lookAhead is much less well used.
Arguments
:: Parsec a | the parser, |
-> Parsec a | a parser that tries |
This combinator parses its argument p
, but rolls back any consumed input on failure.
If the parser p
succeeds, then atomic p
has no effect. However, if p
failed,
then any input that it consumed is rolled back. This has two uses: it ensures that
the parser p
is all-or-nothing when consuming input, and it allows for
parsers that consume input to backtrack when they fail (with (<|>)
). It should be
used for the latter purpose sparingly, however, since excessive backtracking in a
parser can result in much lower efficiency.
Examples
>>>
parse (string "abc" <|> string "abd") "abd"
Failure .. -- first parser consumed a, so no backtrack>>>
parse (atomic (string "abc") <|> string "abd") "abd"
Success "abd" -- first parser does not consume input on failure now
Since: 0.1.0.0
Arguments
:: Parsec a | the parser, |
-> Parsec a | a parser that parses |
This combinator parses its argument p
, but does not consume input if it succeeds.
If the parser p
succeeds, then lookAhead p
will roll back any input consumed
whilst parsing p
. If p
fails, however, then the whole combinator fails and
any input consumed remains consumed. If this behaviour is not desirable,
consider pairing lookAhead
with atomic
.
Examples
>>>
parse (lookAhead (string "aaa") *> string "aaa") "aaa"
Success "aaa">>>
parse (lookAhead (string "abc") <|> string "abd" "abd"
Failure .. -- lookAhead does not roll back input consumed on failure
Since: 0.1.0.0
Arguments
:: Parsec a | the parser, |
-> Parsec () | a parser which fails when |
This combinator parses its argument p
, and succeeds when p
fails and vice-versa, never consuming
input.
If the parser p
succeeds, then notFollowedBy p
will fail, consuming no input.
Otherwise, should p
fail, then notFollowedBy p
will succeed, consuming no input
and returning ()
.
Examples
One use for this combinator is to allow for "longest-match" behaviour. For instance, keywords are normally only considered keywords if they are not part of some larger valid identifier (i.e. the keyword "if" should not parse successfully given "ifp"). This can be accomplished as follows:
keyword :: String -> Parsec () keyword kw = atomic $ string kw *> notFollowedBy letterOrDigit
Since: 0.1.0.0
This parser only succeeds at the end of the input.
Equivalent to `notFollowedBy(item)`.
Examples
>>>
parse eof "a"
Failure ..>>>
parse eof ""
Success ()
Since: 0.1.0.0
Consumptionless Parsers
These combinators and parsers do not consume input: they are the most primitive ways of
producing successes and failures with the minimal possible effect on the parse. They are,
however, reasonably useful; in particular, pure
and unit
can be put to good use in
injecting results into a parser without needing to consume anything, or mapping another parser.
This parser produces ()
without having any other effect.
When this parser is ran, no input is required, nor consumed, and the given value will always be successfully returned. It has no other effect on the state of the parser.
Since: 0.1.0.0
pure :: Applicative f => a -> f a #
Lift a value into the Structure.
Examples
>>>
pure 1 :: Maybe Int
Just 1
>>>
pure 'z' :: [Char]
"z"
>>>
pure (pure ":D") :: Maybe [String]
Just [":D"]
empty :: Alternative f => f a #
The identity of <|>
empty <|> a == a a <|> empty == a
Result Changing Combinators
These combinators change the result of the parser they are called on into a value of a different type. This new result value may or may not be derived from the previous result.
Arguments
:: Parsec a |
|
-> b |
|
-> Parsec b | a parser that runs |
This combinator, pronounced "as", replaces the result of the given parser, ignoring the old result.
Similar to ($)
, except the old result of the given parser is not required to
compute the new result. This is useful when the result is a constant value (or function!).
Functionally the same as p *> pure x
or const x $ p
.
In scala parsley, this combinator is known as #>
or 'as
'.
Examples
>>>
parse (string "true" $> true) "true"
Success true
Since: 0.1.0.0
(<$>) :: Functor f => (a -> b) -> f a -> f b infixl 4 #
An infix synonym for fmap
.
The name of this operator is an allusion to $
.
Note the similarities between their types:
($) :: (a -> b) -> a -> b (<$>) :: Functor f => (a -> b) -> f a -> f b
Whereas $
is function application, <$>
is function
application lifted over a Functor
.
Examples
Convert from a
to a Maybe
Int
using Maybe
String
show
:
>>>
show <$> Nothing
Nothing
>>>
show <$> Just 3
Just "3"
Convert from an
to an
Either
Int
Int
Either
Int
String
using show
:
>>>
show <$> Left 17
Left 17
>>>
show <$> Right 17
Right "17"
Double each element of a list:
>>>
(*2) <$> [1,2,3]
[2,4,6]
Apply even
to the second element of a pair:
>>>
even <$> (2,2)
(2,True)
void :: Functor f => f a -> f () #
discards or ignores the result of evaluation, such
as the return value of an void
valueIO
action.
Examples
Replace the contents of a
with unit:Maybe
Int
>>>
void Nothing
Nothing
>>>
void (Just 3)
Just ()
Replace the contents of an
with unit, resulting in an Either
Int
Int
:Either
Int
()
>>>
void (Left 8675309)
Left 8675309
>>>
void (Right 8675309)
Right ()
Replace every element of a list with unit:
>>>
void [1,2,3]
[(),(),()]
Replace the second element of a pair with unit:
>>>
void (1,2)
(1,())
Discard the result of an IO
action:
>>>
mapM print [1,2]
1 2 [(),()]
>>>
void $ mapM print [1,2]
1 2
Sequencing Combinators
These combinators all combine two parsers in sequence. The first argument of the combinator will be executed first, then the second argument second. The results of both parsers are combined in some way (depending on the individual combinator). If one of the parsers fails, the combinator as a whole fails.
Arguments
:: Parsec a |
|
-> Parsec b |
|
-> Parsec (a, b) | a parser that runs |
This combinator, pronounced "zip", first parses p
then parses q
:
if both succeed the result of p
is paired with that of q
.
First, runs p
, yielding x
on success, then runs q
, yielding y
on success.
If both are successful then (x, y)
is returned.
If either fail then the entire combinator fails.
Examples
>>>
p = char 'a' <~> char 'b'
>>>
parse p "ab"
Success ('a', 'b')>>>
parse p "b"
Failure ..>>>
parse p "a"
Failure ..
Since: 0.1.0.0
Arguments
:: Parsec a |
|
-> Parsec [a] |
|
-> Parsec [a] | a parser that runs |
This combinator, pronounced "cons", first parses p
and then ps
: if both succeed the result of this
parser is prepended onto the result of ps
.
First, runs p
, yielding x
on success, then runs ps
, yielding xs
on success.
If both are successful then x : xs
is returned.
If either fail then the entire combinator fails.
Examples
some p = p <:> many(p)
Since: 0.1.0.0
(<*>) :: Applicative f => f (a -> b) -> f a -> f b infixl 4 #
Sequential application.
A few functors support an implementation of <*>
that is more
efficient than the default one.
Example
Used in combination with
, (<$>)
can be used to build a record.(<*>)
>>>
data MyState = MyState {arg1 :: Foo, arg2 :: Bar, arg3 :: Baz}
>>>
produceFoo :: Applicative f => f Foo
>>>
produceBar :: Applicative f => f Bar
>>>
produceBaz :: Applicative f => f Baz
>>>
mkState :: Applicative f => f MyState
>>>
mkState = MyState <$> produceFoo <*> produceBar <*> produceBaz
liftA2 :: Applicative f => (a -> b -> c) -> f a -> f b -> f c #
Lift a binary function to actions.
Some functors support an implementation of liftA2
that is more
efficient than the default one. In particular, if fmap
is an
expensive operation, it is likely better to use liftA2
than to
fmap
over the structure and then use <*>
.
This became a typeclass method in 4.10.0.0. Prior to that, it was
a function defined in terms of <*>
and fmap
.
Example
>>>
liftA2 (,) (Just 3) (Just 5)
Just (3,5)
>>>
liftA2 (+) [1, 2, 3] [4, 5, 6]
[5,6,7,6,7,8,7,8,9]
(*>) :: Applicative f => f a -> f b -> f b infixl 4 #
Sequence actions, discarding the value of the first argument.
Examples
If used in conjunction with the Applicative instance for Maybe
,
you can chain Maybe computations, with a possible "early return"
in case of Nothing
.
>>>
Just 2 *> Just 3
Just 3
>>>
Nothing *> Just 3
Nothing
Of course a more interesting use case would be to have effectful computations instead of just returning pure values.
>>>
import Data.Char
>>>
import GHC.Internal.Text.ParserCombinators.ReadP
>>>
let p = string "my name is " *> munch1 isAlpha <* eof
>>>
readP_to_S p "my name is Simon"
[("Simon","")]
(<*) :: Applicative f => f a -> f b -> f a infixl 4 #
Sequence actions, discarding the value of the second argument.
(<**>) :: Applicative f => f a -> f (a -> b) -> f b infixl 4 #
A variant of <*>
with the types of the arguments reversed. It differs from
in that the effects are resolved in the order the arguments are
presented.flip
(<*>)
Examples
>>>
(<**>) (print 1) (id <$ print 2)
1 2
>>>
flip (<*>) (print 1) (id <$ print 2)
2 1
>>>
ZipList [4, 5, 6] <**> ZipList [(+1), (*2), (/3)]
ZipList {getZipList = [5.0,10.0,2.0]}
Branching Combinators
These combinators allow for parsing one alternative or another. All of these combinators are left-biased, which means that the left-hand side of the combinator is tried first: the right-hand side of the combinator will only be tried when the left-hand one failed (and did not consume input in the process).
Arguments
:: Parsec a |
|
-> Parsec b |
|
-> Parsec (Either a b) | a parser which either parses |
This combinator, pronounced "sum", wraps the given parser's result in Left
if it succeeds, and parses q
if it failed without consuming input,
wrapping the result in Right
.
If the given parser p
is successful, then its result is wrapped up using Left
and no further action is taken.
Otherwise, if p
fails without consuming input, then q
is parsed instead and its result is
wrapped up using Right
.
If p
fails having consumed input, this combinator fails.
This is functionally equivalent to Left $ p | Right $ q
.
The reason for this behaviour is that it prevents space leaks and improves error messages.
If this behaviour is not desired, use atomic p
to rollback any input consumed on failure.
Examples
>>>
p = string "abc" <+> char "xyz"
>>>
parse p "abc"
Success (Left "abc")>>>
parse p "xyz"
Success (Right "xyz")>>>
parse p "ab"
Failure .. -- first parser consumed an 'a'!
Since: 0.1.0.0
(<|>) :: Alternative f => f a -> f a -> f a infixl 3 #
An associative binary operation
Selective Combinators
These combinators will decide which branch to take next based on the result of another parser.
This differs from combinators like (<|>)
which make decisions based on the success/failure of
a parser: here the result of a successful parse will direct which option is done.
branch :: Selective f => f (Either a b) -> f (a -> c) -> f (b -> c) -> f c #
The branch
function is a natural generalisation of select
: instead of
skipping an unnecessary effect, it chooses which of the two given effectful
functions to apply to a given argument; the other effect is unnecessary. It
is possible to implement branch
in terms of select
, which is a good
puzzle (give it a try!).
We can also implement select
via branch
:
selectB :: Selective f => f (Either a b) -> f (a -> b) -> f b selectB x y = branch x y (pure id)
Filtering Combinators
These combinators perform filtering on the results of a parser. This means that, given the result of a parser, they will perform some function on that result, and the success of that function effects whether or not the parser fails.
Arguments
:: (a -> Bool) | the predicate that is tested against the parser result. |
-> Parsec a | the parser to filter, |
-> Parsec a | a parser that returns the result of |
This combinator filters the result of the given parser p
using the given predicate,
succeeding only if the predicate returns True
.
First, parse p
.
If it succeeds then take its result x
and apply it to the predicate pred
.
If pred x
is true, then return x
.
Otherwise, the combinator fails.
Examples
>>>
keywords = Set.fromList ["if", "then", "else"]
>>>
identifier = filterS (\v -> not (Set.member v keywords)) (some letter)
>>>
parse identifier "hello"
Success "hello">>>
parse identifier "if"
Failure ..
Since: 0.2.2.0
Arguments
:: (a -> Maybe b) | the function used to both filter the result of |
-> Parsec a | the parser to filter, |
-> Parsec b | a parser that returns the result of |
This combinator applies a function f
to the result of the given parser p
:
if it returns a Just y
, y
is returned, otherwise this combinator fails.
First, parse p
. If it succeeds, apply the function f
to the result x
. If
f x
returns Just y
, return y
. If f x
returns Nothing
, or p
failed
then this combinator fails. Is a more efficient way of performing a filterS
and fmap
at the same time.
Examples
>>>
int = ...
>>>
safeDiv = mapMaybeS (\(x, y) -> if y /= 0 then Just (div x y) else Nothing) (int <~> (char ' ' *> int))
>>>
parse safeDiv "7 0"
Failure .. -- y cannot be 0!>>>
parse safeDiv "10 2"
Success 5
Since: 0.2.2.0
Folding Combinators
These combinators repeatedly execute a parser (at least zero or one times depending on the -- specific combinator) until it fails. The results of the successes are then combined together -- using a folding function. An initial value for the accumulation may be given (for the folds), -- or the first successful result is the initial accumulator (for the reduces). These are -- implemented efficiently and do not need to construct any intermediate list with which to store -- the results.
The many
Combinators
The many
combinators will repeatedly parse a given parser zero or more times, and collect each result in a list.
many :: Alternative f => f a -> f [a] #
Zero or more.
Examples
>>>
many (putStr "la")
lalalalalalalalala... * goes on forever *
>>>
many Nothing
Just []
>>>
take 5 <$> many (Just 1)
* hangs forever *
Note that this function can be used with Parsers based on
Applicatives. In that case many parser
will attempt to
parse parser
zero or more times until it fails.
Arguments
:: (b -> a -> b) |
|
-> b |
|
-> Parsec a |
|
-> Parsec b | a parser which parses |
This combinator will parse the given parser zero or more times combining the results with the function f
and base value k
from the left.
The given parser p
will continue to be parsed until it fails having not consumed input.
All of the results generated by the successful parses are then combined in a left-to-right
fashion using the function f
: the left-most value provided to f
is the value k
.
If p
does fail at any point having consumed input, this combinator will fail.
Since: 0.3.0.0
Arguments
:: (a -> b -> b) |
|
-> b |
|
-> Parsec a |
|
-> Parsec b | a parser which parses |
This combinator will parse the given parser p
zero or more times,
combining the results with the function f
and base value k
from the right.
p
will continue to be parsed until it fails having not consumed input.
All of the results generated by the successful parses are then combined in a right-to-left
fashion using the function f
: the right-most value provided to f
is the value k
.
If this parser does fail at any point having consumed input, this combinator will fail.
Examples
many = manyr (:) []
Since: 0.3.0.0
Arguments
:: Monoid m | |
=> (a -> m) |
|
-> Parsec a |
|
-> Parsec m | a parser that repeatedly parses |
This combinator acts like the foldMap
function but with a parser.
The parser manyMap f p
, will parse p
zero or more times, then
adapt each result with the function f
to produce a bunch of values
in some Monoid
m
. These values are then combined together to form a
single value; if p
could not be parsed, it will return the mempty
for m
.
Examples
>>>
parse (manyMap Set.singleton item) "aaaab"
Success (Set.fromList ['a', 'b'])
Since: 0.2.2.0
The some
Combinators
The some
combinators will repeatedly parse a given parser one or more times, and collect each result in a list.
If successful, the list returned by these combinators is always non-empty.
See also: Text.Gigaparsec.Combinator.NonEmpty for variants of these combinators that return a NonEmpty
list.
some :: Alternative f => f a -> f [a] #
One or more.
Examples
>>>
some (putStr "la")
lalalalalalalalala... * goes on forever *
>>>
some Nothing
nothing
>>>
take 5 <$> some (Just 1)
* hangs forever *
Note that this function can be used with Parsers based on
Applicatives. In that case some parser
will attempt to
parse parser
one or more times until it fails.
Arguments
:: (b -> a -> b) |
|
-> b |
|
-> Parsec a |
|
-> Parsec b | a parser which parses |
This combinator will parse the given parser one or more times combining the results with the function f
and base value k
from the left.
The given parser p
will continue to be parsed until it fails having not consumed input.
All of the results generated by the successful parses are then combined in a left-to-right
fashion using the function f
: the left-most value provided to f
is the value k
.
If p
fails at any point having consumed input, this combinator fails.
Examples
natural = somel (\x d -> x * 10 + digitToInt d) 0 digit
Since: 0.3.0.0
Arguments
:: (a -> b -> b) | function to apply to each value produced by this parser, starting at the right. |
-> b |
|
-> Parsec a |
|
-> Parsec b | a parser which parses |
This combinator will parse the given parser p
one or more times,
combining the results with the function f
and base value k
from the right.
p
will continue to be parsed until it fails having not consumed input.
All of the results generated by the successful parses are then combined in a right-to-left
fashion using the function f
: the right-most value provided to f
is the value k
.
If this parser does fail at any point having consumed input, this combinator will fail.
Examples
some = somer (:) []
Since: 0.3.0.0
Arguments
:: Semigroup s | |
=> (a -> s) |
|
-> Parsec a |
|
-> Parsec s | a parser that parses |
This combinator acts like the foldMap
function but with a parser.
The parser manyMap f p
, will parse p
one or more times, then
adapt each result with the function f
to produce a bunch of values
in some Semigroup
s
. These values are then combined together to form a
single value.
Examples
>>>
parse (someMap Max item) "bdcjb"
Success (Max 'j')>>>
parse (someMap Min item) "bdcjb"
Success (Max 'b')
Since: 0.2.2.0