Copyright | (c) The University of Glasgow 2001 (c) David Roundy 2003-2005 (c) Simon Marlow 2005 (c) Bjorn Bringert 2006 (c) Don Stewart 2005-2008 (c) Duncan Coutts 2006-2013 |
---|---|
License | BSD-style |
Maintainer | dons00@gmail.com, duncan@community.haskell.org |
Stability | stable |
Portability | portable |
Safe Haskell | Trustworthy |
Language | Haskell2010 |
Data.ByteString
Contents
Description
A time- and space-efficient implementation of byte vectors using packed Word8 arrays, suitable for high performance use, both in terms of large data quantities and high speed requirements. Byte vectors are encoded as strictWord8
arrays of bytes, held in aForeignPtr
, and can be passed between C and Haskell with little effort.
The recomended way to assemble ByteStrings from smaller parts is to use the builder monoid fromData.ByteString.Builder.
This module is intended to be importedqualified
, to avoid name clashes withPrelude functions. eg.
import qualified Data.ByteString as B
Original GHC implementation by Bryan O'Sullivan. Rewritten to useUArray
by Simon Marlow. Rewritten to support slices and useForeignPtr
by David Roundy. Rewritten again and extended by Don Stewart and Duncan Coutts.
ByteString
A space-efficient representation of aWord8
vector, supporting many efficient operations.
AByteString
contains 8-bit bytes, or by using the operations fromData.ByteString.Char8 it can be interpreted as containing 8-bit characters.
typeStrictByteString =ByteStringSource#
Type synonym for the strict flavour ofByteString
.
Since: 0.11.2.0
ByteString
sO(1) The emptyByteString
singleton ::Word8 ->ByteStringSource#
O(1) Convert aWord8
into aByteString
pack :: [Word8] ->ByteStringSource#
O(n) Convert a[
into aWord8
]ByteString
.
For applications with large numbers of string literals,pack
can be a bottleneck. In such cases, consider usingunsafePackAddress
(GHC only).
unpack ::ByteString -> [Word8]Source#
O(n) Converts aByteString
to a[
.Word8
]
fromStrict ::ByteString ->ByteStringSource#
O(1) Convert a strictByteString
into a lazyByteString
.
toStrict ::ByteString ->ByteStringSource#
O(n) Convert a lazyByteString
into a strictByteString
.
Note that this is anexpensive operation that forces the whole lazy ByteString into memory and then copies all the data. If possible, try to avoid converting back and forth between strict and lazy bytestrings.
fromFilePath ::FilePath ->IOByteStringSource#
Convert aFilePath
to aByteString
.
TheFilePath
type is expected to use the file system encoding as reported bygetFileSystemEncoding
. This encoding allows for round-tripping of arbitrary data on platforms that allow arbitrary bytes in their paths. This conversion function does the same thing thatopenFile
would do when decoding theFilePath
.
This function is inIO
because the file system encoding can be changed. If the encoding can be assumed to be constant in your use case, you may invoke this function viaunsafePerformIO
.
Since: 0.11.2.0
toFilePath ::ByteString ->IOFilePathSource#
Convert aByteString
to aFilePath
.
This function uses the file system encoding, and resultingFilePath
s can be safely used with standard IO functions and will reference the correct path in the presence of arbitrary non-UTF-8 encoded paths.
This function is inIO
because the file system encoding can be changed. If the encoding can be assumed to be constant in your use case, you may invoke this function viaunsafePerformIO
.
Since: 0.11.2.0
cons ::Word8 ->ByteString ->ByteStringinfixr 5Source#
O(n)cons
is analogous to (:) for lists, but of different complexity, as it requires making a copy.
snoc ::ByteString ->Word8 ->ByteStringinfixl 5Source#
O(n) Append a byte to the end of aByteString
append ::ByteString ->ByteString ->ByteStringSource#
O(n) Append two ByteStrings
head ::HasCallStack =>ByteString ->Word8Source#
O(1) Extract the first element of a ByteString, which must be non-empty. An exception will be thrown in the case of an empty ByteString.
uncons ::ByteString ->Maybe (Word8,ByteString)Source#
O(1) Extract the head and tail of a ByteString, returning Nothing if it is empty.
last ::HasCallStack =>ByteString ->Word8Source#
O(1) Extract the last element of a ByteString, which must be finite and non-empty. An exception will be thrown in the case of an empty ByteString.
tail ::HasCallStack =>ByteString ->ByteStringSource#
O(1) Extract the elements after the head of a ByteString, which must be non-empty. An exception will be thrown in the case of an empty ByteString.
init ::HasCallStack =>ByteString ->ByteStringSource#
O(1) Return all the elements of aByteString
except the last one. An exception will be thrown in the case of an empty ByteString.
null ::ByteString ->BoolSource#
O(1) Test whether a ByteString is empty.
map :: (Word8 ->Word8) ->ByteString ->ByteStringSource#
O(n)map
f xs
is the ByteString obtained by applyingf
to each element ofxs
.
reverse ::ByteString ->ByteStringSource#
O(n)reverse
xs
efficiently returns the elements ofxs
in reverse order.
intersperse ::Word8 ->ByteString ->ByteStringSource#
O(n) Theintersperse
function takes aWord8
and aByteString
and `intersperses' that byte between the elements of theByteString
. It is analogous to the intersperse function on Lists.
intercalate ::ByteString -> [ByteString] ->ByteStringSource#
O(n) Theintercalate
function takes aByteString
and a list ofByteString
s and concatenates the list after interspersing the first argument between each element of the list.
transpose :: [ByteString] -> [ByteString]Source#
Thetranspose
function transposes the rows and columns of itsByteString
argument.
ByteString
s (folds)foldl :: (a ->Word8 -> a) -> a ->ByteString -> aSource#
foldl
, applied to a binary operator, a starting value (typically the left-identity of the operator), and a ByteString, reduces the ByteString using the binary operator, from left to right.
foldl' :: (a ->Word8 -> a) -> a ->ByteString -> aSource#
foldl1 ::HasCallStack => (Word8 ->Word8 ->Word8) ->ByteString ->Word8Source#
foldl1
is a variant offoldl
that has no starting value argument, and thus must be applied to non-emptyByteString
s. An exception will be thrown in the case of an empty ByteString.
foldl1' ::HasCallStack => (Word8 ->Word8 ->Word8) ->ByteString ->Word8Source#
foldr :: (Word8 -> a -> a) -> a ->ByteString -> aSource#
foldr
, applied to a binary operator, a starting value (typically the right-identity of the operator), and a ByteString, reduces the ByteString using the binary operator, from right to left.
foldr' :: (Word8 -> a -> a) -> a ->ByteString -> aSource#
foldr1 ::HasCallStack => (Word8 ->Word8 ->Word8) ->ByteString ->Word8Source#
foldr1
is a variant offoldr
that has no starting value argument, and thus must be applied to non-emptyByteString
s An exception will be thrown in the case of an empty ByteString.
foldr1' ::HasCallStack => (Word8 ->Word8 ->Word8) ->ByteString ->Word8Source#
concat :: [ByteString] ->ByteStringSource#
O(n) Concatenate a list of ByteStrings.
concatMap :: (Word8 ->ByteString) ->ByteString ->ByteStringSource#
Map a function over aByteString
and concatenate the results
any :: (Word8 ->Bool) ->ByteString ->BoolSource#
O(n) Applied to a predicate and a ByteString,any
determines if any element of theByteString
satisfies the predicate.
all :: (Word8 ->Bool) ->ByteString ->BoolSource#
O(n) Applied to a predicate and aByteString
,all
determines if all elements of theByteString
satisfy the predicate.
maximum ::HasCallStack =>ByteString ->Word8Source#
O(n)maximum
returns the maximum value from aByteString
An exception will be thrown in the case of an empty ByteString.
minimum ::HasCallStack =>ByteString ->Word8Source#
O(n)minimum
returns the minimum value from aByteString
An exception will be thrown in the case of an empty ByteString.
Arguments
:: (Word8 ->Word8 ->Word8) | accumulator -> element -> new accumulator |
->Word8 | starting value of accumulator |
->ByteString | input of length n |
->ByteString | output of length n+1 |
scanl1 :: (Word8 ->Word8 ->Word8) ->ByteString ->ByteStringSource#
Arguments
:: (Word8 ->Word8 ->Word8) | element -> accumulator -> new accumulator |
->Word8 | starting value of accumulator |
->ByteString | input of length n |
->ByteString | output of length n+1 |
scanr1 :: (Word8 ->Word8 ->Word8) ->ByteString ->ByteStringSource#
mapAccumL :: (acc ->Word8 -> (acc,Word8)) -> acc ->ByteString -> (acc,ByteString)Source#
mapAccumR :: (acc ->Word8 -> (acc,Word8)) -> acc ->ByteString -> (acc,ByteString)Source#
replicate ::Int ->Word8 ->ByteStringSource#
O(n)replicate
n x
is a ByteString of lengthn
withx
the value of every element. The following holds:
replicate w c = unfoldr w (\u -> Just (u,u)) c
This implementation usesmemset(3)
unfoldr :: (a ->Maybe (Word8, a)) -> a ->ByteStringSource#
O(n), wheren is the length of the result. Theunfoldr
function is analogous to the List 'unfoldr'.unfoldr
builds a ByteString from a seed value. The function takes the element and returnsNothing
if it is done producing the ByteString or returnsJust
(a,b)
, in which case,a
is the next byte in the string, andb
is the seed value for further production.
Examples:
unfoldr (\x -> if x <= 5 then Just (x, x + 1) else Nothing) 0== pack [0, 1, 2, 3, 4, 5]
unfoldrN ::Int -> (a ->Maybe (Word8, a)) -> a -> (ByteString,Maybe a)Source#
O(n) Likeunfoldr
,unfoldrN
builds a ByteString from a seed value. However, the length of the result is limited by the first argument tounfoldrN
. This function is more efficient thanunfoldr
when the maximum length of the result is known.
The following equation relatesunfoldrN
andunfoldr
:
fst (unfoldrN n f s) == take n (unfoldr f s)
take ::Int ->ByteString ->ByteStringSource#
drop ::Int ->ByteString ->ByteStringSource#
splitAt ::Int ->ByteString -> (ByteString,ByteString)Source#
takeWhile :: (Word8 ->Bool) ->ByteString ->ByteStringSource#
Similar totakeWhile
, returns the longest (possibly empty) prefix of elements satisfying the predicate.
takeWhileEnd :: (Word8 ->Bool) ->ByteString ->ByteStringSource#
Returns the longest (possibly empty) suffix of elements satisfying the predicate.
is equivalent totakeWhileEnd
p
.reverse
.takeWhile
p .reverse
Since: 0.10.12.0
dropWhile :: (Word8 ->Bool) ->ByteString ->ByteStringSource#
Similar todropWhile
, drops the longest (possibly empty) prefix of elements satisfying the predicate and returns the remainder.
dropWhileEnd :: (Word8 ->Bool) ->ByteString ->ByteStringSource#
Similar todropWhileEnd
, drops the longest (possibly empty) suffix of elements satisfying the predicate and returns the remainder.
is equivalent todropWhileEnd
p
.reverse
.dropWhile
p .reverse
Since: 0.10.12.0
span :: (Word8 ->Bool) ->ByteString -> (ByteString,ByteString)Source#
spanEnd :: (Word8 ->Bool) ->ByteString -> (ByteString,ByteString)Source#
Returns the longest (possibly empty) suffix of elements satisfying the predicate and the remainder of the string.
spanEnd
p
is equivalent to
and tobreakEnd
(not . p)(
.takeWhileEnd
p &&&dropWhileEnd
p)
We have
spanEnd (not . isSpace) "x y z" == ("x y ", "z")
and
spanEnd (not . isSpace) ps ==let (x, y) = span (not . isSpace) (reverse ps) in (reverse y, reverse x)
break :: (Word8 ->Bool) ->ByteString -> (ByteString,ByteString)Source#
Similar tobreak
, returns the longest (possibly empty) prefix of elements whichdo not satisfy the predicate and the remainder of the string.
break
p
is equivalent to
and tospan
(not . p)(
.takeWhile
(not . p) &&&dropWhile
(not . p))
Under GHC, a rewrite rule will transform break (==) into a call to the specialised breakByte:
break ((==) x) = breakByte xbreak (==x) = breakByte x
breakEnd :: (Word8 ->Bool) ->ByteString -> (ByteString,ByteString)Source#
Returns the longest (possibly empty) suffix of elements whichdo not satisfy the predicate and the remainder of the string.
breakEnd
p
is equivalent to
and tospanEnd
(not . p)(
.takeWhileEnd
(not . p) &&&dropWhileEnd
(not . p))
group ::ByteString -> [ByteString]Source#
Thegroup
function takes a ByteString and returns a list of ByteStrings such that the concatenation of the result is equal to the argument. Moreover, each string in the result contains only equal elements. For example,
group "Mississippi" = ["M","i","ss","i","ss","i","pp","i"]
It is a special case ofgroupBy
, which allows the programmer to supply their own equality test. It is about 40% faster thangroupBy (==)
groupBy :: (Word8 ->Word8 ->Bool) ->ByteString -> [ByteString]Source#
inits ::ByteString -> [ByteString]Source#
O(n) Return all initial segments of the givenByteString
, shortest first.
tails ::ByteString -> [ByteString]Source#
O(n) Return all final segments of the givenByteString
, longest first.
stripPrefix ::ByteString ->ByteString ->MaybeByteStringSource#
O(n) ThestripPrefix
function takes two ByteStrings and returnsJust
the remainder of the second iff the first is its prefix, and otherwiseNothing
.
Since: 0.10.8.0
stripSuffix ::ByteString ->ByteString ->MaybeByteStringSource#
O(n) ThestripSuffix
function takes two ByteStrings and returnsJust
the remainder of the second iff the first is its suffix, and otherwiseNothing
.
split ::Word8 ->ByteString -> [ByteString]Source#
O(n) Break aByteString
into pieces separated by the byte argument, consuming the delimiter. I.e.
split 10 "a\nb\nd\ne" == ["a","b","d","e"] -- fromEnum '\n' == 10split 97 "aXaXaXa" == ["","X","X","X",""] -- fromEnum 'a' == 97split 120 "x" == ["",""] -- fromEnum 'x' == 120split undefined "" == [] -- and not [""]
and
intercalate [c] . split c == idsplit == splitWith . (==)
As for all splitting functions in this library, this function does not copy the substrings, it just constructs newByteString
s that are slices of the original.
splitWith :: (Word8 ->Bool) ->ByteString -> [ByteString]Source#
O(n) Splits aByteString
into components delimited by separators, where the predicate returns True for a separator element. The resulting components do not contain the separators. Two adjacent separators result in an empty component in the output. eg.
splitWith (==97) "aabbaca" == ["","","bb","c",""] -- fromEnum 'a' == 97splitWith undefined "" == [] -- and not [""]
isPrefixOf ::ByteString ->ByteString ->BoolSource#
O(n) TheisPrefixOf
function takes two ByteStrings and returnsTrue
if the first is a prefix of the second.
isSuffixOf ::ByteString ->ByteString ->BoolSource#
O(n) TheisSuffixOf
function takes two ByteStrings and returnsTrue
iff the first is a suffix of the second.
The following holds:
isSuffixOf x y == reverse x `isPrefixOf` reverse y
However, the real implementation uses memcmp to compare the end of the string only, with no reverse required..
isInfixOf ::ByteString ->ByteString ->BoolSource#
Check whether one string is a substring of another.
isValidUtf8 ::ByteString ->BoolSource#
O(n) Check whether aByteString
represents valid UTF-8.
Since: 0.11.2.0
Arguments
::ByteString | String to search for |
->ByteString | String to search in |
-> (ByteString,ByteString) | Head and tail of string broken at substring |
Break a string on a substring, returning a pair of the part of the string prior to the match, and the rest of the string.
The following relationships hold:
break (== c) l == breakSubstring (singleton c) l
For example, to tokenise a string, dropping delimiters:
tokenise x y = h : if null t then [] else tokenise x (drop (length x) t) where (h,t) = breakSubstring x y
To skip to the first occurrence of a string:
snd (breakSubstring x y)
To take the parts of a string before a delimiter:
fst (breakSubstring x y)
Note that calling `breakSubstring x` does some preprocessing work, so you should avoid unnecessarily duplicating breakSubstring calls with the same pattern.
elem ::Word8 ->ByteString ->BoolSource#
O(n)elem
is theByteString
membership predicate.
filter :: (Word8 ->Bool) ->ByteString ->ByteStringSource#
O(n)filter
, applied to a predicate and a ByteString, returns a ByteString containing those characters that satisfy the predicate.
partition :: (Word8 ->Bool) ->ByteString -> (ByteString,ByteString)Source#
O(n) Thepartition
function takes a predicate a ByteString and returns the pair of ByteStrings with elements which do and do not satisfy the predicate, respectively; i.e.,
partition p bs == (filter p xs, filter (not . p) xs)
index ::HasCallStack =>ByteString ->Int ->Word8Source#
O(1)ByteString
index (subscript) operator, starting from 0.
elemIndex ::Word8 ->ByteString ->MaybeIntSource#
O(n) TheelemIndex
function returns the index of the first element in the givenByteString
which is equal to the query element, orNothing
if there is no such element. This implementation uses memchr(3).
elemIndices ::Word8 ->ByteString -> [Int]Source#
O(n) TheelemIndices
function extendselemIndex
, by returning the indices of all elements equal to the query element, in ascending order. This implementation uses memchr(3).
elemIndexEnd ::Word8 ->ByteString ->MaybeIntSource#
O(n) TheelemIndexEnd
function returns the last index of the element in the givenByteString
which is equal to the query element, orNothing
if there is no such element. The following holds:
elemIndexEnd c xs = case elemIndex c (reverse xs) of Nothing -> Nothing Just i -> Just (length xs - 1 - i)
findIndex :: (Word8 ->Bool) ->ByteString ->MaybeIntSource#
O(n) ThefindIndex
function takes a predicate and aByteString
and returns the index of the first element in the ByteString satisfying the predicate.
findIndices :: (Word8 ->Bool) ->ByteString -> [Int]Source#
O(n) ThefindIndices
function extendsfindIndex
, by returning the indices of all elements satisfying the predicate, in ascending order.
findIndexEnd :: (Word8 ->Bool) ->ByteString ->MaybeIntSource#
O(n) ThefindIndexEnd
function takes a predicate and aByteString
and returns the index of the last element in the ByteString satisfying the predicate.
Since: 0.10.12.0
count ::Word8 ->ByteString ->IntSource#
count returns the number of times its argument appears in the ByteString
count = length . elemIndices
But more efficiently than using length on the intermediate list.
zip ::ByteString ->ByteString -> [(Word8,Word8)]Source#
zipWith :: (Word8 ->Word8 -> a) ->ByteString ->ByteString -> [a]Source#
packZipWith :: (Word8 ->Word8 ->Word8) ->ByteString ->ByteString ->ByteStringSource#
A specialised version ofzipWith
for the common case of a simultaneous map over two ByteStrings, to build a 3rd.
Since: 0.11.1.0
unzip :: [(Word8,Word8)] -> (ByteString,ByteString)Source#
sort ::ByteString ->ByteStringSource#
O(n) Sort a ByteString efficiently, using counting sort.
copy ::ByteString ->ByteStringSource#
O(n) Make a copy of theByteString
with its own storage. This is mainly useful to allow the rest of the data pointed to by theByteString
to be garbage collected, for example if a large string has been read in, and only a small part of it is needed in the rest of the program.
CString
s and pointerspackCString ::CString ->IOByteStringSource#
O(n). Construct a newByteString
from aCString
. The resultingByteString
is an immutable copy of the originalCString
, and is managed on the Haskell heap. The originalCString
must be null terminated.
packCStringLen ::CStringLen ->IOByteStringSource#
O(n). Construct a newByteString
from aCStringLen
. The resultingByteString
is an immutable copy of the originalCStringLen
. TheByteString
is a normal Haskell value and will be managed on the Haskell heap.
CString
suseAsCString ::ByteString -> (CString ->IO a) ->IO aSource#
O(n) construction Use aByteString
with a function requiring a null-terminatedCString
. TheCString
is a copy and will be freed automatically; it must not be stored or used after the subcomputation finishes.
useAsCStringLen ::ByteString -> (CStringLen ->IO a) ->IO aSource#
O(n) construction Use aByteString
with a function requiring aCStringLen
. As foruseAsCString
this function makes a copy of the originalByteString
. It must not be stored or used after the subcomputation finishes.
ByteString
sRead a line from stdin.
getContents ::IOByteStringSource#
getContents. Read stdin strictly. Equivalent to hGetContents stdin TheHandle
is closed after the contents have been read.
putStr ::ByteString ->IO ()Source#
Write a ByteString to stdout
interact :: (ByteString ->ByteString) ->IO ()Source#
The interact function takes a function of typeByteString -> ByteString
as its argument. The entire input from the standard input device is passed to this function as its argument, and the resulting string is output on the standard output device.
readFile ::FilePath ->IOByteStringSource#
Read an entire file strictly into aByteString
.
writeFile ::FilePath ->ByteString ->IO ()Source#
Write aByteString
to a file.
appendFile ::FilePath ->ByteString ->IO ()Source#
Append aByteString
to a file.
hGetContents ::Handle ->IOByteStringSource#
Read a handle's entire contents strictly into aByteString
.
This function reads chunks at a time, increasing the chunk size on each read. The final string is then reallocated to the appropriate size. For files > half of available memory, this may lead to memory exhaustion. Consider usingreadFile
in this case.
The Handle is closed once the contents have been read, or if an exception is thrown.
hGet ::Handle ->Int ->IOByteStringSource#
Read aByteString
directly from the specifiedHandle
. This is far more efficient than reading the characters into aString
and then usingpack
. First argument is the Handle to read from, and the second is the number of bytes to read. It returns the bytes read, up to n, orempty
if EOF has been reached.
hGet
is implemented in terms ofhGetBuf
.
If the handle is a pipe or socket, and the writing end is closed,hGet
will behave as if EOF was reached.
hGetSome ::Handle ->Int ->IOByteStringSource#
LikehGet
, except that a shorterByteString
may be returned if there are not enough bytes immediately available to satisfy the whole request.hGetSome
only blocks if there is no data available, and EOF has not yet been reached.
hGetNonBlocking ::Handle ->Int ->IOByteStringSource#
hGetNonBlocking is similar tohGet
, except that it will never block waiting for data to become available, instead it returns only whatever data is available. If there is no data available to be read,hGetNonBlocking
returnsempty
.
Note: on Windows and with Haskell implementation other than GHC, this function does not work correctly; it behaves identically tohGet
.
hPut ::Handle ->ByteString ->IO ()Source#
Outputs aByteString
to the specifiedHandle
.
hPutNonBlocking ::Handle ->ByteString ->IOByteStringSource#
Similar tohPut
except that it will never block. Instead it returns any tail that did not get written. This tail may beempty
in the case that the whole string was written, or the whole original string if nothing was written. Partial writes are also possible.
Note: on Windows and with Haskell implementation other than GHC, this function does not work correctly; it behaves identically tohPut
.
Produced byHaddock version 2.24.0