module Utf8:sig
..end
Given a module pp
functions produced by one of the functors in Xml_print
, this modules is used as following:
let encode x = fst (Xml_print.Utf8.normalize_html x) in
Format.printf "%a" (Html.pp ~encode ()) document
typeutf8 =
string
val normalize : string -> utf8 * bool
normalize str
take a possibly invalid utf-8 string
and return a valid utf-8 string
where invalid bytes have been replaced by
the replacement character U+FFFD
.
The returned boolean is true if invalid bytes were foundval normalize_html : string -> utf8 * bool
normalize
plus some extra work :
It encode '<' , '>' , '"' , '&' characters with
corresponding entities and replaced invalid html
character by U+FFFD