public final class UTF8Util
extends java.lang.Object
Note that when the skip
methods mention detection of invalid
UTF-8 encodings, it only checks the first byte of a character. For multibyte
encodings, the second and third byte are not checked for correctness, just
skipped and ignored.
DataInput
Modifier and Type | Class | Description |
---|---|---|
private static class |
UTF8Util.SkipCount |
Helper class to hold skip counts; one for chars and one for bytes.
|
Modifier | Constructor | Description |
---|---|---|
private |
UTF8Util() |
This class cannot be instantiated.
|
Modifier and Type | Method | Description |
---|---|---|
private static UTF8Util.SkipCount |
internalSkip(java.io.InputStream in,
long charsToSkip) |
Skip characters in the stream.
|
static long |
skipFully(java.io.InputStream in,
long charsToSkip) |
Skip the requested number of characters from the stream.
|
static long |
skipUntilEOF(java.io.InputStream in) |
Skip until the end-of-stream is reached.
|
public static final long skipUntilEOF(java.io.InputStream in) throws java.io.IOException
in
- byte stream with UTF-8 encoded charactersjava.io.IOException
- if reading from the stream failsjava.io.UTFDataFormatException
- if an invalid UTF-8 encoding is detectedpublic static final long skipFully(java.io.InputStream in, long charsToSkip) throws java.io.EOFException, java.io.IOException
in
- byte stream with UTF-8 encoded characterscharsToSkip
- number of characters to skipjava.io.EOFException
- if end-of-stream is reached before the requested
number of characters are skippedjava.io.IOException
- if reading from the stream failsjava.io.UTFDataFormatException
- if an invalid UTF-8 encoding is detectedprivate static final UTF8Util.SkipCount internalSkip(java.io.InputStream in, long charsToSkip) throws java.io.IOException
Note that a smaller number than requested might be skipped if the
end-of-stream is reached before the specified number of characters has
been decoded. It is up to the caller to decide if this is an error
or not. For instance, when determining the character length of a stream,
Long.MAX_VALUE
could be passed as the requested number of
characters to skip.
in
- byte stream with UTF-8 encoded characterscharsToSkip
- the number of characters to skipCHAR_COUNT
, the bytes skipped at position
BYTE_COUNT
. Note that the number of characters skipped
may be smaller than the requested number.java.io.IOException
- if reading from the stream failsjava.io.UTFDataFormatException
- if an invalid UTF-8 encoding is detectedApache Derby V10.14 Internals - Copyright © 2004,2018 The Apache Software Foundation. All Rights Reserved.