Package org.jsoup.helper
Class DataUtil
java.lang.Object
org.jsoup.helper.DataUtil
public final class DataUtil extends Object
Internal static utilities for handling data.
-
Field Summary
-
Method Summary
Modifier and Type Method Description static Documentload(File file, String charsetName, String baseUri)Loads and parses a file to a Document, with the HtmlParser.static Documentload(File file, String charsetName, String baseUri, Parser parser)Loads and parses a file to a Document.static Documentload(InputStream in, String charsetName, String baseUri)Parses a Document from an input steam.static Documentload(InputStream in, String charsetName, String baseUri, Parser parser)Parses a Document from an input steam, using the provided Parser.static ByteBufferreadToByteBuffer(InputStream inStream, int maxSize)Read the input stream into a byte buffer.
-
Field Details
-
Method Details
-
load
public static Document load(File file, @Nullable String charsetName, String baseUri) throws IOExceptionLoads and parses a file to a Document, with the HtmlParser. Files that are compressed with gzip (and end in.gzor.z) are supported in addition to uncompressed files.- Parameters:
file- file to loadcharsetName- (optional) character set of input; specifynullto attempt to autodetect. A BOM in the file will always override this setting.baseUri- base URI of document, to resolve relative links against- Returns:
- Document
- Throws:
IOException- on IO error
-
load
public static Document load(File file, @Nullable String charsetName, String baseUri, Parser parser) throws IOExceptionLoads and parses a file to a Document. Files that are compressed with gzip (and end in.gzor.z) are supported in addition to uncompressed files.- Parameters:
file- file to loadcharsetName- (optional) character set of input; specifynullto attempt to autodetect. A BOM in the file will always override this setting.baseUri- base URI of document, to resolve relative links againstparser- alternateparserto use.- Returns:
- Document
- Throws:
IOException- on IO error- Since:
- 1.14.2
-
load
public static Document load(InputStream in, @Nullable String charsetName, String baseUri) throws IOExceptionParses a Document from an input steam.- Parameters:
in- input stream to parse. The stream will be closed after reading.charsetName- character set of input (optional)baseUri- base URI of document, to resolve relative links against- Returns:
- Document
- Throws:
IOException- on IO error
-
load
public static Document load(InputStream in, @Nullable String charsetName, String baseUri, Parser parser) throws IOExceptionParses a Document from an input steam, using the provided Parser.- Parameters:
in- input stream to parse. The stream will be closed after reading.charsetName- character set of input (optional)baseUri- base URI of document, to resolve relative links againstparser- alternateparserto use.- Returns:
- Document
- Throws:
IOException- on IO error
-
readToByteBuffer
Read the input stream into a byte buffer. To deal with slow input streams, you may interrupt the thread this method is executing on. The data read until being interrupted will be available.- Parameters:
inStream- the input stream to read frommaxSize- the maximum size in bytes to read from the stream. Set to 0 to be unlimited.- Returns:
- the filled byte buffer
- Throws:
IOException- if an exception occurs whilst reading from the input stream.
-