All Classes
| Class | Description |
|---|---|
| Attribute |
A single key + value attribute.
|
| Attributes |
The attributes of an Element.
|
| CDataNode |
A Character Data node, to support CDATA sections.
|
| ChangeNotifyingArrayList<E> |
Implementation of ArrayList that watches out for changes to the contents.
|
| CharacterReader |
CharacterReader consumes tokens off a string.
|
| Cleaner |
The safelist based HTML cleaner.
|
| Collector |
Collects a list of elements that match the supplied criteria.
|
| Comment |
A comment node.
|
| Connection |
The Connection interface is a convenient HTTP client and session object to fetch content from the web, and parse them
into Documents.
|
| Connection.Base<T extends Connection.Base<T>> |
Common methods for Requests and Responses
|
| Connection.KeyVal |
A Key:Value tuple(+), used for form data.
|
| Connection.Method |
GET and POST http methods.
|
| Connection.Request |
Represents a HTTP request.
|
| Connection.Response |
Represents a HTTP response.
|
| ConstrainableInputStream |
A jsoup internal class (so don't use it as there is no contract API) that enables constraints on an Input Stream,
namely a maximum read size, and the ability to Thread.interrupt() the read.
|
| DataNode |
A data node, for contents of style, script tags etc, where contents should not show in text().
|
| DataUtil |
Internal static utilities for handling data.
|
| Document |
A HTML Document.
|
| Document.OutputSettings |
A Document's output settings control the form of the text() and html() methods.
|
| Document.OutputSettings.Syntax |
The output serialization syntax.
|
| Document.QuirksMode | |
| DocumentType |
A
<!DOCTYPE> node. |
| Element |
A HTML element consists of a tag name, attributes, and child nodes (including text nodes and
other elements).
|
| Elements |
A list of
Elements, with methods that act on every element in the list. |
| Entities |
HTML entities, and escape routines.
|
| Entities.EscapeMode | |
| Evaluator |
Evaluates that an element matches the selector.
|
| Evaluator.AllElements |
Evaluator for any / all element matching
|
| Evaluator.Attribute |
Evaluator for attribute name matching
|
| Evaluator.AttributeKeyPair |
Abstract evaluator for attribute name/value matching
|
| Evaluator.AttributeStarting |
Evaluator for attribute name prefix matching
|
| Evaluator.AttributeWithValue |
Evaluator for attribute name/value matching
|
| Evaluator.AttributeWithValueContaining |
Evaluator for attribute name/value matching (value containing)
|
| Evaluator.AttributeWithValueEnding |
Evaluator for attribute name/value matching (value ending)
|
| Evaluator.AttributeWithValueMatching |
Evaluator for attribute name/value matching (value regex matching)
|
| Evaluator.AttributeWithValueNot |
Evaluator for attribute name != value matching
|
| Evaluator.AttributeWithValueStarting |
Evaluator for attribute name/value matching (value prefix)
|
| Evaluator.Class |
Evaluator for element class
|
| Evaluator.ContainsData |
Evaluator for matching Element (and its descendants) data
|
| Evaluator.ContainsOwnText |
Evaluator for matching Element's own text
|
| Evaluator.ContainsText |
Evaluator for matching Element (and its descendants) text
|
| Evaluator.CssNthEvaluator | |
| Evaluator.Id |
Evaluator for element id
|
| Evaluator.IndexEquals |
Evaluator for matching by sibling index number (e = idx)
|
| Evaluator.IndexEvaluator |
Abstract evaluator for sibling index matching
|
| Evaluator.IndexGreaterThan |
Evaluator for matching by sibling index number (e > idx)
|
| Evaluator.IndexLessThan |
Evaluator for matching by sibling index number (e < idx)
|
| Evaluator.IsEmpty | |
| Evaluator.IsFirstChild |
Evaluator for matching the first sibling (css :first-child)
|
| Evaluator.IsFirstOfType | |
| Evaluator.IsLastChild |
Evaluator for matching the last sibling (css :last-child)
|
| Evaluator.IsLastOfType | |
| Evaluator.IsNthChild |
css-compatible Evaluator for :eq (css :nth-child)
|
| Evaluator.IsNthLastChild |
css pseudo class :nth-last-child)
|
| Evaluator.IsNthLastOfType | |
| Evaluator.IsNthOfType |
css pseudo class nth-of-type
|
| Evaluator.IsOnlyChild | |
| Evaluator.IsOnlyOfType | |
| Evaluator.IsRoot |
css3 pseudo-class :root
|
| Evaluator.Matches |
Evaluator for matching Element (and its descendants) text with regex
|
| Evaluator.MatchesOwn |
Evaluator for matching Element's own text with regex
|
| Evaluator.MatchText | |
| Evaluator.Tag |
Evaluator for tag name
|
| Evaluator.TagEndsWith |
Evaluator for tag name that ends with
|
| FieldsAreNonnullByDefault | |
| FormElement |
A HTML Form Element provides ready access to the form fields/controls that are associated with it.
|
| HtmlToPlainText |
HTML to plain-text.
|
| HtmlTreeBuilder |
HTML Tree Builder; creates a DOM from Tokens.
|
| HttpConnection |
Implementation of
Connection. |
| HttpConnection.KeyVal | |
| HttpConnection.Request | |
| HttpConnection.Response | |
| HttpStatusException |
Signals that a HTTP request resulted in a not OK HTTP response.
|
| Jsoup |
The core public access point to the jsoup functionality.
|
| ListLinks |
Example program to list links from a URL.
|
| Node |
The base, abstract Node model.
|
| NodeFilter |
Node filter interface.
|
| NodeFilter.FilterResult |
Filter decision.
|
| NodeTraversor |
Depth-first node traversor.
|
| NodeVisitor |
Node visitor interface.
|
| NonnullByDefault | |
| Normalizer |
Util methods for normalizing strings.
|
| ParseError |
A Parse Error records an error in the input HTML that occurs in either the tokenisation or the tree building phase.
|
| ParseErrorList |
A container for ParseErrors.
|
| Parser |
Parses HTML into a
Document. |
| ParseSettings |
Controls parser settings, to optionally preserve tag and/or attribute name case.
|
| PseudoTextElement | |
| QueryParser |
Parses a CSS selector into an Evaluator tree.
|
| ReturnsAreNonnullByDefault | |
| Safelist |
Safe-lists define what HTML (elements and attributes) to allow through the cleaner.
|
| Selector |
CSS-like element selector, that finds elements matching a query.
|
| Selector.SelectorParseException | |
| SerializationException |
A SerializationException is raised whenever serialization of a DOM element fails.
|
| StringUtil |
A minimal String utility class.
|
| StringUtil.StringJoiner |
A StringJoiner allows incremental / filtered joining of a set of stringable objects.
|
| Tag |
HTML Tag capabilities.
|
| TextNode |
A text node.
|
| TokenQueue |
A character queue with parsing helpers.
|
| UncheckedIOException | |
| UnsupportedMimeTypeException |
Signals that a HTTP response returned a mime type that is not supported.
|
| Validate |
Simple validation methods.
|
| W3CDom |
Helper class to transform a
Document to a org.w3c.dom.Document,
for integration with toolsets that use the W3C DOM. |
| W3CDom.W3CBuilder |
Implements the conversion by walking the input.
|
| Whitelist | Deprecated.
As of release
v1.14.1, this class is deprecated in favour of Safelist. |
| Wikipedia |
A simple example, used on the jsoup website.
|
| XmlDeclaration |
An XML Declaration.
|
| XmlTreeBuilder |
Use the
XmlTreeBuilder when you want to parse XML without any of the HTML DOM rules being applied to the
document. |