All Classes and Interfaces
Class
Description
A single key + value attribute.
The attributes of an Element.
Handles per request Authenticator-based authentication.
On Java 8 we install a system-wide Authenticator, which pulls the delegating Auth from a ThreadLocal pool.
A Character Data node, to support CDATA sections.
CharacterReader consumes tokens off a string.
The
Safelist-based HTML cleaner.Collects a list of elements that match the supplied criteria.
Base combining (and, or) evaluator.
A comment node.
The Connection interface is a convenient HTTP client and session object to fetch content from the web, and parse them
into Documents.
Common methods for Requests and Responses
A Key:Value tuple(+), used for form data.
GET and POST http methods.
Represents a HTTP request.
Represents a HTTP response.
A jsoup internal class (so don't use it as there is no contract API) that enables controls on a buffered input stream,
namely a maximum read size, and the ability to Thread.interrupt() the read.
Helper functions to support the Cookie Manager / Cookie Storage in HttpConnection.
A data node, for contents of style, script tags etc, where contents should not show in text().
Internal static utilities for handling data.
A struct to return a detected charset, and a document (if fully read).
A HTML Document.
A Document's output settings control the form of the text() and html() methods.
The output serialization syntax.
A
<!DOCTYPE> node.An HTML Element consists of a tag name, attributes, and child nodes (including text nodes and other elements).
A list of
Elements, with methods that act on every element in the list.HTML entities, and escape routines.
Holds packed data that represents Entity name=value pairs.
An Evaluator tests if an element (or a node) meets the selector's requirements.
Evaluator for any / all element matching
Evaluator for attribute name matching
Abstract evaluator for attribute name/value matching
Evaluator for attribute name prefix matching
Evaluator for attribute name/value matching
Evaluator for attribute name/value matching (value containing)
Evaluator for attribute name/value matching (value ending)
Evaluator for attribute name/value matching (value regex matching)
Evaluator for attribute name != value matching
Evaluator for attribute name/value matching (value prefix)
Evaluator for element class
Evaluator for matching Element (and its descendants) data
Evaluator for matching Element's own text
Evaluator for matching Element (and its descendants) text
Evaluator for matching Element (but not its descendants) wholeText.
Evaluator for matching Element (and its descendants) wholeText.
Evaluator for element id
Evaluator for matching by sibling index number (e = idx)
Abstract evaluator for sibling index matching
Evaluator for matching by sibling index number (e > idx)
Evaluator for matching by sibling index number (e < idx)
Evaluator for matching the first sibling (css :first-child)
Evaluator for matching the last sibling (css :last-child)
css-compatible Evaluator for :eq (css :nth-child)
css pseudo class :nth-last-child)
css pseudo class nth-of-type
css3 pseudo-class :root
Evaluator for matching Element (and its descendants) text with regex
Evaluator for matching Element's own text with regex
Evaluator for matching Element's own whole text with regex.
Evaluator for matching Element (and its descendants) whole text with regex.
Deprecated.
This selector is deprecated and will be removed in jsoup 1.24.1.
Evaluator for tag name
Evaluator for tag name that ends with suffix; used for *|el
Evaluator for tag name that starts with prefix; used for ns|*
An HTML Form Element provides ready access to the form fields/controls that are associated with it.
Deprecated.
for removal in jsoup 1.23.1.
HTML to plain-text.
HTML Tree Builder; creates a DOM from Tokens.
The Tree Builder's current state.
Implementation of
Connection.Signals that a HTTP request resulted in a not OK HTTP response.
The core public access point to the jsoup functionality.
A node that does not hold any children.
Example program to list links from a URL.
The base, abstract Node model.
Matches nodes with no value or only whitespace.
A controllable Node visitor interface.
Traversal action.
Iterate through a Node and its tree of descendants, in document order, and returns nodes of the specified type.
A list of
Node objects, with methods that act on every node in the list.A depth-first node traversor.
Internal helpers for Nodes, to keep the actual node APIs relatively clean.
Node visitor interface, used to walk the DOM and visit each node.
Util methods for normalizing strings.
A Parse Error records an error in the input HTML that occurs in either the tokenisation or the tree building phase.
A container for ParseErrors.
Parses HTML or XML into a
Document.Controls parser case settings, to optionally preserve tag and/or attribute name case.
Base Printer
Outline Printer
Pretty Printer
Deprecated.
use
Element.selectNodes(String, Class) instead, with selector of ::textnode and class TextNode;
will be removed in jsoup 1.24.1.Parses a CSS selector into an Evaluator tree.
A jsoup internal class to wrap an Appendable and throw IOExceptions as SerializationExceptions.
A version that wraps a StringBuilder, and so doesn't need the exception wrap.
A Range object tracks the character positions in the original input source where a Node starts or ends.
A Position object tracks the character position in the original input source where a Node starts or ends.
A regular expression abstraction.
A
RequestAuthenticator is used in Connection to authenticate if required to proxies and web
servers.Provides details for the request, to determine the appropriate credentials to return.
Handles requests using either HttpClient (available in JVM 11+) or HttpURLConnection.
A shim interface to support both HttpURLConnection and HttpClient implementations, in a multi-version jar.
Safelists define what HTML (elements and attributes) to allow through a
Cleaner.CSS element selector, that finds elements matching a query.
A SerializationException is raised whenever serialization of a DOM element fails.
jsoup constants used between packages.
A simple implementation of a buffered input stream, in which we can control the byte[] buffer to recycle it.
A simple decoding InputStreamReader that recycles internal buffers.
A SoftPool is a ThreadLocal that holds a SoftReference to a pool of initializable objects.
A StreamParser provides a progressive parse of its input.
A minimal String utility class.
A StringJoiner allows incremental / filtered joining of a set of stringable objects.
Base structural evaluator.
Any Ancestor (i.e., ascending parent chain.).
Holds a list of evaluators for one > two > three immediate parent matches, and the final direct evaluator under
test.
Implements the :is(sub-query) pseudo-selector
A Tag represents an Element's name and configured options, common throughout the Document.
A TagSet controls the
Tag configuration for a Document's parse, and its serialization.A text node.
Parse tokens for the Tokeniser.
XmlDeclaration - extends Tag for pseudo attribute support
A value holder for Tokens, as the stream is Tokenized.
Readers the input stream into tokens.
States and transition activations for the Tokeniser.
A character reader with helpers focusing on parsing CSS selectors.
Signals that a HTTP response returned a mime type that is not supported.
A utility class to normalize input URLs.
Execute HTTP requests using the HttpURLConnection implementation.
Validators to check that method arguments meet expectations.
Validation exceptions, as thrown by the methods in
Validate.Helper class to transform a
Document to a org.w3c.dom.Document,
for integration with toolsets that use the W3C DOM.Implements the conversion by walking the input.
A simple example, used on the jsoup website.
An XML Declaration.
Use the
XmlTreeBuilder when you want to parse XML without any of the HTML DOM rules being applied to the
document.