Package org.jsoup.parser
package org.jsoup.parser
Contains the HTML parser, tag specifications, and HTML tokeniser.
-
ClassDescriptionCharacterReader consumes tokens off a string.HTML Tree Builder; creates a DOM from Tokens.The Tree Builder's current state.A Parse Error records an error in the input HTML that occurs in either the tokenisation or the tree building phase.A container for ParseErrors.Parses HTML or XML into a
Document.Controls parser case settings, to optionally preserve tag and/or attribute name case.A StreamParser provides a progressive parse of its input.A Tag represents an Element's name and configured options, common throughout the Document.A TagSet controls theTagconfiguration for a Document's parse, and its serialization.Parse tokens for the Tokeniser.XmlDeclaration - extends Tag for pseudo attribute supportA value holder for Tokens, as the stream is Tokenized.Readers the input stream into tokens.States and transition activations for the Tokeniser.A character reader with helpers focusing on parsing CSS selectors.Use theXmlTreeBuilderwhen you want to parse XML without any of the HTML DOM rules being applied to the document.