Enum Class TokeniserState

java.lang.Object
java.lang.Enum<TokeniserState>
org.jsoup.parser.TokeniserState
All Implemented Interfaces:
Serializable, Comparable<TokeniserState>, Constable

enum TokeniserState extends Enum<TokeniserState>
States and transition activations for the Tokeniser.
  • Enum Constant Details

    • Data

      public static final TokeniserState Data
    • CharacterReferenceInData

      public static final TokeniserState CharacterReferenceInData
    • Rcdata

      public static final TokeniserState Rcdata
    • CharacterReferenceInRcdata

      public static final TokeniserState CharacterReferenceInRcdata
    • Rawtext

      public static final TokeniserState Rawtext
    • ScriptData

      public static final TokeniserState ScriptData
    • PLAINTEXT

      public static final TokeniserState PLAINTEXT
    • TagOpen

      public static final TokeniserState TagOpen
    • EndTagOpen

      public static final TokeniserState EndTagOpen
    • TagName

      public static final TokeniserState TagName
    • RcdataLessthanSign

      public static final TokeniserState RcdataLessthanSign
    • RCDATAEndTagOpen

      public static final TokeniserState RCDATAEndTagOpen
    • RCDATAEndTagName

      public static final TokeniserState RCDATAEndTagName
    • RawtextLessthanSign

      public static final TokeniserState RawtextLessthanSign
    • RawtextEndTagOpen

      public static final TokeniserState RawtextEndTagOpen
    • RawtextEndTagName

      public static final TokeniserState RawtextEndTagName
    • ScriptDataLessthanSign

      public static final TokeniserState ScriptDataLessthanSign
    • ScriptDataEndTagOpen

      public static final TokeniserState ScriptDataEndTagOpen
    • ScriptDataEndTagName

      public static final TokeniserState ScriptDataEndTagName
    • ScriptDataEscapeStart

      public static final TokeniserState ScriptDataEscapeStart
    • ScriptDataEscapeStartDash

      public static final TokeniserState ScriptDataEscapeStartDash
    • ScriptDataEscaped

      public static final TokeniserState ScriptDataEscaped
    • ScriptDataEscapedDash

      public static final TokeniserState ScriptDataEscapedDash
    • ScriptDataEscapedDashDash

      public static final TokeniserState ScriptDataEscapedDashDash
    • ScriptDataEscapedLessthanSign

      public static final TokeniserState ScriptDataEscapedLessthanSign
    • ScriptDataEscapedEndTagOpen

      public static final TokeniserState ScriptDataEscapedEndTagOpen
    • ScriptDataEscapedEndTagName

      public static final TokeniserState ScriptDataEscapedEndTagName
    • ScriptDataDoubleEscapeStart

      public static final TokeniserState ScriptDataDoubleEscapeStart
    • ScriptDataDoubleEscaped

      public static final TokeniserState ScriptDataDoubleEscaped
    • ScriptDataDoubleEscapedDash

      public static final TokeniserState ScriptDataDoubleEscapedDash
    • ScriptDataDoubleEscapedDashDash

      public static final TokeniserState ScriptDataDoubleEscapedDashDash
    • ScriptDataDoubleEscapedLessthanSign

      public static final TokeniserState ScriptDataDoubleEscapedLessthanSign
    • ScriptDataDoubleEscapeEnd

      public static final TokeniserState ScriptDataDoubleEscapeEnd
    • BeforeAttributeName

      public static final TokeniserState BeforeAttributeName
    • AttributeName

      public static final TokeniserState AttributeName
    • AfterAttributeName

      public static final TokeniserState AfterAttributeName
    • BeforeAttributeValue

      public static final TokeniserState BeforeAttributeValue
    • AttributeValue_doubleQuoted

      public static final TokeniserState AttributeValue_doubleQuoted
    • AttributeValue_singleQuoted

      public static final TokeniserState AttributeValue_singleQuoted
    • AttributeValue_unquoted

      public static final TokeniserState AttributeValue_unquoted
    • AfterAttributeValue_quoted

      public static final TokeniserState AfterAttributeValue_quoted
    • SelfClosingStartTag

      public static final TokeniserState SelfClosingStartTag
    • BogusComment

      public static final TokeniserState BogusComment
    • MarkupDeclarationOpen

      public static final TokeniserState MarkupDeclarationOpen
    • MarkupProcessingOpen

      public static final TokeniserState MarkupProcessingOpen
    • CommentStart

      public static final TokeniserState CommentStart
    • CommentStartDash

      public static final TokeniserState CommentStartDash
    • Comment

      public static final TokeniserState Comment
    • CommentEndDash

      public static final TokeniserState CommentEndDash
    • CommentEnd

      public static final TokeniserState CommentEnd
    • CommentEndBang

      public static final TokeniserState CommentEndBang
    • Doctype

      public static final TokeniserState Doctype
    • BeforeDoctypeName

      public static final TokeniserState BeforeDoctypeName
    • DoctypeName

      public static final TokeniserState DoctypeName
    • AfterDoctypeName

      public static final TokeniserState AfterDoctypeName
    • AfterDoctypePublicKeyword

      public static final TokeniserState AfterDoctypePublicKeyword
    • BeforeDoctypePublicIdentifier

      public static final TokeniserState BeforeDoctypePublicIdentifier
    • DoctypePublicIdentifier_doubleQuoted

      public static final TokeniserState DoctypePublicIdentifier_doubleQuoted
    • DoctypePublicIdentifier_singleQuoted

      public static final TokeniserState DoctypePublicIdentifier_singleQuoted
    • AfterDoctypePublicIdentifier

      public static final TokeniserState AfterDoctypePublicIdentifier
    • BetweenDoctypePublicAndSystemIdentifiers

      public static final TokeniserState BetweenDoctypePublicAndSystemIdentifiers
    • AfterDoctypeSystemKeyword

      public static final TokeniserState AfterDoctypeSystemKeyword
    • BeforeDoctypeSystemIdentifier

      public static final TokeniserState BeforeDoctypeSystemIdentifier
    • DoctypeSystemIdentifier_doubleQuoted

      public static final TokeniserState DoctypeSystemIdentifier_doubleQuoted
    • DoctypeSystemIdentifier_singleQuoted

      public static final TokeniserState DoctypeSystemIdentifier_singleQuoted
    • AfterDoctypeSystemIdentifier

      public static final TokeniserState AfterDoctypeSystemIdentifier
    • BogusDoctype

      public static final TokeniserState BogusDoctype
    • DoctypeInternalSubset

      public static final TokeniserState DoctypeInternalSubset
    • CdataSection

      public static final TokeniserState CdataSection
  • Field Details

    • nullChar

      static final char nullChar
      See Also:
    • attributeNameCharsSorted

      static final char[] attributeNameCharsSorted
    • attributeValueUnquoted

      static final char[] attributeValueUnquoted
    • replacementChar

      private static final char replacementChar
      See Also:
    • replacementStr

      private static final String replacementStr
    • eof

      private static final char eof
      See Also:
  • Constructor Details

    • TokeniserState

      private TokeniserState()
  • Method Details

    • values

      public static TokeniserState[] values()
      Returns an array containing the constants of this enum class, in the order they are declared.
      Returns:
      an array containing the constants of this enum class, in the order they are declared
    • valueOf

      public static TokeniserState valueOf(String name)
      Returns the enum constant of this class with the specified name. The string must match exactly an identifier used to declare an enum constant in this class. (Extraneous whitespace characters are not permitted.)
      Parameters:
      name - the name of the enum constant to be returned.
      Returns:
      the enum constant with the specified name
      Throws:
      IllegalArgumentException - if this enum class has no constant with the specified name
      NullPointerException - if the argument is null
    • read

      abstract void read(Tokeniser t, CharacterReader r)
    • handleDataEndTag

      private static void handleDataEndTag(Tokeniser t, CharacterReader r, TokeniserState elseTransition)
      Handles RawtextEndTagName, ScriptDataEndTagName, and ScriptDataEscapedEndTagName. Same body impl, just different else exit transitions.
    • readRawData

      private static void readRawData(Tokeniser t, CharacterReader r, TokeniserState current, TokeniserState advance)
    • readCharRef

      private static void readCharRef(Tokeniser t, TokeniserState advance)
    • readEndTag

      private static void readEndTag(Tokeniser t, CharacterReader r, TokeniserState a, TokeniserState b)
    • handleDataDoubleEscapeTag

      private static void handleDataDoubleEscapeTag(Tokeniser t, CharacterReader r, TokeniserState primary, TokeniserState fallback)
    • readDoctypeInternalSubset

      private static void readDoctypeInternalSubset(Tokeniser t, CharacterReader r, TokeniserState current)
      Reads an XML doctype internal subset as opaque text so it can be re-emitted without parsing declarations. Only used when in XML mode; HTML spec will drop these as Bogus.