Class Tokeniser


  • final class Tokeniser
    extends java.lang.Object
    Readers the input stream into tokens.
    • Field Detail

      • notCharRefCharsSorted

        private static final char[] notCharRefCharsSorted
      • win1252Extensions

        static final int[] win1252Extensions
      • emitPending

        private Token emitPending
      • isEmitPending

        private boolean isEmitPending
      • charsString

        private java.lang.String charsString
      • charsBuilder

        private java.lang.StringBuilder charsBuilder
      • dataBuffer

        java.lang.StringBuilder dataBuffer
      • lastStartTag

        private java.lang.String lastStartTag
      • codepointHolder

        private final int[] codepointHolder
      • multipointHolder

        private final int[] multipointHolder
    • Method Detail

      • emit

        void emit​(Token token)
      • emit

        void emit​(java.lang.String str)
      • emit

        void emit​(char[] chars)
      • emit

        void emit​(int[] codepoints)
      • emit

        void emit​(char c)
      • consumeCharacterReference

        int[] consumeCharacterReference​(java.lang.Character additionalAllowedCharacter,
                                        boolean inAttribute)
      • createTagPending

        Token.Tag createTagPending​(boolean start)
      • emitTagPending

        void emitTagPending()
      • createCommentPending

        void createCommentPending()
      • emitCommentPending

        void emitCommentPending()
      • createDoctypePending

        void createDoctypePending()
      • emitDoctypePending

        void emitDoctypePending()
      • createTempBuffer

        void createTempBuffer()
      • isAppropriateEndTagToken

        boolean isAppropriateEndTagToken()
      • appropriateEndTagName

        java.lang.String appropriateEndTagName()
      • characterReferenceError

        private void characterReferenceError​(java.lang.String message)
      • error

        void error​(java.lang.String errorMsg)
      • currentNodeInHtmlNS

        boolean currentNodeInHtmlNS()
      • unescapeEntities

        java.lang.String unescapeEntities​(boolean inAttribute)
        Utility method to consume reader and unescape entities found within.
        Parameters:
        inAttribute - if the text to be unescaped is in an attribute
        Returns:
        unescaped string from reader