Class DictionaryBreakEngine

java.lang.Object
com.ibm.icu.impl.breakiter.DictionaryBreakEngine
All Implemented Interfaces:
LanguageBreakEngine
Direct Known Subclasses:
BurmeseBreakEngine, CjkBreakEngine, KhmerBreakEngine, LaoBreakEngine, LSTMBreakEngine, ThaiBreakEngine

public abstract class DictionaryBreakEngine extends Object implements LanguageBreakEngine
  • Field Details

  • Constructor Details

    • DictionaryBreakEngine

      public DictionaryBreakEngine()
      Constructor
  • Method Details

    • handles

      public boolean handles(int c)
      Specified by:
      handles in interface LanguageBreakEngine
      Parameters:
      c - A Unicode codepoint value
      Returns:
      true if the engine can handle this character, false otherwise
    • findBreaks

      public int findBreaks(CharacterIterator text, int startPos, int endPos, DictionaryBreakEngine.DequeI foundBreaks, boolean isPhraseBreaking)
      Description copied from interface: LanguageBreakEngine
      Implements the actual breaking logic. Find any breaks within a run in the supplied text.
      Specified by:
      findBreaks in interface LanguageBreakEngine
      Parameters:
      text - The text to break over. The iterator is left at the end of the run of characters which the engine has handled.
      startPos - The index of the beginning of the range
      endPos - The index of the possible end of our range. It is possible, however, that the range ends earlier
      foundBreaks - A data structure to receive the break positions.
      Returns:
      the number of breaks found
    • setCharacters

      void setCharacters(UnicodeSet set)
    • divideUpDictionaryRange

      abstract int divideUpDictionaryRange(CharacterIterator text, int rangeStart, int rangeEnd, DictionaryBreakEngine.DequeI foundBreaks, boolean isPhraseBreaking)

      Divide up a range of known dictionary characters handled by this break engine.

      Parameters:
      text - A UText representing the text
      rangeStart - The start of the range of dictionary characters
      rangeEnd - The end of the range of dictionary characters
      foundBreaks - Output of break positions. Positions are pushed. Pre-existing contents of the output stack are unaltered.
      Returns:
      The number of breaks found