Class LimitedTextContentExtractor

  • All Implemented Interfaces:
    Extractor2

    public class LimitedTextContentExtractor
    extends BaseAttachmentContentExtractor
    A subclass of BaseAttachmentContentExtractor which places a limit on how many bytes of the input stream are read into memory. This prevents it from potentially reading in huge attachment streams that trigger memory starvation.

    This may have the side-effect of some content not being indexed if it is to be found "beyond" the limit, but that's preferable to an OOME.

    The default value was changed from fixed 10Mb to be in line with the value set for Attachments:

    Since:
    7.17
    See Also:
    AttachmentExtractedTextExtractor
    • Constructor Detail

      • LimitedTextContentExtractor

        public LimitedTextContentExtractor()