com.atlassian.bonnie.search.extractor
Class PdfContentExtractor

java.lang.Object
  extended bycom.atlassian.bonnie.search.extractor.BaseAttachmentContentExtractor
      extended bycom.atlassian.bonnie.search.extractor.PdfContentExtractor
All Implemented Interfaces:
Extractor

public class PdfContentExtractor
extends BaseAttachmentContentExtractor


Field Summary
static org.apache.log4j.Category log
           
 
Constructor Summary
PdfContentExtractor()
           
 
Method Summary
protected  java.lang.String extractText(java.io.InputStream is, SearchableAttachment attachment)
          Package access for unit testing only.
protected  java.lang.String[] getMatchingContentTypes()
           
protected  java.lang.String[] getMatchingFileExtensions()
           
 
Methods inherited from class com.atlassian.bonnie.search.extractor.BaseAttachmentContentExtractor
addFields, shouldExtractFrom
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

log

public static final org.apache.log4j.Category log
Constructor Detail

PdfContentExtractor

public PdfContentExtractor()
Method Detail

getMatchingContentTypes

protected java.lang.String[] getMatchingContentTypes()
Overrides:
getMatchingContentTypes in class BaseAttachmentContentExtractor

getMatchingFileExtensions

protected java.lang.String[] getMatchingFileExtensions()
Overrides:
getMatchingFileExtensions in class BaseAttachmentContentExtractor

extractText

protected java.lang.String extractText(java.io.InputStream is,
                                       SearchableAttachment attachment)
                                throws ExtractorException
Description copied from class: BaseAttachmentContentExtractor
Package access for unit testing only. Do not use this method directly. Use #addFields().

Specified by:
extractText in class BaseAttachmentContentExtractor
Parameters:
is - a stream containing the attachment contents
attachment - contains useful attachment metadata, e.g. filename
Returns:
a String with a textual representation of the attachment's contents
Throws:
ExtractorException - if there is a problem with converting the attachment content into text. A wrapper around the original exception.


Copyright © 2006-2009 Atlassian Software Systems Pty Ltd. All Rights Reserved.