com.atlassian.bonnie.search.extractor
Class MsWordContentExtractor

java.lang.Object
  extended bycom.atlassian.bonnie.search.extractor.BaseAttachmentContentExtractor
      extended bycom.atlassian.bonnie.search.extractor.MsWordContentExtractor
All Implemented Interfaces:
Extractor

public class MsWordContentExtractor
extends BaseAttachmentContentExtractor


Field Summary
static org.apache.log4j.Category log
           
 
Constructor Summary
MsWordContentExtractor()
           
 
Method Summary
protected  java.lang.String extractText(java.io.InputStream is, SearchableAttachment attachment)
          Package access for unit testing only.
protected  java.lang.String[] getMatchingContentTypes()
           
protected  java.lang.String[] getMatchingFileExtensions()
           
 
Methods inherited from class com.atlassian.bonnie.search.extractor.BaseAttachmentContentExtractor
addFields, shouldExtractFrom
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

log

public static final org.apache.log4j.Category log
Constructor Detail

MsWordContentExtractor

public MsWordContentExtractor()
Method Detail

getMatchingContentTypes

protected java.lang.String[] getMatchingContentTypes()
Overrides:
getMatchingContentTypes in class BaseAttachmentContentExtractor

getMatchingFileExtensions

protected java.lang.String[] getMatchingFileExtensions()
Overrides:
getMatchingFileExtensions in class BaseAttachmentContentExtractor

extractText

protected java.lang.String extractText(java.io.InputStream is,
                                       SearchableAttachment attachment)
                                throws ExtractorException
Description copied from class: BaseAttachmentContentExtractor
Package access for unit testing only. Do not use this method directly. Use #addFields().

Specified by:
extractText in class BaseAttachmentContentExtractor
Parameters:
is - a stream containing the attachment contents
attachment - contains useful attachment metadata, e.g. filename
Returns:
a String with a textual representation of the attachment's contents
Throws:
ExtractorException - if there is a problem with converting the attachment content into text. A wrapper around the original exception.


Copyright © 2006-2009 Atlassian Software Systems Pty Ltd. All Rights Reserved.