degu.util.textstripper
Interface PageSupport

All Known Implementing Classes:
PageSupportedTextStripper, PDFBoxTextStripper

public interface PageSupport

implemented by all text strippers for mime types with a certain concept of "pages" (e.g. pdf)


Method Summary
 java.lang.String stripAsString(java.io.InputStream is, int page)
          strips a single page out of the document
 java.lang.String stripAsString(java.io.InputStream is, int begin, int end)
          strips all pages from begin to end into a String
 java.lang.String[] stripAsStringArray(java.io.InputStream is)
          strips the document into a string array, each entry represents one page
 java.lang.String[] stripAsStringArray(java.io.InputStream is, int begin, int end)
          strips the document pages beginning from page nr.
 

Method Detail

stripAsString

java.lang.String stripAsString(java.io.InputStream is,
                               int page)
                               throws FailedToStripText
strips a single page out of the document

Parameters:
is -
page - the page to be stripped
Returns:
Throws:
FailedToStripText

stripAsString

java.lang.String stripAsString(java.io.InputStream is,
                               int begin,
                               int end)
                               throws FailedToStripText,
                                      InvalidPageIndexException
strips all pages from begin to end into a String

Parameters:
is -
begin -
end -
Returns:
Throws:
InvalidPageIndexException
FailedToStripText

stripAsStringArray

java.lang.String[] stripAsStringArray(java.io.InputStream is)
                                      throws FailedToStripText
strips the document into a string array, each entry represents one page

Parameters:
is -
Returns:
Throws:
FailedToStripText

stripAsStringArray

java.lang.String[] stripAsStringArray(java.io.InputStream is,
                                      int begin,
                                      int end)
                                      throws FailedToStripText,
                                             InvalidPageIndexException
strips the document pages beginning from page nr. "begin" to page nr. "end" into a string array, each entry represents one page, respectively

Parameters:
is -
Returns:
Throws:
FailedToStripText
InvalidPageIndexException