|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
public interface TextExtract
The interface for external modules to extract the text from a document.
Such modules are declared in /configuration/maps.xml
using the text-extract element as in the following example:
<text-extract>
<mime-type>text/*</mime-type>
<!-- Additional mime-type elements can come here. -->
<!-- Several path elements can come here. -->
<class>
<name>be.re.repo.mod.TextExtractPlainText</name>
<!-- Optional URL relative to maps.xml.
<jar>modules/my_module.jar</jar>
-->
</class>
</text-extract>
| Method Summary | |
|---|---|
Reader |
get(String vcr,
String version,
InputStream in,
String mimeType,
Context context)
The method should return the complete text from the document. |
| Method Detail |
|---|
Reader get(String vcr,
String version,
InputStream in,
String mimeType,
Context context)
throws IOException
vcr - the local path of the resource.version - the version path of the resource.in - the inputstream for the document.mimeType - the MIME type of the document.context - the repository context.
IOException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||