How to remove UIMA annotations?

Question

How to remove UIMA annotations?

I use some UIMA annotators in the pipeline. It runs tasks such as:

tokenizer
offer separator
gazetizer
My annotator

The problem is that I don’t want to write all annotations (tokens, sentences, subtitles, time, myAnnotations, etc.) to disk, because the files become very fast.

I want to delete all annotations and keep only those created by My Annotator .

I work with the following libraries:

uimaFIT 2.0.0
ClearTK 1.4.1
Maven

And I use org.apache.uima.fit.pipeline.SimplePipelinewith:

SimplePipeline.runPipeline(
    UriCollectionReader.getCollectionReaderFromDirectory(filesDirectory), //directory with text files
    UriToDocumentTextAnnotator.getDescription(),
    StanfordCoreNLPAnnotator.getDescription(),//stanford tokenize, ssplit, pos, lemma, ner, parse, dcoref
    AnalysisEngineFactory.createEngineDescription(//
        XWriter.class, 
        XWriter.PARAM_OUTPUT_DIRECTORY_NAME, outputDirectory,
        XWriter.PARAM_FILE_NAMER_CLASS_NAME, ViewURIFileNamer.class.getName())
);

What I'm trying to do is use the Standford NLP annotator (from ClearTK) and remove the useless annotation.

How to do it?

From what I know, you can use the method removeFromIndexes();from an Annotation instance.

UIMA- ?

+5

java nlp uima

German Attanasio 30 . '14 17:14

3

: MyAnnotator XWriter , , .

+2

Renaud 31 . '14 12:30

German Attanasios java 8 , - TypePrefix:

public void filterAnnotations(JCas jcas, String annotationTypePrefix) {

    JCasUtil.selectAll(jcas)
            .stream()
            .filter(t -> !t.getType().getName().startsWith(annotationTypePrefix))
            .forEach(TOP::removeFromIndexes);
}

0

nadre 06 . '18 14:40

German Attanasio · Accepted Answer · 2014-01-01T23:11:10+0000

, :

public class AnnotationRemover extends JCasAnnotator_ImplBase {
    public static AnalysisEngineDescription getDescription() throws ResourceInitializationException {
        return AnalysisEngineFactory.createEngineDescription(AnnotationRemover.class);
    }

    public void initialize(UimaContext context) throws ResourceInitializationException {
        super.initialize(context);
    }

    public void process(JCas jCas) throws AnalysisEngineProcessException {
        List<TOP> tops = new ArrayList<TOP>(JCasUtil.selectAll(jCas));
        for (TOP t : tops) {
            if (!t.getType().getName().equals("mypackage.MyAnnotation")) 
                t.removeFromIndexes();
            }
        }
}

, mypackage.MyAnnotation

How to remove UIMA annotations?

More articles: