Learn R Programming

tosca (version 0.3-4)

filterCount: Subcorpus With Count Filter

Description

Generates a subcorpus by restricting it to texts containing a specific number of words.

Usage

filterCount(...)

# S3 method for default filterCount(text, count = 1L, out = c("text", "bin", "count"), ...)

# S3 method for textmeta filterCount( object, count = 1L, out = c("text", "bin", "count"), filtermeta = TRUE, ... )

Value

textmeta object if object is specified, else only the filtered text. If a textmeta object is returned its meta data are filtered to those texts which appear in the corpus by default (filtermeta).

Arguments

...

Not used.

text

Not necassary if object is specified, else should be object$text: list of article texts

count

An integer marking how many words must at least be found in the text.

out

Type of output: text filtered corpus, bin logical vector for all texts, count the counts.

object

A textmeta object

filtermeta

Logical: Should the meta component be filtered, too?

Examples

Run this code
texts <- list(A="Give a Man a Fish, and You Feed Him for a Day.
Teach a Man To Fish, and You Feed Him for a Lifetime",
B="So Long, and Thanks for All the Fish",
C="A very able manipulative mathematician, Fisher enjoys a real mastery
in evaluating complicated multiple integrals.")

filterCount(text=texts, count=10L)
filterCount(text=texts, count=10L, out="bin")
filterCount(text=texts, count=10L, out="count")

Run the code above in your browser using DataLab