sentenceSimil

A character vector of sentence IDs corresponding to the <code>docId</code> and <code>token</code> arguments

sentenceId

A character vector of tokens corresponding to the <code>docId</code> and <code>sentenceId</code> arguments

token

A character vector of document IDs corresponding to the <code>sentenceId</code> and <code>token</code> arguments. Can be <code>NULL</code> if <code>sentencesAsDocs</code> is <code>TRUE</code>.

docId

<code>TRUE</code> or <code>FALSE</code>, indicating whether or not to treat sentences as documents when calculating tfidf scores. If <code>TRUE</code>, inverse document frequency will be calculated as inverse sentence frequency (useful for single document extractive summarization)

sentencesAsDocs

Compute distance between sentences using modified idf cosine distance from "LexRank: Graph-based Lexical Centrality as Salience in Text Summarization". Output can be used as input to <code><a rd-options="" href="/link/lexRankFromSimil?package=lexRankr&version=0.5.2" data-mini-rdoc="lexRankr::lexRankFromSimil">lexRankFromSimil</a></code>.

An R implementation of the LexRank algorithm described by G. Erkan and D. R. Radev (2004) <DOI:10.1613/jair.1523>.

Adam Spannbauer

lexRankr

Extractive Summarization of Text with the LexRank Algorithm

sentenceSimil function

Compute distance between sentences using modified idf cosine distance from "LexRank: Graph-based Lexical Centrality as Salience in Text Summarization". Output can be used as input to <code><a rd-options='' href='lexRankFromSimil'>lexRankFromSimil</a></code>.

sentenceSimil: Compute distance between sentences

Description

Usage

Arguments

Value

References

Examples