cbind_dependencies

Annotated results of <code>udpipe_annotate</code> contain dependency parsing results which indicate
how each word is linked to another word and the relation between these 2 words. 
This information is available in the fields token_id, head_token_id and dep_rel which indicates how each token
is linked to the parent. The type of relation (dep_rel) is defined at 
<a href="https://universaldependencies.org/u/dep/index.html">https://universaldependencies.org/u/dep/index.html</a>. 
For example in the text 'The economy is weak but the outlook is bright', the term economy is linked to weak
as the term economy is the nominal subject of weak. 
This function adds the parent or child information to the annotated data.frame.

This natural language processing toolkit provides language-agnostic
'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency
parsing' of raw text. Next to text parsing, the package also allows you to train
annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided
at <https://universaldependencies.org/format.html>. The techniques are explained
in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0
with UDPipe', available at <doi:10.18653/v1/K17-3009>.
The toolkit also contains functionalities for commonly used data manipulations on texts
which are enriched with the output of the parser. Namely functionalities and algorithms
for collocations, token co-occurrence, document term matrix handling,
term frequency inverse document frequency calculations,
information retrieval metrics (Okapi BM25), handling of multi-word expressions,
keyword detection (Rapid Automatic Keyword Extraction, noun phrase extraction, syntactical patterns)
sentiment scoring and semantic similarity analysis.

Jan Wijffels

udpipe

Tokenization, Parts of Speech Tagging, Lemmatization and
Dependency Parsing with the 'UDPipe' 'NLP' Toolkit

BNOSAC 

Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic 

Milan Straka 

Jana Straková 

cbind_dependencies function

<dl><dt>x</dt>
<dd>a data.frame or data.table as returned by <code>as.data.frame(udpipe_annotate(...))</code></dd>
<dt>type</dt>
<dd>either one of 'parent', 'child', 'parent_rowid', 'child_rowid'. 
Look to the return value section for more information on the difference in logic. 
Defaults to 'parent', indicating to add the information of the head_token_id to the dataset</dd>
<dt>recursive</dt>
<dd>in case when <code>type</code> is set to 'parent_rowid' or 'child_rowid', do you want the parent of the parent of the parent, ... or the child of the child of the child ... included. Defaults to FALSE indicating to only have the direct parent or children.</dd></dl>

Arguments

Annotated results of <code>udpipe_annotate</code> contain dependency parsing results which indicate
how each word is linked to another word and the relation between these 2 words. 
This information is available in the fields token_id, head_token_id and dep_rel which indicates how each token
is linked to the parent. The type of relation (dep_rel) is defined at 
<a href='https://universaldependencies.org/u/dep/index.html'>https://universaldependencies.org/u/dep/index.html</a>. 
For example in the text 'The economy is weak but the outlook is bright', the term economy is linked to weak
as the term economy is the nominal subject of weak. 
This function adds the parent or child information to the annotated data.frame.

Add the dependency parsing information to an annotated dataset — cbind_dependencies

<dl>

<dt>x</dt>
<dd>a data.frame or data.table as returned by <code>as.data.frame(udpipe_annotate(...))</code></dd>


<dt>type</dt>
<dd>either one of 'parent', 'child', 'parent_rowid', 'child_rowid'. 
Look to the return value section for more information on the difference in logic. 
Defaults to 'parent', indicating to add the information of the head_token_id to the dataset</dd>


<dt>recursive</dt>
<dd>in case when <code>type</code> is set to 'parent_rowid' or 'child_rowid', do you want the parent of the parent of the parent, ... or the child of the child of the child ... included. Defaults to FALSE indicating to only have the direct parent or children.</dd>

</dl>

cbind_dependencies: Add the dependency parsing information to an annotated dataset

Description

Usage

Value

Arguments

Details

Examples