Learn R Programming

fdm2id (version 0.9.9)

Data Mining and R Programming for Beginners

Description

Contains functions to simplify the use of data mining methods (classification, regression, clustering, etc.), for students and beginners in R programming. Various R packages are used and wrappers are built around the main functions, to standardize the use of data mining methods (input/output): it brings a certain loss of flexibility, but also a gain of simplicity. The package name came from the French "Fouille de Données en Master 2 Informatique Décisionnelle".

Copy Link

Version

Install

install.packages('fdm2id')

Monthly Downloads

439

Version

0.9.9

License

GPL-3

Maintainer

Alexandre Blansch<c3><a9>

Last Published

June 12th, 2023

Functions in fdm2id (0.9.9)

DBSCAN

DBSCAN clustering method
EM

Expectation-Maximization clustering method
APRIORI

Classification using APRIORI
LR

Classification using Logistic Regression
MCA

Multiple Correspondence Analysis (MCA)
MEANSHIFT

MeanShift method
MLP

Classification using Multilayer Perceptron
LINREG

Linear Regression
NMF

Non-negative Matrix Factorization
LDA

Classification using Linear Discriminant Analysis
HCA

Hierarchical Cluster Analysis method
KERREG

Kernel Regression
STUMP

Classification using one-level decision tree
KMEANS

K-means method
SPECTRAL

Spectral clustering method
SVM

Classification using Support Vector Machine
POLYREG

Polynomial Regression
RANDOMFOREST

Classification using Random Forest
SVD

Singular Value Decomposition
KNN

Classification using k-NN
PCA

Principal Component Analysis (PCA)
boosting-class

Boosting methods model
britpop

Population and location of 18 major british cities.
cartdepth

Depth
boxclus

Clustering Box Plots
SVR

Regression using Support Vector Machine
QDA

Classification using Quadratic Discriminant Analysis
SVMl

Classification using Support Vector Machine with a linear kernel
SVMr

Classification using Support Vector Machine with a radial kernel
data.target1

Target1 dataset
data.xor

XOR dataset
data.parabol

Parabol dataset
evaluation.recall

Recall of classification predictions
data1

"data1" dataset
evaluation.r2

R2 evaluation of regression predictions
SVRl

Regression using Support Vector Machine with a linear kernel
compare.jaccard

Comparison of two sets of clusters, using Jaccard index
compare.kappa

Comparison of two sets of clusters, using kappa
apriori-class

APRIORI classification model
alcohol

Alcohol dataset
cookplot

Plot the Cook's distance of a linear regression model
cartplot

CART Plot
cartnodes

Number of Nodes
correlated

Correlated variables
SOM

Self-Organizing Maps clustering method
data3

"data3" dataset
evaluation

Evaluation of classification or regression predictions
data2

"data2" dataset
evaluation.accuracy

Accuracy of classification predictions
data.target2

Target2 dataset
data.twomoons

Two moons dataset
em-class

Expectation-Maximization model
eucalyptus

Eucalyptus dataset
kmeans.getk

Estimation of the number of clusters for K-means
exportgraphics

Open a graphics device
factorial-class

Factorial analysis results
MLPREG

Multi-Layer Perceptron Regression
NB

Classification using Naive Bayes
cartinfo

CART information
SVRr

Regression using Support Vector Machine with a radial kernel
evaluation.msep

MSEP evaluation of regression predictions
evaluation.precision

Precision of classification predictions
intern

Clustering evaluation through internal criteria
TEXTMINING

Text mining
model-class

Generic classification or regression model
intern.dunn

Clustering evaluation through Dunn's index
cartleafs

Number of Leafs
TSNE

t-distributed Stochastic Neighbor Embedding
accident2014

Sample of car accident location in the UK during year 2014.
knn-class

K Nearest Neighbours model
performance

Performance estimation
beetles

Flea beetles dataset
data.diag

Square dataset
evaluation.adjr2

Adjusted R2 evaluation of regression predictions
data.gauss

Gaussian mixture dataset
evaluation.fmeasure

F-measure
params-class

Learning Parameters
ozone

Ozone dataset
augmentation

Duplicate and add noise to a dataset
loadtext

load a text file
meanshift-class

MeanShift model
leverageplot

Plot the leverage points of a linear regression model
linsep

Linsep dataset
plotcloud

Plot word cloud
print.apriori

Print a classification model obtained by APRIORI
plotavsp

Plot actual vs. predictions
predict.boosting

Model predictions
plot.cda

Plot function for cda-class
resplot

Plot the studentized residuals of a linear regression model
roc.curves

Plot ROC Curves
print.factorial

Plot function for factorial-class
rotation

Rotation
titanic

Titanic dataset
autompg

Auto MPG dataset
predict.model

Model predictions
predict.meanshift

Predict function for MeanShift
predict.selection

Model predictions
predict.cda

Model predictions
predict.textmining

Model predictions
exportgraphics.off

Toggle graphic exports
cda-class

Canonical Disciminant Analysis model
birth

Birth dataset
credit

Credit dataset
cost.curves

Plot Cost Curves
closegraphics

Close a graphics device
splitdata

Splits a dataset into training set and test set
spine

Spine dataset
dataset-class

Training set and test set
plotzipf

Plot rank versus frequency
movies

Movies dataset
vectorizer-class

Document vectorization object
dbs-class

DBSCAN model
compare

Comparison of two sets of clusters
evaluation.jaccard

Jaccard index
query.docs

Document query
predict.apriori

Model predictions
pseudoF

Pseudo-F
scatterplot

Clustering Scatter Plots
selectfeatures

Feature selection for classification
cookies

Cookies dataset
confusion

Confuion matrix
compare.accuracy

Comparison of two sets of clusters, using accuracy
decathlon

Decathlon dataset
spectral-class

Spectral clustering model
som-class

Self-Organizing Maps model
treeplot

Dendrogram Plots
evaluation.kappa

Kappa evaluation of classification predictions
runningtime

Running time
general.rules

Remove redundancy in a set of rules
universite

University dataset
snore

Snore dataset
distplot

Plot a k-distance graphic
selection-class

Feature selection
ionosphere

Ionosphere dataset
kaiser

Kaiser rule
frequentwords

Frequent words
evaluation.fowlkesmallows

Fowlkes–Mallows index
filter.rules

Filtering a set of rules
evaluation.goodness

Goodness
plotclus

Generic Plot Method for Clustering
plotdata

Advanced plot function
vowels

Vowels dataset
zoo

Zoo dataset
predict.knn

Model predictions
predict.kmeans

Predict function for K-means
getvocab

Extract words and phrases from a corpus
reg2

reg2 dataset
regplot

Plot function for a regression model
stability

Clustering evaluation through stability
intern.intraclass

Clustering evaluation through intraclass inertia
intern.interclass

Clustering evaluation through interclass inertia
plot.factorial

Plot function for factorial-class
plot.som

Plot function for som-class
query.words

Word query
predict.em

Predict function for EM
predict.dbs

Predict function for DBSCAN
reg1

reg1 dataset
summary.apriori

Print summary of a classification model obtained by APRIORI
wheat

Wheat dataset
wine

Wine dataset
temperature

Temperature dataset
textmining-class

Text mining object
vectorize.docs

Document vectorization
vectorize.words

Word vectorization
BAGGING

Classification using Bagging
FEATURESELECTION

Classification with Feature selection
CA

Correspondence Analysis (CA)
GRADIENTBOOSTING

Classification using Gradient Boosting
CART

Classification using CART
ADABOOST

Classification using AdaBoost
CDA

Classification using Canonical Discriminant Analysis