Learn R Programming

⚠️There's a newer version (0.9.9) of this package.Take me there.

fdm2id (version 0.9.6)

Data Mining and R Programming for Beginners

Description

Contains functions to simplify the use of data mining methods (classification, regression, clustering, etc.), for students and beginners in R programming. Various R packages are used and wrappers are built around the main functions, to standardize the use of data mining methods (input/output): it brings a certain loss of flexibility, but also a gain of simplicity. The package name came from the French "Fouille de Données en Master 2 Informatique Décisionnelle".

Copy Link

Version

Install

install.packages('fdm2id')

Monthly Downloads

439

Version

0.9.6

License

GPL-3

Maintainer

Alexandre Blansch<c3><a9>

Last Published

October 29th, 2021

Functions in fdm2id (0.9.6)

CART

Classification using CART
BAGGING

Classification using Bagging
STUMP

Classification using one-level decision tree
SVRl

Regression using Support Vector Machine with a linear kernel
britpop

Population and location of 18 major british cities.
SVR

Regression using Support Vector Machine
boxclus

Clustering Box Plots
KERREG

Kernel Regression
APRIORI

Classification using APRIORI
LR

Classification using Logistic Regression
HCA

Hierarchical Cluster Analysis method
ADABOOST

Classification using AdaBoost
TSNE

t-distributed Stochastic Neighbor Embedding
CA

Correspondence Analysis (CA)
accident2014

Sample of car accident location in the UK during year 2014.
compare.kappa

Comparison of two sets of clusters, using kappa
SVRr

Regression using Support Vector Machine with a radial kernel
confusion

Confuion matrix
TEXTMINING

Text mining
MCA

Multiple Correspondence Analysis (MCA)
alcohol

Alcohol dataset
data.parabol

Parabol dataset
apriori-class

APRIORI classification model
data.gauss

Gaussian mixture dataset
data1

"data1" dataset
cartdepth

Depth
cartinfo

CART information
closegraphics

Close a graphics device
compare

Comparison of two sets of clusters
PCA

Principal Component Analysis (PCA)
SVD

Singular Value Decomposition
NMF

Non-negative Matrix Factorization
cookies

Cookies dataset
cookplot

Plot the Cook's distance of a linear regression model
data2

"data2" dataset
decathlon

Decathlon dataset
dbs-class

DBSCAN model
EM

Expectation-Maximization clustering method
cartleafs

Number of Leafs
DBSCAN

DBSCAN clustering method
eucalyptus

Eucalyptus dataset
data.target1

Target1 dataset
boosting-class

Boosting methods model
birth

Birth dataset
cartplot

CART Plot
SVM

Classification using Support Vector Machine
cda-class

Canonical Disciminant Analysis model
exportgraphics

Open a graphics device
factorial-class

Factorial analysis results
ionosphere

Ionosphere dataset
cartnodes

Number of Nodes
intern.intraclass

Clustering evaluation through intraclass inertia
intern.interclass

Clustering evaluation through interclass inertia
filter.rules

Filtering a set of rules
evaluation

Evaluation of classification or regression predictions
frequentwords

Frequent words
dataset-class

Training set and test set
evaluation.msep

MSEP evaluation of regression predictions
data3

"data3" dataset
data.target2

Target2 dataset
keiser

Keiser rule
evaluation.precision

Precision of classification predictions
KNN

Classification using k-NN
MLPREG

Multi-Layer Perceptron Regression
KMEANS

K-means method
FEATURESELECTION

Classification with Feature selection
LINREG

Linear Regression
LDA

Classification using Linear Discriminant Analysis
GRADIENTBOOSTING

Classification using Gradient Boosting
NB

Classification using Naive Bayes
evaluation.r2

R2 evaluation of regression predictions
data.twomoons

Two moons dataset
SVMr

Classification using Support Vector Machine with a radial kernel
SVMl

Classification using Support Vector Machine with a linear kernel
correlated

Correlated variables
POLYREG

Polynomial Regression
QDA

Classification using Quadratic Discriminant Analysis
compare.accuracy

Comparison of two sets of clusters, using accuracy
beetles

Flea beetles dataset
autompg

Auto MPG dataset
compare.jaccard

Comparison of two sets of clusters, using Jaccard index
data.diag

Square dataset
credit

Credit dataset
plot.factorial

Plot function for factorial-class
evaluation.recall

Recall of classification predictions
plotcloud

Plot word cloud
plotdata

Advanced plot function
plot.som

Plot function for som-class
plotzipf

Plot rank versus frequency
distplot

Plot a k-distance graphic
cost.curves

Plot Cost Curves
ozone

Ozone dataset
predict.cda

Model predictions
params-class

Learning Parameters
snore

Snore dataset
som-class

Self-Organizing Maps model
data.xor

XOR dataset
evaluation.fowlkesmallows

Fowlkes–Mallows index
em-class

Expectation-Maximization model
plotclus

Generic Plot Method for Clustering
evaluation.goodness

Goodness
predict.boosting

Model predictions
predict.apriori

Model predictions
predict.dbs

Predict function for DBSCAN
predict.selection

Model predictions
predict.model

Model predictions
roc.curves

Plot ROC Curves
spectral-class

Spectral clustering model
vowels

Vowels dataset
spine

Spine dataset
leverageplot

Plot the leverage points of a linear regression model
evaluation.fmeasure

F-measure
evaluation.accuracy

Accuracy of classification predictions
reg1

reg1 dataset
reg2

reg2 dataset
rotation

Rotation
evaluation.jaccard

Jaccard index
predict.knn

Model predictions
linsep

Linsep dataset
predict.meanshift

Predict function for MeanShift
exportgraphics.off

Toggle graphic exports
treeplot

Dendrogram Plots
query.words

Word query
query.docs

Document query
regplot

Plot function for a regression model
intern

Clustering evaluation through internal criteria
resplot

Plot the studentized residuals of a linear regression model
intern.dunn

Clustering evaluation through Dunn's index
evaluation.kappa

Kappa evaluation of classification predictions
general.rules

Remove redundancy in a set of rules
textmining-class

Text mining object
titanic

Titanic dataset
vectorize.words

Word vectorization
vectorizer-class

Document vectorization object
temperature

Temperature dataset
summary.apriori

Print summary of a classification model obtained by APRIORI
wheat

Wheat dataset
getvocab

Extract words and phrases from a corpus
kmeans.getk

Estimation of the number of clusters for K-means
knn-class

K Nearest Neighbours model
movies

Movies dataset
predict.kmeans

Predict function for K-means
print.apriori

Print a classification model obtained by APRIORI
meanshift-class

MeanShift model
model-class

Generic classification or regression model
predict.em

Predict function for EM
loadtext

load a text file
predict.textmining

Model predictions
wine

Wine dataset
performance

Performance estimation
zoo

Zoo dataset
print.factorial

Plot function for factorial-class
plot.cda

Plot function for cda-class
pseudoF

Pseudo-F
scatterplot

Clustering Scatter Plots
runningtime

Running time
stability

Clustering evaluation through stability
splitdata

Splits a dataset into training set and test set
selectfeatures

Feature selection for classification
selection-class

Feature selection
universite

University dataset
vectorize.docs

Document vectorization
MLP

Classification using Multilayer Perceptron
MEANSHIFT

MeanShift method
CDA

Classification using Canonical Discriminant Analysis
SPECTRAL

Spectral clustering method
RANDOMFOREST

Classification using Random Forest
SOM

Self-Organizing Maps clustering method