Learn R Programming

⚠️There's a newer version (0.9.9) of this package.Take me there.

fdm2id (version 0.9.5)

Data Mining and R Programming for Beginners

Description

Contains functions to simplify the use of data mining methods (classification, regression, clustering, etc.), for students and beginners in R programming. Various R packages are used and wrappers are built around the main functions, to standardize the use of data mining methods (input/output): it brings a certain loss of flexibility, but also a gain of simplicity. The package name came from the French "Fouille de Donnes en Master 2 Informatique Dcisionnelle".

Copy Link

Version

Install

install.packages('fdm2id')

Monthly Downloads

439

Version

0.9.5

License

GPL-3

Maintainer

Alexandre Blansch<c3><a9>

Last Published

January 4th, 2021

Functions in fdm2id (0.9.5)

BAGGING

Classification using Bagging
EM

Expectation-Maximization clustering method
FEATURESELECTION

Classification with Feature selection
DBSCAN

DBSCAN clustering method
CART

Classification using CART
CDA

Classification using Canonical Discriminant Analysis
APRIORI

Classification using APRIORI
CA

Correspondence Analysis (CA)
ADABOOST

Classification using AdaBoost
GRADIENTBOOSTING

Classification using Gradient Boosting
HCA

Hierarchical Cluster Analysis method
SPECTRAL

Spectral clustering method
KERREG

Kernel Regression
birth

Birth dataset
NB

Classification using Naive Bayes
cartplot

CART Plot
MEANSHIFT

MeanShift method
boosting-class

Boosting methods model
KNN

Classification using k-NN
MLPREG

Multi-Layer Perceptron Regression
KMEANS

K-means method
LDA

Classification using Linear Discriminant Analysis
LR

Classification using Logistic Regression
RANDOMFOREST

Classification using Random Forest
SVD

Singular Value Decomposition
POLYREG

Polynomial Regression
QDA

Classification using Quadratic Discriminant Analysis
LINREG

Linear Regression
SVM

Classification using Support Vector Machine
MLP

Classification using Multilayer Perceptron
autompg

Auto MPG dataset
accident2014

Sample of car accident location in the UK during year 2014.
TSNE

t-distributed Stochastic Neighbor Embedding
cda-class

Canonical Disciminant Analysis model
SOM

Self-Organizing Maps clustering method
cartdepth

Depth
SVMl

Classification using Support Vector Machine with a linear kernel
MCA

Multiple Correspondence Analysis (MCA)
NMF

Non-negative Matrix Factorization
cartinfo

CART information
cost.curves

Plot Cost Curves
compare

Comparison of two sets of clusters
closegraphics

Close a graphics device
correlated

Correlated variables
PCA

Principal Component Analysis (PCA)
SVMr

Classification using Support Vector Machine with a radial kernel
data1

"data1" dataset
STUMP

Classification using one-level decision tree
beetles

Flea beetles dataset
evaluation.accuracy

Accuracy of classification predictions
cookplot

Plot the Cook's distance of a linear regression model
cookies

Cookies dataset
data2

"data2" dataset
eucalyptus

Eucalyptus dataset
SVRr

Regression using Support Vector Machine with a radial kernel
SVR

Regression using Support Vector Machine
evaluation.fmeasure

F-measure
alcohol

Alcohol dataset
SVRl

Regression using Support Vector Machine with a linear kernel
apriori-class

APRIORI classification model
data3

"data3" dataset
frequentwords

Frequent words
evaluation

Evaluation of classification or regression predictions
compare.kappa

Comparison of two sets of clusters, using kappa
filter.rules

Filtering a set of rules
compare.accuracy

Comparison of two sets of clusters, using accuracy
compare.jaccard

Comparison of two sets of clusters, using Jaccard index
loadtext

load a text file
dataset-class

Training set and test set
credit

Credit dataset
data.diag

Square dataset
confusion

Confuion matrix
boxclus

Clustering Box Plots
TEXTMINING

Text mining
britpop

Population and location of 18 major british cities.
meanshift-class

MeanShift model
plot.som

Plot function for som-class
plot.factorial

Plot function for factorial-class
predict.apriori

Model predictions
distplot

Plot a k-distance graphic
data.gauss

Gaussian mixture dataset
predict.boosting

Model predictions
em-class

Expectation-Maximization model
factorial-class

Factorial analysis results
exportgraphics

Open a graphics device
leverageplot

Plot the leverage points of a linear regression model
data.parabol

Parabol dataset
data.xor

XOR dataset
data.twomoons

Two moons dataset
evaluation.msep

MSEP evaluation of regression predictions
linsep

Linsep dataset
predict.knn

Model predictions
predict.meanshift

Predict function for MeanShift
print.apriori

Print a classification model obtained by APRIORI
predict.textmining

Model predictions
snore

Snore dataset
som-class

Self-Organizing Maps model
evaluation.jaccard

Jaccard index
evaluation.kappa

Kappa evaluation of classification predictions
cartleafs

Number of Leafs
zoo

Zoo dataset
wine

Wine dataset
temperature

Temperature dataset
cartnodes

Number of Nodes
summary.apriori

Print summary of a classification model obtained by APRIORI
data.target1

Target1 dataset
ionosphere

Ionosphere dataset
data.target2

Target2 dataset
general.rules

Remove redundancy in a set of rules
dbs-class

DBSCAN model
getvocab

Extract words and phrases from a corpus
kmeans.getk

Estimation of the number of clusters for K-means
knn-class

K Nearest Neighbours model
intern.dunn

Clustering evaluation through Dunn's index
intern

Clustering evaluation through internal criteria
evaluation.precision

Precision of classification predictions
plotclus

Generic Plot Method for Clustering
plotcloud

Plot word cloud
plotdata

Advanced plot function
performance

Performance estimation
plot.cda

Plot function for cda-class
decathlon

Decathlon dataset
pseudoF

Pseudo-F
vectorizer-class

Document vectorization object
print.factorial

Plot function for factorial-class
vectorize.words

Word vectorization
plotzipf

Plot rank versus frequency
query.docs

Document query
resplot

Plot the studentized residuals of a linear regression model
query.words

Word query
regplot

Plot function for a regression model
keiser

Keiser rule
reg1

reg1 dataset
model-class

Generic classification or regression model
evaluation.goodness

Goodness
evaluation.fowlkesmallows

Fowlkes<U+2013>Mallows index
movies

Movies dataset
selection-class

Feature selection
exportgraphics.off

Toggle graphic exports
selectfeatures

Feature selection for classification
predict.dbs

Predict function for DBSCAN
predict.cda

Model predictions
evaluation.r2

R2 evaluation of regression predictions
treeplot

Dendrogram Plots
universite

University dataset
evaluation.recall

Recall of classification predictions
vectorize.docs

Document vectorization
stability

Clustering evaluation through stability
splitdata

Splits a dataset into training set and test set
intern.interclass

Clustering evaluation through interclass inertia
ozone

Ozone dataset
predict.em

Predict function for EM
intern.intraclass

Clustering evaluation through intraclass inertia
predict.kmeans

Predict function for K-means
params-class

Learning Parameters
predict.model

Model predictions
predict.selection

Model predictions
roc.curves

Plot ROC Curves
rotation

Rotation
textmining-class

Text mining object
titanic

Titanic dataset
vowels

Vowels dataset
spectral-class

Spectral clustering model
runningtime

Running time
reg2

reg2 dataset
scatterplot

Clustering Scatter Plots
spine

Spine dataset
wheat

Wheat dataset