Learn R Programming

⚠️There's a newer version (0.5.7) of this package.Take me there.

superml

The goal of superml is to provide sckit-learn's fit,predict,transform standard way of building machine learning models in R. It is build on top of latest r-packages which provides optimized way of training machine learning models.

Installation

You can install superml from github with:

# install.packages("devtools")
devtools::install_github("saraswatmks/superml")

Description

In superml, every machine learning algorithm is called as a trainer. Following is the list of trainers available as of today:

  • LMTrainer: used to train linear, logistic, ridge, lasso models)
  • RFTrainer: Random Forest Model
  • KNNTrainer: K-Nearest Neighbour Model
  • KMeansTrainer: KMeans Model
  • XGBTrainer: XGBoost Model

In addition, there are other useful functions to support modeling tasks such as:

  • CountVectorizer: Create Bag of Words model
  • TfidfVectorizer: Create TF-IDF feature model
  • LabelEncoder: Convert categorical features to numeric
  • GridSearch: For hyperparameter optimization
  • RandomSearch: For hyperparameter optimization
  • kFoldMean: Target encoding
  • smoothMean: Target encoding

Usage

Any machine learning model can be trained using the following steps:

data(iris)
library(superml)
rf <- RFTrainer$new(n_estimators = 100)
rf$fit(iris, "Species")
pred <- rf$predict(iris)

Copy Link

Version

Install

install.packages('superml')

Monthly Downloads

404

Version

0.1.0

License

GPL-3 | file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Manish Saraswat

Last Published

December 30th, 2018

Functions in superml (0.1.0)

cla_train

cla_train
kFoldMean

kFoldMean Calculator
smoothMean

smoothMean Calculator
reg_train

reg_train
bm25

Best Matching(BM25)
XGBTrainer

Extreme Gradient Boosting Model
RFTrainer

Random Forest Model
Counter

Calculate count of values in a list or vector
GridSearchTrainer

Grid Search Trainer
KMeansTrainer

K-Means Trainer
LabelEncoder

Label Encoder
CountVectorizer

Count Vectorizer
RandomSearchTrainer

Random Search Trainer
TfIdfVectorizer

TfIDF(Term Frequency Inverse Document Frequency) Vectorizer
KNNTrainer

K Nearest Neighbours Trainer
LMTrainer

Linear Models Trainer