Learn R Programming

rdomains: Classify Domains Based on Their Content

The package provides a few ways to classify domains based on their content. You can either get the categorizations from shallalist (which has stopped its service --- the latest you will get is from 1/14/22), trusted (McAfee), DMOZ (the service has ended; available at curlie), Alexa API, which uses the DMOZ Data (now hosted at https://curlie.org), or virustotal API, or use validated machine learning models based off the shallalist data.

Installation

To get the current release version from CRAN:

install.packages("rdomains")

To get the current development version from GitHub:

# install.packages("devtools")
devtools::install_github("themains/rdomains", build_vignettes = TRUE)

Usage

To learn how to use rdomains, launch the vignette within R:

vignette("rdomains", package = "rdomains")

License

Scripts are released under the MIT License.

Copy Link

Version

Install

install.packages('rdomains')

Monthly Downloads

198

Version

0.2.1

License

MIT + file LICENSE

Maintainer

Last Published

January 15th, 2022

Functions in rdomains (0.2.1)

virustotal_cat

Get Category from Virustotal
dmoz_cat

Get Category from DMOZ
uni_cat

Get Category from University Domain List
shalla_cat

Get Category from Shallalist
rdomains-package

rdomains: Classify Domains by their Content
glm_shalla

ML Model
get_shalla_data

Get Shalla Data
get_dmoz_data

Get DMOZ Data
adult_ml1_cat

Probability that Domain Hosts Adult Content Based on features of Domain Name and Suffix alone.
brightcloud_cat

Get Category from Brightcloud
not_news

Classify News and Non-News Based on keywords in the URL
alexa_cat

Get Category from Alexa
get_alexa_data

Get Alexa Traffic Data