Learn R Programming

urltools

A package for elegantly handling and parsing URLs from within R.

Author: Oliver Keyes, Jay Jacobs License: MIT Status: Stable

Description

URLs in R are often treated as nothing more than part of data retrieval - they're used for making connections and reading data. With web analytics and research, however, URLs can be the data, and R's default handlers are not best suited to handle vectorised operations over large datasets. urltools is intended to solve this.

It contains drop-in replacements for R's URLdecode and URLencode functions, along with new functionality such as a URL parser and parameter value extractor. In all cases, the functions are designed to be content-safe (not breaking on unexpected values) and fully vectorised, resulting in a dramatic speed improvement over existing implementations - crucial for large datasets. For more information, see the urltools vignette.

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.

Installation

The latest CRAN version can be obtained via:

install.packages("urltools")

To get the development version:

devtools::install_github("ironholds/urltools")

Dependencies

Copy Link

Version

Install

install.packages('urltools')

Monthly Downloads

28,710

Version

1.7.3

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Last Published

April 14th, 2019

Functions in urltools (1.7.3)

port

Get or set a URL's port
param_get

get the values of a URL's parameters
tld_refresh

Retrieve a TLD dataset
url_compose

Recompose Parsed URLs
puny_encode

Encode or Decode Internationalised Domains
tld_extract

Extract TLDs
url_parse

split URLs into their component parts
suffix_refresh

Retrieve a public suffix dataset
suffix_extract

extract the suffix from domain names
tld_dataset

Dataset of top-level domains (TLDs)
scheme

Get or set a URL's scheme
suffix_dataset

Dataset of public suffixes
urltools

Tools for handling URLs
strip_credentials

Get or remove user authentication credentials
domain

Get or set a URL's domain
host_extract

Extract hosts
param_remove

Remove key-value pairs from query strings
param_set

Set the value associated with a parameter in a URL's query.
parameters

Get or set a URL's parameters
url_decode

Encode or decode a URI
path

Get or set a URL's path
fragment

Get or set a URL's fragment