The result is similar to the crawled pages report found in the raw data export (https://ahrefs.com/site-explorer/export/v2/csv/subdomains/live?target=ahrefs.com) in Site Explorer.
rah_pages(target, token = Sys.getenv("AHREFS_AUTH_TOKEN"),
mode = "domain", metrics = NULL, limit = 1000, order_by = NULL,
where = NULL, having = NULL)
character string. Aim of a request: a domain, a directory or a URL
character string. Authentication token. Should be available through enviromental variables
after authentication with function rah_auth()
character string. Mode of operation: exact, domain, subdomains or prefix. See more in Details section
character vector of columns to select. See more in Details section
integer. Number of results to return
character vector of columns to sort on. See more in Details section
character string - a condition created by rah_condition_set()
function that generates proper
"where"
condition to satisfy. See more in Details section
character string - a condition created by rah_condition_set()
function that generates proper
"having"
condition to satisfy. See more in Details section
data frame
1. available metrics - you can select which columns (metrics) you want to download and which one
would be useful in filtering, BUT not all of them can always be used in "where"
&
"having"
conditions:
Column | Type | Where | Having | Description |
url | string | + | + | URL of the crawled page. |
ahrefs_rank | int | + | + | URL Rating of the page. |
first_seen | date | + | + | The date Ahrefs' bot first found a backlink to your target website or URL on a given referring page. |
last_visited | date | + | + | Most recent date when the Ahrefs crawler was able to crawl the page. |
http_code | int | + | + | HTTP code that was last returned for the page. |
size | int | + | + | Size of the crawled page, in bytes. |
links_internal | int | + | + | Number of internal links found in the crawled page. |
links_external | int | + | + | Number of external links found in the crawled page. |
encoding | string | + | + | Character encoding of the page, for example "utf8" or "iso-8859-1" (Latin-1). |
title | string | + | + | Title of the crawled page. |
redirect_url | string | + | + | URL where the page redirects to. |
2. "mode"
parameter can take 4 different values that will affect how the results will be grouped.
Example of URL directory with folder:
Example URL: ahrefs.com/api/
exact: ahrefs.com/api/
domain: ahrefs.com/*
subdomains: *ahrefs.com/*
prefix: ahrefs.com/api/*
Example of URL directory with subdomain:
Example URL: apiv2.ahrefs.com
exact: apiv2.ahrefs.com/
domain: apiv2.ahrefs.com/*
subdomains: *apiv2.ahrefs.com/*
prefix: apiv2.ahrefs.com/*
3. "order_by"
parameter is a character string that forces sorting of the results. Structure:
Structure: "column_name
:asc|desc"
Single column example: "first_seen:asc" ~ this sorts results by first_seen
column in ascending order
Multi column example: "last_seen:desc,first_seen:asc" ~ this sorts results
by 1) last_seen
column in descending order, and next by 2) first_seen
column in
ascending order
4. "where"
& "having"
are EXPERIMENTAL parameters of condition sets
(character strings) that control filtering the results. To create arguments:
use rah_condition()
function to create a single condition, for example:
cond_1 <- rah_condition(column_name = "links", operator = "GREATER_THAN", value = "10")
use rah_condition_set()
function to group single conditions into final condition
string, for example: fin_cond <- rah_condition_set(cond_1, cond_2)
provide final condition to proper report function as a parameter, for example:
RAhrefs::rah_pages(target = "ahrefs.com", token = "0123456789",
mode = "domain", metrics = NULL, limit = 1000, where = fin_cond, order_by = "first_seen:asc")
Other Ahrefs reports: rah_ahrefs_rank
,
rah_anchors_refdomains
,
rah_anchors
,
rah_backlinks_new_lost_counters
,
rah_backlinks_new_lost
,
rah_backlinks_one_per_domain
,
rah_backlinks
,
rah_broken_backlinks
,
rah_broken_links
,
rah_domain_rating
,
rah_linked_anchors
,
rah_linked_domains_by_type
,
rah_linked_domains
,
rah_metrics_extended
,
rah_metrics
,
rah_pages_extended
,
rah_pages_info
,
rah_refdomains_by_type
,
rah_refdomains_new_lost_counters
,
rah_refdomains_new_lost
,
rah_refdomains
, rah_refips
,
rah_subscription_info
# NOT RUN {
# creating single conditions for 'where' parameter
cond_1 <- RAhrefs::rah_condition(
column_name = "ahrefs_rank",
operator = "GREATER_OR_EQUAL",
value = "10")
cond_2 <- RAhrefs::rah_condition(
column_name = "size",
operator = "GREATER_THAN",
value = "2048000")
# joining conditions into one condition set
cond_where <- RAhrefs::rah_condition_set(cond_1, cond_2)
# downloading
b <- RAhrefs::rah_pages(
target = "ahrefs.com",
limit = 2,
where = cond_where,
order_by = "ahrefs_rank:desc")
# }
Run the code above in your browser using DataLab