not_news

url_list

<p>Based on a slightly amended version of the regular expression used to classify news, and non-news in:
``Exposure to ideologically diverse news and opinion on Facebook''
by Bakshy, Messing, and Adamic. Science. 2015.</p>

Get the category of content hosted by a domain. Use Shallalist <http://shalla.de/>,
Virustotal (which provides access to lots of services) <https://www.virustotal.com/>,
Alexa <https://aws.amazon.com/awis/>, DMOZ <https://curlie.org/>, University Domain list
<https://github.com/Hipo/university-domains-list> or validated machine learning
classifiers based on Shallalist data to learn about the kind of content hosted by a domain.

Gaurav Sood

not_news: Classify News and Non-News Based on keywords in the URL

Description

Usage

Arguments

Value

Details

References

Examples