Data comes from http://www.openarchives.org/Register/BrowseSites. It includes the oai-identifier (if they have one) and the base URL. The website has the name of the data provider too, but not provided in the data pulled down here, but you can grab the name using the example below.
update_providers(path = ".", ...)
Path to put data in.
Curl options passed on to httr::GET()
This table is scraped from http://www.openarchives.org/Register/BrowseSites. I would get it from http://www.openarchives.org/pmh/registry/ListFriends, but it does not include repository names.
This function updates the table for you. Does take a while though, so go get a coffee.
load_providers()
if (FALSE) {
update_providers()
load_providers()
}
Run the code above in your browser using DataLab