Access to the Guardian open API https://open-platform.theguardian.com/, containing all articles published in the Guardian from 1999 to the present. An API key is required, sign up here.
Installationguardianapi
is available on CRAN:
install.packages("guardianapi")
You can install the development version guardianapi
from github with:
# install.packages("devtools")
devtools::install_github("evanodell/guardianapi")
Usage
Save your API key as the environmental variable GU_API_KEY
, or use the gu_api_key()
function to set a key for a single session.
guardianapi
contains functions to query and retrieve articles, tags, editions and sections. For example, return all tags containing the string âappleâ, which includes both the technology company and Fiona Apple, amongst others.
library(guardianapi)
tags1 <- gu_tags(query = "apple")
#> Retrieving page 1
tibble::glimpse(tags1)
#> Observations: 10
#> Variables: 13
#> $ id <chr> "technology/apple", "technology/apple-tv", "tecâ¦
#> $ type <chr> "keyword", "keyword", "keyword", "contributor",â¦
#> $ section_id <chr> "technology", "technology", "technology", NA, "â¦
#> $ section_name <chr> "Technology", "Technology", "Technology", NA, "â¦
#> $ web_title <chr> "Apple", "Apple TV", "Apple WWDC", "Emily Appleâ¦
#> $ web_url <chr> "https://www.theguardian.com/technology/apple",â¦
#> $ api_url <chr> "https://content.guardianapis.com/technology/apâ¦
#> $ references <list> [<data.frame[1 x 2]>, <data.frame[0 x 0]>, <daâ¦
#> $ bio <chr> NA, NA, NA, "<p>Emily Apple is the co-founder oâ¦
#> $ byline_image_url <chr> NA, NA, NA, "https://static.guim.co.uk/sys-imagâ¦
#> $ first_name <chr> NA, NA, NA, "apple", NA, NA, NA, NA, NA, "chia-â¦
#> $ last_name <chr> NA, NA, NA, "emily", NA, NA, NA, NA, NA, "(applâ¦
#> $ description <chr> NA, NA, NA, NA, NA, NA, NA, "Guardian journalisâ¦
Return all articles with the string ârelationshipsâ and the tag âlifeandstyle/sexâ from December 2018:
relations_sex <- gu_content(query = "relationships", from_date = "2018-12-01",
to_date = "2018-12-31", tag = "lifeandstyle/sex")
#> Retrieving page 1
tibble::glimpse(relations_sex)
#> Observations: 5
#> Variables: 40
#> $ id <chr> "lifeandstyle/2018/dec/10/im-inâ¦
#> $ type <chr> "article", "article", "article"â¦
#> $ section_id <chr> "lifeandstyle", "lifeandstyle",â¦
#> $ section_name <chr> "Life and style", "Life and styâ¦
#> $ web_publication_date <dttm> 2018-12-10 08:00:39, 2018-12-2â¦
#> $ web_title <chr> "Iâm in a relationship with anoâ¦
#> $ web_url <chr> "https://www.theguardian.com/liâ¦
#> $ api_url <chr> "https://content.guardianapis.câ¦
#> $ tags <list> [<data.frame[10 x 13]>, <data.â¦
#> $ is_hosted <lgl> FALSE, FALSE, FALSE, FALSE, FALâ¦
#> $ pillar_id <chr> "pillar/lifestyle", "pillar/lifâ¦
#> $ pillar_name <chr> "Lifestyle", "Lifestyle", "Lifeâ¦
#> $ headline <chr> "Iâm in a relationship with anoâ¦
#> $ standfirst <chr> "We kiss and cuddle, but he wonâ¦
#> $ trail_text <chr> "We kiss and cuddle, but he wonâ¦
#> $ byline <chr> "Pamela Stephenson Connolly", "â¦
#> $ main <chr> "<figure class=\"element elemenâ¦
#> $ body <chr> "<p><strong>Until last year, I â¦
#> $ newspaper_page_number <chr> "7", "66", "83", NA, "44"
#> $ wordcount <chr> "387", "307", "2189", "759", "1â¦
#> $ comment_close_date <dttm> 2018-12-13 08:00:39, 2018-12-2â¦
#> $ commentable <chr> "true", "true", "true", "false"â¦
#> $ first_publication_date <dttm> 2018-12-10 08:00:39, 2018-12-2â¦
#> $ is_inappropriate_for_sponsorship <chr> "false", "false", "false", "falâ¦
#> $ is_premoderated <chr> "true", "true", "true", "false"â¦
#> $ last_modified <chr> "2018-12-10T08:00:39Z", "2018-1â¦
#> $ newspaper_edition_date <date> 2018-12-10, 2018-12-22, 2018-1â¦
#> $ production_office <chr> "UK", "UK", "UK", "UK", "UK"
#> $ publication <chr> "The Guardian", "The Guardian",â¦
#> $ short_url <chr> "https://gu.com/p/a5fad", "httpâ¦
#> $ should_hide_adverts <chr> "false", "false", "false", "falâ¦
#> $ show_in_related_content <chr> "true", "true", "true", "true",â¦
#> $ thumbnail <chr> "https://media.guim.co.uk/35fd3â¦
#> $ legally_sensitive <chr> "false", "false", "false", "falâ¦
#> $ sensitive <chr> "true", NA, NA, NA, "true"
#> $ lang <chr> "en", "en", "en", "en", "en"
#> $ body_text <chr> "Until last year, I identified â¦
#> $ char_count <chr> "2192", "1729", "12190", "4404"â¦
#> $ should_hide_reader_revenue <chr> "false", "false", "false", "falâ¦
#> $ show_affiliate_links <chr> "false", "false", "false", "falâ¦
See the full package documentation at docs.evanodell.com/guardianapi.
Please note that the âguardianapiâ project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.
This package is not affiliated with or supported by The Guardian.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4