Assorted experiments and examples working with Trove’s digitised newspapers
Tips, tools, and examples¶
Visualise Trove newspaper searches over time¶
This notebook helps you zoom out and explore how the number of Trove newspaper articles in your search results varies over time by using the
decade and year
facets. We then combine this approach with other search facets to see how we can slice a set of results up in different ways to investigate historical changes.
Visualise the total number of newspaper articles in Trove by year and state¶
Trove currently includes more 200 million digitised newspaper articles published between 1803 and 2015. In this notebook we explore how those newspaper articles are distributed over time, and by state.
Map Trove newspaper results by state¶
Uses the Trove
state facet to create a choropleth map that visualises the number of search results per state.
Map Trove newspaper results by place of publication¶
Uses the Trove
title facet to find the number of results per newspaper, then merges the results with a dataset of geolocated newspapers to map where articles were published.
Map Trove newspaper results by place of publication over time¶
Adds a time dimension to the examples in the previous notebook to create an animated heatmap.
Today’s news yesterday¶
date index and the
firstpageseq parameter to find articles from exactly 100 years ago that were published on the front page. It then selects one of the articles at random and downloads and displays an image of the front page.
Create a Trove OCR corrections ticker¶
has:corrections parameter to get the total number of newspaper articles with OCR corrections, then displays the results, updating every five seconds.
Save a Trove newspaper article as an image¶
This notebook grabs the page on which an article was published, and then crops the page image to the boundaries of the article. The result is an image which presents the article as it was originally published.
Upload Trove newspaper articles to Omeka-S¶
This notebook steps through the process of uploading Trove newspaper articles to your own Omeka-S instance via the API. As well as uploading the article metadata, it attaches image(s) and PDFs of the articles, and creates a linked record for the publishing newspaper. The source of the articles can be a Trove search, a Trove list, a Zotero collection, or just a list of article ids.
Beyond the copyright cliff of death¶
Most of the newspaper articles on Trove were published before 1955, but there are some from the later period. Let's find out how many, and which newspapers they were published in.
Download a page image¶
The Trove web interface doesn’t provide a way of getting high-resolution page images from newspapers. This simple app lets you download page images as complete, high-resolution JPG files.
Generate an article thumbnail¶
Generate a nice square thumbnail image for a newspaper article.
Make composite images from lots of Trove newspaper thumbnails¶
This notebook starts with a search in Trove's newspapers. It uses the Trove API to work its way through the search results. For each article it creates a thumbnail image using the code from this notebook. Once this first stage is finished, you have a directory full of lots of thumbnails. The next stage takes all those thumbnails and pastes them one by one into a BIG image to create a composite, or mosaic.
QueryPic is a tool I created many years ago to visualise searches in Trove's digitised newspapers. It shows you the number of articles each year that match your query — instead of a page of search results, you see the complete result set. You can look for patterns and trends across time. This is a deconstructed, extended, and hackable version of QueryPic.
Get a list of Trove newspapers that doesn't include government gazettes¶
The Trove API includes an option to retrieve details of digitised newspaper titles. Version 2 of the API added a separate option to get details of government gazettes. However the original
newspaper/titles requests actually returns both the newspaper and gazette titles, so there's no way of getting just the newspaper titles. This notebook explains the problem and provides a simple workaround.