OTHER
Alexander Turnbull Library: Harvested Twitter data relating to Ihumātao
- Date
- 24 July - 19 August 2019
- By
- Alexander Turnbull LibrarySave Our Unique Landscape (SOUL); Twitter Inc (Firm)
- Reference
- ATL-Group-00585
- Description
Twitter crawl conducted by staff at the Alexander Turnbull Library relating to the Ihumātao protest.
This dataset contains 54,868 tweets collected between 24 July - 19 August 2019 from the public Twitter API using Twarc.
The dataset contains harvested Tweets (Twitter JSON data) and associated image files. Also includes Tweet IDs, and a ReadMe text file that details the process to capture the Tweets and the tools used. Twarc retrieves content going back several days, so some tweets prior to July 24 will have been captured.
In 2014 Auckland City, using the Special Housing Areas Act, designated 32 hectares adjacent to the Ōtuataua Stonefields Historic Reserve as a special housing area. This land, known as Puketāpapa, was confiscated under the New Zealand Settlements Act in 1863. 151 years later, the land was sold to Fletcher Residential Limited for the purpose of building a low-density subdivision with 480 high-price dwellings on the unique heritage landscape. In 2016, Pania Newton along with five cousins and other supporters, formed the group Save Our Unique Landscape (SOUL) to protest the development of land at Ihumātao in south Auckland. Since November 2016, kaitiaki peacefully occupied the land whilst campaigning to #protectihumatao.
On Tuesday 23 July, New Zealand Police arrived at Ihumātao to issue eviction notices. As a result, thousands of kaitiaki/protectors came from across Aotearoa and the world to support the struggle to reclaim the whenua, both in person at Ihumātao, as well as via social media and online activism.
Quantity: 1 data set(s). 1885 digital image(s). 5 Electronic document(s).
Processing information: The data from the crawls was combined and deduplicated to reveal a total of 54,868 Tweets over the crawl period. Images in the dataset were harvested separately by the Library using a Python script. The Library also created CSV access copies from the JSON files.
The Library unshortened shortened URLs in the Tweets, and conducted a WARC crawl to captured HTML pages referred to in the Tweets.
- Additional description
JSON dataset requires computational methods for access (eg. Python and script editor). Access copies for some material are available as text and csv files.
- Access restrictions
- Partly restricted material - Some digital material only available in the Katherine Mansfield Reading Room.
- Format
- 1 data set(s), 1885 digital image(s), 5 Electronic document(s), Data sets, Social media
Click to request to view this item, access digital version (if available), and see more information.
Copyright
UnknownReadMe text file for the Twitter harvest relating to Ihumātao
Date: July 2019
From: Alexander Turnbull Library: Harvested Twitter data relating to Ihumātao
Reference: WADL-0032
Description: ReadMe text file documenting the Library's search criteria and tools used to generate the Library's Twitter harvest relating to Ihumātao. Quantity: 1 Electronic document(s).
Tweet ID text file for the Twitter harvest relating to Ihumātao
Date: July 2019
From: Alexander Turnbull Library: Harvested Twitter data relating to Ihumātao
Reference: WADL-0033
Description: Text file containing a list of all Tweet IDs from the Library's Twitter crawl relating to Ihumātao in July 2019. This dataset can be reconstituted (rehydrated) from the Twitter Developer API using the tweet IDs. Tools for doing so include Twarc, Social Feed Manager, and DocNow, all available on Github. Because tweets may be deleted or made private, hydrating from the tweet IDs may not produce the same dataset. Quantity: 1 Electronic document(s).
Tweet objects JSON file and Twitter crawl hashed and unhashed text file
Date: July 2019
From: Alexander Turnbull Library: Harvested Twitter data relating to Ihumātao
Reference: WADL-0035
Description: Tweet objects Java Script Object Notation (JSON) file for the Library's Twitter harvest relating to Ihumātao in July 2019. JSON file generated by the Twitter API containing attribute data for tweets, including author, message, unique ID, timestamp, and geolocation data, if shared by the user, plus text file access copies of the data genereated by the Library, hashed and unhashed. Quantity: 3 Electronic document(s).
Image files for the Twitter harvest relating to Ihumātao
Date: July 2019
From: Alexander Turnbull Library: Harvested Twitter data relating to Ihumātao
Reference: WADL-0034
Description: Individual image and media files from harvested Tweets from the Library's Twitter harvest relating to Ihumātao. Images include memes, quoted text and photographs. Photographs show personalities, politicians, and various scenes at the Ihumātao occupation, and photographs taken at other places places, including Auckland City, and Parliament grounds, Wellington. Quantity: 1885 digital image(s).