These are extra datasets that aren't in the main dataset folder, but might be useful for teachers. This is from the 2026 set of extra data
These can be accessed from the extradata folder: https://grapher.nz/?folder=2026extradata
This data covers the time period: 01/10/2024 to 30/09/2025 and is sourced from data.govt.nz. The reason for this date range rather than calendar or fiscal year is because October is the Celebrant Renewal date.
It only includes data from Independent Marriage and Civil Union Celebrants.
Variable & Description:
Name: The name of the celebrant
Island: Determined from the post code if provided
Post Code: Note: location data is subject to manual entry/collection.
Total Ceremonies performed: Total number of marriage / civil union ceremonies performed
Registry ceremonies performed: Number of registry marriage / civil union ceremonies performed
Data is sourced from DIA: https://www.dia.govt.nz/Dog-Control and lists the council and the number of registered dogs.
Variable & Description:
Council: The council from which the data is sourced
Registered Dogs: Total number of reigstered dogs
Female: Number of female dogs
Male: Number of male dogs
De-Sexed: Number of dogs de-sexed
Micro Chipped: Number of dogs microchipped
Dog-related injury: Number of dog related injuries reported to ACC
Pure Bred: Number of pure pred dogs
Cross Bred: Number of cross--bred dogs
By Breed: Columns for each purebred dog
Beagle
Bichon Frise
Boxer
Cattle, Australian
Chihuahua, Long Coat
Chihuahua, Smooth Coat
Collie, Bearded
Collie, Border
Collie, Rough
Collie, Smooth
Dogo Argentino
Greyhound
Heading
Huntaway Maltese
Poodle, Miniature
Poodle, Toy
Retriever, Golden
Retriever, Labrador
Rottweiler
Schnauzer, Miniature
Shepherd, German
Shih Tzu
Spaniel, Cavalier
King Charles
Spaniel, Cocker
Spaniel, English Springer
Terrier, American Pit Bull
Terrier, Fox (Smooth)
Terrier, Jack Russell
Terrier, Staffordshire Bull
Terrier, West Highland White
zz_other (pure)
This dataset contains recorded observations for a sample from a fictional colony of yellow-eyed penguins (hoiho - megadyptes antipodes). It provides multivariate data suitable for exploring relationships between physical traits, gender differences, and age distributions.
Variable & Description:
Gender: The biological sex of the penguin (Male or Female).
Weight (kg): The weight of the penguin measured in kilograms.
Height (cm): The standing height of the penguin measured in centimeters.
Age: The age of the penguin in years.
Teacher Notes
This is a synthetic dataset generated programmatically to simulate realistic biological trends while ensuring clean data for classroom use. It was not collected from fieldwork.
Sexual Dimorphism: The data was generated with different parameters for males and females. Males are, on average, taller and heavier than females, allowing for box plot comparisons.
Correlation: Weight was generated as a function of height with added random noise (Gaussian). This results in a moderate-to-strong positive correlation r approx 0.65, making the dataset ideal for teaching scatter plots and linear regression.
Distributions:
Height and Weight follow roughly normal distributions.
Age follows a uniform distribution (ranging from 1 to 20 years).
Data is sourced and combined from Wikipedia, namely: List of countries by divorce rate and List of countries by age at first marriage and contains a sample of 100 countries.
Variable & Description:
Country/region: The country / region the data is for
Continent: The continent the country / region is in
Marriage Rate: Number of marriages per 1,000 population / year
Divorce Rate: Number of divorces per 1,000 population / year
Ratio (%): Number of Divorces ÷ Number of Marriages
Divorce Data Source Year: The year the data is from
Age at First Marriage - Men: Mean age at first marrage for men
Age at First Marriage - Women: Mean age at first marrage for women
Average Age at First Marriage: Mean age at first marrage for everyone getting married
Age gap: The age gap between men and women at first marriage
Age ratio: The ratio between men and women
Age Data Source Year: The year the data is from
This data was sourced January 2026
This data is sourced from MPI in January 2026 and covers the fishing year from 2024-2025. As this is summary data it is best used with the "Bar Chart - Summary Data" option. October fishing year runs from 1 October to 30 September.
Variable & Description:
Fisheries Management Area: The New Zealand EEZ is divided into 10 fisheries management units. They can be viewed in the National Aquatic BioDiversity System (NABIS) under 'Fisheries Management Areas' – 'General FMAs'
Fishing Method Category: Fishing method employed at time of NFPS interaction.
Seabird Capture Type: How the species was captured or interacted with
Species Category: Non-Fish or Protected Species categories are Fish, Birds, Marine Mammals and Reptiles
Species Code - Species Common Name: For a complete list of NFPS codes, refer to the currently in force Fisheries (E-logbook Users Instructions and Codes) Circular here https://www.mpi.govt.nz/fishing-aquaculture/commercial-fishing/fisheries-change-programme/digital-monitoring-resources
Attribute:
Alive Uninjured - the interaction results in the species being released alive and uninjured.
Alive Injured - the interaction results in the species being caught alive but it is injured.
Dead - the interaction results in the species being caught dead.
Value: The number of species where there is an interaction
Note: This dataset excludes Cnidaria (Corals) and Other (grouped sponges/bryozoans/corals and individual bryozoans).
The goal of this dataset is to predict who might have survived the titanic distaster. There was certainly an element of luck involved in surviving, it seems some groups of people were more likely to survive than others.
Source: https://www.kaggle.com/datasets/sakshisatre/titanic-dataset/data
Variable & Description:
Pclass: Ticket class indicating the socio-economic status of the passenger. It is categorized into three classes: 1 = Upper, 2 = Middle, 3 = Lower.
Survived: A binary indicator that shows whether the passenger survived (1) or not (0) during the Titanic disaster. This is the target variable for analysis.
Name: The full name of the passenger, including title (e.g., Mr., Mrs., etc.).
Sex: The gender of the passenger, denoted as either male or female.
Age: The age of the passenger in years.
SibSp: The number of siblings or spouses aboard the Titanic for the respective passenger.
Parch: The number of parents or children aboard the Titanic for the respective passenger.
Ticket: The ticket number assigned to the passenger.
Fare: The fare paid by the passenger for the ticket.
Cabin: The cabin number assigned to the passenger, if available.
Embarked: The port of embarkation for the passenger. It can take one of three values: C = Cherbourg, Q = Queenstown, S = Southampton.
Boat: If the passenger survived, this column contains the identifier of the lifeboat they were rescued in.
Home.dest: The destination or place of residence of the passenger.
The Tuatara (Sphenodon punctatus) is a "living fossil," the last survivor of an order of reptiles that thrived in the age of the dinosaurs. Once found all over New Zealand, they are now restricted to predator-free offshore islands. This dataset comes from a health survey conducted on two famous sanctuaries: Stephens Island (Takapourewa) in the Cook Strait and Little Barrier Island (Hauturu) in the Hauraki Gulf. Stephens Island has one of the highest densities of Tuatara in the world, which can lead to higher competition and parasite transmission. Little Barrier is a larger, more diverse forest ecosystem. Researchers are investigating whether the overcrowding on Stephens Island affects the physical condition and parasite load of the animals compared to their cousins in the north.
Variable & Description:
Island: Stephens Island or Little Barrier.
birthSex: Male or Female.
Tail Status: Original or Regrown (Tuatara drop tails when threatened; regrown tails are often shorter and discoloured).
Snout-Vent Length (mm): Body length from nose to vent (excluding tail).
Total Length (mm): Full length including the tail.
Weight (g): Total mass.
Parasite Load: Count of ticks (tuatara tick, Amblyomma sphenodonti).
Teacher Notes
This is a synthetic dataset generated programmatically to simulate realistic biological trends while ensuring clean data for classroom use. It was not collected from fieldwork.
Outliers: Students should notice that animals with "Regrown" tails have a much smaller Total Length relative to their Weight compared to those with "Original" tails.
Island Effect: Tuatara from Stephens Island have been generated to be slightly larger and have a higher parasite load (density effect) compared to Little Barrier.
Correlations: Strong positive correlation between Snout-Vent Length and Weight.