This is a list of what all the variables are and where the dataset is from for the built in datasets on NZGrapher. You can always load your own data, or have a school folder with data that your school has uploaded. Later datasets I have been adding are in the Extra Datasets.
The data on 189 births were collected at Baystate Medical Center, Springfield, Mass. during 1986. The goal of this study was to identify risk factors associated with giving birth to a low birth weight baby (weighing less than 2500 grams). Data was collected on 189 women, 59 of which had low birth weight babies and 130 of which had normal birth weight babies.
Variable & Description
LowBirthWeight: No = Birth Weight >= 2500g, Yes = Birth Weight < 2500g
MothersAge: Age of the Mother in Years
Race: Race of the mother
MotherSmoke: Smoking Status During Pregnancy
FTV: Number of Physician Visits During the First Trimester
BirthWeight: Birth Weight in Grams
Data was recorded of students going to the school ball in 2012 as to how much they spent on their clothing and accessories.
Variable & Description
Gender: Boy = new student is male, Girl = new student is female
Amount.spent: The amount spent on clothing and accessories in New Zealand Dollars.
With rising costs of owning and running a car, and environmental awareness, buyers are becoming more conscious of the features when purchasing new cars. The data supplied is for new vehicles sold in America in 1993.
Variable & Description
Vehicle Name
Origin: Country of manufacture (America, Foreign)
Price: US $1000
Type: Small, midsize, large, compact, sporty, van
City: City MPG (miles per gallon by EPA rating)
OpenRoad: Highway MPG
Drive Train: Front Wheel Drive, Rear Wheel Drive
Engine Size: Size in litres
Manual Transmission: Yes, No
Weight: Weight of car in Kg
Every diamond is unique, and there are a variety of factors which affect the price of a diamond. Insurance companies in particular are concerned that stones are valued correctly. Data on 236 round diamond stones was collected from a Singapore based retailer of diamond jewellery, who had the stones valued.
Variable & Description
Carat: Weight of diamond stones in carat units 1 carat = 0.2 grams
Colour: Numerical value given for quality of colour ranging from 1=colourless to 6=near colourless
Clarity: Average = score 1, 2 or 3, Above average = score 4, 5 or 6
Lab: Laboratory that tested & valued the diamond (1 = laboratory 1, 2 = laboratory 2)
Price: Price in US dollars
This is a blank dataset designed for entering experimental data.
A sample of kiwi birds around New Zealand was collected in order to help with conservation efforts. The original data is from: http://www.kiwisforkiwi.org/ and was sourced from the secondary school guides.
Variable & Description
Species: GS-Great Spotted, NIBr-NorthIsland Brown, Tok-Southern Tokoeka
Gender: M-Male, F-Female
Weight(kg): The weight of the kiwi bird in kg. (Note: while this variable is called 'Weight', it is actually mass)
Height(cm): The height of the kiwi bird in cm
Location: NWN-North West Nelson, SF-South Fiordland, CW-Central Westland, N-Northland, EC-Eastern Canterbury, E-East North Island, StI-Stewart Island, W-West North Island, NF-North Fiordland
Teachers note: this is a synthesised dataset based on real data. At the time of creating the data set there were around 25,000 brown, 17,000 great spotted and 34,500 southern tokoeka. These numbers formed the basis of the data set, but instead of being out of around 76,000 the data set contains around 700 birds.
The data was generated using the population parameters, including gender, location, height, weight and species in Fathom. The size of the population was so that it was too big to use all the data (when doing by hand) but not too big that it couldn’t be created for students to use as a “population” to sample from.
The data is a sample taken from marathons in NZ. It is a simple random sample of 200 athletes.
Variable & Description
Minutes: How many minutes they completed the marathon in
Gender: Male (M) or Female (F)
AgeGroup: Younger (under 40) or older (over 40)
StridelengthCM: The persons average stride length over the marathon in cm.
The data is real data and comes from http://www.rugby-sidestep-central.com/
Variable & Description
Country: New Zealand or South Africa
Position: Forward or Back
Weight: The weight of the player in kilograms (kg)
Height: The height of the player in metres (m)
The data is real and comes from the MPI centralised observer database.
http://www.fish.govt.nz/minz/Research+Services/Research+Database+Documentation/Cod/default.htm
Variable & Description
Calendar Year: Year which the data was recorded in
Fish Sex: The gender of the shark
Total Length: The total length of the shark measured in centimetres
Fork Length: The fork length of the shark measured in centimetres
The data is real data and comes from http://www.statsci.org/data/oz/ais.html. The data set provides information about 102 male athletes and 100 female athletes at the Australian Institute of Sport. You can read the full paper here.
Variable & Description
Sex: male or female
Sport: sport played
Ht: height in cm
Wt: weight in kg
LBM: lean body mass in kg
%Bfat: % body fat
BMI: body mass index (weight/height2)
RCC: red blood cell count
WCC: white blood cell count
Hc: haematocrit
Hg: haemoglobin
Ferr: plasma ferritin concentration
SSF: sum of skin folds
All datasets prefixed with a TS that are preloaded in NZGrapher are time series datasets, and not particularly well suited to bivariate or multivariate analysis.
Data on the number of births and deaths in New Zealand. The data is sourced from Statistics New Zealand.
Variable & Description
Quarter: Quarterly
Male Live Births: Number of males born during the quarter
Female Live Births: Number of females born during the quarter
Male Deaths: Number of male deaths during the quarter
Female Deaths: Number of female deaths during the quarter
The volume of wood removed from different types of forests in New Zealand. The data is sourced from the Ministry for Primary Industries.
Variable & Description
Quarter: Quarterly
Natural Forests: The volume of wood removed from Natural Forests in millions of m3
Plantation Forests: The volume of wood removed from Plantation Forests in millions of m3
Information on imports to and from New Zealand. The data is sourced from Statistics New Zealand.
Variable & Description
Month: Monthly
TotalAirportsCIF: Cost, insurance, and freight of imported goods in NZ$(000)
TotalParcelPostCIF: Cost, insurance, and freight of imported goods in NZ$(000)
TotalSeaportsCIF: Cost, insurance, and freight of imported goods in NZ$(000)
TotalAirportsWeight: Weight of imported goods in tonnes
TotalParcelPostWeight: Weight of imported goods in tonnes
TotalSeaportsWeight: Weight of imported goods in tonnes
The number of people in employment in New Zealand. The data is sourced from Statistics New Zealand.
Variable & Description
Month: Monthly
Total Filled Jobs: The number of jobs that are filled in millions
NZ Population: The population of New Zealand in millions
Data on the number of penguins at the Phillip Island Penguin Parade in Australia. This data was created by a teacher on her return from Philip Island and should not be considered real but is still useful for teaching and learning.
Variable & Description
Month: Monthly
Number: The number of penguins in the colony
The data is the surface area of sea ice in millions of square kilometres. The data is sourced from the National Snow and Ice Data Center.
Variable & Description
Time: Monthly
Arctic: Million Square Kilometres of Ice in the Arctic
Antarctica: Million Square Kilometres of Ice in Antarctica
You can find some more info about the different measurements used to calculate sea ice and why the numbers in the different versions are different here.
The main datasets folder now only contains the most recent sea ice dataset. To access historical ones go to: https://grapher.nz/?folder=seaice
The April 2017 data was sourced from http://www.climate4you.com/SeaIce.htm Others have been shared with me from various people directly from NSIDC.
More or Less did a 9 minute podcast about this in 2024 which provides a lot of interesting insights into the data: https://www.bbc.co.uk/programmes/w3ct5b7w
Data on the value of sunglasses sold.
Variable & Description
Quarter: Quarterly
Sales: Amount of sales in thousands of dollars
Temperature data from the weather station at Auckland Airport sourced from NIWA.
Variable & Description
Month: The Month of the Data
Tmax: Average Maximum Temperature for the Month
Tmin: Average Minumum Temperature for the Month
The visitors' dataset is the number of people entering New Zealand on a Visitor Visa from Australia, China, Japan and the UK. The data is sourced from Statistics New Zealand.
Variable & Description
Date: Quarterly
Australia: Number of visitors in the quarter from Australia
China, People's Republic of: Number of visitors in the quarter from China
Japan: Number of visitors in the quarter from Japan
United Kingdom: Number of visitors in the quarter from the UK
The updated version April 2020 also includes:
Variable & Description
Korea, Republic of: Number of visitors in the quarter from Korea
Germany: Number of visitors in the quarter from Germany
Canada: Number of visitors in the quarter from Canada
United States of America: Number of visitors in the quarter from USA
Total All Countries: Total Visitors to New Zealand
Extra Notes:
Data is derived from a sample of records and hence may contain sample error. Caution should be used when using data with low cell values.
Visitor arrivals are overseas residents arriving in New Zealand for a stay of less than 12 months. For arrival series, the country of residence is the country where a person last lived for 12 months or more (country of last permanent residence).
For detailed metadata about countries used in International Travel and Migration statistics, see DataInfo+ http://ow.ly/Mm9ba
Temperature data from the weather station at Auckland Airport sourced from NIWA. Each row is one day.
Variable & Description
Month: Month Number (1 = Jan, 2 = Feb, etc.) from which the temperature was collected
Decade: Decade from which the temperature was collected
Tmax(C): Maximum Temperature on the Day
Tmin(C): Minimum Temperature on the Day