Dataset Info
This is a list of what all the variables are and where the dataset is from for the built in datasets on NZGrapher. You can always load your own data, or have a school folder with data that your school has uploaded.
Babies
The data on 189 births were collected at Baystate Medical Center, Springfield, Mass. during 1986. The goal of this study was to identify risk factors associated with giving birth to a low birth weight baby (weighing less than 2500 grams). Data was collected on 189 women, 59 of which had low birth weight babies and 130 of which had normal birth weight babies.
Variable | Description |
LowBirthWeight | No = Birth Weight >= 2500g Yes = Birth Weight < 2500g |
MothersAge | Age of the Mother in Years |
Race | Race of the mother |
MotherSmoke | Smoking Status During Pregnancy |
FTV | Number of Physician Visits During the First Trimester |
BirthWeight | Birth Weight in Grams |
Ball Wear
Data was recorded of students going to the school ball in 2012 as to how much they spent on their clothing and accessories.
Variable | Description |
Gender | Boy = new student is male Girl = new student is female |
Amount.spent | The amount spent on clothing and accessories in New Zealand Dollars. |
Cars
With rising costs of owning and running a car, and environmental awareness, buyers are becoming more conscious of the features when purchasing new cars. The data supplied is for new vehicles sold in America in 1993.
Variable | Description |
Vehicle Name | |
Origin | Country of manufacture · America · Foreign |
Price | US $1000 |
Type | Small, midsize, large, compact, sporty, van |
City | City MPG (miles per gallon by EPA rating) |
OpenRoad | Highway MPG |
Drive Train | Front Wheel Drive Rear Wheel Drive |
Engine Size | Size in litres |
Manual Transmission | Yes No |
Weight | Weight of car in Kg |
Diamonds
Every diamond is unique, and there are a variety of factors which affect the price of a diamond. Insurance companies in particular are concerned that stones are valued correctly. Data on 236 round diamond stones was collected from a Singapore based retailer of diamond jewellery, who had the stones valued.
Variable | Description |
Carat | Weight of diamond stones in carat units 1 carat = 0.2 grams |
Colour | Numerical value given for quality of colour ranging from 1=colourless to 6=near colourless |
Clarity | Average = score 1, 2 or 3 Above average = score 4, 5 or 6 |
Lab | Laboratory that tested & valued the diamond 1 = laboratory 1 2 = laboratory 2 |
Price | Price in US dollars |
Empty Dataset for Editing
This is a blank dataset designed for entering experimental data.
Kiwi
A sample of kiwi birds around New Zealand was collected in order to help with conservation efforts. The original data is from:http://www.kiwisforkiwi.org/ and was sourced from the secondary school guides (http://seniorsecondary.tki.org.nz/Mathematics-and-statistics/Achievement objectives/AOs-by-level/AO-S7-1)
Variable | Description |
Species | GS-Great Spotted NIBr-NorthIsland Brown Tok-Southern Tokoeka |
Gender | M-Male F-Female |
Weight(kg) | The weight of the kiwi bird in kg Note: while this variable is called 'Weight', it is actually mass, but I have left it called Weight in the dataset due to the number of related resources based on this dataset. |
Height(cm) | The height of the kiwi bird in cm |
Location | NWN-North West Nelson SF-South Fiordland CW-Central Westland N-Northland EC-Eastern Canterbury E-East North Island StI-Stewart Island W-West North Island NF-North Fiordland |
Teachers note: this is a synthesised dataset based on real data. At the time of creating the data set there were around 25,000 brown, 17,000 great spotted and 34,500 southern tokoeka. These numbers formed the basis of the data set, but instead of being out of around 76,000 the data set contains around 700 birds.
The data was generated using the population parameters, including gender, location, height, weight and species in Fathom. The size of the population was so that it was too big to use all the data (when doing by hand) but not too big that it couldn’t be created for students to use as a “population” to sample from.
Marathon
The data is a sample taken from marathons in NZ.
It is a simple random sample of 200 athletes.
Variable | Description |
Minutes | How many minutes they completed the marathon in |
Gender | Male (M) or Female (F) |
AgeGroup | Younger (under 40) or older (over 40) |
StridelengthCM | The persons average stride length over the marathon in cm. |
Rugby
The data is real data and comes from http://www.rugby-sidestep-central.com/
Variable | Description |
Country | New Zealand or South Africa |
Position | Forward or Back |
Weight | The weight of the player in kilograms (kg) |
Height | The height of the player in metres (m) |
Sharks
The data is real and comes from the MPI centralised observer database:
http://www.fish.govt.nz/minz/Research+Services/Research+Database+Documentation/Cod/default.htm
Variable | Description |
Calendar Year | Year which the data was recorded in |
Fish Sex | The gender of the shark |
Total Length | The total length of the shark measured in centimetres |
Fork Length | The fork length of the shark measured in centimetres |
Sports Science
The data is real data and comes from http://www.statsci.org/data/oz/ais.html The data set provides information about 102 male athletes and 100 female athletes at the Australian Institute of Sport.
Variable | Description |
Sex | male or female |
Sport | sport played |
Ht | height in cm |
Wt | weight in kg |
LBM | lean body mass in kg |
%Bfat | % body fat |
BMI | body mass index (weight/height2) |
RCC | red blood cell count |
WCC | white blood cell count |
Hc | haematocrit |
Hg | haemoglobin |
Ferr | plasma ferritin concentration |
SSF | sum of skin folds |
Teachers note: this is the dataset used in the TKI Exemplar A
TS Note
All datasets prefixed with a TS that are preloaded in NZGrapher are time series datasets, and not particularly well suited to bivariate or multivariate analysis.
TS - Births and Deaths
Data on the number of births and deaths in New Zealand.
The data is sourced from Statistics New Zealand.
Variable | Description |
Quarter | Quarterly |
Male Live Births | Number of males born during the quarter |
Female Live Births | Number of females born during the quarter |
Male Deaths | Number of male deaths during the quarter |
Female Deaths | Number of female deaths during the quarter |
TS - Forestry
The volume of wood removed from different types of forests in New Zealand. The data is sourced from the Ministry for Primary Industries.
Variable | Description |
Quarter | Quarterly |
Natural Forests | The volume of wood removed from Natural Forests in millions of m3 |
Plantation Forests | The volume of wood removed from Plantation Forests in millions of m3 |
TS - Imports
Information on imports to and from New Zealand. The data is sourced from Statistics New Zealand.
Variable | Description |
Month | Monthly |
TotalAirportsCIF | Cost, insurance and freight of imported goods in NZ$(000) |
TotalParcelPostCIF | Cost, insurance and freight of imported goods in NZ$(000) |
TotalSeaportsCIF | Cost, insurance and freight of imported goods in NZ$(000) |
TotalAirportsWeight | Weight of imported goods in tonnes |
TotalParcelPostWeight | Weight of imported goods in tonnes |
TotalSeaportsWeight | Weight of imported goods in tonnes |
TS - Jobs
The number of people in employment in New Zealand. The data is sourced from Statistics New Zealand.
Variable | Description |
Month | Monthly |
Total Filled Jobs | The number of jobs that are filled in millions |
NZ Population | The population of New Zealand in millions |
TS - Penguin
Data on the number of penguins at the Phillip Island Penguin Parade in Australia. This data was created by a teacher on her return from Philip Island and should not be considered *real* but is still useful for teaching and learning.
Variable | Description |
Month | Monthly |
Number | The number of penguins in the colony |
TS - Sea Ice
The data is the surface area of sea ice in millions of square kilometres.
The data is sourced from the National Snow and Ice Data Center.
Variable | Description |
Time | Monthly |
Arctic | Million Square Kilometres of Ice in the Arctic |
Antarctica | Million Square Kilometres of Ice in Antarctica |
You can find some more info about the different measurements used to calculate sea ice and why the numbers in the different versions are different here.
The main datasets folder now only contains the most recent sea ice dataset. To access historical ones go to: https://grapher.nz/?folder=seaice
The April 2017 data was sourced from http://www.climate4you.com/SeaIce.htm Others have been shared with me from various people directly from NSIDC.
More or Less did a 9 minute podcast about this in 2024 which provides a lot of interesting insights into the data: https://www.bbc.co.uk/programmes/w3ct5b7w
TS - Sunglasses
Data on the value of sunglasses sold.
Variable | Description |
Quarter | Quarterly |
Sales | Amount of sales in thousands of dollars |
TS – Temperatures Auckland
Temperature data from the weather station at Auckland Airport sourced from NIWA.
Variable | Description |
Month | The Month of the Data |
Tmax | Average Maximum Temperature for the Month |
Tmin | Average Minumum Temperature for the Month |
TS - Visitors
The visitors’ dataset is the number of people entering New Zealand on a Visitor Visa from Australia, China, Japan and the UK.
The data is sourced from Statistics New Zealand.
Variable | Description |
Date | Quarterly |
Australia | Number of visitors in the quarter from Australia |
China, People's Republic of | Number of visitors in the quarter from China |
Japan | Number of visitors in the quarter from Japan |
United Kingdom | Number of visitors in the quarter from the UK |
The updated version April 2020 also includes:
Variable | Description |
Korea, Republic of | Number of visitors in the quarter from Korea |
Germany | Number of visitors in the quarter from Germany |
Canada | Number of visitors in the quarter from Canada |
United States of America | Number of visitors in the quarter from USA |
Total All Countries | Total Visitors to New Zealand |
Extra Notes:
Data is derived from a sample of records and hence may contain sample error. Caution should be used when using data with low cell values.
Visitor arrivals are overseas residents arriving in New Zealand for a stay of less than 12 months. For arrival series, the country of residence is the country where a person last lived for 12 months or more (country of last permanent residence).
For detailed metadata about countries used in International Travel and Migration statistics, see DataInfo+ http://ow.ly/Mm9ba
Temperatures - Auckland
Temperature data from the weather station at Auckland Airport sourced from NIWA. Each row is one day.
Variable | Description |
Month | Month Number (1 = Jan, 2 = Feb, etc.) from which the temperature was collected |
Decade | Decade from which the temperature was collected |
Tmax(C) | Maximum Temperature on the Day |
Tmin(C) | Minimum Temperature on the Day |