068. Data Types and Data Collection
Learning Intentions
- To know the meaning of the terms primary source, secondary source, census, sample and observation
- Identify variables as numerical (discrete or continuous) or categorical
- To understand that different methods are suitable for collecting different types of data, based on the size and nature of the data
Pre-requisite Summary
- Understand that data is information collected to answer a question
- Know that a variable is something that can change from one case to another
- Be able to distinguish between counting and measuring
- Understand that categories group items by type or label
- Know that a population is the full group being studied
- Understand that practical limits such as time, cost and access affect how data is collected
Worked Examples
Worked Example 1
For each term, State its meaning and give an example:
a) primary source
b) secondary source
c) observation
Worked Example 2
For each term, state its meaning and give an example:
a) census
b) sample
c) Explain one advantage of each
Worked Example 3
Classify each variable as categorical, numerical discrete, or numerical continuous:
a) favourite sport
b) number of siblings
c) height of a student
Worked Example 4
Classify each variable as categorical, numerical discrete, or numerical continuous:
a) eye colour
b) number of pets
c) time taken to run
Worked Example 5
Choose a suitable method of collecting data and justify it:
a) finding the shoe sizes of all students in one class
b) finding the daily temperature at noon for a month
c) finding the favourite music genre of students in a school of
Worked Example 6
A researcher wants data on the mass of apples sold at a market.
a) State whether the variable is categorical, numerical discrete, or numerical continuous.
b) Suggest a suitable method of collection.
c) Explain whether a census or sample is more suitable.
Problems
Problem 1
For each term, state its meaning and give an example:
a) primary source
b) secondary source
c) observation
Problem 2
For each term, state its meaning and give an example:
a) census
b) sample
c) explain one advantage of each
Problem 3
Classify each variable as categorical, numerical discrete, or numerical continuous:
a) favourite fruit
b) number of books read in a month
c) arm span of a student
Problem 4
Classify each variable as categorical, numerical discrete, or numerical continuous:
a) hair colour
b) number of cars in a household
c) amount of water in a bottle
Problem 5
Choose a suitable method of collecting data and justify it:
a) finding the hand spans of all students in one class
b) finding rainfall each day for two weeks
c) finding the favourite school subject of students in a school of
Problem 6
A researcher wants data on the lengths of leaves in a garden.
a) State whether the variable is categorical, numerical discrete, or numerical continuous.
b) Suggest a suitable method of collection.
c) Explain whether a census or sample is more suitable.
Exercises
Understanding and Fluency
Exercise 1.
Match each term to its meaning:
a) primary source
b) secondary source
c) observation
Exercise 2.
State whether each example is a primary or secondary source:
a) measuring the heights of your classmates
b) Use a government report
c) recording the colour of cars passing a school gate
Exercise 3.
State whether each study uses a census or a sample:
a) surveying every student in one class
b) surveying
c) measuring every tree in a small garden
Exercise 4.
Classify each variable:
a) type of pet
b) number of goals scored
c) temperature of a drink
Exercise 5.
Classify each variable:
a) brand of phone
b) number of siblings
c) distance travelled to school
Exercise 6.
Classify each numerical variable as discrete or continuous:
a) number of text messages sent
b) mass of a watermelon
c) time spent doing homework
Exercise 7.
Choose a suitable method for collecting each type of data:
a) favourite lunch food of a class
b) height of tomato plants over time
c) number of buses arriving in one hour
Exercise 8.
Choose whether a census or sample is more suitable:
a) finding the favourite colour of a class of
b) finding the favourite streaming service of all teenagers in Australia
c) finding the heights of all players in one netball team
Reasoning
Exercise 9.
Explain why the number of siblings is numerical discrete and not continuous.
Exercise 10.
A student says that height is numerical discrete because it can be written as a decimal. Explain the mistake.
Exercise 11.
Explain why a school might Use a sample instead of a census when surveying all students.
Exercise 12.
A student says that favourite sport is numerical because you can count how many people choose each sport. Explain why this is incorrect.
Problem-solving
Exercise 13.
A class wants to Solve the most popular fruit among students.
a) State the variable.
b) Classify the variable.
c) Suggest a suitable collection method.
Exercise 14.
A weather station records the temperature every hour for one day.
a) State the variable.
b) Classify the variable.
c) State whether observation is a suitable method.
Exercise 15.
A sports club wants to know the average number of training sessions attended each week by its members.
a) State the variable.
b) Classify it.
c) Suggest whether a census or sample is more suitable and explain.
Exercise 16.
A researcher wants to know the masses of fish in a large lake.
a) Classify the variable.
b) Suggest a suitable collection method.
c) Decide whether a census or sample is more realistic.
Exercise 17.
A school library records the number of books borrowed by each student in a term.
a) State the variable.
b) Classify it.
c) Explain why the variable is not categorical.
Exercise 18.
A council studies the colour of cars parked in a shopping centre.
a) State the variable.
b) Classify it.
c) Suggest whether direct observation is suitable.
Potential Misunderstandings
- Students may confuse primary and secondary sources by focusing on where the data is written rather than where it originally came from
- Students may think a sample and a census are the same because both involve collecting data
- Students may think any numerical variable is continuous
- Students may think any variable that can be counted from responses is numerical, even when the responses are categories
- Students may confuse discrete data with rounded continuous data
- Students may assume a census is always better, without considering time, cost and practicality
- Students may choose an unsuitable method of collection because they do not consider the size or nature of the data
- Students may think observation can only be used for categorical data, even though it can also be used to record numerical data in some contexts