Quiz Bus: Dealing with Data
Western Reserve Public Media
 
 
 
Introduction
 
Video 1
 
Video 2
 
Video 3
 
Video 4
 
Video 5
 
Resources
 
Activities
 
Teacher Materials
 
Watch Online
 
 
pbs.org
Resources
Vocabulary

Association: The trend in a graph that shows the effect on one measure by a second measure.

Biased sampling: A sample that overrepresents or underrepresents part of the population.

Bar graph: A graph of data with parallel bars used for comparing information on categories where each bar represents a category and the height or length of the bar represents the number of events in that category.

Bivariate data: Data or events described using two variables.

Box-and-whisker plot: A diagram that shows pictorially the median and measures of spread (upper and lower interquartile ranges and the range) for one set of data.

Broken-scale symbol: Is used at the end of the axis to indicate that the graph does not begin at the origin (0,0).

Causation: The relationship between two variables where a change in one variable affects the outcome of the other variable.

Categorical data: Data that can be classified by type; e.g., color, types of dogs. These types of data are typically represented using bar chart, pie charts or pictographs.

Census: When every member of the population has data collected from them.

Clusters and gaps: Numbers that tend to crowd around a particular point in a set of values. The spaces between the clusters are called gaps.

Combination: A selection of a group of items or events from a set without regard to order; e.g., the number of three-piece outfits from the set of clothes in the closet.

Complementary events: Two or more mutually exclusive events that together cover all possible outcomes. The sum of the probabilities of complementary events is one.

Compound events: Combining two or more independent events or outcomes and considering it as one single event or outcome.

Conditional probability: The probability of an event occurring given that another event has already occurred. For example: What is the probability that the total of two dice will be greater than eight given that the first die is a six?

Continuous data: Data that can be assigned an infinite number of values between whole numbers, the assigned values are approximated; e.g., the size of the apples on an apple tree is continuous data. See discrete data for a counterexample.

Convenience data: Data that has been gathered with the group at hand.

Coordinate plane: A plane determined by the intersection of two perpendicular number lines in which any point can be located.

Coordinates: An ordered pair of numbers used to show a position on a graph. The first number (X) gives the place left or right and the second number (Y) gives the place up or down (X,Y).

Correlation: The relation between two sets of data, a positive or direct correlation exists when both sets vary in the same direction (both sets decrease); a negative or inverse correlation exists when one set of data increases as the other decreases.

Correlation coefficient: A measure of the correlation between two variables or sets of data. The value of the correlation coefficient, r, is always -1 to 1, where 1 is perfect positive correlation, 0 is no correlation, and -1 is perfect negative correlation.

Cumulative frequency: A running total of frequencies.

Datum: A single piece of data. The singular form of data.

Dependent events: A statement or probability for one event affects a statement or probability for another event.

Descriptive statistics: Gathered and described data using probability, statistical methods and concepts like graphs and measures of center.

Discrete data: Data that can be counted; e.g., the number of people in a town is discrete (there is no such thing as a fractional person). See continuous data for a counterexample.

Disjoint events: Two events that have no outcomes in common.

Dispersion: How data is spread out around some central point.

Distribution: A graph or table showing how many pieces of data there are in each class, or of each type.

Equation: A statement that shows two mathematical expressions that are equal to each other.

Event: An outcome of a probability experiment.

Experiment: The collecting of data through a planned investigation.

Experimental probability: The probability based on a series of trials. The experimental probability, P, can be found using the following equation:

P(event) = # of trials w/favorable outcomes
 
  number of trials in experiment

 

Experimental results: The outcome as a result of a probability experiment or test. These outcomes are sometimes called actual results.

Extrapolation: A term used in interpreting scatter plots that predicts the location of points extending beyond the data displayed.

Frequency: The number of times an occurrence takes place as expressed in a count. The count is the frequency.

Frequency distribution: A collection of data that represents the number of times a set of numbers, items or events has occurred.

Frequency table: A table that shows how often each item, number or range of numbers occurs in a set of data.

Geometric probability (area models): The probability that a random point is located in a particular part subregion or larger region.

Glyph: A picture that represents data.

Histogram: Shows how measurements spread out across a real number line, marked off in equal intervals. The height of the bar represents the frequencies of measurements contained in that interval.

Independent event: Two events in which the outcome of the first event does not affect the outcome of the second event.

Intercepts: The value of y on the coordinate plane where x = 0, called the y-intercept. The value of x on the coordinate plane where y = 0, called the x-intercept.

Interpolation: A term used in interpreting scatter plots that predicts the location of points that would lie between those points of the data already displayed.

Interquartile range: The difference between the upper quartile and the lower quartile.

Interval: The distance on a real number scale between two consecutive tick marks or the space between the two points.

Line of best fit: A line drawn in the midst of the points on a scatter plot in an attempt to estimate the mathematical relationship between the variables used to generate the plot.

Line plot: A graph that indicates the location of data points along a segment of the real number line.

Linear equation: An equation whose graph on a coordinate grid is a straight line.

Lower extreme: The lowest value in a set of data.

Lower quartile: When data is ordered from smallest to largest and divided into four quarters, the values that are in the lower quarter of the data.

Matrix: A rectangular array of numbers or symbols.

Measurement data (quantitative): Has a numerical value and could be placed on a number line.

Measures of center: Numbers that provide information about cluster and average of a collection of data. They include the mean, mode and median.

Measures of spread or variability: A term used to refer to how much numbers are spread, varied or dispersed in a set of data. They include range, quartile and interquartile range.

Mean: The sum of a set of numbers divided by the number of elements in the set.

Median: The middle number or item in a set of numbers or objects arranged from least to greatest, or the mean of the two middle numbers when the set has two middle numbers.

Mode: The number or object that appears most frequently in a set of numbers or objects.

Mutually exclusive events: Two events that cannot occur at the same time.

Negative association: A pattern in the shape of the data that shows when one measurement grows larger, the second measure grows smaller. A negative association will be slanted downward from left to right.

Odds of an event: The ratio of favorable outcomes to unfavorable outcomes.

Ordered pairs: A pair of numbers that gives the coordinates of a point on a grid in this order (horizontal coordinate, vertical coordinate). Also called paired data or paired coordinates.

Origin: The point on a coordinate plane where the x-axis and the y-axis meet and have the ordered pair (0,0).

Outcome: Any possible result of an activity or experiment.

Outlier: A data point in a sample widely separated from the main cluster of points in the sample.

Parallel box plots: Two or more box plots using the same number line to allow comparison of the data.

Population: A whole set of individuals, items or data from which a sample is drawn.

Permutations: Possible orders or arrangements of a set of events or items.

Positive association: The pattern in the shape of data that shows when one number gets larger, the second also gets larger. It will be slanted upward from the left to right.

Probability: The chance of an event occurring. The probability of an event is equal to the number of favorable outcomes divided by the number of possible outcomes.

Pictograph: The use of a repeat picture in a bar graph form, to represent data.

Quadrants: The four separate sections formed by the two axes of a coordinate system. These are identified as the first, second, third and fourth quadrants.

Qualitative data: Data that can be assigned qualities or categories. They are nonnumerical data.

Quantitative data: Data that are numerical. The data can be discrete or continuous.

Quartile: In conjunction with the median, the quartiles divide the set of data into four groups of equal size.

Random sample: A sample in which every event has an equal chance of selection and each event is chosen by a random process.

Random variable: A variable that takes any of a range of values that cannot be predicted with certainty.

Range: The difference between the greatest and the least numbers in a set of data.

Rank: Placing a numerical order according to scale or value.

Relative frequency: The number of times an event occurs divided by the total number of trials.

Sample: A set of data taken from a larger set used to create or test theories about the data as a whole.

Sample space: A list of all possible outcomes of an activity.

Sampling method: The process used to collect data; see random sample.

Scale: The regular intervals of the number line that are chosen to represent the full range of data on a graph.

Scatter plot: A graph with one point for each item being measured. The coordinates of a point represent the measure of two attributes of each item.

Simple event: A subset of the sample space that contains only one outcome that cannot be broken down into a simpler, more basic outcome.

Simulation: The study of probability by collecting mathematically appropriate data that models an actual event.

Slope of 1: A line that contains points where the abscissa (x value) and the ordinate (Y value) are equal. (The rise equals the run.)

Stem-and-leaf plot: A frequency diagram that displays the actual data together with its frequency, by using a part of the value of each piece of data to fix the class or group (the stem), while the remainder of the value is actually listed (the leaves).

Survey: A question or set of questions answered by a population of people.

Symbolic form: A representation of something using numbers and symbols.

Target population: The set from which a sample will be selected.

Theoretical probability: Identifying, using mathematical expectations, the number of ways an event could happen compared to all the events that could happen.

Theoretical results: The expected results given the theoretical probability of an event.

Tree diagram: A graphic representation that resembles the branching out of a tree. It is used to illustrate outcomes of an event.

Trend: An emerging pattern in the shape of a data display that can be seen on a scatter plot.

Univariate data: Having one variable.

Upper extreme: The largest value in a set of data; the maximum.

Upper quartile: When data is ordered from smallest to largest and divided into four quarters, the values that are in the upper quarter of the data.

Variable: A changing quantity, usually a letter in an algebraic equation or expression, that might have one of a range of possible values.

Variance: A measure of the dispersion of the distribution of a random variable.

Variants: A quantity whose value may change or vary.

X-axis: The horizontal axis on a coordinate grid.

Y-axis: The vertical axis on a coordinate grid.

 

Definitions are from the Ohio State Content Standards for Mathematics and Exploring Statistics in the Elementary Grades by Bereska, Boster, Bolster and Scheaffer. (Dale Seymour Publications)

 
Copyright©2008, Northeastern Educational Television of Ohio, Inc. All rights reserved.