|
|
|||||||
![]() |
Making the Numbers Come Alive: Basic Data Analysis |
|
Introduction Once educators have made the commitment to become "data-driven" and enter into the continuous improvement process, they must build competency in data analysis. The act of data analysis involves transforming data into useful information that can help guide decision-making. Having the ability to organize and display data enables educators to effectively plan their work, study their progress, and communicate to stakeholders. | ||
|
Academic:
| ||
|
Attendance:
| ||
|
Participation in extracurricular activities:
| ||
|
Parent involvement:
| ||
|
Language proficiency
| ||
| Question 2: Has our performance changed over time? To respond to this question, educators will need to evaluate longitudinal outcome data that have been consistently collected over time. | ||
|
Academic:
| ||
|
Attendance:
| ||
|
Participation in extracurricular activities:
| ||
|
Parent involvement:
| ||
|
Language proficiency
| ||
|
School comparison
| ||
| There are different ways of looking at change. One way to look at change is by tracking changes within a grade level over time. As we know, within a given grade, there are different groups of students from year to year. Therefore, changes in the performance of students in a grade level may indicate issues beyond the educational programs in the school. They may also reflect significant changes in student demographic data. This is called a longitudinal analysis. | ||
| The second way to look at change is by following the same group of students, or cohort, through their educational experience. Changes in the performance of the same group of students may then be linked in time to changes in educational programming or forms of process data. This is called a cohort analysis. | ||
|
The No Child Left Behind (NCLB) legislation requires that data be disaggregated according to four demographic categories for precisely this reason. The NCLB categories are:
| ||
|
Other important demographic variables may include:
| ||
|
Where are we now?
| ||
|
Has our performance changed over time?
| ||
|
Step 2: Selecting Data to Analyze To identify the data we will use in our analyses, we must consider how the outcomes in question are best measured, the demographic variables that are important to us, and the data necessary to reflect the instructional processes at our school. The evidence we rely on to understand our needs can come in the form of quantitative or qualitative data. However, for the purposes of this lesson, we will focus primarily on measures that are quantitative. | ||
|
Goal 1: 70% of students will be proficient/advanced in Reading/Math
Outcome data:
| ||
|
Goal 2: All children will read independently at the 3rd grade level by the 3rd grade
Outcome data:
| ||
|
Goal 3: Individual student attendance will be 90% or better
Outcome data:
| ||
| Data that are reliable can be trusted to provide consistent information over time. In order for data to be reliable, we must consider the tools or measures that generate the data. Reliable tools measure the same way each time they are used under the same conditions with the same individuals. The results they yield are repeatable. For example, if we administer a math assessment on a Monday to our class, and, without additional instruction, administer the same assessment again on Tuesday, we would expect similar scores from each student. If the scores differed significantly, we would question the reliability of the test. If we have confidence in the reliability of our measures, we can have confidence in the accuracy of our data. | ||
| Data that are valid measure what we intended to measure. The degree that the data are valid contributes to the strength of our conclusions, inferences or propositions. For example, if an assessment that intended to measure math proficiency had directions that were unfamiliar and difficult for the students, the resulting data from the assessment may actually represent the child's ability to accurately read the directions more than they do the child's proficiency in math. In this case, the directions would need to be reworked to remove this bias from the results and enhance the validity of the assessment. | ||
|
| ||
|
| ||
|
| ||
| In example 1, the data have been disaggregated by grade, a demographic variable in the school. Each row is labeled with a grade level. | ||
| The columns in this example show how the scores change over the course of three years. | ||
| There may be times when data need to be disaggregated based on two demographic variables. | ||
|
Put data in the cells. Once the chart has been set up, quantitative data can be placed in each cell. These data can represent several concepts, such as:
| ||
|
Step 4: Graphing the Data The next step to data analysis is using the data that are in a chart to create a graph. | ||
| Each graph has the common elements of an "x" and "y" axis. The x-axis is the horizontal line of the graph and often reflects the categories that are being measured or a period of time. | ||
|
The y-axis is the vertical line of the graph and reflects the numeric scale for what is being measured. Graphs should be given a title in the same way as charts. There are different kinds of graphs, each more appropriate for certain guiding questions than others. Next, we will link the questions we have with the type of graphic representations that aide our analyses most. Bar graphs are helpful for presenting data from different demographic groups and facilitating comparisons. For that reason, they provide an appropriate tool to display data that respond the guiding question, "Where are we now?" In this type of graph, the data are recorded in bars. | ||
| In the example below, "Where are we now?" is applied to the subject of reading. The data represented in this bar graph have been taken from the 2002 data shown in chart example 1. Each bar represents a different category ("grade 1" or "grade 2") and the heights of the bars indicate a quantity (in this case, the percent of students in each grade that were proficient in reading according to the 2002 SAT 9 achievement test). The scale on the y-axis begins at zero and ends at 100, as that is the range of possible percentage scores. | ||
| It also shows that the percentage of proficient readers in grade 3, at 38 percent, is considerably lower than in the other grades. This should raise a "red flag" to educators. | ||
| Line graphs are used to show change, so they are well suited to respond the guiding question, "Has our performance changed over time?" A line graph is read in the same way as a bar graph. The x-axis usually reflects the time period, and the y-axis shows the numeric scale for what is being measured. A line graph is drawn by dropping points for the data values at each time period, then joining the points with a straight line. When data are disaggregated into demographic groups, lines can be drawn in different colors or the points can be designated by different symbols. In this scenario, a legend for the graph is critical for accurate interpretation. | ||
| The final type of graph that will be discussed is known as the scatterplot. This graph does not align directly with the three guiding questions previously discussed. Instead, it is used as a tool for further exploration when hypotheses begin to be generated. Scatterplots help investigate hypotheses about relationships between school phenomena and therefore respond to a new guiding question, "What is the relationship?" On a scatterplot, one data point, plotted according to numeric scales on the x- and y-axes, represents an individual with scores on two variables. In scatterplot example 1, the relationship between reading proficiency and attendance rate is shown for all students. The x-axis is labeled with a scale for attendance rate and the y-axis represents NCE scores for reading. This scatterplot may have stemmed from a hypothesis that attendance has an impact on reading performance. | ||
| Scatterplot example 1 shows a strong, positive relationship between attendance and reading achievement. The students with higher rates of attendance tend to have higher levels of reading proficiency as well. Although the relationship is not perfect, if we draw a line through the middle point of the dots (also known as the "line of best fit"), we see that the students cluster around the line pretty tightly. | ||
|
| ||
| ||||