Numbers Count: Contextual Assessment and Quantitative Measures in #PBL #DTk12

“He got one out of three!,” said Phil.

“Wow! Can you believe that?!,” responded Ann.

Did the “He” in this short story experience success or failure? Context makes all the difference in the world, doesn’t it?

I can imagine one context: A teacher on a team is reviewing assessment data, and Phil announces to his team that a student “got one out of three.” The tone could be disappointment and disbelief, indicating that Phil thought the student had more command over what had been assessed. The teammate, Ann, knowing how hard the team has been working on the lesson study and assessment echoes Phil’s consternation. In fact, I’ve heard just such a conversation.

I can also imagine a second context: A young boy relatively new to baseball is talking to his mom about a player hitting one out of three at-bats during a season, as the boy figures out what batting averages of .333 mean versus averages of .250. In this context, the exclamations indicate wild excitement at reviewing the success of the young boy’s friend who made the All-Star team. The mom is reflecting the excitement with a big smile on her face, saying, “Can you believe that?!” In fact, I’ve heard just such a conversation.

As schools examine and employ strategies like project-based learning and design thinking, I believe the stories above can be catalysts for talking about quantitative feedback in context. Why is it that the same fraction and decimal is called “failure” in one context and “success” (great success!) in another? Could it be that many of us have a “movie in our mind” playing – one that shapes our beliefs about what it means to get a one out of three based on experience with traditional quizzes or formative assessments? Could it be that we have come to assume that the content and skills on such assessments should be evaluated in such a way that only 70% and above would be considered “passing?” Considering an ed psych concept like Vygotsky’s ZPD (zone of proximal development) might lead us to believe that the scaffolding and instruction is in misalignment with the student’s learning. In context one, many might view one out of three as a problem.

But in the context of baseball, a 33% means something very different. It involves a mental movie that tells us that one out of three is grounds for Hall of Fame induction if the player can do that consistently over a career. Why is 33% so different in this context? Could it be that the high-quality activity of being face-to-face with a pitcher throwing serious heat causes us to shift our expectations and see 33% in an entirely new perspective and point of view? In context two, many might view one out of three as a celebration.

As schools, when we design project-based learning and design-thinking exercises, how might they be informed, in terms of assessment, by the contrasting contexts of taking a quiz versus standing at bat? Are we putting new wine into old wine skins (please forgive the mix of metaphors) when we apply traditional grading practices and certain quantitative measures to more high-quality, intensive contexts that refuse to be assessed with the same mindsets that have historically been applied in the classroom?

How might we be more purposeful and intentional about the interpretation and context of mathematical feedback?

About 14 months ago, I counseled a group of four boys who said to a colleague and me that they had failed.

“Why do you think you’ve failed, guys?”

“Well, Mr. Adams, we only got 2 out of 10 – 20%. In school, 20% is seriously failing!”

“But in your case, through your project, you helped 2 out of 10 unemployed human beings get a job! In your case, your point of view of 20% might need to shift a bit. Just because 20% on a quiz or a test might have indicated real disappointments and ‘disasters’ to you in the past, a 20% employment-bump statistic in your job-fair project could be seen as a wildly successful outcome. It’s more like a batting average than a vocab quiz. That’s how Ms. G and I see it. You positively changed 2 people’s lives this week. Your ’20%’ will cause ripples that will send significantly positive waves throughout that community.”

When we in schools apply quantitative measures – 100 point scales, 4 point Guskey scales, whatever kind of scales – I believe we need to do so very thoughtfully and carefully. We need to be proactive about our strategic communications surrounding these assessment measures. Students, teachers, parents – we all bring existing mental movies with us into the school setting.

Even if we don’t apply numerical measures – we did not do so in Synergy in the case of the food-desert, job-fair project – we must be aware of the mental movies and previous experiences that students bring with them to these contexts of project-based learning and design thinking. Those four boys did not receive any kind of “final grade” on that project (our course was non-graded, but heavily assessed), yet they applied previous context to a new situation and drew some profound conclusions about their perceived success. It was a powerful learning moment for me. One that has likely taken me the entire 14 months to fully process.

During the past few years, as I’ve consulted with a number of schools, more than a few are applying relatively traditional grading practices to the assessment of skill sets and dispositions. For example, on a report card or progress report, one might find a column or row labeled “Collaboration” and another labeled “Critical Thinking.” Next to the categories one might find an “82″ or a “2 on a four-point scale.” One might also see a “B-” in the scoring cell. Or one might see initials like “PG” – “Progressing.”

I realize I am telling a very incomplete story here. I imagine some readers writing to me in the comments or email or Twitter and saying, “Bo, you’re missing the whole point! High-quality PBL shouldn’t even be getting a quantitative measure. It should be performance-task assessed with only narrative, negotiated feedback. No numbers at all! What’s wrong with you?!” With this post, I really mean to provide a catalysts for thinking and doing with those readers and schools who ARE trying to marry quantitative-assessment measures with high-quality PBL and DT. I, too, have serious questions about the “Why?,” and I am also deeply interested in the “How?” if a school just will not consider non-numerical assessment reporting, even for certain courses, strands, projects, assignments, etc.

Are the challenges we are curating or creating causing us to think deeply about the nature of the challenges relative to assessment? Are we orchestrating experiences that are more like the intensive match up between a super pitcher and a batter – ones in which the quantitative measures we apply communicate All-Star results at “33%?” Or are we trying to place new wine into old wine skins and facilitating experiences that challenge kids so slightly that it’s assured most will “pass” or view their Herculean efforts as failure because we’ve neglected to help everyone involved reconceptualize and pivot perspectives on what “one out of three” might really mean in our context?

Are traditional report cards “vanity metrics?”

Are traditional report cards “vanity metrics?”

In the business world, we talk about the difference between vanity metrics and meaningful metrics. Vanity metrics are like dandelions – they might look pretty, but to most of us, they’re weeds, using up resources, and doing nothing for your property value.

from “Know the Difference Between Your Data and Your Metrics” by Jeff Bladt and Bob Filbin, Harvard Business Review  |  11:00 AM March 4, 2013

English – 92
Math – 89
Science – 91
History – 88
PE – 93
Art – 90

Such a report card might make the refrigerator. But does it really say very much about the student’s growing capacities in writing (ideas, organization, word choice, sentence fluency, conventions, voice)? And did you assume from that last sentence that I was referring to English, or did you assume that such metrics could cut across the departmentalized curricular landscape and comprise parts of all the subject grades? Do we know if the student asks probing questions and demonstrates curiosity for understanding more deeply? What do those dandelions tell us about the student’s application of such thinking skills as divergent, emergent, and convergent explorations? What do we know of the student’s perseverance, resilience, risk taking, and grit? Can we make any deductions about the student’s observation and experimentation capabilities? Do we know how the student has demonstrated integrity and empathy from those pretty flowers on the refrigerator door?

From that data set, the one that will live indefinitely in the fireproof cabinets and ethernets of schools, what do we know about the student’s growth and emergence in the 4, 5, or 7Cs of 21st century skills? What do we see on that report card about mindset?

What are we grading? What are we measuring? What are we commenting on? What are we collecting and recording and archiving data on? What do we say matters most about our children? Do our report cards shed light on what we say we value most?

Is your school asking these questions? Why or why not? Are these questions among more systemic considerations that you are examining?

#QuestionsAreTheWaypointsOnThePathOfWisdom

#PedagogicalMasterPlanning

#TED Andreas Schleicher: Use data to build better schools #MustSee

How can we measure what makes a school system work? Andreas Schleicher walks us through the PISA test, a global measurement that ranks countries against one another — then uses that same data to help schools improve. Watch to find out where your country stacks up, and learn the single factor that makes some systems outperform others.

TED – “Andreas Schleicher: Use data to build better schools”

These are not direct quotes from the end of the talk, but I tried to capture the basic messages (the TED transcript is not posted yet):

Range of factors true of high-performing PISA schools:

  • Leaders have convinced citizens to value education more than consumerism – consumption today.
  • Belief and practices that all children are capable of success. Expect all children to succeed.
  • Growth mindsets.
  • Diversified and differentiated instruction. Personalized learning.
  • Clear and ambitious standards that are known, understood and pursued.
  • High quality, highly trained, collaborative teachers. Intelligent pathways for growth.
  • Clear on good performance, and enable teachers level of autonomy to reach those performance standards. Not prescribed what to teach. Autonomy is not independence, though.
  • Shift from delivered knowledge to enabling user-generated wisdom.
  • Moved from administrative forms of accountability and control to professional forms of work organizations. Provide development for pedagogical innovation.
  • Looking outwards.
  • Achievement across entire system. Every school is part of success – it’s systemic.
  • Align policies and make them coherent. Consistent implementation.

A truly great talk. Schleicher admits that there are no “copy and paste” approaches to school innovation and transformation, and I love his use of the data and analysis to inspire thinking about what can work for other schools. It’s not the usual “country competition” talk using PISA. It’s so much more.

2012 in review – the auto-prepared annual report of It’s About Learning

The WordPress.com stats helper monkeys prepared a 2012 annual report for this blog.

Here’s an excerpt:

4,329 films were submitted to the 2012 Cannes Film Festival. This blog had 23,000 views in 2012. If each view were a film, this blog would power 5 Film Festivals

Click here to see the complete report.

[Note: I've read a few blog self-analysis, "year in review" posts from other bloggers that I follow. However, I have resisted doing the same exercise for this blog. Then, yesterday, I received an email from WordPress explaining that their "stats helper monkeys prepared a 2012 annual report for this blog." At the conclusion of the annual report was an option to make the stats public and add as a blog post. I'm just trying out this feature.]

[Note #2: Here's the automated annual report for PLC-Facilitators: Learning is the Focus, a blog that @jgough and I maintained for our work with the PLC facilitators at The Westminster Schools.]

PROCESS POST: Dreaming about learning apps that use data collection and dashboard displays

I love dreaming about the future of education. From dreams come possibilities and innovations. To stretch my own thinking, I seek inspiration from a number of sources. Frog design and TED are two of my favorites.

In the past 24 hours, I’ve thought more about data collection and dashboard display, all in the service of continuing to develop systems that visualize and enhance individual student learning. Recent inspirations for this dreaming have come from:

  • Advancing the Future of Healthcare: frog’s Connected Care Solution. I particularly love the images and visuals of the individual health dashboard. Where frog is showing dashboard items for blood pressure, BMI, physical activity, etc., I see a translation to education that could be dashboard items for oral communication, collaborative problem solving, and project management success.
  • Matt Killingsworth: Want to be happier? Stay in the moment (TED talk). In the talk, Killingsworth describes his app that collects information from a huge data set of people. In translation to education, I could see such an app being used on student smart phones, so that learners could real-time report on what they’re doing in class or at home to learn, if they are enjoying and/or benefiting from what they are doing, if they feel deeply engaged or confused or bored. That individual data could be aggregated to see a clearer picture of an individual learner’s preferences, proficiencies, etc. That data could also be aggregated on a larger set to see what types of activities are working best for various learner profiles, age groups, etc. And all of this data could feed into the dashboards imagined above.

Does any such app, data collection, and dash boarding already exist? If so, I would love for you to leave a comment and a link about what’s already out there. The closest thing I have seen personally is the data tool being developed with Khan Academy that provides individual and class data sets of learning targets, time spent on modules, etc.

Just like frog design’s CCS could reveal what’s working and what to address with a person’s health, and just like Matt Killingsworth’s app could reveal what is leading to greater happiness for folks, a comparable learning app could make tangible so much about what is working and what to address with individual learners and groups of learners.

If we can dream it, we can build it.