Does mixed ability teaching harm pupils progress?

A new study comparing groupings of year 7 and year 8 pupils has found that teaching mathematics in mixed ability classes is less effective than placing pupils in sets based on ability or prior attainment.

The research – conducted by a team from the University College London (UCL), the Institute of Education, Brunel University and Queens University Belfast – finds that mixed attainment teaching leads to one months less progress overall for pupils (although they admit this is not statistically significant) and that the impact is particularly pronounced on high prior attainers, with mixed ability grouping costing them two months of maths progress when compared to setting.

As well as attainment, the study also surveyed pupils’ self-confidence in maths learning. It found that being taught in mixed ability classes negatively affected the self-confidence of pupils studying mathematics, in particular those with the lowest prior attainment.

This smacks in the face of established thinking in certain areas of maths education, which is that any cost to high attainers from mixed-ability teaching is more than made up for by the positive impact on outcomes and attitudes for the lower attainers. Proponents of mixed ability teaching often use this as a 'social justice' argument, stating (rightly) that low prior-attainers and those in lower sets tend to come from disadvantaged backgrounds and/or certain ethnic backgrounds and so the positive benefits for these pupils are worth and adverse impact on those that are more likely to have a more privileged upbringing.

I remember in the relatively early days of the the maths hubs hearing stories about certain hubs also promoting mixed ability teaching as being 'essential' for teaching for mastery. This seemed to coincide with key principles of teaching for mastery like 'all pupils can learn and enjoy mathematics' and all pupils working on the same mathematical idea.

Interestingly though, the heads of maths survey revealed that both those in charge of mixed ability departments and those in charge of setting departments both thought that their approach to grouping benefited lower attainers indicating a more divided opinion.

Now, I am sure that proponents of setting will take this as a victory, saying that 'if it helps the most able and doesn't have a negative impact on the least able, we should all be setting'.

Meanwhile I am sure that the adherents to mixed ability teaching will point to possible flaws in the research. Some of these are admitted in report, such as the admission that, due to the number of schools that either dropped out of the study or had to be withdrawn because they couldn’t be matched with a school with the alternative grouping approach, the study ‘may not fully represent the national population of schools’. Or the fact that the pupils in the setting schools had slightly higher prior attainment on entry, or more experienced teachers. Or more of the setting schools being rated 'outstanding' by Ofsted. Although Professor Allen has called for this to be the end of the mixed-attainment debate in her recent blog, I can see there being enough ammunition in the studies low evidence strength rating and own admitted limitations for die-hard MA proponents to continue the fight.

In my last school (an eight form entry school), we taught mixed ability in year 7 and year 8. This started many years ago when the school had its first KS4 cohort, and could no longer organise year 7 into two bands due to timetabling constraints. The best they could do is put them into quarters, with two classes in each quarter. I made the decision at this point to simply teach them in form groups instead. I knew we were a couple of years away from introducing a new scheme at KS3 that would be suitable for mixed ability teaching. Once this rolled out, we actually found that so many pupils were doing well (scoring over 80% on the end of year 7 testing) that setting in year 8 wasn't always necessary - we would sometimes put a top set or nurture group in place where the data from year 7 suggested it might be required. Ultimately, we had no real options in how we grouped year 7, and we were flexible enough to adapt to what the data told us (within reason) for when these pupils moved on to year 8.

For me, the way that pupils were grouped was much less important than the provision they were getting, and it is here that the study offers some real concerns. During the case study observations that the research team did they found that the majority of lessons rated low on the 'Teaching for Robust Understanding' framework. They commented things like ‘Typically, the focus was on procedures and methods with little focus on developing understandings of the underlying concepts’ and 'Although there were opportunities for challenging extension activities for high-prior attaining students [in mixed ability classrooms], these opportunities frequently involved mathematics that was not directly related to the lesson content and were rarely discussed in class.’

The teacher survey also paints a bleak picture in parts. Despite significant work over the last decade or more by the NCETM to embed practices such as mathematical thinking and the use of representations and manipulatives, the use of problem solving as anything but extension work, particularly those problems for which there is no obvious solution, as well as the use of manipulatives in lessons for these year groups remaining disappointing low. This is despite nearly 60% of pupils saying that they understand mathematics better when they can use objects to help them, rising to nearly 65% for the lowest prior attainers.

The study also contains some further concerning (although not necessarily surprising) statistics. Boys being about 50% more likely than girls to report that they feel comfortable talking about their mistakes in class adds to the evidence that maths teaching practice needs to change to be more inclusive of girls and support the building of girls confidence in maths (as was also highlighted in this recent education brief from UNESCO looking at recent TIMMS data). High prior attainers being 50% more likely that low prior attainers to say their parents like maths highlights the continuing theme of parental issues and anxieties around maths impacting the progress of their children.

Whatever happens with grouping of pupils, whether more schools move to setting, whether schools try and deal with the lack of challenge for high prior attainers by having a top set and mixed attainment for rest, these are, for me, secondary concerns. If this research is to be believed, the lower prior attainers aren't being served by either setting or mixed ability - they are struggling when they start secondary school and still struggling two years later no matter how they are grouped. This is what we need to deal with as a sector, improving practice so that it supports all learners in making progress, providing access and entitlement to the highest quality CPD to ensure teachers continue to develop their craft throughout their careers, and addressing wider societal issues and attitudes around mathematics. These are not entirely solvable by schools.


They can learn the lyrics...

“They can learn the lyrics to loads of songs, but they can’t remember…”

I have heard a phrase like this many times in my career in schools, usually from one of two sources:

  1. People who are advocating for more “fun” in lessons – if we make lessons as engaging as the popular songs of the day then kids will remember the content as well as they remember the lyrics to songs.
  2. People bemoaning the lack of students’ retention of the subject matter they are teaching.

I am by no means a musical person. I never learned to read music, or to play any instrument beyond very basic keyboard skills. I typically have no clue if something is in tune, or ability to harmonise. I do, however, very much enjoy singing along to songs on the radio (much to my wife’s chagrin). It was whilst doing this recently on a drive that I reflected on the fallacy of the idea that learning song lyrics is akin to learning the content of a lesson or curriculum at school.

In my reflections I identified four types of songs that I can associate with my ability to reproduce the lyrics of: 

  1. Songs that I know the lyrics of completely – that is to say I can write out the complete and correct lyrics even in the absence of the tune or words.
  2. Songs that I know all (or nearly all) of the lyrics in context – these are the songs that I can comfortably sing along to, matching the words as they are sang by the artist (or get the timing right on the rare occasions I sing karaoke), but I require at least the tune, and probably the words, to be present in some form.
  3. Songs that I know some of the lyrics in context (and maybe some without the context) – these are the songs where I can sing along to bits of them (usually the chorus or an early verse), and may even be able to reproduce those words without the tune if asked to say or write them, but there will be other parts of the song that I don’t know and will have to stop singing to.
  4. Songs that I don’t know the lyrics of at all – speaks for itself really!

For me, and I suspect for the vast majority of other people, this is a list which grows as you go through it. In fact, there is only one song that I can confidently place in section 1 (“Another Day in Paradise” by Phil Collins if you are interested). There are a few songs I can place in section 2. More than that in section 3. And plenty in section 4. Most of these last ones will be songs that I have never heard, or not heard often, whilst those in 2 and 3 will be songs that I hear regularly.

Now, I have no empirical evidence for this statement, but I suspect that the vast majority of people will recognise that their own experience of this is quite similar. There will be some; professional musicians, those with eidetic memories, and those who are much more heavily invested in music than I am; who will, of course, have a much larger list in section 1 than me. But for the everyday person who enjoys music without being devoted to it, I would be willing to bet that these four categories ring true. And the key difference, for me, between sections 1 and 2 or 3 can be summed up by one word: cue.

It is simple really. When I hear certain tunes, my brain is cued to retrieve the lyrics of the song along with the tune. In cognitive psychology terms, the desired memory (the lyrics) have been associated with an appropriate stimulus (the tune of the song) sufficiently during the encoding phase of my memory of the lyrics that when I hear the tune the memory of at least some of the lyrics comes to the fore. Even the (for me) one song that sits in section 1 is somewhat like this; although I can recall the lyrics perfectly I do so by singing the song in my head – the words and the tune (or at least the rhythm of the song) are inseparable in my memory.

I can offer some evidence of this from the popular TV show, “Never Mind the Buzzcocks”. The last round in this show (for those that haven’t seen it) is called “next lines”, where the host Greg Davies (in its most recent incarnation) reads out a line from a song and the panel of contestants have to provide the next line. It is very common for a panel to fail to do this, even when the person who sang the song is on the panel. The cues for them to recall it are simply not there. Many other times, the members of the panel have to sing the song to themselves to cue the memory of the next lines.

In teaching however, one of our big goals is for students to retrieve information given little or no cue, or in situations where the stimuli are not the same as they were during the encoding phase of the memory to be retrieved. Students learn a particular piece of knowledge in one context, but then are asked to recall and apply it in many different contexts – they are asked to reproduce the song lyrics without the tune. Even worse, they are asked to reproduce the song lyrics whilst a different tune is playing. I know there will be many people out there with experience of the difficulty here – trying to recall the words of a song but being unable to because a different song is playing in the background. But that is precisely what we are often aiming for in teaching; transference of the knowledge or skills we teach into different contexts or domains. And this is a big reason why students find it much more challenging to recall what we teach than to sing a long to their favourite songs.

This isn’t to say that there aren’t useful things we can take away from kids’ ability to sing along. If we can identify useful cues for memory retrieval that are, perhaps less context specific, and get our students to associate them with the knowledge to be retrieved, this can be helpful. To this day I can’t hear the word “trigonometry” without thinking of the phrase “Some Officers Have, Curly Auburn Hair, 'Til Old Age” (the mnemonic that I was taught nearly 30 years ago to help remember which trig function is used with each pair of sides in a right-triangle). Of course, this is only helpful when I recognise something as requiring trigonometry (which, fortunately, I can do quite well now) – it doesn’t help me if the context is not one where I would think to use trigonometry in the first place.

There are also things we can take away by reflecting on what helps those songs (or in my case, song) get into category 1. In my case, a key one I can identify is that the song tells a story. Much has been written about the power of story telling in learning, and I think that definitely plays a role in why the lyrics to “Another Day in Paradise” are so memorable to me. Another aspect of the song is the chunked nature (again, something written about extensively in education circles). Each verse is only 4 lines long, and each line is only between 3 and 10 words – most of them very short words. There are probably other properties of the song that make it more memorable that a musician could identify that I cannot, but I do think these two have a particularly strong impact.

So, the next time someone says “Why can kids learn the lyrics to so many songs, but can’t remember…” or maybe tries to use it to justify increasing the “fun” in a lesson, you can answer them with “It is probably only because they are only trying to retrieve that knowledge when they have strong cues to help them. We are trying to get them to retrieve and apply that knowledge across many contexts so it will always be more challenging for us.”

How to revise for GCSE Maths without all the panic

A friend and colleague of mine, Helen Osmond, has produced a wonderful new book to support pupils in revising for GCSE maths.


Titled "How to revise for GCSE Maths without all the panic" this short book is potentially the perfect companion for your child or children that are struggling to revise effectively for GCSE maths.

The book is short and split into three easy to manage chapters, examining how the brain forms knowledge and memories, and how revision needs to work with this process, in chapter 1. 

In chapter 2 it then moves on to how to set up your revision space, standard revision techniques - including (importantly) why they work, how to maximise their effectiveness, and common mistakes people make when using each strategy. There is also a note on the use of AI in revision, how to use it well and what not to do, as well as how strategies might be updated for neurodivergent pupils. 

Chapter 3 then focuses on the foibles of GCSE maths in particular, looking at the different command words used in exam papers, the common mistakes that cost pupils marks in exams, and the basics of interpreting a mark scheme. There is then guidance on which of the strategies in chapter 2 might suit certain maths content, as well as some advice for what to do when you are actually in the exam itself. 

The book also contains a useful link at the end of the book which provides access to for further free resources to support revision.

As soon as I finished reading my copy I immediately passed it on to my daughter, who is sitting GCSE exams in this year, and told her she should read it. As a pupil guide, I really like this book. It contains lots of great advice and information, but in a simple and easy to digest form. Helen writes as if she is addressing the child directly, no doubt drawing from her years of experience having similar conversations with the pupils she has tutored over the years. I am sure many of the messages in the book have been delivered piecemeal by schools and parents over pupils' GCSE years, but having them all together in a single reference guide could be, in my opinion, invaluable for children to refer to as they are deciding on and using revision techniques. I just wish I had a copy a couple of months ago so I could give it to my daughter then!

The role of 'problems' in learning mathematics.

Yesterday I came across this quote from a podcast interview between Professor Anna Stokke from the University of Winnipeg and Professor Emeritus John Sweller, best known for formulating Cognitive Load Theory.


This seemed too binary to me. That a problem can either be impossible or simple depending on the schema that a person has seems to belie the complexity of how learners develop mathematical knowledge. Surely, there must be points where the solving is difficult, but achievable, and this difficulty lessens over time.

One example given in the podcast is a pair of simultaneous equations: x + y = 5 and 2x - y = 8. Now, of course, to someone who knows lots about simultaneous equations, the path to finding the values of x and y here is relatively clear. As Anna said in the podcast, you would add the equations together to eliminate y, find the value of x and then substitute to find y.

Clearly one needs enough knowledge of algebra to even interpret the question. If I don't have some knowledge of the concept of x and y as unknowns here, I won't even understand what the equations themselves mean, never mind what asking me to 'solve the pair of equation' means.

However, there are many other ways to solve this pair of equations. The podcast mentioned trial and error. Although not efficient, trial and improvement is valid. Other numerical methods such as the Gauss-Seidel method are also valid. Alternatively, we could plot the two linear graphs and look for their point of intersection. We could employ matrix approaches involving the inverse matrix or reduction to row echelon form. If I know anything about any of these approaches, the problem is not impossible even though my schema may not contain any knowledge of solving pairs of equations using elimination.

What I think Professor's Stokke and Sweller mean by 'impossible' in this case is actually 'unreasonable to expect learners to do using the approach intended by the teacher'. This I have more sympathy with. If I offered that pair of equations to pupils with the intention of them 'discovering' elimination as an approach without having ever manipulated pairs of equations, I don't think many (if any) of them would work out the approach for themselves. I am not, nonetheless, in 100% agreement with Sweller and Stokke's point of view. I think that, if I did teach pupils more generally manipulating systems of equations - showing them how to add, subtract, multiply and the like single equations or pairs of equations - without the goal of 'solving' the pair, and then explained what it meant to solve a pair without modelling or exemplifying the approach, I think it much more likely that some pupils would then 'discover' the elimination approach. 

To be fair, having listened to the podcast, I don't think Sweller or Stokke really think it is impossible anyway. The point they seem to be making is that it is simply not a good approach to ask learners to employ if the goal is them learning to recognise and appropriately deploy that strategy. The much more useful approach that will support more learners in achieving this goal will be to have an expert exemplify and model the approach, and then have learners practice application of what they have seen/studied to an increasingly complex array of carefully chosen and structured problems to support development of increasing fluency. This I do 100% agree with, given that the goal is the learner getting to the point where they recognise and can effectively deploy the strategy.

The bit that I have more of an issue with is that this should not be the only goal of a mathematics education - to be taught lots of strategies, how to recognise when to deploy said strategies, and then to deploy them automatically. In my opinion, there does need to be space created for learners at all levels to be able to grapple with uncertainty, deal with competing constraints, and examine the pros and cons of different approaches. There is a phrase that I first encountered in Colin Foster's MT article that comes from Japanese "problem solving" lessons; 'the lesson begins when the problem is solved'. As teachers, part of our goals for a mathematics education must include opportunities for engaging with authentic problems, not simply questions which are very closely related to a single mathematical approach or result that has been recently taught. One can argue that questions like solving the pair of simultaneous equations given above, once pupils have been taught elimination as an approach, cease to be a 'problem' in a mathematical sense rather than simply questions that should cue a particular recognition and deployment. Indeed, at GCSE, such a question would be considered an AO1, "use and apply standard techniques" question, rather than an AO3, "solve problems within mathematics and in other contexts" question. 

Contrast that with this question taken from the Corbett Maths website:


This problem has multiple possible approaches. Yes they all revolve around having equal amounts to compare - either equal volumes of Cola or equal values of money. However, there are a number of different volumes or amounts of money that could be in consideration here, 6 litres, 100 ml, 1 ml, 1p, 10p, £1 or more are all feasible. This is entirely the sort of problem I can see featuring in a Japanese style lesson, with the teacher introducing the problem and providing any necessary input around scaling of volumes and pricing or the like, before allowing pupils to approach the decision in their own way and generating meaningful discussion about how different approaches that pupils might take compare to each other. 

Another issue to consider is the role that engaging with problems prior to learning an approach specifically tailored to the problem type might have on motivation. This is a complex issue. It might be that, in certain circumstances, having to consider problems for which the solution isn't obvious provides a motivation to learn the techniques that will make the problem easier to solve. Conversely, it may be that this negatively impacts pupil motivation if they feel the concept is too difficult to grasp due to early exposure with challenging problems. I also recall a phrase from Skemp here, 'well is the enemy of better'. If pupils are able to solve the initial problem using an inefficient but adequate strategy, they may be less motivated to move away from that strategy even if it is more efficient. 

I think a lot of this is likely to do with how invested pupils are in the initial problem - either because of a positive attitude to maths in general or due to some 'hook' in the problem itself that piques pupil interest. I do believe, however, that the use of problems to motivate a need (or at least usefulness) to engage with new mathematical learning is one that is worth examining more fully.

In my response to the first Maths Horizons report I shared this image which I think contributes to the role of problems in learning maths:


Rather than problems being something that learners only attempt once fluency has been achieved, which is the view in some places, I strongly believe that teachers of mathematics need to recognise the cyclical relationship between developing fluency, reasoning and problem solving, This was based on a sesson I had done at a LaSalle MathsConf a couple of months earlier, where I shared this anecdote:

A personal anecdote:

It may just be me, but it seems that the pupils I am teaching are less prepared to try and think about an idea.

They seem to be expecting me to do all of the hard work, and show them every little aspect of everything.

I wonder if they have formed an expectation about maths lessons that all they have to do is sit and listen to the teacher and then try and do what they (we) do.

I wonder if this is because of their experience of maths lessons to this point.


Learning is effortful, and mathematical learning should require the employment of reasoning, deduction, conjecture and the like. I think that exposure to problems where the solution isn't obvious may have a role to play in impressing upon pupils that they are expected to think in maths lessons, expected to bring their reasoning skills alongside their prior knowledge to the table, and that mathematics learning is not simply watching the teacher do something and then try and regurgitate it.

So, what is the role of 'problems' in learning mathematics. I don't think there is a simple answer. They both build and test fluency. They affect motivation and provide points for discussion. They communicate something of what we value in learning mathematics and in seeking a mathematical education. This is probably what makes their role so debated and difficult - 'problems' and how they are used have many roles in the teaching and learning of mathematics and how and when to use different problems and problem types will depend a lot on the pupils you have in front of you.




ResearchEd Birmingham session: Front-Loaded Feedback and the 'I do, We do, You do'

So today I attended the excellent ResearchEd Birmingham event and re-delivered the session I ran at the ResearchEd National conference in September 2025 about the idea of front-loading opportunities for feedback into questions and tasks so that pupils have the opportunity to get feedback directly from the task about their approaches, and to force them to make their thinking more visible - which has prompted me to create my latest website www.front-loaded-feedback.co.uk. The slides for this presentation are linked at the bottom of the blog.

During the session I had alluded to some thinking about the 'I do, We do, You do' model of instruction related to the question at the bottom of this slide:


I had planned to expand on this thinking towards the end of the session, but due to time constraints I was unable to  - so I thought for clarity and posterity I would include it here.

I tend to think about most maths learning episodes (a distinction I draw as the 'lesson' is clearly not a useful unit of time in relation to securing learning) as going through through roughly four phases as we attempt to move learners through a continuum from novice to developing greater expertise. These phases are summarised in a slide I used in other sessions:

In the first phase learners have little-to-no knowledge of the concept of process that we are introducing, and so they benefit most of exemplification and modelling - any practice attempted in this phase will likely be unsuccessful.

In the second phase learners have (the exemplification and modelling) gained some inflexible knowledge of the concept or process. Practice in this phase is likely to be error prone, particularly if it strays too far from what learners have seen during the exemplification/modelling. This is the phase where guided practice is required, with learners still requiring significant support and immediate feedback on their attempts. This is where I see front-loaded feedback questions and tasks as having real value.

The third phase is one in which knowledge is moving from inflexible towards flexible, but the goal of practice now to expand beyond the the initial modelling to open other broader knowledge of the concept or process in order to target key elements of the structure of the concept or process and push learners beyond the comfort of what was initially modelled or exemplified. This is the phase where procedural and conceptual variation in questions and tasks are likely to be most prevalent and useful (although, depending on what is being taught, these could feature in all phases of learning). In this phase of deliberate practice we are still likely to see errors, and so immediate or near-immediate feedback is still a clear requirement. 

However, for me, the reason for the errors and the feedback required as a result is subtly different to the guided practice phase. In guided practice, errors happen because learners are still on shaky ground with what was initially introduced, and so feedback needs to identify and focus on what learners haven't quite grasped from the initial instruction and correct that. In the deliberate practice phase, learners should already be confident in mimicry - being able to do what they were originally shown to do or identify what they were taught to identify - but the errors now come when trying to extend that thinking, going down wrong paths or under-/over-generalising certain properties. The feedback here needs to highlight where thinking needs to change for learners to be able to move forward. A simple example of the distinction here is learners being shown how to solve the simple linear equation 3x + 2 = 8, and then being asked to solve the equations 4x + 5 = 13 and 4x - 5 = 13. The first equation of these two is structurally identical to the one modelled and should be part of guided practice. Any errors arising from this can be tackled by directing attention back to the initial modelling or worked example, and either highlighting (or asking pupils to reflect on) where their approach has differed from the approach given. The second requires a deviation from what was modelled; it requires learners to adapt from the modelling (assuming the teacher hasn't modelled a specific example of this structure beforehand) and so might be considered useful as part of deliberate practice. 

Depending on the complexity of the concept or process, guided practice might be as small as one question or might be several questions. It might involve front-loaded feedback, the use of mini-whiteboards or multiple-choice questions with immediate feedback, and/or backwards faded examples like those on Dave Taylor's excellent website (this is, of course, not an exhaustive list).

Whilst the deliberate practice phase might include something like increasingly difficult questions (another of Dave's websites), completion tables or similar - still completed on mini-whiteboards or otherwise monitored carefully so feedback and support can be given when needed.

The independent application phase only then comes once pupils have gained that high success rate (a la Rosenshine) in the deliberate practice phase - once learner thinking around the concept or idea is more secure and misconceptions arising from this and the guided practice phase have been dealt with. It will further stretch pupils by bringing in contexts, interleaving and/or interweaving other concepts or processes with the current one, or in general asking for wider applications of the knowledge studied.

The issue I have with the 'I Do, We Do, You Do' instructional approach is that, if we accept that learners will need to go through these, or similar phases, on their journey towards developing expertise, then this model seems overly simplistic to capture the range and nuance of practice opportunities that pupils need to engage with to develop the necessary expertise. I see and hear about teachers treating the 'You Do' as part of the independent practice, where it is, at best, the first in what should be a series of questions in the deliberate practice phase and more likely still part of guided practice. Alternatively, I see and hear about teachers moving straight into questions that might be considered more suitable in the deliberate practice phase, during the 'We Do', requiring learners to engage with adaptations before they have even got to grips with the concept or process as exemplified/modelled. And this is not to mention that the very approach of 'I Do, We Do, You Do' can be seen to imply teaching of process (several stages of doing things), which is clearly not applicable when teaching concepts (where comparing examples and non-examples are generally more suitable) or facts like a full turn is made up of 360 degrees (where repetition and reframing through choral response or similar might be more beneficial).

To me, there is a real danger that focusing on a structure like this without the conceptual underpinning of what different stages of practice are trying to achieve means that the practice that learners are offered will not lead to them developing the expertise we want them to develop.

For those reading who were in my session - you can probably understand why I didn't feel I had time to get into this in the session! If you have read through all of this to get to the slides (or skipped to the end to get to them) then your patience is rewarded here.