Order of operations is one of my biggest issues when it
comes to mathematics teaching in this country. For me, it completely epitomises
the incorrect prioritisation of speed and accuracy using “tricks” and “hacks”
over developing true, deep understanding of mathematical concepts. I am
thoroughly tired of learners thinking that 9 – 6 + 3 is equal to 0 because “we
do the addition first, so this is 9 – 9”, whereas the result of this
calculation (as any good calculator will tell you) is 6.
This, of course, starts with acronyms like
BODMAS/BIDMAS/BEDMAS, PEMDAS or a more recent entrant into the market, GEMS
(standing for “Grouping symbols”, “Exponents”, “Multiplication and Division”,
“Subtraction and Addition”). Acronyms like these have been around for over 100
years, although grew to greater prominence in England around the time of the
introduction of the National Curriculum in 1988. They were designed to help
mathematics learners remember the order of precedence for operations in standard
algebraic notation, i.e. in an expression such as:

The precedence is for the calculation in the brackets to be
completed first, giving:

Followed by the exponent, giving:

This highlights the problem with strict acronyms like
BODMAS; a learner might now be tempted to try and calculate 8 ÷ 36 first (which
will lead to the correct result, but is a less useful path to the result for
most learners), and then multiply the result by 9. This is the same problem as
in the first paragraph where a learner felt the need to calculate the addition
before the subtraction. In fact, like addition and subtraction, multiplication
and division are of the same precedence, and so can be carried out left to
right. This gives:

GEMS overcomes this difficulty by showing multiplication and
division with the same precedence (as well as subtraction and addition), and in
the last decade many visualisations of BODMAS/PEMDAS have been produced to try
and highlight that there is no precedence between multiplication and division,
or between addition and subtraction, such as this one from the Oak National
Academy:

However, there are still issues with this. The first is
that, even presented like this (or as GEMS), that order of precedence is not
strictly true. Take the same starting calculation as before, I can evaluate
this is a number of different ways, such as:
In the first of these I simply did the multiplication before
the bracketed sum or applying the exponent, because I recognised that the
timing of the multiplication wouldn’t matter provided that I completely
evaluated my divisor (i.e. the
”) and my dividend (
, before attempting to
divide one by the other. In the second (which, granted, is not a technique to
calculation I would employ normally except under very specific circumstances),
once I had evaluated the bracketed sum, I “saw” the resulting division as the
product of two fractions (i.e.
) which would both simplify
and leave a relatively simple fraction multiplication (given that the 3s cancel
and the 4 cancels with the 2). This, at least, relegates the precedence of
operations from a “rule” which must be obeyed, to guidelines that can be
applied flexibly when you are fluent enough to recognise the allowed
flexibility. And, for me, this is entirely the fluency we should be aiming to
develop in our learners, not creating tricks to bypass. A similar example of
this comes in trying to calculate this result:

Without a calculator, and following the precedence of
operations, this is a difficult pair of multiplications that many learners of
mathematics would struggle to compute correctly, followed by a sum that is only
marginally easier. Certainly, even for those that would be successful, it is a
time-consuming endeavour. However, the calculation can be made much simpler, to
the point where most learners with basic place value knowledge would be able to
compute the result:



Here I am recognising that, because both multiplications
involve 9.2, the calculation can be simplified to 9.2 × a single number, in
this case 10. Because multiplication by 10 is a much easier calculation than by
6.3 or 3.7, this is a far superior strategy than actually following the
precedence of operations. However, notice that, in every sense that matters, I
have added before I multiplied. This becomes even more apparent when we
introduce numbers like surds, where the focus is not on evaluating the “exponents”
(in this case, expressed as roots) and the precedence seemingly disappears
entirely:




It is worth noting that, like the previous example, the
first line contains two multiplications to be added, which can be simplified as
both multiplications involve
, and then the resulting
multiplication is carried out after (using index laws), with the
exponents/roots never even evaluated.
At this point, these guidelines become less “can be applied
flexibly” and become more of a “if absolutely necessary”. So then the question
naturally arises, when are they definitely necessary. When must the precedence
of operation be followed, and under what conditions can we employ some
flexibility. And this links to a much deeper question, namely, why is there a
precedence at all?
Now, on the surface, this is not a deeper question. The
precedence is there to ensure no ambiguity in the algebraic notation we are
employing. For example, if calculating 5 + 3 × 4, the precedence tells me that
this should be interpreted as “the sum of 5 and the result of 3 × 4”, rather
than “the result of 5 + 3 multiplied by 4”. This is a case where the only
correct order is to calculate that 3 × 4 = 12, and so calculate the result of 5
+ 12. There are other notations where this potential ambiguity simply does not
arise. For example, in polish notation, the same calculation would appear as +
× 4 3 5. In this notation, the binary operation always precedes the numbers on
which the operation takes place, so this would be read as an instruction to
“sum the product of 4 and 3 with 5” and would proceed as:



whereas the alternative of adding the 5 and 3 before
multiplying by 4 (which in algebraic notation would appear as (5 + 3) × 4)
would be written as × 4 + 3 5 (read as “multiply 4 with the sum of 3 and 5).
What makes the question deeper is when we start to probe why
might the convention be what it is when it comes to our ordinary, algebraic
notation. Why have we decided to take 5 + 3 × 4 as the sum of 5 and the product
of 3 and 4, and (5 + 3) × 4 as the product of the sum of 5 and 3 with 4? I
think part of the answer lies in understanding important models for addition
and multiplication, along with a fundamental law of arithmetic – the
distributive law.
When it comes to addition, one of the key principles that
our learners need to understand is that addition operates on a consistent unit
or “like terms” to borrow from language generally used with algebra. When
learners first start studying addition, this is implicit as all the numbers
they work with are automatically in ones, e.g. 3 + 4 means 3 ones + 4 ones.
Where it should be made more explicit than it currently is when learners start
adding larger numbers in columns, for example 32 + 25. In columns we add the
ones (7), and then add the tens (5), but what is important here is that it is
this property of addition that allows this is that the ones are a consistent
unit, and the tens are a consistent unit – it is literally “2 ones plus 5 ones
is equal to 7 ones, and 3 tens plus 2 tens is equal to 5 tens”. Since the loss
of multi-base arithmetic from the curriculum, what we have lost at this point
is the opportunity to carry out similar sums outside of our normal place value
columns, for example in base 8 this same sum becomes “2 ones plus 5 ones is
equal to 7 ones, and 3 eights plus 2 eights is equal to 5 eights”. This doesn’t
mean of course, that we can’t offer learners opportunities to calculate in
multiple bases, or simply to simplify sums like this without evaluation, for
example:


Learners don’t need to be explicitly taught that this is
working in base 8, just what it means to simplify a calculation like this. This
approach has major benefits later on as it ties to other occurrences of the
same property of addition that learners tend to struggle with more; namely:
-
Why
can be added, but
cannot (at least until converted to a common
unit, probably 77ths).
-
Why
can be added, but
cannot (and this time there is no equivalent
common unit to convert either of these to, except approximately).
-
Why
can be added, but
cannot (unless you know something about the
relationship between
and
which would allow you to convert them to a
common unit).
Of course, it is this last pair of sums that really drives
this behaviour, as algebra (at least at school level) is just generalised
arithmetic and so it is the fact that this common unit (the like terms I
mentioned earlier) is required for addition in algebra that makes it equally
true for all types of numbers. However, this shouldn’t only be made explicit
once learners start to study algebra – if they haven’t had this property
highlighted to them through lots of different work with lots of different types
of numbers, then this generalised arithmetic comes out of the blue. This is
what leads to many learners’ early difficulties with algebra – they simply
haven’t worked explicitly with specific examples of the structures that we are
asking them to then generalise. The key message here is “what am I counting?”,
because if what you are counting is not the same type of thing, then these
can’t be added together unless they can be made the same type of thing.
Where multiplication then fits into this is that
multiplication creates/changes units;
or at least this is one model for making sense of multiplication that has many
useful properties. For example, when we write the calculation 3 × 5, we can
think of this as “5 things worth 3”, or, alternatively, “3 things worth 5”.
Money is an excellent example of those as it comes in different denominations
(and yes, that word links very closely to denominators). For example, you can
ask something like, “What do we have here?”

And learners might say “£53”. I would say “No” at this
point, and highlight that I didn’t ask “how much money do we have here?”, but
rather, “what do we have here?”. Learners will probably then respond correctly
with “Three £1 coins and five £10 notes”. As they do I write down 3 + 5 × 10.
Then explore how this arrives at £53 (and certainly not £80). Learners can then
picture what it would mean to have (3 + 5) × 10 in terms of money (three £10
notes in one hand and five £10 notes in the other). We can then go on to
explore other calculations like the ones below:
Can we write each of
the calculations below as a single multiplication?
a) 2 × 3 + 5 × 3
b) 2 × 7 + 5 × 7
c) 2 × 12 + 5 × 12
d) 2 × 1000 + 5 × 1000
e) 2 × 1 million + 5 × 1 million
f) 2 × 3 + 5 × 7
g) 2 × 3 + 5 × 9
h) 2 × 3 + 5 × 6
i) 32 + 5 × 6
j) 2 × 3 + 5 × 2
Can you come up with a
calculation of this type that can be written in (i) no other ways? (ii) one
other way? (iii) two other ways? (iv) more than two other ways? What can you
generalise from your calculations?
The last four are particularly interesting here, as 2 × 3 +
5 × 9 can be rewritten as 2 × 3 + 15 × 3 = 17 × 3; 2 × 3 + 5 × 6 can be
rewritten as 2 × 3 + 10 × 3 = 12 × 3 or as 6 + 5 × 6 = 6 × 6, and even when we
introduce indices in (i) we can still write this as 13 × 3, with the last one
there to show it doesn’t matter if the same unit (in this case 2) is the first
number in the multiplication or the second. This is laying the groundwork for
being able to add fractions by finding equivalent fractions (just like 2 × 3 +
5 × 9 can be written as 2 × 3 + 15 × 3, so
can be written with a common unit of ninths).
In general, the model again stretches into all areas of number and algebra,
including standard form, with
being seen as “6 of these things called 105”,
surds (6 of these things called
and algebraic expressions:
a)
2 × 3 + 5 × 3
b)
2 × n + 5 × n
c)
2n + 5n
d)
2n + 5m
e)
2(n + 1) + 5(n + 1)
f)
2(x + 1) + 5(x + 1)
g)
2(x + 1) +
5(1 + x)
h)
2(x + 1) + 5(x - 1)
i)
x(x + 1) + 5(x + 1)
This model also helps with something like
as we can see this as “2 of these things
called 32”, or alternatively “32 of these things called
2”. If we want to consider the first case, 32 is the unit, so we
need to know what this is worth in ones so we can see what we have two of. In
the second, we have 32 of these things called 2, and we need to know
actually how many 2s we have.
Whilst this doesn’t move the precedence of operations from
convention to required, it is suggestive of why it makes sense that we would
consider multiplication as a higher precedence than addition – multiplication
creates/changes the units so we need to look at these first and make sure that
the units are consistent before we can then add. If the units aren’t consistent
(for example in 3 + 5 × 10) then we carry out the multiplication in order to
change the units (3 + 50) so we can then add the values with consistent units
(3 ones plus 0 ones = 3 ones, and then 5 tens, so 53). As indices are, at least
initially, an extension of multiplication, then we need to look at these and
make sure they are also unit consistent (in the case of something like
), or that we know how much
the unit is actually worth (in the case of something like
).
So, what about the distributive law? What is this and where
does it fit in?
The distributive law is a fundamental axiom of mathematics,
that states (in our usual notation):

Learners normally see this when they are working with
algebraic expressions, particularly in expanding and factorising. However, it
is the same law that governs most of the properties that I have just talked
about. For example, why does addition require consistent “units”; because I
need a “factor” I can take out to enable the addition in ones. For example, why
does 2 × 3 + 5 × 3 = 7 × 3? Because:



Or why does
? Because
. The “unit” that
multiplication creates or changes is the factor, and without that common factor
across all of the terms to be added, the distributive law doesn’t apply. Our
use of brackets becomes crucial, then, to show what is being distributed over what.
And, what can be distributed over
what also ties very closely to our precedence. For example, we have seen that
multiplication (by definition) distributes over addition, but that the reverse
is not true (e.g. 3 + (4 × 5) is not equal to 3 + 4 × 3 + 5). Indices will then
distribute over multiplication, e.g. (2 × 3)2 = 22 × 32
(which is easy to demonstrate with small, positive integer powers). However,
indices do not distribute over addition, e.g. (2 + 3)2 ≠ 22
+ 32, because (2 + 3)2 actually means (2 + 3) × (2 + 3).
Similarly, multiplication does not distribute over indices, e.g.
. This gives us that:

This, of course, is our precedence of operations!
The distributive law is independent of notation – it is a
fundamental law of arithmetic. In Polish notation,
equivalent to saying
. Distributivity is true no
matter how you write it, but it perfectly explains why we have a precedence in
operations with indices and roots (which are just indices) at the top,
multiplication and division (which is just multiplication) on the next layer,
and then addition and subtraction (which is just addition) at the bottom. In
this sense, although we have to define a precedence to ensure our standard
algebraic notation can be interpreted incorrectly, the choice of this
precedence is far from arbitrary.
Hopefully this has shown that the way that the precedence of
operations is taught in many of our schools needs urgent revision. But what do
we do? What should this revision look like? For me, it has to include the
following:
●
Early teaching of the
laws of arithmetic, particularly distributivity and its usefulness in factoring
and re-factoring calculations.
●
Ensuring properties of
operations like “requires the same unit” for addition are explicitly
highlighted and linked to the laws of arithmetic. Linked to this, referring to
the ones column as “ones” rather than “units” is important; anything can be a
unit, but only ones can be ones.
●
Creating time in the
curriculum to explore integer calculations in depth with tasks that focus on
thinking, reasoning and re-writing rather than simply calculating results, e.g.
How many ways can you find to show that
?
●
Ensuring that, as
learners progress from calculations in integers to those in rationals and then
later irrationals, the same underlying laws and properties are highlighted and
linked back to earlier calculations, i.e. learners should recognise that the fact
that 2 + 3 = 5 links to 0.2 + 0.3 = 0.5,
,
,
,
and
.
●
Once security in base
10 integers and arithmetic has been achieved, extending this to other bases so
that learners can better understand what changes in a different base (the
representations of the numbers) but also what remains unchanged (the properties
and laws of arithmetic).
Some of these may be within the remit of individual
teachers, schools and trusts. However, some will require wholesale curriculum
change and support to create the necessary time to explore these things in
depth, and to ensure that they are taught in a “forward-facing” way as learners
progress through school-level mathematics so that teachers can have confidence
when attempting to build on earlier work that it has been done in such a way as
to provide the necessary firm foundations. We can only hope that when the
curriculum review currently under way begins to examine the curricula in
individual subjects, that they recognise that time must be created for things
like this to be explored much more deeply than they are at the minute, so that
later learning can proceed at a much faster rate and that more learners can
build that deep, connected, conceptual understanding of mathematics that I know
is the goal for many teachers of maths across the country.