No 1, 2004
Current Concerns
P.O. box 223
CH-8044 Zurich
+41-44-350 65 50
Current Concerns - The monthly journal for independent thought, ethical standards and moral responsibility - English Edition of Zeit-Fragen
No 1, 2004
31 Jul 2010, 12:20 AM
current issue
archive

OECD: The Trojan Horse Within

Short history of the OECD and its PISA activities

Fred Naylor, Educational Consultant, UK

The Organisation of Economic Cooperation and Development (OECD) was set up in Paris in 1960. Its main aims were:

  • to achieve the highest sustainable economic growth and employment and a rising standard of living in Member countries, while maintaining financial stability, and thus to contribute to the development of the world economy;
  • to contribute to sound economic expansion in Member as well as non-member countries in the process of economic development; and
  • to contribute to the expansion of world trade on a multi-lateral, non-discriminatory basis in accordance with international obligations.

There were 20 founding members from around the world; the number had increased to 30 by 2000. A notable feature of the OECD is its drive for expansion in membership. A significant and growing number of non-OECD countries already take part in the PISA studies.

Not surprisingly, the member countries turned their attention to educational issues during the first decade of the OECD's existence. A prime aim of a state-funded education system is to anticipate the kind of work force required by the nation's employers, and hence the kind of training that needs to be provided in schools and colleges. This, however, is only one of the aims of education. Traditionally, schools have existed to provide character training, and to ensure that morality underlies social activities.

Educational aim is equality of results

The problem of a group of nations agreeing on moral, social and indeed political matters grows as the number of nations within the group increases. Another factor that makes agreement difficult is the growing pluralism within individual member countries. This problem was a feature of the 1960s, and soon made its presence felt within the OECD - particularly in the educational context. A turning point was reached in the early 1970s. This was delineated by the Swedish professor and OECD consultant, Torsten Husen, in his ground-breaking OECD publication Social Influences on Educational Attainment, 1975. In it he announced that the OECD had replaced the educational aim of 'equality of opportunity' by 'equality of results'. The first he described as 'liberal'. Equality of results was depicted as 'the more "radical" or sociological (and sometimes Marxist)' concept. Since that time the political bias of the OECD has been firmly grounded. The conflict between equality of opportunity and equality of results (one cannot have both) underlies all major world issues today.

Another aspect of Marxism absorbed by the OECD is Internationalism. Tony Blair's guru, Anthony Giddens, is the author of The Third Way, 1998. It contains a chapter entitled 'Into the Global Age', in which Giddens speaks of the OECD as 'a possible bridgehead to a wider global order if it absorbed the European UnionÉ'. He has in mind a European Union, which has taken the form of a federation and rid its members of their national identities; and whose Parliament will act as a blueprint for that which will be eventually established within the United Nations.

The bias of the OECD is not widely recognised, particularly among many national political leaders. In 1980 I drew it to the attention of Britain's Prime Minister, Margaret Thatcher. She replied, thanking me 'for letting me know about these matters and for analysing the several documents so clearly'. The first was published around the time of the 1970 General Election (after which she became Education Minister). She stated that it had never been brought to her attention. Dealing with the Husen publication, she declared: 'I thought we had got rid of that patently ridiculous theory long ago.' Unfortunately, during her period as Prime Minister the OECD was not deflected from the course it had embarked upon. Doubly unfortunately, the patently ridiculous notion that we should strive for equality of results in education has become firmly rooted in British and some European government circles today.

The OECD PISA operations (Programme for International Student Assessment) were initiated in the late 1990s and the first results (PISA 2000) were published in 2001. These related to 28 of the OECD countries plus 4 non-OECD countries - Brazil, Latvia, Liechtenstein and the Russian Federation. Another 13 countries, including China, took part in the same survey later, and their results were published in July 2003.1 The Slovak Republic and Turkey are going to be included in the PISA 2003 survey.

PISA hailed as evidence against selective system ...

There have been mixed reactions to PISA 2000, which is not surprising considering the OECD's political agenda. It has been unjustifiably hailed by left-wing supporters and Press as providing evidence supporting their attack on selective systems of education. The most striking example of this is an article (31 January 2003) in the Times Educational Supplement (TES) by a former Editor, in which she stated that the percentages of pupils reaching the 5+ GCSE 'higher grades' standard in England was higher than that reaching the corresponding standard in Germany. Not only was there no mention of GCSE examinations in the PISA report, but there is no known way of relating the standards of public examinations in the different countries. It was all a misrepresentation of the finding that Germany's position in relation to England - in mathematics - in PISA 2000 was much lower than it had been in the IEA (TIMSS)2 survey carried out a year earlier. A similar example was provided by the Independent newspaper in reporting the PISA results of the 13 additional countries. It claimed that it had been shown that streaming had an adverse effect on educational achievement. An investigation, however, revealed that there was absolutely no reference to streaming in the report. The Independent had simply reported the opinion expressed by the OECD overall co-ordinator, Andreas Schleicher, who had chaired the Press Conference!

As a matter of supreme importance, in view of the opinions expressed by Schleicher, it should be noted that on page 212 of the PISA 2000 Report it was stated: 'PISA results suggest that there is no single factor that explains why some schools or some countries have better results than others. Successful performance is attributable to a constellation of factors, including school resources, school policy and practice, and classroom practice.'

... but criticized by eminent statisticians

Against these misinterpretations of the Report's findings for political reasons, PISA 2000 has come in for severe criticism from a number of eminent statisticians. Prof. Elart Von Collani of the University of Wurzburg wrote a paper entitled OECD PISA - An example of Stochastic Illiteracy?3 In it the professor listed a number of errors made by PISA in designing and carrying out the exercise. He concluded that most results could not be evaluated as the necessary information about sample size, model and methods had not been made available. Almost no absolute data had been given; it had been transformed and made relative. For example, he noted that the average score in each of the literacies tested was set at 500, with a standard deviation of 100. And he drew attention to the fact that the times allowed for the regular PISA tests varied from country to country.

Background factors not accounted for

The most striking conclusion to be drawn from this kind of statistical critique was that the PISA comparisons were never of a multivariate nature. To understand the meaning of this one should compare the PISA exercise with that undertaken by the Statistical Division of the English Department of Education and Science published in 1984 ('Statistical Bulletin 13/84'). In an attempt to discover the relationship between school standards and the socio-economic and other background factors of pupils in maintained schools, the performances of pupils in the nation's 16+ public examinations were compared across the 96 local education authorities. In all, 23 background factors were investigated. By a process known as multivariate regression analysis equations were established, which related a given level of performance to each of the background factors that were correlated with it. PISA, by contrast, whenever it did try to establish correlations, limited itself to comparing two variables at a time. In this way it failed to make allowance for background factors.

Statistical Bulletin 13/84 was notable for its finding that the percentage of pupils in grammar schools in local authorities was related to examination success in a positive away. This finding was played down in the Bulletin by a failure to describe the magnitude of the effects. With the support of the Education Minister, Sir Keith Joseph, I was able to arrange a meeting between the Department's statisticians and a group of colleagues - including Professor S.J. Prais. We reached agreement, which was published in Hansard4 as a result of the asking of a Parliamentary Question. The main points agreed were that the 13/84 analysis indicated that for three out of the five categories of examination attainments studied - covering pupils across the ability range - results were significantly higher in those local authorities that had retained a selective system of schooling. This was after due allowance had been made for background factors. Furthermore, the superiority shown by a selective authority was directly proportional to the degree of selectivity it exhibited. This kind of exercise has never been repeated.

For me, Von Collani's thoughts cast a new light on the PISA procedures There is no limit to the observations that could be made by a vast army of inspectors, dedicated to the task of observing and recording. Almost limitless sets of figures could be produced. For example, why not the number of times students switched off their attention from their teacher during a lesson? And using the questionnaire why not ask them to describe fully their smoking and drinking habits? The jungle of statistics could be dipped into and utilised by anyone who had a pre-conceived idea that needed substantiating. For example, the two most successful countries of all in PISA 2000 were Japan and Korea. The same two countries spent very much less per pupil by having the largest class sizes by a very wide margin. For example, the average secondary school class sizes of Japan and Korea were respectively 35 and 38. This was not something the OECD was interested in, so it was never remarked upon.

What they were interested in, given their commitment to equality of results, were countries in which differences between pupil performances were relatively low. They were conscious of the fact that there is a very strong argument that differences can only be minimised by handicapping the more able pupils, or by applying reverse discrimination to the less able. They were therefore always on the lookout for any instances where narrower differences were associated with higher than average performances (equality and quality, as they described it). They were careful, however, in trying to summarise their findings to leave them suitably vague. The most astounding omission from the PISA reports was any attempt to measure the innate abilities of the students. It was assumed that all differences were entirely due to the conditions under which the students worked and lived. Given the success of the current genome projects this is an assumption that will not be allowed to go unchallenged for much longer.

Another eminent statistician, who has criticised PISA 2000, is Professor S.J. Prais. He and the organisation for which he works, the National Institute of Economic and Social Research (NIESR), have been involved for some time in international comparative studies, particularly detailed comparisons between Switzerland and Britain. Their direct class observations confirmed that the previous international IEA (TIMSS) test-surveys of mathematic attainment were correct in reporting that pupils in Switzerland (and Germany) were well ahead of those in Britain. PISA studies, however, suggested that Britain might have caught up since the last TIMSS study a year previously (both countries had a PISA mean score of 529). The same considerations applied in the case of Germany, whose PISA mean score of 490 was taken by its critics as implying that it had crashed.

TIMMS survey on the same age cohort produced different findings

This led to Prof. Prais writing a paper which appeared in the Oxford Review of Education, June 2003, Cautions on OECD's Recent Educational Survey (PISA). His criticisms were purely of a statistical, rather than political, nature. He addressed himself to the differences between the IEA (TIMSS) survey published in 1999 and the PISA report 2000. Since the IEA survey was carried out on 14 year-olds, and the PISA survey was carried out on 15 year-olds, one year later, he pointed out that the same age cohort was involved in each. The differences between the two sets of findings were therefore even more unexpected. He identified four main differences between the objectives and methods adopted in the IEA and PISA studies, which invalidated the direct comparisons that were being made in respect of their findings.

Imprecise and confusing questions

(i) Questions in the IEA survey were directed to the mastery of the school syllabus by students in the relevant school Grade, whereas PISA was deliberately directed to so-called 'every day life' problems. It was not mathematics that PISA was concerned with, but rather 'mathematical literacy'. PISA had admitted that the students were provided with written information and were expected to interpret it and draw conclusions from it. I entirely agree with Prof. Prais. I found many of the PISA questions difficult to answer, not because they were difficult from the mathematical point of view, but because they were imprecise and confusing. Let me give an example:

A seal typically comes to the surface of the water to breathe; it then dives to the bottom and rests there for 3 minutes; then slowly floats to the surface in 8 minutes; takes a breath, and repeats the cycle. Where was the seal 60 minutes after having come to the top to breathe?

So much necessary information is missing that it must put a person with mathematical ability under stress. How long does it take the seal to dive to the bottom? How long does it take the seal to fill its lungs with air? What is the difference between 'slowly floating' and 'floating'? How slowly did it float to the surface? A much more mathematical and less literacy-dominated question would have asked the student to work out what was the minimum amount of time, devoted to diving and breathing in the cycle, that would have ensured that the seal was at the bottom rather than floating upwards after 60 minutes.

Warning carefully hidden

PISA, quite properly, gave a stark warning on p.26 of PISA 2000 that 'If one country's PISA scores are higher than those of another country, it cannot automatically be inferred that the schools in the former are more effective, since learning starts well before school and occurs in a range of institutional and out-of-school settings.' This warning, however, was very carefully hidden away, and was in fact ignored by the authors of the PISA report themselves, who elsewhere stated: 'PISA seeks to compare how well different school systems prepare students for life.' It added that the results would help governments to make policy decisions. Politicians such as Tony Blair have ignored the warning, and have wrongly taken the findings as an endorsement of the success of their national educational systems.

(ii) The IEA (TIMSS) survey was based on samples of whole classes, including late entrants or repeaters. The PISA study was based strictly on a 12 month period of birth, and excluded repeaters and some late entrants. The variability within a class could not therefore be examined in the PISA survey.

(iii) England's response rate for schools was 60 per cent, compared with 95 per cent in leading European countries.

(iv) England's response rate for pupils was lower than in any other country and lower than in the previous IEA (TIMSS) survey.

No valid conclusions

Prof. Prais drew attention to the fact that the PISA returns for Hamburg and Berlin were omitted from the figures for Germany, because of poor response rates; similar omissions were not made in the case of England, despite similarly low response rates.

He also mentioned the finding in the OECD Adult Literacy Study that the French were not capable of reading their country's newspapers!

In his summing up Prof. Prais stated that it was difficult to draw valid conclusions for Britain, and planned repeats of the studies should be postponed until methodological problems had been resolved. The English taxpayer should not continue to support PISA studies.

In an Annex to his paper he discussed the computational mystery of PISA's country-rankings. He found (to his astonishment!) that, for the results of the 31 mathematical questions that have more recently become available, Switzerland was ahead of Britain in answering correctly two-thirds of the number of questions (21/31).

To complete the story it should be said that the OECD has produced a 10 page Rejoinder to Prof. Prais' paper, and aimed to publish it in August 2003. It recognises that Prais, and it itself, may be using different criteria for judging model school systems. If it really believes its own statement that the figures should not be used to compare educational systems, then in saying that its results are sound it is not expressing any criticism of the professor. Furthermore, it admits that its PISA report had agreed (i) that the UK school response rate of 59% (prior to replacement) and 82% after replacement was a matter for concern, as was the student level response of 81%, and (ii) such low response rates brought a threat of bias that could not be formally addressed.

Irrational interpretations of the Left

The most recent publication by PISA (now in association with UNESCO!) in July 2003 incorporated the results of the additional 13 countries taking part in PISA 2000. It evoked the most amazing reactions from the left-wing organisations and press in England, who fully share the OECD bias and are ready to put their own interpretations on the vague and unsound observations made by the report's authors. The Times Educational Supplement's heading was 'Class Segregation Holds Britain Back'. The Guardian led with 'Finland boasts comprehensive system where standards soar'; the Times with 'Streaming pupils "worsens results" .' John Bangs of the National Union of Teachers (NUT) claimed: 'It is the final nail in the coffin of selection in any way, shape or form'. The Association of Teachers and Lecturers (ATL) said: 'It proved state run education systems can simultaneously achieve educational quality and equity, which will be music to teachers' ears'.

Of course they had all failed to pick up the PISA 2000 Report's point on page 212 that successful performance is attributable to no single factor, but to a constellation of factors, including school resources, school policy and practice, and classroom practice.

These irrational reactions matched the left-wing reaction to the lowly positions of Germany and Switzerland in PISA 2000 (published in 2001). They were wrongfully taken as an indication that the tripartite educational system, for which these countries are famous, was less successful than the comprehensive system. The German teachers of mathematics and science were livid at these misinterpretations, and revolted against the type of questions used. No attempt has been made by the PISA researchers to compare the performances of the comprehensive and selective systems in any of the countries in which they exist side by side. The result is that Germany is reacting by carrying out research on the performances of the individual Lander by supplementing the PISA 'literacy' tests with academic tests more suited to the school curriculum. The Swiss apparently have not reacted as sharply as Germany.

Selective system superior

In England the superiority of the selective system is supported by evidence especially suited for the purpose. Not only do we have the results of the impeccable 13/84 statistical analysis The statistics published annually for success in the GCSE have related only to the top 50 per cent of the population, but recent figures extracted from the Government by the Conservative Party in respect of higher ability pupils demonstrate the great superiority of schools in the selective system. The combined results of grammar and secondary modern schools are nearly double those in the comprehensives. For example, in wholly selective areas 15.1% of pupils achieved five or more GCSE grades A*-A in 2002. The corresponding figure in wholly comprehensive areas was 8.6%.

The idea that the relative strengths of the selective and comprehensive systems can be deduced by ignoring the evidence collected on this subject in the countries in which the two systems exist side by side, and instead using the mass of unrelated evidence collected from countries world wide, is bizarre. It is equally bizarre to expect such politically diverse countries as the USA, the Russian Federation and China to be making their policy decisions on the statistics gathered by PISA in the context of its own distinctive educational model. It is noteworthy that, in criticising Prof. Prais' criteria for citing Switzerland as a model educational system within the European context, the PISA Rejoinder defines the criteria on which it bases its own ideas of a model system. These are 'the level and distribution of educational outcomes.'

Our present Secretary of State for Education has made a plea that we should consider the performances of our different systems by the scientific evidence alone, uninfluenced by ideological considerations. This is an excellent recipe, and it is clear that Britain and its European counterparts should join forces to demonstrate scientifically that selective systems of education, based on equality of opportunity, produce better academic results than the comprehensive schools, based on the patently ridiculous notion that equality of results is the correct educational target. This is becoming a more urgent task day by day.



1 Literacy Skills for the World of Tomorrow, July 2003. OECD/UNESCO.

2 IEA - International Association for the Evaluation of Educational Attainment

TIMSS - Third International Maths and Science Study (1996).

3 Economic Quality Control, 2001

4 Hansard, 15 May 1987, Columns 458, 465 and 466.

OECD ADDENDUM

(5 January 2004)

On 3 December 2003 I attended a London conference on Statistical Perspectives on PISA ,held at The Royal Statistical Society. The purpose of the conference was to look critically at PISA 2000 and other related international studies. Prominent amongst the critics was Harvey Goldstein, of the Institute of Education at the University of London, who was one of the initiators of PISA. He raised methodological concerns about the analysis and interpretation of the results. He pointed to features that raised doubts about the adequacy of the data.

In particular, he spoke of the difficulty of comparing the results of culturally and linguistically different cultures, and complained that PISA relied on THE ITEM RESPONSE MODEL (IRM) to decide whether items fitted their unidimensional models. This ensured that the items chosen conformed to a given scale, and this resulted in 'dodgy items'. He also criticised PISA's na-ve approach to causation, stressing that it was essential to introduce a longitudinal component. In other words, it was necessary to examine the performances of individual countries over a number of years before any judgments could be made about their relative successes.

Other speakers were equally suspicious of PISA's findings. For example, Prof. Margaret Brown of King's College London, dealing with mathematics performances, stated that: 'The use, and sometimes even the collection, of these comparative data are often ideologically driven, either by governments trying to justify greater central control or by governments, groups or individuals arguing for changes in teaching methods or curriculum. In some of these cases, the references to the data have been selective and/or misleading.'

The speaker representing the Department for Education and Skills (DfES) defined what he termed the main weakness of PISA studies as "the failure to measure innate qualities'. He thereby confirmed what I had earlier described as 'the most astounding omission from the PISA reports'.

One of the Discussion Sessions, led by Professor Peter Tymms, University of Durham was entitled: 'The difficulty of interpreting international achievement data in the absence of prior cognitive measures'. It summed up the general feeling of the inadequacy of the PISA studies.

printer friendly version
Article published on 01-26-2004

© 2001-2004. All rights reserved.
No reproduction, copy or transmission of this publication may be made without written permission.

(mails to the webmaster)