<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1472-6920-8-21</ui>
   <ji>1472-6920</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>The educational background and qualifications of UK medical students from ethnic minorities</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>McManus</snm>
               <fnm>IC</fnm>
               <insr iid="I1"/>
               <email>i.mcmanus@ucl.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Woolf</snm>
               <fnm>Katherine</fnm>
               <insr iid="I2"/>
               <email>kwoolf@medsch.ucl.ac.uk</email>
            </au>
            <au id="A3">
               <snm>Dacre</snm>
               <fnm>Jane</fnm>
               <insr iid="I2"/>
               <email>j.dacre@medsch.ucl.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Dept of Psychology, University College London, Gower Street, London WC1E 6BT, UK</p>
            </ins>
            <ins id="I2">
               <p>Academic Centre for Medical Education, Royal Free and University College Medical School, 4th Floor, Holborn Union Building, Whittington Campus, Highgate Hill, London N19 3LW, UK</p>
            </ins>
         </insg>
         <source>BMC Medical Education</source>
         <issn>1472-6920</issn>
         <pubdate>2008</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>21</fpage>
         <url>http://www.biomedcentral.com/1472-6920/8/21</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18416818</pubid>
               <pubid idtype="doi">10.1186/1472-6920-8-21</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>20</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>16</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>16</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>McManus et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>UK medical students and doctors from ethnic minorities underperform in undergraduate and postgraduate examinations. Although it is assumed that white (W) and non-white (NW) students enter medical school with similar qualifications, neither the qualifications of NW students, nor their educational background have been looked at in detail. This study uses two large-scale databases to examine the educational attainment of W and NW students.</p>
            </sec>
            <sec>
               <st>
                  <p>Methods</p>
               </st>
               <p>Attainment at GCSE and A level, and selection for medical school in relation to ethnicity, were analysed in two separate databases. The 10<sup>th </sup>cohort of the Youth Cohort Study provided data on 13,698 students taking GCSEs in 1999 in England and Wales, and their subsequent progression to A level. UCAS provided data for 1,484,650 applicants applying for admission to UK universities and colleges in 2003, 2004 and 2005, of whom 52,557 applied to medical school, and 23,443 were accepted.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>NW students achieve lower grades at GCSE overall, although achievement at the highest grades was similar to that of W students. NW students have higher educational aspirations, being more likely to go on to take A levels, especially in science and particularly chemistry, despite relatively lower achievement at GCSE. As a result, NW students perform less well at A level than W students, and hence NW students applying to university also have lower A-level grades than W students, both generally, and for medical school applicants. NW medical school entrants have lower A level grades than W entrants, with an effect size of about -0.10.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The effect size for the difference between white and non-white medical school entrants is about B0.10, which would mean that for a typical medical school examination there might be about 5 NW failures for each 4 W failures. However, this effect can only explain a portion of the overall effect size found in undergraduate and postgraduate examinations of about -0.32.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>In the UK there is a concern that medical students and doctors from ethnic minorities underperform both in undergraduate and postgraduate examinations. In this paper we look firstly at a large population study of the educational achievement of all UK sixteen year-olds (the Youth Cohort Study), which reflects the pool from which all future UK medical students will be drawn, and we relate those results to a separate study based on data from UCAS (Universities and Colleges Admissions Service), looking at the educational qualifications of all university applicants, some of whom applied to medical school, and a proportion of whom were then accepted to study medicine. The primary interest throughout is whether, where, when and why ethnic minority students underperform relative to white students, and the extent to which any differences might explain the underperformance of medical students from ethnic minorities.</p>
         <p>Medical students in UK universities are ethnically diverse, so that in recent years about 30% of home students (i.e. those resident in the UK, with UK nationality) come from ethnic minorities. That proportion has risen <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, surveys by one of us finding that about 10% of 1981 entrants were non-white, a figure which rose to 14% for those entering in 1986, and 22% for those entering in 1991 <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Official data from UCAS for 1996, 2001 and 2005, show that 30%, 33% and 30% of entrants for medicine and dentistry were non-white. (It should be noted that the statistics provided at the UCAS website <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> do not allow a separate analysis of medicine and dentistry in relation to ethnic group. Based on the datafile described below, the proportion of non-white entrants for medicine alone in 2005 was 31.2%, a value very similar to the UCAS figure for medicine 29.7% for medicine and dentistry combined. In the 2001 census, about 13% of those aged 15, 17% of those aged 16&#8211;19 and 19% of those aged 20&#8211;24 were from ethnic minorities, suggesting that ethnic minorities are somewhat over-represented amongst medical students compared with their proportion in the population as a whole.</p>
         <p>In 1995, controversy erupted in the UK because of a higher failure rate in clinical examinations of non-white students in the University of Manchester <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. As a result of that controversy, a re-analysis of data from the 1981 and 1986 cohorts found that amongst students taking University of London final examinations, the non-white students performed less well overall than did white students, the difference being found in all types of examination (multiple-choice questions, essay, oral and clinical), and in all the five main subjects (Medicine, Surgery, Obstetrics and Gynaecology, Pathology and Pharmacology) <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The reasons for the differences were not clear, and could not be explained away in terms of a range of background measures, be they demographic, educational or psychometric.</p>
         <p>In the past ten years, further data have accumulated on the relatively poorer performance of UK non-white medical students and doctors in undergraduate and postgraduate examinations, non-whites performing less well in undergraduate clinical examinations <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>, as well as in the postgraduate examinations of the Royal Colleges of Physicians (the MRCP(UK) <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>), and the Royal College of General Practitioners (MRCGP <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>).</p>
         <p>The published studies comparing the undergraduate and postgraduate performance of white and non-white medical students and doctors can be compared by calculating <it>d</it>, a standard meta-analytic measure of effect size techniques (see Method, below, for details). Effect sizes are a standard statistical method for comparing disparate results across studies, by converting all results to a standard scale similar to <it>z </it>scores. When there are two groups, then the effect size is a measure of the distance apart of the two means measured in standard deviation units. A conventional interpretation of effect sizes, due to Cohen, is that effect sizes of .2, .5 and .8 can be described as small, medium and large <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Considering the published studies described above, the effect sizes were -.097 and -.283 <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, -.237, -.291, -.580 and -.273 <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, -.665 <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, -.579 <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, -.492 <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, and -.277, -.344 and -.391 <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. We are currently preparing a formal meta-analysis of these and other unpublished data, but for present purposes it suffices to say that the mean effect size found in these studies is -.375, with a median of -.318. Viewing the data conservatively, for the present paper we will use the median effect size of -.32 as typical of published studies. Using Cohen's terminology, it is between medium and small.</p>
         <p>Although the problem of the underperformance of ethnic minorities in medical education has been much discussed, the underperformance is not unique to medical studies, Richardson in a recent analysis of HESA (Higher Education Statistics Authority) data for graduates of UK degree-giving bodies in 2005 <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, found a consistent under-performance of non-white students, which for the cohort aged 21&#8211;24 at graduation gave an odds ratio comparing the likelihood of NW students with W students gaining > good degrees = (i.e. I or II.i) of 0.518, equivalent to an effect size of B0.363, similar to the averaged effect reported in medicine. Similar results have also been reported by Connor et al <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, who have also emphasised that the participation rate of ethnic minorities in higher education is higher than for the white population, although there are differences between the various groups. Neither is underperformance of ethnic minorities restricted to university performance. Kirkup et al <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, in a study for the Sutton Trust, with what was probably not an entirely representative sample of UK students taking A-levels, found differences between white and non-white students, with estimated effect sizes of -0.159 for GCSEs, -0.239 for A-level grades, and -.514, -.141, -.571, -.588 and -.199 for the Critical reading, Mathematics, Writing, Writing-Multiple Choice, and Writing: essay assessments of the SAT test (giving a mean effect size of -.403 for the five SAT components); overall the seven measures had a mean effect size of -.345. The summary data from the Research Study of the Department for Education and Skills (DfES) <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> for pupils taking assessments in 2005 allow the calculation of effect sizes for differences between white and non-white pupils of -.215 at Key Stage 1 (age 7), -.211 at Key Stage 2 (age 11), -.178 at Key Stage 3 (age 14), and -.028 at Key Stage 4 (GCSEs) (effect sizes being calculated separately from the percentages of pupils attaining expected levels of achievement in the three separate components of each Key Stage, using the data provided in table 8 of the Research Report [<abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, p.42], with all non-white pupils (excluding 'Unclassified') being combined together, and weighted by the Ns provided for the data at the (former) DfES website <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>) In pre-school children, the second sweep of the Millennium Cohort Study of a large, representative group of UK three-year olds, found lower performance in ethnic minorities on two cognitive scales, with an effect size of -.28 for the Naming Vocabulary scale of the British Ability Scales, and an effect size of -.11 for the Bracken Basic School Readiness Scale <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
         <p>A range of explanations have been put forward for the underperformance of ethnic minorities in medical school, often revolving around discrimination of some form or another (e.g. Wass <it>et al </it><abbrgrp><abbr bid="B12">12</abbr></abbrgrp>), although such explanations have problems in explaining underperformance in computer-marked multiple-choice examinations (e.g. at the undergraduate level in <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and at postgraduate level in Part 1 and Part 2 of the MRCP(UK) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>). A detailed analysis of a large number of candidates taking PACES, the MRCP(UK) clinical examination, also finds that there is little association between the performance of candidates in relation to their own ethnicity and that of their examiners <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>There are relatively few consistent predictors of educational outcome at university level and beyond, although one measure which does continue to make a statistical prediction, is A level examination results, the > Advanced level = examination taken by would-be university entrants in the UK, particularly in England, Wales and Northern Ireland (most applicants in Scotland taking Highers). A level results have been shown to predict performance at university, both in general <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp> and in medicine for both undergraduate <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp> and post-graduate medical examinations <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>. If there were differences in performance of non-white entrants at A level examinations, then that might in part explain some of the differences found in performance of ethnic minorities at medical school.</p>
         <p>In the UK, school education is compulsory until the age of 16, when the GCSE (General Certificate of Secondary Education) exams are taken by almost all students in schools, the only exception being a small minority with severe educational, intellectual or behavioural problems (and for a detailed study of them and their characteristics see Cassen &amp; Kingdom <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>). At the age of 16, students can leave education, and many choose to do so. Those who do stay on, either at school (Sixth Form), or at colleges of further education, can take a range of qualifications, of which the main ones relevant to the question of medical school applications, are the A level examinations (or Highers in Scotland), with only a few medical school applicants taking instead the International Baccalaureate or the Welsh Baccalaureate, which is currently being piloted).</p>
         <p>Education after the age of sixteen is often called post-compulsory education, with the important implication that students <it>choose </it>to stay on, and also, in staying on, they <it>choose </it>which subjects to study and for what purposes. The element of choice in post-compulsory education complicates all statistical analyses of differences between groups, ethnic or otherwise, because one is not dealing with an entire population sample, but instead a sample that is self-selected, having chosen to be doing what it is doing, and which therefore, in statistical terms, is potentially biased. Any differences between groups may therefore result either from differences in true ability or from differences in the choices that have been made, for whatsoever reason.</p>
         <p>In this paper we wish to look carefully at academic qualifications in relation to ethnic origin, initially in the Youth Cohort Study (YCS), a representative, population sample of 16-year olds in England and Wales, in whom performance at GCSE is well-described, as also are many aspects of performance in post-compulsory education. Our principle interest will be in factors relevant to the pool of potential medical students, and we note that more general aspects of the attainment gap in minority ethnic pupils in school have been reviewed comprehensively elsewhere <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B31">31</abbr></abbrgrp>. Because medical students are a relatively small proportion of the total population, we will also compare the analysis of the representative sample of students from YCS with a parallel analysis of a separate database of all UK applicants to university, who have applied through UCAS (Universities and Colleges Admissions Service). The data in the latter are of course far less comprehensive in relation to the population as a whole, because many of the population do not apply to university, but they allow a analysis of applicants and acceptances to a range of university subjects, and in particular to medicine. A joint analysis of the YCS data and the UCAS data allows a more complete picture to appear.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>i) Youth Cohort Study (YCS)</p>
            </st>
            <p>The YCS has been carried out annually or biennially since 1985, and the current analysis is for Cohort 10, who were aged 16 or 17 when the first sweep of the survey was carried out in Spring 2000, mostly having taken GCSEs the summer before, in May 1999 <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. Further sweeps were carried out in November 2000 (Sweep 2), and March 2002 (Sweep 3), the latter being of particular importance as A level results are available. The initial sample consisted of a representative sample of 25,000 pupils from all schools in England and Wales (excluding special schools and those with fewer than 20 pupils) who were of school-leaving age during the academic year 1998&#8211;1999, and reached the age of 16 by Aug 31<sup>st </sup>1999. Questionnaires for sweeps 2 and 3 were only sent to those replying to previous sweeps. Details of the survey are available in the survey codebook <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, which includes details of all three sweeps. The YCS provides weighting variables to take into account any biasses in responding in the surveys, and these have been applied in all analyses described here, so all values are representative of population values.</p>
         </sec>
         <sec>
            <st>
               <p>ii) UCAS dataset</p>
            </st>
            <p>Data were provided by UCAS for all applicants applying for admission to university in the autumn of 2003, 2004 and 2005. Information was available on the university subject for which the applicant had applied, as well as specific information on whether they had applied for medicine, and whether they had been accepted for medicine. Demographic information was also available on age, sex, ethnicity, place of residence, parental occupation, and parental education, and the educational data included the subject and grade of all A levels and AS levels taken, as well as the standard UCAS tariff score (see below).</p>
            <p>Statistical analysis used SPSS 11.5 for most analyses, with LISREL 8.54 being used for structural equation modelling. Effect sizes for continuous variables are calculated as <it>d </it>= (mean <sub>W</sub>-mean <sub>NW</sub>)/SD <sub>W</sub>. Odds ratios, calculated as <it>OR </it>= <it>p</it><sub><it>W </it></sub>&#215; (1-<it>p</it><sub><it>NW</it></sub>)/(<it>p</it><sub><it>NW </it></sub>&#215; (1-<it>p</it><sub><it>W</it></sub>)), are converted to effect sizes using the method of Chinn <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Negative effect sizes should be interpreted as NW students performing less well or having a lower mean than W, or NW having a lower proportion or probability of an event than W.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Definition of ethnicity</p>
            </st>
            <p>The definition of ethnicity is complex, with different studies using different criteria and classifications. It is also probable that there are differences between varying groups of ethnic minorities. However for our present purpose we are primarily interested in those studies which, for convenience and other reasons, simply compare > white = (W) students with > non-white = (NW) students. In the medical school population the majority of non-white students are from the Indian sub-continent (primarily Indian, Pakistani, Bangladeshi and other), although there are students from other groups. Supplementary table 1 [see Additional file <supplr sid="S1">1</supplr>] provides a complete description of all individuals using the classificatory schemes of YCS and UCAS. Although so doing inevitably simplifies a complex process, for ease of explication, we will divide students into just two groups, in order to be able to see the bigger picture more straightforwardly. We acknowledge in advance that many further, more detailed, analyses could be carried out.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Ethnicity breakdown in YCS and UCAS data.</p>
               </text>
               <file name="1472-6920-8-21-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>The Youth Cohort Study</p>
            </st>
            <sec>
               <st>
                  <p>Performance at GCSE</p>
               </st>
               <p>Almost all students took GCSEs, only 649/13698 (4.7%) not taking any GCSEs, and those individuals were excluded from further analysis. The simplest overall measure of GCSE performance is the total number of points attained (where A = 5, B = 4, C = 3, D = 2, E = 1, else = 0). It should be noted that although for a number of years GCSEs have awarded a grade of A*, which is higher than A, those A* results are unfortunately not distinguished from A in the YCS database. Inevitably any simple points total is influenced by the total number of GCSE examinations for which a student chooses to enter or is allowed to enter, and so we also provide the mean grade attained at GCSE. Overall there is only a very small absolute difference in the number of GCSEs taken by white and non-white candidates, which just reaches significance (see Table <tblr tid="T1">1</tblr>), but there is a highly significant difference in the number of points and the mean grade attained, p &lt; .001 in each case. Figure <figr fid="F1">1</figr> shows the distribution of points for W and NW students, and although it is clear that the NW students have a lower mean, which can be seen particularly in the left-hand tail, at the top end of the distribution the differences are far less marked, such that the number of A grades attained is more similar in W and NW students, and the proportion of students gaining 6 or more A grades B the pool of those with a realistic chance of subsequently applying for medical school B is not significantly different in the W and NW candidates, 10.7% and 9.8% respectively, the effect size being B.054. Table <tblr tid="T1">1</tblr> also shows the proportions gaining A grades in the main subjects taken, and for most subjects there is no significant difference between W and NW candidates.</p>
               <fig id="F1">
                  <title>
                     <p>Figure 1</p>
                  </title>
                  <caption>
                     <p>The distribution of GCSE points gained by W and NW students, shown in red and green respectively</p>
                  </caption>
                  <text>
                     <p>The distribution of GCSE points gained by W and NW students, shown in red and green respectively. GCSE points are group as 0&#8211;4, 5&#8211;9, 10&#8211;14, etc..</p>
                  </text>
                  <graphic file="1472-6920-8-21-1"/>
               </fig>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>GCSE achievement of W and NW students (YCS).</p>
                  </caption>
                  <tblbdy cols="5">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>White Mean (SD) N = 11273</p>
                        </c>
                        <c ca="center">
                           <p>Non-White Mean (SD) N = 1776</p>
                        </c>
                        <c ca="center">
                           <p>Significance</p>
                        </c>
                        <c ca="center">
                           <p>Effect size</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="5">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Number of GCSEs taken</p>
                        </c>
                        <c ca="center">
                           <p>8.65 (1.90)</p>
                        </c>
                        <c ca="center">
                           <p>8.55 (1.99)</p>
                        </c>
                        <c ca="center">
                           <p>t = 2.07,</p>
                           <p>p = .039</p>
                        </c>
                        <c ca="center">
                           <p>d = -.053</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Total number of GCSE points</p>
                        </c>
                        <c ca="center">
                           <p>24.12 (14.18)</p>
                        </c>
                        <c ca="center">
                           <p>22.01 (14.44)</p>
                        </c>
                        <c ca="center">
                           <p>t = 5.81,</p>
                           <p>p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>d = -.148</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Mean GCSE points</p>
                        </c>
                        <c ca="center">
                           <p>2.65 (1.34)</p>
                        </c>
                        <c ca="center">
                           <p>2.44 (1.38)</p>
                        </c>
                        <c ca="center">
                           <p>t = 5.89,</p>
                           <p>p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>d = -.156</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Number of GCSEs at A grade</p>
                        </c>
                        <c ca="center">
                           <p>1.44 (2.62)</p>
                        </c>
                        <c ca="center">
                           <p>1.32 (2.54)</p>
                        </c>
                        <c ca="center">
                           <p>t = 1.80,</p>
                           <p>p = .072</p>
                        </c>
                        <c ca="center">
                           <p>d = -.046</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Average difficulty of GCSEs taken</p>
                        </c>
                        <c ca="center">
                           <p>.773 (.029))</p>
                        </c>
                        <c ca="center">
                           <p>.771 (.029)</p>
                        </c>
                        <c ca="center">
                           <p>t = 1.93,</p>
                           <p>p = .054</p>
                        </c>
                        <c ca="center">
                           <p>d = -.048</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>White N (%)</p>
                        </c>
                        <c ca="center">
                           <p>Non-White N(%)</p>
                        </c>
                        <c ca="center">
                           <p>Chi-squared (p)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Percent with 6 or more GCSEs at grade A</p>
                        </c>
                        <c ca="center">
                           <p>1206/11272 = 10.7%</p>
                        </c>
                        <c ca="center">
                           <p>174/1602 = 9.8%</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= 1.32, 1df, p = .251</p>
                        </c>
                        <c ca="center">
                           <p>OR = 0.907 (CI .77 &#8211; 1.07) d = -.054</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="5">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c cspan="5" ca="center">
                           <p>
                              <it>Percent with A grade in particular subjects:</it>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="5">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>English language</p>
                        </c>
                        <c ca="center">
                           <p>1580/10798 = 14.6%</p>
                        </c>
                        <c ca="center">
                           <p>216/1666 = 13.0%</p>
                        </c>
                        <c ca="center">
                           <p>.190 (1) p = .190</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>English Literature</p>
                        </c>
                        <c ca="center">
                           <p>1561/9272 = 16.8%</p>
                        </c>
                        <c ca="center">
                           <p>196/1391 = 14.1%</p>
                        </c>
                        <c ca="center">
                           <p>6.62 (1) p = .010</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Maths</p>
                        </c>
                        <c ca="center">
                           <p>1321/9485 = 13.9%</p>
                        </c>
                        <c ca="center">
                           <p>208/1168 = 15.1%</p>
                        </c>
                        <c ca="center">
                           <p>1.41 (1) p = .236</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>French</p>
                        </c>
                        <c ca="center">
                           <p>1219/5798 = 21.0%</p>
                        </c>
                        <c ca="center">
                           <p>151/783 = 19.3%</p>
                        </c>
                        <c ca="center">
                           <p>1.27 (1) p = .260</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Geography</p>
                        </c>
                        <c ca="center">
                           <p>969/4678 = 20.7%</p>
                        </c>
                        <c ca="center">
                           <p>110/513 = 17.7%</p>
                        </c>
                        <c ca="center">
                           <p>3.17 (1) p = .075</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>History</p>
                        </c>
                        <c ca="center">
                           <p>916/3647 = 25.1%</p>
                        </c>
                        <c ca="center">
                           <p>122/369 = 24.8%</p>
                        </c>
                        <c ca="center">
                           <p>.017 (1) p = .897</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Art</p>
                        </c>
                        <c ca="center">
                           <p>799/3427 = 23.3%</p>
                        </c>
                        <c ca="center">
                           <p>123/560 = 22.0%</p>
                        </c>
                        <c ca="center">
                           <p>.494 (1) p = .482</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Craft</p>
                        </c>
                        <c ca="center">
                           <p>869/4508 = 16.2%</p>
                        </c>
                        <c ca="center">
                           <p>101/828 = 12.2%</p>
                        </c>
                        <c ca="center">
                           <p>8.54 (1) p = .003</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>German</p>
                        </c>
                        <c ca="center">
                           <p>521/2508 = 20.8%</p>
                        </c>
                        <c ca="center">
                           <p>55/303 = 18.2%</p>
                        </c>
                        <c ca="center">
                           <p>1.14 (1) p = .286</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Business Studies</p>
                        </c>
                        <c ca="center">
                           <p>251/1910 = 13.1%</p>
                        </c>
                        <c ca="center">
                           <p>50/346 = 14.5%</p>
                        </c>
                        <c ca="center">
                           <p>.43 (1) .510</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Double Science</p>
                        </c>
                        <c ca="center">
                           <p>1141/8125 = 14.0%</p>
                        </c>
                        <c ca="center">
                           <p>158/1182 = 13.4%</p>
                        </c>
                        <c ca="center">
                           <p>.39 (1) p = .531</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>Difficulty of GCSEs</p>
               </st>
               <p>Although almost all students take GCSE at age 16, there is an element of choice in the particular subjects chosen for study. There is also good evidence that not all subjects are equally difficult, some such as Chemistry, Physics, Biology and Latin being harder subjects in which to gain top grades than Art, Drama, and Sociology <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. We therefore calculated a measure of the average difficulty of all the GCSE subjects that had been taken by a student, based on Coe = s estimates of the Rasch difficulty for an A* grade. The choice of A* for this calculation has little impact overall, since the Rasch difficulties for different grades are, to a large extent, parallel <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, and A* has the advantage for present purposes that it is the grade which most medical students would be expected to achieve. Table <tblr tid="T1">1</tblr> shows that there is no significant difference between W and NW students, meaning that NW students take GCSEs of similar difficulty to W students.</p>
            </sec>
            <sec>
               <st>
                  <p>Demographic factors</p>
               </st>
               <p>W and NW students differ in a number of demographic factors, as can be seen in Table <tblr tid="T2">2</tblr>, the NW students coming from a somewhat lower socio-economic group, and their parents having somewhat less education, although interestingly there are no differences in the proportions attending private schools.</p>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Demographic measures for W and NW students (YCS).</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>White N = 11273</p>
                        </c>
                        <c ca="center">
                           <p>Non-White N = 1776</p>
                        </c>
                        <c ca="center">
                           <p>Significance</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <it>Sex</it>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Male</p>
                        </c>
                        <c ca="center">
                           <p>5657 (50.2%)</p>
                        </c>
                        <c ca="center">
                           <p>898 (50.6%)</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= .089 (1) p = .765</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Female</p>
                        </c>
                        <c ca="center">
                           <p>5616 (49.8%)</p>
                        </c>
                        <c ca="center">
                           <p>878 (49.4%)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <it>Socio-economic group</it>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>V (Unskilled)</p>
                        </c>
                        <c ca="center">
                           <p>482 (4.7%)</p>
                        </c>
                        <c ca="center">
                           <p>70 (5.6%)</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= 25.71 (4) p &lt; .001</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>IV (Semi-skilled)</p>
                        </c>
                        <c ca="center">
                           <p>1181 (11.5%)</p>
                        </c>
                        <c ca="center">
                           <p>188 (15.1%)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>III (Skilled manual)</p>
                        </c>
                        <c ca="center">
                           <p>3940 (38.3%)</p>
                        </c>
                        <c ca="center">
                           <p>495 (39.7%)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>II (Other non-manual)</p>
                        </c>
                        <c ca="center">
                           <p>2218 (21.5%)</p>
                        </c>
                        <c ca="center">
                           <p>251 (20.1%)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>I (Managerial/profssional)</p>
                        </c>
                        <c ca="center">
                           <p>2477 (24.1%)</p>
                        </c>
                        <c ca="center">
                           <p>243 (19.5%)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <it>Parental education</it>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Pre-A-level</p>
                        </c>
                        <c ca="center">
                           <p>6935 (61.5%)</p>
                        </c>
                        <c ca="center">
                           <p>1261 (71.0%)</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= 75.72 (2) p &lt; .001</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>A levels</p>
                        </c>
                        <c ca="center">
                           <p>1790 (15.9%)</p>
                        </c>
                        <c ca="center">
                           <p>159 (9.0%)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Degree</p>
                        </c>
                        <c ca="center">
                           <p>2547 (22.6%)</p>
                        </c>
                        <c ca="center">
                           <p>356 (20.0%)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <it>School type</it>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>State</p>
                        </c>
                        <c ca="center">
                           <p>10488 (93.0%)</p>
                        </c>
                        <c ca="center">
                           <p>1652 (93.0%)</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= .015 (1) p = .093</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Private</p>
                        </c>
                        <c ca="center">
                           <p>784 (7.0%)</p>
                        </c>
                        <c ca="center">
                           <p>125 (7.0%)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>Correlation and regression analyses</p>
               </st>
               <p>Table <tblr tid="T3">3</tblr> shows simple Pearson correlations of the measures of GCSE achievement with background variables. As in most complex social data there are correlations between very many of the measures. Of particular interest are that non-white ethnicity correlates with taking slightly fewer GCSEs and gaining fewer points at GCSE, as well as coming from a lower socio-economic background and having less parental education. However it is also the case that non-white ethnicity does not correlate with attending a private school, taking more difficult GCSEs, or being female. There are also many significant correlations between background variables, such as between social class and all of the other measures. A multiple regression of GCSE points on the seven background factors found that all were significant at p &lt; .001, higher GCSE points being predicted by taking more GCSEs (&#946; = .604), greater level of parental education (&#946; = .152), attending a private school (&#946; = .147), taking <it>more </it>difficult GCSEs (&#946; = .116), coming from a higher socio-economic group (&#946; = .110), being female (&#946; = .081), and being white (&#946; = .026), the variables together accounting for 55% of the variance, of which 43% was accounted for by number of GCSEs taken. A similar analysis using logistic regression and with the attainment of 6 or more A grades as the dependent variable, found highly significant effects of all the same variables at p &lt; .001, with the important exception that ethnicity was <it>not </it>significant. If ethnicity was forced into the logistic regression it had a significance level of .437.</p>
               <tbl id="T3">
                  <title>
                     <p>Table 3</p>
                  </title>
                  <caption>
                     <p>Pearson correlations between achievement at GCSE and background variables. Weighted N for cells varies between 11,544 and 13,049. Significance levels are shown as: *** p &lt; .001; ** p &lt; .01; * p &lt; .05.</p>
                  </caption>
                  <tblbdy cols="9">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>Total points at GCSE</p>
                        </c>
                        <c ca="center">
                           <p>Number of GCSEs taken</p>
                        </c>
                        <c ca="center">
                           <p>Difficulty of GCSEs taken</p>
                        </c>
                        <c ca="center">
                           <p>Independent (private)</p>
                        </c>
                        <c ca="center">
                           <p>Female sex</p>
                        </c>
                        <c ca="center">
                           <p>Higher socio- economic</p>
                        </c>
                        <c ca="center">
                           <p>Greater parental</p>
                        </c>
                        <c ca="center">
                           <p>Non-white ethnicity</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="9">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="right">
                           <p>Total points at GCSE</p>
                        </c>
                        <c ca="center">
                           <p>1.000</p>
                        </c>
                        <c ca="center">
                           <p>0.658 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.187 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.265 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.115 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.295 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.338 ***</p>
                        </c>
                        <c ca="center">
                           <p>-0.051 ***</p>
                        </c>
                     </r>
                     <r>
                        <c ca="right">
                           <p>Number of GCSEs taken</p>
                        </c>
                        <c ca="center">
                           <p>0.658 ***</p>
                        </c>
                        <c ca="center">
                           <p>1.000</p>
                        </c>
                        <c ca="center">
                           <p>-0.006</p>
                        </c>
                        <c ca="center">
                           <p>0.057 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.058 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.159 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.167 ***</p>
                        </c>
                        <c ca="center">
                           <p>-0.018 *</p>
                        </c>
                     </r>
                     <r>
                        <c ca="right">
                           <p>Difficulty of GCSEs taken</p>
                        </c>
                        <c ca="center">
                           <p>0.187 ***</p>
                        </c>
                        <c ca="center">
                           <p>-0.006</p>
                        </c>
                        <c ca="center">
                           <p>1.000</p>
                        </c>
                        <c ca="center">
                           <p>0.286 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.000</p>
                        </c>
                        <c ca="center">
                           <p>0.098 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.140 ***</p>
                        </c>
                        <c ca="center">
                           <p>-0.017</p>
                        </c>
                     </r>
                     <r>
                        <c ca="right">
                           <p>Independent (private)</p>
                        </c>
                        <c ca="center">
                           <p>0.265 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.057 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.286 ***</p>
                        </c>
                        <c ca="center">
                           <p>1.000</p>
                        </c>
                        <c ca="center">
                           <p>0.002</p>
                        </c>
                        <c ca="center">
                           <p>0.153 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.228 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.001</p>
                        </c>
                     </r>
                     <r>
                        <c ca="right">
                           <p>Female sex</p>
                        </c>
                        <c ca="center">
                           <p>0.115 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.058 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.000</p>
                        </c>
                        <c ca="center">
                           <p>0.002</p>
                        </c>
                        <c ca="center">
                           <p>1.000</p>
                        </c>
                        <c ca="center">
                           <p>-0.025 **</p>
                        </c>
                        <c ca="center">
                           <p>0.004</p>
                        </c>
                        <c ca="center">
                           <p>-0.003</p>
                        </c>
                     </r>
                     <r>
                        <c ca="right">
                           <p>Higher socio- economic group<sup>+</sup></p>
                        </c>
                        <c ca="center">
                           <p>0.295 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.159 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.098 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.153 ***</p>
                        </c>
                        <c ca="center">
                           <p>-0.025 **</p>
                        </c>
                        <c ca="center">
                           <p>1.000</p>
                        </c>
                        <c ca="center">
                           <p>0.318 ***</p>
                        </c>
                        <c ca="center">
                           <p>-0.044 ***</p>
                        </c>
                     </r>
                     <r>
                        <c ca="right">
                           <p>Greater parental education</p>
                        </c>
                        <c ca="center">
                           <p>0.338 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.167 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.140 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.228 ***</p>
                        </c>
                        <c ca="center">
                           <p>0.004</p>
                        </c>
                        <c ca="center">
                           <p>0.318 ***</p>
                        </c>
                        <c ca="center">
                           <p>1.000</p>
                        </c>
                        <c ca="center">
                           <p>-0.050 ***</p>
                        </c>
                     </r>
                     <r>
                        <c ca="right">
                           <p>Non-white Ethnicity</p>
                        </c>
                        <c ca="center">
                           <p>-0.051 ***</p>
                        </c>
                        <c ca="center">
                           <p>-0.018 *</p>
                        </c>
                        <c ca="center">
                           <p>-0.017</p>
                        </c>
                        <c ca="center">
                           <p>0.001</p>
                        </c>
                        <c ca="center">
                           <p>-0.003</p>
                        </c>
                        <c ca="center">
                           <p>-0.044 ***</p>
                        </c>
                        <c ca="center">
                           <p>-0.050 ***</p>
                        </c>
                        <c ca="center">
                           <p>1.000</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p><sup>+ </sup>Note: For convenience of interpretation, socio-economic group here and in other analyses has been rescored so that <it>higher </it>groups have <it>higher </it>scores (i.e. I = 5, II = 4, III = 3, IV = 2 and V = 1).</p>
                  </tblfn>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>Path analysis</p>
               </st>
               <p>Achievement at GCSE depends on a number of variables, and as Table <tblr tid="T3">3</tblr> shows, those variables are themselves inter-related, both in correlational and causal terms. We therefore used LISREL to fit a path model, shown in Figure <figr fid="F2">2</figr>. The causal ordering was chosen on the basis that the number of GCSEs taken and the difficulties of the GCSEs taken were both immediately prior to the total points attained, and that type of schooling was prior to taking GCSEs. The sex of the student, being fixed throughout the student = s life, was prior to all educational variables. The remaining variables were related to the student = s families as much as to the student themselves, and hence were earlier in causal terms, with parental socio-economic group being likely to be secondary to parental education, and parental ethnicity prior to parental education. The logic is that the ethnicity of parents and offspring is generally the same, and fixed throughout the lifespan of parents and offspring, whereas education of both parents and offspring is less fixed, so that in any causal model, the ethnicity of individuals and their parents is to the left of measures of education. The number of individuals of mixed ethnicity (who are here classified as NW) is relatively small, and hence little affects the above argument, and it is still the case that an individual's ethnicity precedes their education causally.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>Path model of direct and indirect influences of background factors upon GCSE performance</p>
                  </caption>
                  <text>
                     <p>Path model of direct and indirect influences of background factors upon GCSE performance. Path coefficients and t-statistics are shown alongside paths, the thickness of paths is proportional to the path coefficient, and negative paths are shown as red, dashed lines.</p>
                  </text>
                  <graphic file="1472-6920-8-21-2"/>
               </fig>
               <p>The model was fitted by firstly using a saturated model in which all variables to the left of a variable could cause all variables to the right of the variable, through the BETA matrix in LSIREL. Paths which were non-significant at the 0.05 level were dropped sequentially from the model, with the least significant first, until all paths remaining were significant.</p>
               <p>The final fitted model is shown in Figure <figr fid="F2">2</figr>. The overall goodness of fit was excellent, with &#967;<sup>2 </sup>= 8.92, 8 df, p = .349, a goodness of fit index of 1.000, and an adjusted goodness of fit of 0.999. Figure <figr fid="F2">2</figr> shows a number of readily interpretable effects. Higher parental education results in a higher parental socio-economic group, and both factors result in a child being more likely to take part in private schooling. Private schools are more likely to put children in for more difficult GCSEs, and the children also gain more points at GCSE. Total GCSE points is particularly dependent on number of GCSEs taken, and the difficulty of the GCSEs taken, and there are also direct influences of private schooling, as well as effects of sex, parental social class, and parental education which are not explained by schooling. Finally it should be noticed that ethnicity has multiple effects, as a result of NW students having less parental education and coming from a lower socio-economic group, but after taking those into account, ethnic minorities are somewhat <it>more </it>likely to attend private schools.</p>
               <p>To summarise the results so far, NW students do achieve slightly less well at GCSE, but that is largely, but not entirely, mediated by a number of background variables such as parental education and type of schooling. In particular, the numbers of NW students attaining good grades at GCSE (6 or more As) is similar to that in W students, both in a simple analysis, and also after taking background factors into account. From the point of view of understanding the potential pool of medical school applicants B those performing particularly well at GCSE, in other words B W and NW students do not show significant differences in GCSE achievement.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Choosing to take A levels</p>
            </st>
            <p>A levels are taken as a part of post-compulsory education in the UK, so that students firstly must choose to take A levels, and then they must choose which particular A level subjects to take. Medical schools typically require science subjects to have been studied, at least in part, and a majority of schools require a good grade in A level Chemistry. A levels undoubtedly differ in their relative difficulty, and Chemistry is widely perceived as more difficult than most other subjects, a perception which it would seem is correct <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. To enter the pool of potential medical students a student must therefore in most cases take A level sciences in general, and A level Chemistry in particular.</p>
            <p>YCS asked students in Sweep 1, who were surveyed about six months after getting their GCSE results, whether they were continuing in education, and in particular whether they were taking A levels, and if so, in what subjects. We have looked at three simple summary statistics; whether a student was taking one or more A levels (excluding General Studies), whether a student was taking one or more science A levels, and whether a student was taking Chemistry at A level. Table <tblr tid="T4">4</tblr> shows that although a similar proportion of NW and W students go on to take A levels (despite NW students having somewhat lower GCSE grades overall), NW students are significantly <it>more </it>likely to take science A levels, and are nearly twice as likely to take A level chemistry in particular, one in nine of all NW students in the population taking A level chemistry in comparison with one in fifteen of all W students in the same age cohort. The fact that an equal proportion of W and NW students go on to take A levels is surprising, given the somewhat lower achievement of NW students at GCSE. Figure <figr fid="F3">3</figr> shows the proportion of students going on to A levels in relation to GCSE points, and it is clear that at almost all levels of achievement, NW students are more likely to take A levels than are W students. Logistic regression confirms, after taking GCSE points into account, that non-white students are 1.74 times (95% CI 1.48&#8211;2.04; effect size = .306) more likely to take at least one A level, 2.08 times (95% CI 1.80&#8211;2.42; effect size = .405) more likely to take at least one science A level, and 2.61 times (95% CI 2.15 &#8211; 3.16; effect size = .530) more likely to take chemistry A level. In so far as GCSEs are predictive of A level achievement, as will be shown below, it seems likely therefore that NW students will perform less well at A level than W students.</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>A-levels in W and NW students (YCS).</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>White Mean (SD) N</p>
                     </c>
                     <c ca="center">
                        <p>Non-White Mean (SD) N</p>
                     </c>
                     <c ca="center">
                        <p>Significance</p>
                     </c>
                     <c ca="center">
                        <p>Effect size</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Total number of points achieved at A level</p>
                     </c>
                     <c ca="center">
                        <p>17.71 (9.17) N = 2376</p>
                     </c>
                     <c ca="center">
                        <p>16.42 (9.15) N = 355</p>
                     </c>
                     <c ca="center">
                        <p>t = 2.474 p = .013</p>
                     </c>
                     <c ca="center">
                        <p>-.141</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>White N (%)</p>
                     </c>
                     <c ca="center">
                        <p>Non-White N(%)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Odds Ratio (95% CI)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Taking one or more A-levels (excluding General Studies)</p>
                     </c>
                     <c ca="center">
                        <p>4666/11273 = 41.4%</p>
                     </c>
                     <c ca="center">
                        <p>738/1777 = 41.5%</p>
                     </c>
                     <c ca="center">
                        <p>&#967;<sup>2 </sup>= .012 (1) p = .911</p>
                     </c>
                     <c ca="center">
                        <p>OR = 1.01 (.91&#8211;1.11) d = .005</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Taking one or more science A-levels</p>
                     </c>
                     <c ca="center">
                        <p>2701/11273 = 24.0%</p>
                     </c>
                     <c ca="center">
                        <p>515/1777 = 29.0%</p>
                     </c>
                     <c ca="center">
                        <p>&#967;<sup>2 </sup>= 20.84 (1) p &lt; .001</p>
                     </c>
                     <c ca="center">
                        <p>1.30 (1.16&#8211;1.45) d = .145</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Taking A-level Chemistry</p>
                     </c>
                     <c ca="center">
                        <p>744/11273 = 6.6%</p>
                     </c>
                     <c ca="center">
                        <p>202/1776 = 11.4%</p>
                     </c>
                     <c ca="center">
                        <p>&#967;<sup>2 </sup>= 51.98 (1) p &lt; .001</p>
                     </c>
                     <c ca="center">
                        <p>1.82 (1.54&#8211;2.14) d = .331</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Proportion of W (Open points and solid lines) and NW students (solid points and dashed lines) taking at least one A-level (circles), at least one science A-level (triangles), and chemistry A-level (squares), in relation to points gained at GCSE</p>
               </caption>
               <text>
                  <p>Proportion of W (Open points and solid lines) and NW students (solid points and dashed lines) taking at least one A-level (circles), at least one science A-level (triangles), and chemistry A-level (squares), in relation to points gained at GCSE. Error bars show &#177; one standard error.</p>
               </text>
               <graphic file="1472-6920-8-21-3"/>
            </fig>
            <sec>
               <st>
                  <p>Regression analysis and path analysis</p>
               </st>
               <p>Ethnic differences in aspirations to take A levels were explored further using multiple regression and structural equation modelling, looking at taking of A-levels in relation to the eight variables shown in Figure <figr fid="F2">2</figr> that relate to GCSE attainment and background characteristics. Forward entry logistic regression with taking of one or more A levels as the dependent variable, and the eight background variables as predictors, found significant effects of all the measures except private schooling. Predictors of taking A levels, in order of entry into the regression (but with significance levels calculated after taking all other variables into account) were higher numbers of GCSE points (p &lt; .001), taking fewer GCSEs (p &lt; .001), being non-white (p &lt; .001), coming from a higher socio-economic group (p &lt; .001), being male (p &lt; .001), having greater parental education (p &lt; .001), and taking more difficult GCSEs (p = .027). NW students were 1.979 times more likely to take A levels than white students (effect size = .377), all other background factors being taken into account. Very similar results were obtained for taking at least one science A level, and for taking Chemistry A level, NW students being 2.370 times more likely to take a science A level (effect size = .477), and 2.735 times more likely to take Chemistry A level (effect size = .556). Figure <figr fid="F4">4</figr> shows a path model for taking A levels, and not only can the effects of ethnicity be seen, but it is also clear that NW students are more likely to take a science A level, even after taking into account the likelihood of taking A levels, and are more likely to take Chemistry A level, even after taking into account the fact that they are taking science A levels.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>Path model of influences of background factors and GCSE performance (variables in yellow) upon taking of A-levels, science A-levels and A-level chemistry (variables in blue)</p>
                  </caption>
                  <text>
                     <p>Path model of influences of background factors and GCSE performance (variables in yellow) upon taking of A-levels, science A-levels and A-level chemistry (variables in blue). Path coefficients and t-statistics are shown alongside paths, the thickness of paths is proportional to the path coefficient, and negative paths are shown as red, dashed lines. Paths previously shown in figure 2 are in pale grey without coefficients, to avoid undue confusion.</p>
                  </text>
                  <graphic file="1472-6920-8-21-4"/>
               </fig>
               <p>Although overall NW students have poorer achievement at GCSE, a previous analysis here had suggested that NW students are equally likely as W students to achieve very high GCSE grades (6 or more A grades or better). The logistic regressions described in the previous section were therefore repeated only for students with 6 or more A grades at GCSE. Overall there was now no effect of ethnicity on taking one or more A-levels, although that largely reflects the fact, seen at the right hand end of Figure <figr fid="F3">3</figr>, that 97% of such a high achieving group go on to take A levels. However, a similar analysis for the taking of science at A level and the taking of Chemistry showed that even among GCSE high-achievers, NW students were 2.85 times more likely to take a science A level (p &lt; .001; effect size = .579), and 2.45 times more likely to take Chemistry A level (p &lt; .001; effect size = .495).</p>
            </sec>
            <sec>
               <st>
                  <p>A level difficulty</p>
               </st>
               <p>As with GCSEs, not all A level subjects are equally difficult. We therefore used the data of the CEM Centre at the University of Durham <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> to calculate a difficulty score for the subjects being taken by each student at A level. Multiple regression of A level difficulty on the six background variables and two measures of GCSE attainment shown in Figure <figr fid="F2">2</figr> found that more difficult A levels were taken by students with more GCSE points (p &lt; .001), but who had taken more difficult (P &lt; .001) but somewhat fewer GCSEs (P &lt; .001), were female (p &lt; .001), and were non-white (p &lt; .001). However, since Chemistry is one of the most difficult of A levels, the latter effect is perhaps not unexpected. Repeating the analysis but including a variable indicating whether or not students were studying chemistry at A level meant that the effect of ethnicity was no longer significant (p = .207).</p>
               <p>In summary, despite having somewhat poorer overall attainment at GCSE than W students, NW students are more likely to go on to take A levels, and in particular science A levels and Chemistry A level, which tend to be more difficult. The same was also true of science A levels and Chemistry A level amongst the high achieving students with 6 or more A grades at GCSE.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Attainment at A level</p>
            </st>
            <p>YCS was carried out in several sweeps, with information on GCSE results and intentions to take A level obtained at sweep 1. Information on actual attainment at A level was obtained only at sweep 3, and inevitably the response rate was much poorer. Of the original 25,000 individuals in the sampling frame, 13,699 responded to sweep 1, and only these individuals were sent sweep 2, of whom 10,100 responded. Only these 10,100 individuals were sent sweep 3, of whom 7,971 responded to sweep 3 (and of course many of these students did not take A levels). Nevertheless, because the sampling frame is well characterised, YCS provides appropriate weighting factors to take response biases into account, and the weighting has been applied in the following analyses, so that estimates are likely to be applicable to the population as a whole. Information at sweep 3 is less comprehensive, and little apart from examination achievement is directly relevant to the study of medicine as such.</p>
            <sec>
               <st>
                  <p>A level grades in relation to GCSE grades</p>
               </st>
               <p>For present purposes we are only interested in A levels, and not AS levels, since most medical students have at least three A levels, and they are the main criterion for medical student selection. As is also the case in many medical schools, we have also not included General Studies as a > proper = A level subject. Figure <figr fid="F5">5</figr> shows a scatter plot of total points at A level (calculated as was conventionally done by UCAS as A = 10, B = 8, C = 6, D = 4, E = 2, otherwise 0), in relation to total points at GCSE, with a lowess line fitted separately for W and NW students. The overall correlation is 0.628 (N = 2456), and was 0.633 (N = 2131) in the W students, and 0.606 (N = 326) in the NW students. It is also clear, in Figure <figr fid="F5">5</figr>, that the fitted lowess regression lines are very similar. To a first approximation therefore, GCSE achievement predicts A level outcome equally well in W and NW students, although there is a trend towards NW students with high GCSE grades underachieving to some extent at A level. Regression of A level points on GCSE points showed that the linear component of GCSE points accounted for 39.5% of the variance in A level points. Inclusion of ethnicity did not significantly improve the fit of the model (R<sup>2 </sup>change &lt; 0.1%; p = .470), although inclusion of an ethnicity <it>x </it>GCSE points interaction was significant (F(1,2472) = 12.7, p &lt; .001) accounting for a further 0.3% of the variance. Fitting lines separately to W and NW groups, showed that the unstandardised regression coefficient was somewhat higher in W students (b = .660, SE = .017; beta = .633, R<sup>2 </sup>= 40.1%) than in non-White students (b = .515, SE = .038, beta = .606, R<sup>2 </sup>= 36.7%). GCSE grades therefore predict A levels slightly better in W than NW students, although the effect is small, and the broad picture is of similarity. Since NW students taking A levels have lower GCSE achievement than W students, it can be predicted that NW students will perform less well at A level than will W students.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>The relationship between points achieved at GCSE and points achieved at A-level, separately for W students (solid black points and solid black line) and NW students (open red points and dashed red line)</p>
                  </caption>
                  <text>
                     <p>The relationship between points achieved at GCSE and points achieved at A-level, separately for W students (solid black points and solid black line) and NW students (open red points and dashed red line). All data points have been jittered slightly so that they do not overlap. The fitted lines are loess regression lines.</p>
                  </text>
                  <graphic file="1472-6920-8-21-5"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Actual taking of A levels</p>
               </st>
               <p>Of the 13,049 individuals in sweep 1 of YCS, only 6692 (51.3%) responded on sweep 3 when A level results were asked for, and of these only 2731 had actually taken one or more A levels. (Note that these figures are slightly different from those cited at the beginning of the section on A level achievement as those figures are raw counts of number of respondents, whereas the figures quoted here, and subsequently, refer to weighted responses). Of the 3478 individuals at sweep 1 who said they did not intend to take A levels, only 74 (2.1%) actually did, with no difference in proportion between W and NW (2.1% and 2.4% respectively). However of the 3214 individuals at sweep 1 who said they intended to take A levels, 2657 (82.7%) actually did, and the proportion was higher in W students (2314/2780 = 83.2%) than in NW students (342/433 = 79.0%), a significant difference (chi-squared = 4.73, 1df, p = .030; odds ratio = 0.760, effect size = B.152), suggesting that more NW than W students had decided their aspirations were inappropriate or impractical, although the data cannot decide between those possibilities. Those not actually taking A levels who had said they would, had significantly lower GCSE point scores (mean = 29.1, SD = 10.24, N = 557) than those who did take A levels as intended (mean = 48.56, SD = 8.68, N = 2657, t = 22.76, p &lt; .001). Similarly, although fewer NW students actually took A levels than said they would, the NW students who actually took A levels had significantly lower GCSE points than the W students (W: mean = 38.62, SD 8.63, N = 2376; NW: mean = 36.32, SD = 10.39, N = 355; t = 4.56, p &lt; .001).</p>
            </sec>
            <sec>
               <st>
                  <p>A level achievement</p>
               </st>
               <p>Given the lower achievement at GCSE it is not surprising that NW students also had lower total point scores at A level (see Table <tblr tid="T4">4</tblr>). Numbers taking chemistry and other individual subjects were too small to make useful comparisons. The effect just described is a simple effect, without taking background factors into account. As before it is useful to summarise the data on the different variables determining A level performance by means of a path analysis, and Figure <figr fid="F6">6</figr> shows a LISREL model for the determination of A level points in terms of nine background variables. The most important determinant of A level points is the number of A levels taken, for fairly trivial reasons. GCSE points also determine A level points, both directly, and also indirectly via the number of A levels taken, better students at GCSE taking more A levels. Better students at GCSE also tend to take slightly more difficult A level subjects, and as a result do slightly less well than if they had taken easier subjects. Students taking more difficult subjects at GCSE also tend to take more difficult subjects at A level. Schooling, socio-economic group, and ethnicity do not have direct effects upon A level performance, although they have a range of indirect effects, whereas parental education level has direct effects, on both GCSE points and on A level points. Of particular interest is that there is no direct effect of ethnicity upon A-level performance, all effects being mediated indirectly via background factors and measures of performance at GCSE.</p>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p>Path model of influences of background factors and GCSE performance (variables in yellow), upon A-level difficulty, number of A levels taken, and A-level performance (variables in pale green)</p>
                  </caption>
                  <text>
                     <p>Path model of influences of background factors and GCSE performance (variables in yellow), upon A-level difficulty, number of A levels taken, and A-level performance (variables in pale green). Path coefficients and t-statistics are shown alongside paths, the thickness of paths is proportional to the path coefficient, and negative paths are shown as red, dashed lines. Paths previously shown in figure 2 are in pale grey without coefficients, to avoid undue confusion.</p>
                  </text>
                  <graphic file="1472-6920-8-21-6"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>The UCAS dataset on applicants and entrants to universities</p>
               </st>
               <p>The YCS is a useful database for looking at the entire population of individuals taking GCSEs and then A levels. However, the numbers involved are too small to be useful for looking at applicants and entrants for particular university subjects. The UCAS database is extremely large, the three years from 2003&#8211;2005 having a total of 1,484,650 applicants, 476,467 in 2003, 486,028 in 2004 and 522,155 in 2005, of whom 52,557 applied for medicine, and 23,443 were accepted at medical school. However the database is weak in terms of the richness of the background variables, having only A levels, and not GCSEs, for instance, and having only a limited number of demographic variables. And of course, of necessity, it can only look at those individuals who have applied to university. Nevertheless, the UCAS data complements and supports the YCS data.</p>
            </sec>
            <sec>
               <st>
                  <p>A level scores</p>
               </st>
               <p>Although most of UCAS = published analyses are in terms of its > tariff score = <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, the tariff score has several disadvantages when thinking primarily about medical school applicants. The tariff combines different qualifications which are on different metrics, but that has the disadvantage that the origin of any particular tariff is far from clear. For instance, a candidate gaining B grades at three A levels will receive 3 @ 100 = 300 points, as also will a candidate gaining two As and a D (120+120+60), as also will a candidate taking six AS levels and gaining Bs in all of them (6 @ 50). The tariff also considers all such candidates equally, irrespective of the level or number of qualifications or the subjects in which they were taken. In particular, the tariff score includes General Studies A level on an equal footing with all other A levels, whereas many medical schools do not regard it as a full A level. For the present study we have therefore recalculated attainment scores <it>de novo</it>, and have considered only A levels, scoring them in the same way as for the YCS, with 10 = A, 8 = B, etc.. For most medical schools it is A levels that matter, and not combinations of A and AS levels. UCAS was unable to provide data on Scottish Highers, and therefore our analysis is restricted to those taking A levels, being typically candidates outside of Scotland.</p>
               <p>For convenience and ease of interpretation we have confined our analyses to applicants who were aged under 21 at the time of application (making them broadly equivalent to the YCS data), and who were resident in the UK, giving a total sample of 976,007. Some applicants had not taken A levels, for various reasons, in particular being resident in Scotland or Wales, and taking Highers or Baccalaureate, and we therefore considered only the 531,333 candidates who had results for at least three A level subjects (excluding General Studies). A very small number of candidates, 2642, had five or more A levels, and for practical reasons we excluded them also, leaving 528,691 applicants, 473,781 with three A levels and 54,910 with four A levels. For simplicity, for candidates with four A levels we have considered only the grades from their three best results, meaning that for all candidates the maximum number of points is 30, which is equivalent to AAA. Although this might seem to devalue the additional qualification obtained by those with four A levels, it is worth noting that while 10.5% of the applicants with three A levels had attained the maximum score of 30 points, that score was attained by 48.9% of the applicants with four A levels, showing that those taking four A levels tended to be amongst the very highest achieving of all applicants.</p>
            </sec>
            <sec>
               <st>
                  <p>A level results of UCAS applicants in relation to the YCS population</p>
               </st>
               <p>In order to compare UCAS applicants with all those taking A levels, the data from the YCS A levels were expressed in the same way as for the UCAS applicants, considering only students gaining three or four A levels at grade E or above, and for those taking four A levels, using the three best results. This score correlates highly with the total points score used in the earlier analyses (r = .957). Figure <figr fid="F7">7</figr> shows the possible scores for three A levels, from 6 (EEE) through to 30 (AAA). The YCS data were compared specifically with the 2003 UCAS applicants, since that cohort was the closest in time of the UCAS cohorts to the YCS cohort. Although the distributions are broadly similar, in particular being > censored= in a similar way at the top end, the mode in both cases being at AAA due to no higher grades being available, it is also clear that the YCS group (red bars) has somewhat more individuals at lower grades, and fewer individuals at the highest grades. UCAS applicants therefore have somewhat higher grades than the population as a whole, which would be expected as only the better A-level candidates will apply to UCAS, but otherwise have a broadly similar A level distribution to that found in YCS. .</p>
               <fig id="F7">
                  <title>
                     <p>Figure 7</p>
                  </title>
                  <caption>
                     <p>Distribution of points attained on best three A-levels (see text), for students in the YCS (red bars), and for all students applying to UCAS in 2003 (black bars)</p>
                  </caption>
                  <text>
                     <p>Distribution of points attained on best three A-levels (see text), for students in the YCS (red bars), and for all students applying to UCAS in 2003 (black bars). Grey bars in the background correspond to 6, 12, 18, 24 and 30 points, equating to EEE, DDD, CCC, BBB and AAA.</p>
                  </text>
                  <graphic file="1472-6920-8-21-7"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>A level results in UCAS applicants in relation to ethnicity</p>
               </st>
               <p>Figure <figr fid="F8">8</figr> shows the distribution of A level grades in the UCAS dataset in relation to ethnicity. The broad distributions of each group are similar to those in Figure <figr fid="F7">7</figr>, with a mode at 30 points due to censoring. A simple comparison of W and NW students finds that W students have a higher achievement than NW students (Table <tblr tid="T5">5</tblr>). Of interest is that the overall effect size comparing W with NW is B 0.140, which is similar to the effect size found in the YCS data of B 0.141 (see Table <tblr tid="T4">4</tblr>). An achievement of grades ABB or above (i.e. 26 points or equivalent, which can be regarded as a high A-level achievement), is reached by 35.7% of W applicants overall (i.e. for any subject), compared with 32.4% of NW applicants (Chi-squared = 308.8, 1df, p &lt; .001, odds ratio = 1.160, effect size = B.082).</p>
               <tbl id="T5">
                  <title>
                     <p>Table 5</p>
                  </title>
                  <caption>
                     <p>Comparison of A-level achievement in White and non-White applicants in the UCAS data, for all applicants, applicants taking Chemistry, applicants applying to medical school, and applicants accepted for medical school.</p>
                  </caption>
                  <tblbdy cols="5">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>White Mean A-level scores (SD)</p>
                        </c>
                        <c ca="center">
                           <p>Non-White Mean A-level scores (SD)</p>
                        </c>
                        <c ca="center">
                           <p>Significance</p>
                        </c>
                        <c ca="center">
                           <p>Effect size</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="5">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>All applicants</p>
                        </c>
                        <c ca="center">
                           <p>21.92 (5.93) N = 443,038</p>
                        </c>
                        <c ca="center">
                           <p>21.09 (6.31) N = 74,262</p>
                        </c>
                        <c ca="center">
                           <p>t = 34.397 p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>d = -.140</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>All applicants taking Chemistry</p>
                        </c>
                        <c ca="center">
                           <p>23.83 (6.03) N = 72,925</p>
                        </c>
                        <c ca="center">
                           <p>22.92 (6.35) N = 24,626</p>
                        </c>
                        <c ca="center">
                           <p>t = 20.26, p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>d = -.151</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>All medical school applicants</p>
                        </c>
                        <c ca="center">
                           <p>27.29 (3.56) N = 15,073</p>
                        </c>
                        <c ca="center">
                           <p>26.29 (4.41) N = 8,174</p>
                        </c>
                        <c ca="center">
                           <p>t = 18.81 p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>d = -.282</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>All medical school entrants</p>
                        </c>
                        <c ca="center">
                           <p>28.77 (1.97) N = 9,945</p>
                        </c>
                        <c ca="center">
                           <p>28.54 (2.38) N = 4,373</p>
                        </c>
                        <c ca="center">
                           <p>t = 5.89 p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>d = -.114</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>White N (%)</p>
                        </c>
                        <c ca="center">
                           <p>Non-White N(%)</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>All applicants: percent gaining ABB or higher</p>
                        </c>
                        <c ca="center">
                           <p>158,373/443,038 = 35.7%</p>
                        </c>
                        <c ca="center">
                           <p>24,074/74,262 = 32.4%</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= 308.8 (1) p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>OR = .862 (CI .848 &#8211; .877) d = -.082</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>All applicants: proportion taking Chemistry</p>
                        </c>
                        <c ca="center">
                           <p>72,925/443,038 = 16.5%</p>
                        </c>
                        <c ca="center">
                           <p>24,626/74,262 33.2%</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= 11,593 (1) p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>OR = 2.52 (CI 2.48 &#8211; 2.56) d = .-511</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Medical school applicants: percent gaining ABB or higher</p>
                        </c>
                        <c ca="center">
                           <p>11,977/15,074 = 79.5%</p>
                        </c>
                        <c ca="center">
                           <p>5815/8174 = 71.1%</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= 204.3 (1) p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>OR = 0.634 (CI .60 &#8211; .68) d = -.252</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Medical school applicants: percent gaining AAA higher</p>
                        </c>
                        <c ca="center">
                           <p>6,893/15,074 = 45.7%</p>
                        </c>
                        <c ca="center">
                           <p>3098/8174 = 37.9%</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= 132.6 (1) p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>OR = .724 (CI .69&#8211;.77) d = -.178</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Medical school acceptances: percent gaining ABB or higher</p>
                        </c>
                        <c ca="center">
                           <p>9509/9945 = 95.6%</p>
                        </c>
                        <c ca="center">
                           <p>4109/4373 = 94.0%</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= 17.9 (1) p &lt; .001</p>
                        </c>
                        <c ca="center">
                           <p>OR = .714 (CI .61 &#8211; .84) d = -.186</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Medical school acceptances: percent gaining AAA higher</p>
                        </c>
                        <c ca="center">
                           <p>6083/9945 = 61.2%</p>
                        </c>
                        <c ca="center">
                           <p>2586/4373 = 59.1%</p>
                        </c>
                        <c ca="center">
                           <p>&#967;<sup>2 </sup>= 5.24 (1) p = .022</p>
                        </c>
                        <c ca="center">
                           <p>OR = .919 (CI .854&#8211;.988) d = -.047</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
               <fig id="F8">
                  <title>
                     <p>Figure 8</p>
                  </title>
                  <caption>
                     <p>The distribution in all UCAS applicants of A level points gained by W students (red bars) and NW students (green bars)</p>
                  </caption>
                  <text>
                     <p>The distribution in all UCAS applicants of A level points gained by W students (red bars) and NW students (green bars).</p>
                  </text>
                  <graphic file="1472-6920-8-21-8"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Applicants taking A level chemistry</p>
               </st>
               <p>Although the UCAS database has 517,300 university applicants with three or more A levels, only 97,551 (18.9%) of these have taken A level chemistry, and hence can be considered as being in the > real pool = of possible entrants to medical school. Amongst those taking chemistry, 25.2% are non-white, compared with 11.8% of those not taking chemistry (odds ratio = 2.518, effect size = .510), again, a very similar figure to the odds ratio found in the YCS data of 2.735. The mean total A level points of NW students taking chemistry is significantly less than that for W students (NW: mean = 22.92, SD = 6.35, N = 24626; W: mean = 23.83, SD = 6.03, N = 72925; t = 20.26, p &lt; .001; effect size = B.151), as can be seen in Figure <figr fid="F9">9</figr>, where it is very clear that students taking A level chemistry are a high-achieving sub-group of students compared with the population in general (Figure <figr fid="F8">8</figr>).</p>
               <fig id="F9">
                  <title>
                     <p>Figure 9</p>
                  </title>
                  <caption>
                     <p>The distribution of A-level points for all UCAS applicants who have taken A-level chemistry, separately for W students (red bars) and NW students (green bars)</p>
                  </caption>
                  <text>
                     <p>The distribution of A-level points for all UCAS applicants who have taken A-level chemistry, separately for W students (red bars) and NW students (green bars).</p>
                  </text>
                  <graphic file="1472-6920-8-21-9"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Applicants to medical school</p>
               </st>
               <p>For the 23,247 applicants to medical school, of whom 8,174 (35.2%) were NW, the average number of A level points was significantly higher in the W applicants than the NW applicants, with an effect size of B.282 (see Table <tblr tid="T5">5</tblr>). Figure <figr fid="F10">10</figr> shows that the W applicants have a higher proportions of 30 points and 28 points, whereas NW applicants have a higher proportions of all other grades. In view of the non-normal distribution, a non-parametric test was used to confirm the difference in distributions (Mann-Whitney U = 54151179, z = 15.99, p &lt; .001). Table <tblr tid="T5">5</tblr> also shows that NW applicants were significantly less likely to achieve 26 points or more, which is equivalent to ABB, (odds ratio = 0.634, effect size = B.252), or achieve grades of AAA (odds ratio = .724, effect size = B.178).</p>
               <fig id="F10">
                  <title>
                     <p>Figure 10</p>
                  </title>
                  <caption>
                     <p>The distribution of A-level points for all medical school applicants, separately for W students (red bars) and NW students (green bars)</p>
                  </caption>
                  <text>
                     <p>The distribution of A-level points for all medical school applicants, separately for W students (red bars) and NW students (green bars).</p>
                  </text>
                  <graphic file="1472-6920-8-21-10"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Entrants to medical school</p>
               </st>
               <p>Of the 14,318 entrants to medical school, the average number of A level points was 28.77 (SD 1.97) in the 9,945 W entrants, and 28.54 (SD 2.38) in the 4,373 NW entrants, a highly significant difference (t = 5.89, 14316 df, p &lt; .001), with an effect size of B.114. Figure <figr fid="F11">11</figr> shows the distribution, with W candidates being more likely to have 30 and 28 points, and NW candidates to have 28 points or less. In view of the skewed distribution, a non-parametric test was used to confirm the difference in distributions (Mann-Whitney, U = 21057283, Z = 3.453, p &lt; .001). Table <tblr tid="T5">5</tblr> also shows that NW acceptances were significantly less likely than W acceptances to have achieved 26 points or more, equivalent to a grade of ABB or more (odds ratio &lt; .001; odds ratio = .721, effect size = -.186), or to achieve a grade of AAA (odds ratio = .919, effect size = B.047).</p>
               <fig id="F11">
                  <title>
                     <p>Figure 11</p>
                  </title>
                  <caption>
                     <p>The distribution of A-level points for all medical school entrants, separately for W students (red bars) and NW students (green bars)</p>
                  </caption>
                  <text>
                     <p>The distribution of A-level points for all medical school entrants, separately for W students (red bars) and NW students (green bars).</p>
                  </text>
                  <graphic file="1472-6920-8-21-11"/>
               </fig>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Exploring the censored distribution of A level attainment in W and NW students</p>
            </st>
            <p>The distribution of A level attainment in university applicants in general, shown in Figure <figr fid="F7">7</figr>, is an important one that is capable of further analysis. It can be seen that to a first approximation the distribution is normal, with a mode at about 20 points (i.e equivalent to about BCC) but with a wide distribution around that, and, in particular a clear second mode at the maximum of 30 points (AAA). It should also be noticed that the distribution tails away towards the left-hand end of the distribution, the very lowest points possible with three A levels being 6 (i.e. EEE).</p>
            <p>In interpreting this distribution, a clear distinction should be made between distributions that statisticians describe as <it>censored </it>and those described as <it>truncated</it>. In a clinical survival analysis, patients may be followed up for, say, five years. During that time, some will have died, and their precise survival will be known. Others, however, will have survived for the whole of the five years but will, inevitably, succumb at some future time, but all that can be said of them is that their survival is at least five years; these data are <it>censored</it>, the number of observations being known but their precise value being unknown. In contrast, a study may look only at a group of patients whose blood pressure is known to be at least 160 mmHg, and the distribution of blood pressure for those individuals is <it>truncated</it>, no individuals being included below the threshold level. The key difference is that for the censored data, all of those surviving for beyond the measured time are included in the final bin of the histogram, whereas in the case of truncation, those not included are not shown anywhere in the histogram.</p>
            <p>The A level data of Figures <figr fid="F7">7</figr> to <figr fid="F11">11</figr> are truncated at the lower bound, as individuals are only included who have gained a minimum of 6 points from 3 A levels. In contrast, at the upper end of the distribution, individuals who could have gained more than 30 points (AAA) if the marking scheme had allowed it, by, for instance, including hypothetical grades of A*, A** and A***, gaining 12, 14 and 16 points respectively, are forced into the highest bin of the histogram, AAA, because they cannot achieve any higher result given the marking scheme. The A level distribution is therefore censored at the top end (and that accounts for the clear second mode in Figures <figr fid="F7">7</figr> to <figr fid="F11">11</figr> at 30 points).</p>
            <p>When a distribution is censored, as in Figures <figr fid="F7">7</figr> to <figr fid="F11">11</figr>, then although a simple mean can be calculated in the traditional way, it makes more sense to consider the mean of the true or latent, underlying, distribution which had generated the censored distribution that was actually found. The latent distributions for all UCAS applicants were calculated separately for W and NW candidates, and the simple censored distribution was fitted (further details are described elsewhere <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>). For medical school applicants and acceptances, the parameters were estimated jointly for both applicants and acceptances, using a two-parameter logistic function to estimate the selection ratio at each level of achievement, with all parameters calculated separately for W and NW applicants. As an example of the fitting process, Figure <figr fid="F12">12</figr> shows the observed distribution of A level scores for all applicants, all applicants for medical school, and all acceptances for medical school. The wide black bars in Figure <figr fid="F12">12a</figr> show the percentage of all UCAS candidates gaining the various grades. The ceiling at 30 points (AAA) is clearly shown, and is gained by 14.5% of these applicants. The modelled, normal distribution, censored at 30 points, showed a reasonable fit to the data (yellow bars, mean = 22.1, SD = 6.60). The red bars in Figure <figr fid="F1">1a</figr> show the expected distribution of applicants gaining more than 30 points from 3 A levels, assuming similar scaling of the current A levels, and with A* = 12, A** = 14, etc.. Figures <figr fid="F12">12b</figr> and <figr fid="F12">12c</figr> show similar distributions for medical school applicants and acceptances, the true mean (SD) being estimated as 28.3 (5.5) for applicants and 30.1 (4.5) for acceptances.</p>
            <fig id="F12">
               <title>
                  <p>Figure 12</p>
               </title>
               <caption>
                  <p>The black bars show the observed distributions of A-level grades attained by a) all UCAS applicants, b) all medical school applicants, and c) all medical school entrants</p>
               </caption>
               <text>
                  <p>The black bars show the observed distributions of A-level grades attained by a) all UCAS applicants, b) all medical school applicants, and c) all medical school entrants. The yellow bars show the fitted distribution which has been estimated on the basis of a normal distribution which has been truncated at 6 points and censored at 30 points, with all individuals scoring more than 30 points being included in the 30 point bin. The orange bars show the proportions of individuals who would have been expected to gain 32, 34, 36 etc. points, where 36 points can be construed as three A* grades, 42 points three A** grades, etc..</p>
               </text>
               <graphic file="1472-6920-8-21-12"/>
            </fig>
            <p>Table <tblr tid="T6">6</tblr> shows the estimated parameters of the uncensored distributions for W and NW applicants, medical school applicants and medical school acceptances. The first four columns are relatively uncomplicated and show the fitted mean and standard deviation of A-level grades for all university applicants, the NW applicants having a lower mean and slightly higher standard deviation than the W applicants. The reminder of the table is best interpreted in conjunction with Figure <figr fid="F13">13</figr>, which shows the estimated distributions and selection functions for medical school applicants and acceptances. The red and green circles on Figure <figr fid="F13">13</figr> show the fitted distributions of W and NW medical school applicants respectively, the NW distribution being clearly shifted to the left by about 0.6 standard deviations (the precise parameters are shown in the last three columns of Table <tblr tid="T6">6</tblr>). Similarly red and green squares show the fitted A-level grade distributions of W and NW medical school acceptances, the parameters being shown at the bottom of Table <tblr tid="T6">6</tblr>. The selection functions, which show the proportions of applicants who are accepted, in relation to A-level points, are shown as separately for the W and NW applicants as the green and red dashed lines, and the parameters of the function are shown in Table <tblr tid="T6">6</tblr>.</p>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Parameter estimates for the latent (uncensored) distributions in all applicants, and in medical school applicants and acceptances, separately for W and NW candidates.</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3" ca="center">
                        <p>All applicants</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Medical school applicants and acceptances</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>W</p>
                     </c>
                     <c ca="center">
                        <p>NW</p>
                     </c>
                     <c ca="center">
                        <p>Effect size</p>
                     </c>
                     <c ca="center">
                        <p>W</p>
                     </c>
                     <c ca="center">
                        <p>NW</p>
                     </c>
                     <c ca="center">
                        <p>Effect size</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Estimated true mean in applicants</p>
                     </c>
                     <c ca="center">
                        <p>22.29</p>
                     </c>
                     <c ca="center">
                        <p>21.45</p>
                     </c>
                     <c ca="center">
                        <p>-.129</p>
                     </c>
                     <c ca="center">
                        <p>28.63</p>
                     </c>
                     <c ca="center">
                        <p>25.63</p>
                     </c>
                     <c ca="center">
                        <p>-.592</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Estimated true SD in applicants</p>
                     </c>
                     <c ca="center">
                        <p>6.53</p>
                     </c>
                     <c ca="center">
                        <p>6.92</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>5.06</p>
                     </c>
                     <c ca="center">
                        <p>5.53</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Selection process: Slope</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>.225</p>
                     </c>
                     <c ca="center">
                        <p>.200</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Selection process: Intercept</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>26.00</p>
                     </c>
                     <c ca="center">
                        <p>29.9</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Estimated true mean in acceptances</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>30.80</p>
                     </c>
                     <c ca="center">
                        <p>30.51</p>
                     </c>
                     <c ca="center">
                        <p>-.068</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Estimated true SD in acceptances</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>4.23</p>
                     </c>
                     <c ca="center">
                        <p>4.36</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <fig id="F13">
               <title>
                  <p>Figure 13</p>
               </title>
               <caption>
                  <p>The estimated, underlying, uncensored distributions of A-level ability in medical school applicants and acceptances are shown for W candidates (red lines and open points) and NW candidates (green lines and solid points)</p>
               </caption>
               <text>
                  <p>The estimated, underlying, uncensored distributions of A-level ability in medical school applicants and acceptances are shown for W candidates (red lines and open points) and NW candidates (green lines and solid points). The leftmost two lines with circles joined by thick lines show the fitted distribution for the percentages of <it>applicants </it>with particular A-level points, whereas the rightmost two solid lines with square points show the fitted distribution for the percentages of <it>entrants </it>with particular numbers of A-level points; all four lines use the left-hand ordinate. The dashed, ascending lines use the right-hand ordinate and show the fitted proportions of candidates with particular number of A-level points who are accepted, red for W applicants and green for NW applicants.</p>
               </text>
               <graphic file="1472-6920-8-21-13"/>
            </fig>
            <p>The effect size of -.129 for the A level performance of all NW applicants is similar both to the simple, unadjusted, effect size of -.141 found for applicants in the UCAS data (Table <tblr tid="T5">5</tblr>), and the effect size of -.140 found for A level points in the YCS data (Table <tblr tid="T4">4</tblr>). In contrast the true effect size of -0.592 for medical school applicants shown in Table <tblr tid="T6">6</tblr> is considerably larger than the effect size calculated from the raw data that is shown in Table <tblr tid="T5">5</tblr> of -.282, showing that the censoring of the data has distorted the true picture. Similarly, the raw effect size of -.114 for medical school acceptances in Table <tblr tid="T5">5</tblr> is larger than the adjusted effect size of -.068 in Table <tblr tid="T6">6</tblr>. Censoring of the data by the artificial ceiling of AAA in A levels therefore distorts the estimates of effect size.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>This paper has analysed two very different databases, in order to answer questions about the academic background and qualifications of medical students from ethnic minorities. The questions it has asked have, inevitably, been simplified, particularly in so far as the studies have looked only at A level grades, in those students who have taken three or four A levels, and they have looked only at a division of the population into just two groups, White and non-White. No doubt far more subtle analyses could carried out, but those presented here are sufficient for a basic analysis of the issue.</p>
         <p>The starting point for the paper is that in a range of studies, medical students and doctors from ethnic minorities have underperformed in different types of examination, both undergraduate and postgraduate. Although it is inevitably tempting to attribute underperformance in some examinations, such as clinical assessments, to factors such as direct or indirect discrimination on the part of examiners, perhaps because of cultural insensitivity or other reasons, all such explanations have problems in explaining why candidates from ethnic minorities underperform in assessments of knowledge which use multiple-choice questions and are marked by a computer.</p>
         <p>A crucial but implicit assumption of many studies of underperformance by students from ethnic minorities is that W and NW students are initially equivalent in their academic ability. Since selection has taken place on the basis of an academic criterion, then it might se