<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1743-422X-5-48</ui>
	<ji>1743-422X</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Bioinformatic analysis suggests that the Orbivirus VP6 cistron encodes an overlapping gene</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Firth</snm>
					<mi>E</mi>
					<fnm>Andrew</fnm>
					<insr iid="I1"/>
					<email>A.Firth@ucc.ie</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Department of Biochemistry, BioSciences Institute, University College Cork, Cork, Ireland</p>
				</ins>
			</insg>
			<source>Virology Journal</source>
			<issn>1743-422X</issn>
			<pubdate>2008</pubdate>
			<volume>5</volume>
			<issue>1</issue>
			<fpage>48</fpage>
			<url>http://www.virologyj.com/content/5/1/48</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">18489030</pubid><pubid idtype="doi">10.1186/1743-422X-5-48</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>25</day>
					<month>3</month>
					<year>2008</year>
				</date>
			</rec>
			<acc>
				<date>
					<day>14</day>
					<month>4</month>
					<year>2008</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>14</day>
					<month>4</month>
					<year>2008</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2008</year>
			<collab>Firth; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>The genus <it>Orbivirus </it>includes several species that infect livestock &#8211; including Bluetongue virus (BTV) and African horse sickness virus (AHSV). These viruses have linear dsRNA genomes divided into ten segments, all of which have previously been assumed to be monocistronic.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>Bioinformatic evidence is presented for a short overlapping coding sequence (CDS) in the <it>Orbivirus </it>genome segment 9, overlapping the VP6 cistron in the +1 reading frame. In BTV, a 77&#8211;79 codon AUG-initiated open reading frame (hereafter ORFX) is present in all 48 segment 9 sequences analysed. The pattern of base variations across the 48-sequence alignment indicates that ORFX is subject to functional constraints at the amino acid level (even when the constraints due to coding in the overlapping VP6 reading frame are taken into account; MLOGD software). In fact the translated ORFX shows greater amino acid conservation than the overlapping region of VP6. The ORFX AUG codon has a strong Kozak context in all 48 sequences. Each has only one or two upstream AUG codons, always in the VP6 reading frame, and (with a single exception) always with weak or medium Kozak context. Thus, in BTV, ORFX may be translated via leaky scanning. A long (83&#8211;169 codon) ORF is present in a corresponding location and reading frame in all other <it>Orbivirus </it>species analysed except Saint Croix River virus (SCRV; the most divergent). Again, the pattern of base variations across sequence alignments indicates multiple coding in the VP6 and ORFX reading frames.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>At ~9.5 kDa, the putative ORFX product in BTV is too small to appear on most published protein gels. Nonetheless, a review of past literature reveals a number of possible detections. We hope that presentation of this bioinformatic analysis will stimulate an attempt to experimentally verify the expression and functional role of ORFX, and hence lead to a greater understanding of the molecular biology of these important pathogens.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>The <it>Orbivirus </it>genus is one of &#8805;12 genera within the family <it>Reoviridae</it>. The <it>Reoviridae </it>have segmented linear dsRNA genomes. There are 9&#8211;12 segments <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and these are usually, but not always, monocistronic. Subgenomic RNAs are unknown. <it>Orbivirus </it>genomes have 10 segments. Many species infect ruminants while some infect humans. Transmission is via arthropods &#8211; including midges, ticks and mosquitoes. The type species is Bluetongue virus (BTV) which causes severe and sometimes fatal disease, particularly in sheep. BTV is endemic in many tropical countries, but there have also been recent outbreaks in Europe <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Another species is African horse sickness virus (AHSV) which is a fatal disease of horses. AHSV is endemic in many parts of sub-Saharan Africa, but has made incursions into Europe <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Recent outbreaks of BTV in Europe may be a consequence of climate change &#8211; allowing the midge vectors to expand their range <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
			<p>The <it>Orbivirus </it>proteins, structure, assembly and replication have been reviewed in <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. The BTV core is composed of two major proteins (VP3 and VP7). Transcription complexes &#8211; composed of three minor proteins (VP1 &#8211; polymerase, VP4 &#8211; capping enzyme, and VP6 &#8211; helicase) are located inside the core. Transcription occurs within the intact core and full-length capped mRNAs from each of the genome segments are fed out into the cytoplasm for translation. An outer capsid (VP2 and VP5) surrounds the core, but is removed during cell entry. There are four non-structural proteins &#8211; NS1, NS2 and NS3/3A. VP6 is a hydrophilic, basic protein that binds dsRNA and other nucleic acids and functions as the viral helicase <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. In some, but not all, BTV serotypes, VP6 migrates as a closely-spaced doublet <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. This is apparently due to the fact that in these serotypes the first VP6 AUG codon has weak Kozak context while a second in-frame AUG codon has medium context.</p>
			<p>The genomes of RNA viruses are under strong selective pressure to compress maximal coding and regulatory information into minimal sequence space. Thus overlapping CDSs are particularly common in such viruses. Such CDSs can be difficult to detect using conventional gene-finding software <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, especially when short. The software package MLOGD, however, was designed specifically for locating short overlapping CDSs in sequence alignments and overcomes many of the difficulties with alternative methods <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. MLOGD includes explicit models for sequence evolution in double-coding regions as well as models for single-coding and non-coding regions. It can be used to predict whether query ORFs are likely to be coding, via a likelihood ratio test, where the null model comprises any known CDSs and the alternative model comprises the known CDSs plus the query ORF. MLOGD has been tested extensively using thousands of known virus CDSs as a test set, and it has been shown that, for overlapping CDSs, a total of just 20 independent base variations are sufficient to detect a new CDS with ~90% confidence.</p>
			<p>Using MLOGD, we recently identified &#8211; and subsequently experimentally verified &#8211; a new short CDS in the <it>Potyviridae </it>that overlaps the polyprotein cistron but is translated in the +2 reading frame <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. When we applied MLOGD to the <it>Orbivirus </it>genome we also found evidence for a short CDS overlapping the VP6 cistron. Here we describe the bioinformatic analysis.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Identification in BTV using MLOGD</p>
				</st>
				<p>The putative new CDS, ORFX, was first identified in a BTV sequence alignment, using MLOGD. In the RefSeq [GenBank: <ext-link ext-link-type="gen" ext-link-id="NC_006008">NC_006008</ext-link>] (1049 nt), ORFX has coords 182..415 (77 codons) and therefore is completely contained within the VP6 cistron (16..1005), overlapping it in the +1 reading frame (Figure <figr fid="F1">1</figr>). When applied to an alignment of 48 BTV sequences (see Methods; pairwise divergences &#8804;0.21 base variations per nucleotide and total alignment divergence ~0.77 independent base variations per column in the ORFX region), MLOGD detected a strong coding signature for ORFX (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>). There are ~180 independent base variations across the alignment in the ORFX region, thus providing MLOGD with a robust signal. Formally, and within the MLOGD model, <it>p </it>&lt; 10<sup>-40</sup>. Indeed Figure <figr fid="F2">2</figr> shows four non-overlapping &#8211; and hence completely independent &#8211; positively scoring windows in the ORFX region. Moreover, the MLOGD results showed that, within the ORFX region, ORFX is <it>more </it>conserved at the amino acid level than VP6 (Figure <figr fid="F2">2</figr>). Finally, inspection of the MLOGD output showed that the ORF is present in all of the 48 sequences (i.e. no premature termination codons; Figure <figr fid="F2">2</figr>).</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Genome map for BTV</p>
					</caption>
					<text>
						<p><b>Genome map for BTV</b>. The putative new coding sequence &#8211; ORFX &#8211; is located on segment 9 (RNA9), in the +1 reading frame relative to the overlapping VP6 cistron. Molecular masses are based on the unmodified amino acid sequences.</p>
					</text>
					<graphic file="1743-422X-5-48-1"/>
				</fig>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>MLOGD statistics for the alignment of 48 BTV sequences</p>
					</caption>
					<text>
						<p><b>MLOGD statistics for the alignment of 48 BTV sequences</b>. The input alignment comprised a CLUSTALW [39] alignment of the VP6 amino acid sequences only, back-translated to nucleotide sequences. <b>(1) </b>The positions of alignment gaps in each of the 48 sequences. In fact most of the alignment is ungapped, though a few sequences are incomplete. <b>(2)&#8211;(4) </b>The positions of stop codons in each of the 48 sequences in each of the three forward reading frames. Note the conserved absence of stop codons in the +0 frame (i.e. the VP6 CDS) and in the +1 frame in the ORFX region. <b>(5)&#8211;(8) </b>MLOGD sliding-window plots. Window size = 20 codons. Step size = 10 codons. Each window is represented by a small circle (showing the likelihood ratio score for that window), and grey bars showing the width (ends) of the window. See [16] for further details of the MLOGD software. In <b>(5)&#8211;(6) </b>the null model, in each window, is that the sequence is non-coding, while the alternative model is that the sequence is coding in the window frame. Positive scores favour the alternative model. There is a strong coding signature in the +0 frame (5) throughout the VP6 CDS, except where the VP6 CDS overlaps ORFX. In this region there is a strong coding signature in the +1 frame (6) indicating that ORFX is subject to stronger functional constraints than the overlapping section of VP6. In <b>(7)&#8211;(8) </b>the null model, in each window, is that only the VP6 frame is coding, while the alternative model is that both the VP6 frame and the window frame are coding. Only the +1 (7) and +2 (8) frames are shown because the +0 frame is the VP6 frame which is included in the null model. Scores are generally negative with occasional random scatter into low positive scores, except for the ORFX region which has consecutive high-positively scoring windows (7). Note that there are four non-overlapping &#8211; and hence completely independent &#8211; positively scoring windows in the ORFX region (7). Formally, and within the MLOGD model, <it>p </it>&lt; 10<sup>-40</sup>. <b>(9) </b>Genome map for the reference sequence [GenBank: <ext-link ext-link-type="gen" ext-link-id="NC_006008">NC_006008</ext-link>]. <b>(10) </b>Phylogenetically summed sequence divergence (mean number of base variations per nucleotide) for the sequences that contribute to the statistics at each position in the alignment. In any particular column, some sequences may be omitted from the statistical calculations due to alignment gaps. Statistics in regions with lower summed divergence (i.e. partially gapped regions) have a lower signal-to-noise ratio.</p>
					</text>
					<graphic file="1743-422X-5-48-2"/>
				</fig>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>MLOGD statistics for BTV, AHSV, PALV and PHSV/YUOV alignments</p>
					</caption>
					<text>
						<p><b>MLOGD statistics for BTV, AHSV, PALV and PHSV/YUOV alignments</b>. Output plots from MLOGD used in the 'Test Query CDS' mode, applied to the ORFX region in BTV, AHSV, PALV and PHSV/YUOV sequence alignments. See [16] for full details of the MLOGD software. The null model comprises the VP6 CDS and the query CDS is ORFX. In each plot, the top panel displays the raw log(LR) statistics at each alignment position. There is a separate track for each reference &#8211; non-reference sequence pair (labelled at the right, together with the pairwise divergences; albeit not legible for the BTV alignment since it contains so many &#8211; i.e. 48 &#8211; sequences). Stop codons (of which there are none except 3' terminal ones) in each of the VP6 and ORFX reading frames, and alignment gaps for each sequence, are marked on the appropriate tracks. The second panel displays the &#931;<sub>tree </sub>log(LR) statistic at each alignment position, where 'tree' represents a phylogenetic tree &#8211; see [16]. The third and fourth panels display sliding window means of the statistics in the first and second panels, respectively. The fifth panel shows the locations of the null and alternative model CDSs (i.e. VP6 and ORFX, respectively). The sixth panel shows the summed mean sequence divergence (base variations per alignment nt column) for the sequence pairs that contribute to the &#931;<sub>tree </sub>log(LR) statistic at each alignment position. This is a measure of the information available at each alignment position (e.g. partially gapped regions have lower summed mean sequence divergence). The predominantly positive values in the fourth panel indicate that ORFX is subject to functional constraints, at the amino acid level, over the majority of its length.</p>
					</text>
					<graphic file="1743-422X-5-48-3"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Nucleotide sequence analysis in BTV</p>
				</st>
				<p>In the 48-sequence BTV alignment (not shown), one can observe the following:</p>
				<p>&#8226; The ORFX AUG initiation codon is present in all 48 sequences and is at the same location in the alignment. All have 'G' at +4; 46/48 have 'A' at -3 and 2/48 have 'G' at -3, giving the ORFX AUG codon a strong Kozak context <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
				<p>&#8226; As far as amino acid constraints in the VP6 reading frame are concerned, there is no reason for the ORFX AUG codon to be conserved. In every sequence, the overlapping VP6-frame codons are gAU_Ggu. GAU codes for <it>Asp</it>, but <it>Asp </it>could also be encoded by GAC.</p>
				<p>&#8226; Many sequences contain ORFX-frame termination codons just two codons 5' of the AUG codon. Thus initiation of ORFX at an upstream non-AUG codon, or via other non-canonical mechanisms, appears unlikely.</p>
				<p>&#8226; ORFX is always in the +1 frame relative to the VP6 reading frame.</p>
				<p>&#8226; The length of ORFX is 77 aa in 44/48 sequences (UAG termination codon) and 79 aa in 4/48 sequences (UAA termination codon). The alignment is gap-free within ORFX.</p>
				<p>&#8226; All AUG codons upstream of the ORFX AUG codon are in the VP6 reading frame. There are a maximum of two upstream AUG codons in any given sequence, and the Kozak contexts of the upstream AUG codons are nearly always weak or medium (Table <tblr tid="T1">1</tblr>).</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Kozak contexts of VP6 AUG codons in BTV. Kozak contexts of AUG codons upstream of ORFX in BTV for the 34 segment 9 sequences which appear to contain the complete 5'UTR. Kozak contexts are assumed to be 'strong' if there is 'G' at +4 and an 'A' or 'G' at -3, 'medium' if one of these is present, and 'weak' if neither are present.</p>
					</caption>
					<tblbdy cols="10">
						<r>
							<c cspan="4" ca="center">
								<p>One upstream AUG codon</p>
							</c>
							<c cspan="6" ca="center">
								<p>Two upstream AUG codons</p>
							</c>
						</r>
						<r>
							<c cspan="4">
								<hr/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c cspan="2" ca="center">
								<p>First</p>
							</c>
							<c ca="center">
								<p>Strength</p>
							</c>
							<c ca="center">
								<p>Number</p>
							</c>
							<c cspan="2" ca="center">
								<p>First</p>
							</c>
							<c cspan="2" ca="center">
								<p>Second</p>
							</c>
							<c ca="center">
								<p>Strength</p>
							</c>
							<c ca="center">
								<p>Number</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>-3</p>
							</c>
							<c ca="center">
								<p>+4</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>-3</p>
							</c>
							<c ca="center">
								<p>+4</p>
							</c>
							<c ca="center">
								<p>-3</p>
							</c>
							<c ca="center">
								<p>+4</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c cspan="4">
								<hr/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>G</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>medium</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>U</p>
							</c>
							<c ca="center">
								<p>U</p>
							</c>
							<c ca="center">
								<p>G</p>
							</c>
							<c ca="center">
								<p>weak-medium</p>
							</c>
							<c ca="center">
								<p>15</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>A</p>
							</c>
							<c ca="center">
								<p>A</p>
							</c>
							<c ca="center">
								<p>medium</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>U</p>
							</c>
							<c ca="center">
								<p>G</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>weak-medium</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>U</p>
							</c>
							<c ca="center">
								<p>weak</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>U</p>
							</c>
							<c ca="center">
								<p>A</p>
							</c>
							<c ca="center">
								<p>A</p>
							</c>
							<c ca="center">
								<p>weak-medium</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>U</p>
							</c>
							<c ca="center">
								<p>A</p>
							</c>
							<c ca="center">
								<p>G</p>
							</c>
							<c ca="center">
								<p>weak-strong</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>U</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>C</p>
							</c>
							<c ca="center">
								<p>weak-weak</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>&#8226; There is only a single AUG codon (in a single sequence) in the purine-rich ~70 nt region (Figure <figr fid="F4">4</figr>) directly upstream of the ORFX AUG codon.</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Nucleotide frequencies for segment 9</p>
					</caption>
					<text>
						<p><b>Nucleotide frequencies for segment 9</b>. Nucleotide frequencies in 60 nt running windows along each <it>Orbivirus </it>segment 9 RefSeq. 'A' &#8211; red, 'C' &#8211; green, 'G' &#8211; blue, 'U' &#8211; purple. Horizontal black bars represent the locations of the VP6 CDS and ORFX (the grey bar represents ORFXb in SCRV). Except for SCRV, the sequences are A- or AG-rich, but they also have an A-rich peak just upstream of ORFX.</p>
					</text>
					<graphic file="1743-422X-5-48-4"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Nucleotide sequence analysis in other Orbivirus RefSeqs</p>
				</st>
				<p>The five non-BTV <it>Orbivirus </it>GenBank RefSeqs (see Methods) were inspected for a long ORF in the same location and reading frame as ORFX relative to the annotated VP6 CDS. Such an ORF was found in all RefSeqs except SCRV (Figure <figr fid="F5">5</figr>). The ORFX lengths are 143, 111, 113 and 83 codons in, respectively, AHSV, PHSV, YUOV and PALV. We propose (see Discussion) that ORFX is not present in SCRV. The following AUG codons are (potentially) used in the various RefSeqs (Kozak contexts &#8211; in parantheses &#8211; are assumed to be 'strong' if there is 'G' at +4 and an 'A' or 'G' at -3, 'medium' if one of these is present, and 'weak' if neither are present):</p>
				<fig id="F5">
					<title>
						<p>Figure 5</p>
					</title>
					<caption>
						<p>Segment 9 genome maps for six Orbivirus species</p>
					</caption>
					<text>
						<p><b>Segment 9 genome maps for six Orbivirus species</b>. Genome maps for segment 9 of the six <it>Orbivirus </it>RefSeqs in GenBank, showing the location of putative ORFX homologues. In SCRV, no long ORF was found in the right location and frame; the two ORFs indicated here are separated by a stop codon. A phylogenetic tree for the six <it>Orbivirus </it>VP6 amino acid sequences (columns with alignment gaps excluded; neighbour-joining tree; numbers indicate bootstrap support [out of 1000]; scale bar represents the number of substitutions per site; tree produced with CLUSTALX [39]) is given at left.</p>
					</text>
					<graphic file="1743-422X-5-48-5"/>
				</fig>
				<p>BTV: AUG1 (weak) and AUG2 (medium) in VP6 frame. AUG3 (strong) in ORFX frame. AUG[4-10] also in ORFX frame.</p>
				<p>AHSV: AUG1 (weak) in VP6 frame. AUG2 (strong) in ORFX frame. AUG[3-10] also in ORFX frame.</p>
				<p>PALV: AUG1 (weak) in VP6 frame. AUG2 (strong) in ORFX frame. AUG[3-7] also in ORFX frame.</p>
				<p>PHSV: AUG1 (weak) in VP6 frame. AUG2 (medium) in ORFX frame (1 codon ORF). AUG3 (medium) in +2 frame (10 codon ORF). AUG4 (weak) and AUG5 (strong) in ORFX frame. AUG[6-7] also in ORFX frame.</p>
				<p>YUOV: AUG1 (weak) in VP6 frame. AUG2 (medium) in ORFX frame (1 codon ORF). AUG3 (medium) in +2 frame (21 codon ORF; overlaps AUG4 [strong; +2 frame] and AUG5 [medium; VP6 frame]). AUG6 (medium), AUG7 (strong), AUG8 (strong) and AUG9 (medium) in ORFX frame.</p>
				<p>SCRV: AUG1 (medium) and AUG2 (strong) in VP6 frame. AUG3 (medium) in ORFX frame (1 codon ORF). AUG4 (medium), AUG5 (strong) and AUG6 (strong) in VP6 frame. AUG7 (weak) and AUG8 (strong) in ORFX frame (ORFXa; Figure <figr fid="F5">5</figr>). AUG9 (weak) and AUG10 (weak) in ORFX frame (ORFXb; Figure <figr fid="F5">5</figr>).</p>
			</sec>
			<sec>
				<st>
					<p>MLOGD analysis of ORFX coding potential</p>
				</st>
				<p>MLOGD can not be used effiectively on an alignment of the six RefSeqs because the pairwise divergences are too great. However it can be used on other within-species alignments. Alignments were constructed for (a) the 48 BTV sequences, (b) the 3 AHSV sequences, (c) the 11 PALV sequences (183 nt, partial), and (d) the PHSV and YUOV RefSeqs (see Methods). PHSV and YUOV are the two most-closely related of the six RefSeqs and are not too divergent for MLOGD. MLOGD results for ORFX are given in Table <tblr tid="T2">2</tblr> and Figure <figr fid="F3">3</figr>. ORFX is present in all the aligned sequences (no premature termination codons) and, in each alignment, MLOGD detects a strong coding signature for ORFX. ORFX is longest in the three AHSV sequences &#8211; the maximal lengths being 143 codons in [Genbank:<ext-link ext-link-type="gen" ext-link-id="NC_006019">NC_006019</ext-link>], 154 codons in [Genbank:<ext-link ext-link-type="gen" ext-link-id="AM883170">AM883170</ext-link>], and 169 codons in [Genbank:<ext-link ext-link-type="gen" ext-link-id="U19881">U19881</ext-link>].</p>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>ORFX MLOGD statistics. MLOGD statistics for ORFX in different <it>Orbivirus </it>alignments. These statistics were derived using MLOGD in the 'Test Query CDS' mode (Figure 3) &#8211; specifically testing the coding potential of the whole ORFX &#8211; rather than the 'Sliding Window' mode used for Figure 2.</p>
					</caption>
					<tblbdy cols="9">
						<r>
							<c ca="left">
								<p>Species</p>
							</c>
							<c ca="left">
								<p>Reference<sup>1</sup></p>
							</c>
							<c ca="center">
								<p>N<sub>seqs</sub></p>
							</c>
							<c ca="center">
								<p>Length</p>
							</c>
							<c ca="center">
								<p>ln(LR)<sup>2</sup></p>
							</c>
							<c ca="center">
								<p>var/nt<sup>3</sup></p>
							</c>
							<c ca="center">
								<p>ln(LR)/nt<sup>4</sup></p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1743-422X-5-48-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:msubsup>
														<m:mtext>N</m:mtext>
														<m:mrow>
															<m:mi>var</m:mi>
															<m:mo>&#8289;</m:mo>
														</m:mrow>
														<m:mn>5</m:mn>
													</m:msubsup>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeOta40aa0baaSqaaiGbcAha2jabcggaHjabckhaYbqaaiabiwda1aaaaaa@3249@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
							<c ca="center">
								<p>
									<inline-formula>
										<m:math name="1743-422X-5-48-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
											<m:semantics>
												<m:mrow>
													<m:msubsup>
														<m:mrow>
															<m:mtext>div</m:mtext>
														</m:mrow>
														<m:mrow>
															<m:mi>max</m:mi>
															<m:mo>&#8289;</m:mo>
														</m:mrow>
														<m:mn>6</m:mn>
													</m:msubsup>
												</m:mrow>
												<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeizaqMaeeyAaKMaeeODay3aa0baaSqaaiGbc2gaTjabcggaHjabcIha4bqaaiabiAda2aaaaaa@353D@</m:annotation>
											</m:semantics>
										</m:math>
									</inline-formula>
								</p>
							</c>
						</r>
						<r>
							<c cspan="9">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>BTV</p>
							</c>
							<c ca="left">
								<p>NC_006008</p>
							</c>
							<c ca="center">
								<p>48</p>
							</c>
							<c ca="center">
								<p>234 nt</p>
							</c>
							<c ca="center">
								<p>101.8</p>
							</c>
							<c ca="center">
								<p>0.77</p>
							</c>
							<c ca="center">
								<p>0.44</p>
							</c>
							<c ca="center">
								<p>180</p>
							</c>
							<c ca="center">
								<p>0.21</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AHSV</p>
							</c>
							<c ca="left">
								<p>NC_006019</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>429 nt</p>
							</c>
							<c ca="center">
								<p>15.8</p>
							</c>
							<c ca="center">
								<p>0.06</p>
							</c>
							<c ca="center">
								<p>0.04</p>
							</c>
							<c ca="center">
								<p>26</p>
							</c>
							<c ca="center">
								<p>0.05</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PALV</p>
							</c>
							<c ca="left">
								<p>NC_005992</p>
							</c>
							<c ca="center">
								<p>11</p>
							</c>
							<c ca="center">
								<p>180<sup>7 </sup>nt</p>
							</c>
							<c ca="center">
								<p>29.7</p>
							</c>
							<c ca="center">
								<p>0.23</p>
							</c>
							<c ca="center">
								<p>0.16</p>
							</c>
							<c ca="center">
								<p>41</p>
							</c>
							<c ca="center">
								<p>0.12</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>PHSV/YUOV</p>
							</c>
							<c ca="left">
								<p>NC_007753</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>336 nt</p>
							</c>
							<c ca="center">
								<p>33.0</p>
							</c>
							<c ca="center">
								<p>0.56</p>
							</c>
							<c ca="center">
								<p>0.10</p>
							</c>
							<c ca="center">
								<p>189</p>
							</c>
							<c ca="center">
								<p>0.56</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>1. GenBank reference sequence used for MLOGD.</p>
						<p>2. Total MLOGD log likelihood score &#8211; positive values indicate that ORFX is likely to be coding. Formally, exp(ln(LR)) gives <inline-formula><m:math name="1743-422X-5-48-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mrow><m:mtext>P</m:mtext><m:mo stretchy="false">(</m:mo><m:mtext>alignment</m:mtext><m:mo>|</m:mo><m:mtext>ORFX&#160;coding</m:mtext><m:mo stretchy="false">)</m:mo></m:mrow><m:mrow><m:mtext>P</m:mtext><m:mo stretchy="false">(</m:mo><m:mtext>alignment</m:mtext><m:mo>|</m:mo><m:mtext>ORFX&#160;noncoding</m:mtext><m:mo stretchy="false">)</m:mo></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqqGqbaucqGGOaakcqqGHbqycqqGSbaBcqqGPbqAcqqGNbWzcqqGUbGBcqqGTbqBcqqGLbqzcqqGUbGBcqqG0baDcqGG8baFcqqGpbWtcqqGsbGucqqGgbGrcqqGybawcqqGGaaicqqGJbWycqqGVbWBcqqGKbazcqqGPbqAcqqGUbGBcqqGNbWzaeaacqqGqbaucqGGOaakcqqGHbqycqqGSbaBcqqGPbqAcqqGNbWzcqqGUbGBcqqGTbqBcqqGLbqzcqqGUbGBcqqG0baDcqGG8baFcqqGpbWtcqqGsbGucqqGgbGrcqqGybawcqqGGaaicqqGUbGBcqqGVbWBcqqGUbGBcqqGJbWycqqGVbWBcqqGKbazcqqGPbqAcqqGUbGBcqqGNbWzcqGGPaqkaaaaaa@6BD6@</m:annotation></m:semantics></m:math></inline-formula>, which may be equated to <inline-formula><m:math name="1743-422X-5-48-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mrow><m:mtext>P</m:mtext><m:mo stretchy="false">(</m:mo><m:mtext>ORFX&#160;coding</m:mtext><m:mo stretchy="false">)</m:mo></m:mrow><m:mrow><m:mtext>P</m:mtext><m:mo stretchy="false">(</m:mo><m:mtext>ORFX&#160;noncoding</m:mtext><m:mo stretchy="false">)</m:mo></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqqGqbaucqGGOaakcqqGpbWtcqqGsbGucqqGgbGrcqqGybawcqqGGaaicqqGJbWycqqGVbWBcqqGKbazcqqGPbqAcqqGUbGBcqqGNbWzaeaacqqGqbaucqGGOaakcqqGpbWtcqqGsbGucqqGgbGrcqqGybawcqqGGaaicqqGUbGBcqqGVbWBcqqGUbGBcqqGJbWycqqGVbWBcqqGKbazcqqGPbqAcqqGUbGBcqqGNbWzcqGGPaqkaaaaaa@505C@</m:annotation></m:semantics></m:math></inline-formula> if equal Bayesian priors are assumed. These probabilities are, however, subject to the assumptions of the MLOGD sequence evolution model [15]. Nonetheless, extensive tests with known single-coding and double-coding sequences indicate that 'N<sub>var </sub>&#8805; 20' and 'ln(LR)/nt &#8805; <inline-formula><m:math name="1743-422X-5-48-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mn>1</m:mn><m:mn>6</m:mn></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqaIXaqmaeaacqaI2aGnaaaaaa@2E5D@</m:annotation></m:semantics></m:math></inline-formula> &#215; var/nt' signals robust detection of an overlapping same-strand CDS [16] (and unpublished data).</p>
						<p>3. Alignment divergence per nucleotide &#8211; i.e. mean number of independent base variations per alignment column in the ORFX region.</p>
						<p>4. Log likelihood score per alignment column.</p>
						<p>5. Approximate total number of independent base variations in ORFX region.</p>
						<p>6. Maximum pairwise divergence from the chosen reference sequence.</p>
						<p>7. Alignment of PALV partial sequences &#8211; does not cover the entire ORFX region.</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Analysis of the ORFX peptide sequence</p>
				</st>
				<p>Application of blastp <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> to the ORFX peptide sequences for the six RefSeqs revealed no similar amino acid sequences in GenBank (14 Mar 2008), while tblastn identified only the ORFX region in other <it>Orbivirus </it>sequences (as expected). Application of InterProScan <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> to the six sequences returned no hits (protein motifs, domains etc).</p>
				<p>The ORFX amino acid sequence appears to have greater amino acid conservation than the overlapping region of the VP6 CDS (e.g. Figure <figr fid="F2">2</figr>). In a comparison between [Genbank:<ext-link ext-link-type="gen" ext-link-id="NC_006008">NC_006008</ext-link>] and three divergent BTV sequences &#8211; [Genbank:<ext-link ext-link-type="gen" ext-link-id="DQ289044">DQ289044</ext-link>], [Genbank:<ext-link ext-link-type="gen" ext-link-id="D10905">D10905</ext-link>] and [Genbank:<ext-link ext-link-type="gen" ext-link-id="DQ825671">DQ825671</ext-link>], all three showed greater amino acid conservation (relative to <ext-link ext-link-type="gen" ext-link-id="NC_006008">NC_006008</ext-link>) in the ORFX frame than in the VP6 frame in the ORFX region. Specifically, there was respectively 87%, 78% and 100% amino acid identity in the ORFX frame, but only 58%, 73% and 83% identity in the VP6 frame. Similarly, in a comparison of [Genbank:<ext-link ext-link-type="gen" ext-link-id="NC_007753">NC_007753</ext-link>] (PHSV) with [Genbank:<ext-link ext-link-type="gen" ext-link-id="NC_007664">NC_007664</ext-link>] (YUOV), there were 32 amino acid identities in ORFX while, in the corresponding region of VP6, there were only 22 amino acid identities.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>Due to the segmented nature of their genomes, the <it>Reoviridae </it>may escape a fundamental problem that many other eukaryotic viruses face &#8211; how to circumvent the host cell's general rule of 'one functional protein per mRNA'. Nonetheless, of the 352 <it>Reoviridae </it>RefSeqs in GenBank (10 Mar 2008; 33 species &#215; 9&#8211;12 segments per species), ~5% are multicistronic. Among these are a few examples of fully overlapping genes apparently translated via leaky scanning, for example in <it>Phytoreovirus </it>segment S12 or S9 <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and mammalian <it>Orthoreovirus </it>segment S1 <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>.</p>
			<p>For optimal leaky scanning <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, one would expect the VP6 CDS to initiate at AUG1 with weak context and ORFX to initiate at AUG2 with strong context. This indeed is the situation in the AHSV and PALV RefSeqs. Although there are two upstream VP6-frame AUG codons in many BTV serotypes, leaky scanning still appears fairly straightforward in this virus as a translational mechanism for ORFX (though potentially at a much lower abundance than VP6). In the YUOV and PHSV RefSeqs, leaky scanning may be possible, but requires scanning through or translation and reinitiation of two upstream short ORFs. It is interesting, and possibly relevant, that in another <it>Reoviridae </it>species &#8211; Avian reovirus &#8211; a novel, as yet not fully understood, scanning-independent ribosome migration mechanism is used to bypass two upstream CDSs in order to translate the 3'-proximal CDS on the tricistronic S1 mRNA <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>.</p>
			<p>IRESs have not been reported in the <it>Reoviridae </it>and, at this genomic location, use of an IRES would seem unlikely. However, it has been shown that a variety of poly-purine A-rich sequences &#8211; such as (GAAA)<sub>16 </sub>&#8211; can serve as efficient IRESs without the requirement for a complex RNA secondary structure such as in the <it>Picornaviridae </it>IRESs <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, so it is interesting to note that there is an A-rich poly-purine tract just upstream of ORFX in all species except SCRV (Figure <figr fid="F4">4</figr>). In the BTV RefSeq, for example, the 68 nt immediately preceding ORFX comprise 32 A, 7 C, 25 G and 4 U nucleotides. In fact the entire sequences (except SCRV) are A- or AG-rich (Table <tblr tid="T3">3</tblr>). Nonetheless the region just upstream of ORFX is a peak in A-richness (Figure <figr fid="F4">4</figr>). Admittedly, this could be due to many other reasons (e.g. just amino acid coding constraints in VP6) and there is no strong reason to suspect an IRES here.</p>
			<tbl id="T3">
				<title>
					<p>Table 3</p>
				</title>
				<caption>
					<p>Nucleotide frequencies for segment 9. Mean nucleotide frequencies for the six <it>Orbivirus </it>segment 9 RefSeqs in GenBank.</p>
				</caption>
				<tblbdy cols="6">
					<r>
						<c ca="left">
							<p>RefSeq</p>
						</c>
						<c ca="left">
							<p>Species</p>
						</c>
						<c ca="center">
							<p>A%</p>
						</c>
						<c ca="center">
							<p>C%</p>
						</c>
						<c ca="center">
							<p>G%</p>
						</c>
						<c ca="center">
							<p>U%</p>
						</c>
					</r>
					<r>
						<c cspan="6">
							<hr/>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>NC_006008</p>
						</c>
						<c ca="left">
							<p>BTV</p>
						</c>
						<c ca="center">
							<p>32</p>
						</c>
						<c ca="center">
							<p>16</p>
						</c>
						<c ca="center">
							<p>33</p>
						</c>
						<c ca="center">
							<p>19</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>NC_006019</p>
						</c>
						<c ca="left">
							<p>AHSV</p>
						</c>
						<c ca="center">
							<p>32</p>
						</c>
						<c ca="center">
							<p>16</p>
						</c>
						<c ca="center">
							<p>32</p>
						</c>
						<c ca="center">
							<p>20</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>NC_005992</p>
						</c>
						<c ca="left">
							<p>PALV</p>
						</c>
						<c ca="center">
							<p>36</p>
						</c>
						<c ca="center">
							<p>16</p>
						</c>
						<c ca="center">
							<p>26</p>
						</c>
						<c ca="center">
							<p>23</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>NC_007753</p>
						</c>
						<c ca="left">
							<p>PHSV</p>
						</c>
						<c ca="center">
							<p>41</p>
						</c>
						<c ca="center">
							<p>13</p>
						</c>
						<c ca="center">
							<p>24</p>
						</c>
						<c ca="center">
							<p>22</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>NC_007664</p>
						</c>
						<c ca="left">
							<p>YUOV</p>
						</c>
						<c ca="center">
							<p>36</p>
						</c>
						<c ca="center">
							<p>18</p>
						</c>
						<c ca="center">
							<p>25</p>
						</c>
						<c ca="center">
							<p>20</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>NC_006005</p>
						</c>
						<c ca="left">
							<p>SCRV</p>
						</c>
						<c ca="center">
							<p>25</p>
						</c>
						<c ca="center">
							<p>27</p>
						</c>
						<c ca="center">
							<p>24</p>
						</c>
						<c ca="center">
							<p>25</p>
						</c>
					</r>
				</tblbdy>
			</tbl>
			<p>SCRV lacks a long ORF in the correct reading frame and location for an ORFX homologue. The number (six) and contexts (3 are strong) of upstream AUG codons make conventional leaky scanning to 'ORFXa' (38 codons; Figure <figr fid="F5">5</figr>) extremely unlikely. It is quite possible, therefore, that no ORFX homologue is present in SCRV. This is not too surprising &#8211; SCRV segment 9 is the most divergent, and the shortest, of the six RefSeqs (Figure <figr fid="F5">5</figr>) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. SCRV is also the only species of the six which is tick-borne instead of insect-borne (BTV, AHSV and PALV are transmitted by midges; YUOV by mosquitoes).</p>
			<p>At ~9.5 kDa, the putative ORFX product in BTV is too small to appear on most published protein gels. Nonetheless there are unidentified low molecular mass bands in a number of reported gels <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, often running near the dye front, that <it>may </it>represent ORFX product. Furthermore, ref. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> (<it>in vitro </it>translation of the individual segments) noted, with reference to excluded data, that segment 9 may encode a low molecular weight protein in addition to VP6.</p>
			<p>The ORFX product is largest in AHSV (~17 kDa in [GenBank:<ext-link ext-link-type="gen" ext-link-id="NC_006019">NC_006019</ext-link>] and ~20 kDa in [GenBank:<ext-link ext-link-type="gen" ext-link-id="U19881">U19881</ext-link>]). Ref. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> (<it>in vitro </it>translation of the individual AHSV segments, and comparison with proteins extracted from infected cell lysate) clearly identified an additional non-structural protein translated from segment 9 &#8211; termed 'NS3' &#8211; migrating ~1.5 kDa behind the 'NS4/4A' proteins (equivalent to NS3/3A in our notation) translated from segment 10. 'NS3' is a good candidate for ORFX product migrating a little slower than expected, possibly as a result of post-translational modification. The protein labelled 'VP6' in ref. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> appears to be a truncated version of VP5 (translated from the same segment as VP5, and both were shown to have similar partial protease digestion products). Interestingly the VP6 protein (our notation) is not visible as a product of segment 9 translation in Fig. 6 of ref. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, but may be visible in Fig. 7 of ref. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> (migrating next to NS2), unless this is cross-contamination. An additional segment 9 product (~20 kDa), migrating ahead of 'NS4/4A', is also visible (albeit fainter) in Fig. 7 of ref. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. If the 'NS3' band is post-translationally modified ORFX product, then this band could be unmodified ORFX product.</p>
			<p>Ref. <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> also identified a number of low molecular mass proteins in AHSV-infected cells &#8211; in particular P23, P20 and P21. Ref. <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> equated two of these (P20 and P21) to the segment 10 products NS3/3A (~24/~22 kDa in AHSV). The third protein may be ORFX product.</p>
			<p>In addition to its small size, the fact that ORFX product has not been widely reported suggests that it may be present only in low abundance and/or only expressed at certain stages (e.g. only in the insect vector) or cellular locations.</p>
		</sec>
		<sec>
			<st>
				<p>Conclusion</p>
			</st>
			<p>We have identified a conserved ORF (ORFX) overlapping the <it>Orbivirus </it>VP6 CDS in the +1 reading frame. ORFX ranges from 77&#8211;169 codons in length, depending on species, and is present in all <it>Orbivirus </it>segment 9 sequences analysed except for the highly divergent species SCRV. The software package MLOGD &#8211; designed specifically for identifying and analysing overlapping CDSs &#8211; finds a strong coding signature for ORFX when applied to BTV, AHSV, PALV and PHSV/YUOV sequence alignments. The location and Kozak context of the VP6 and ORFX initiation codons is generally consistent with a leaky scanning model for ORFX translation. ORFX product bears no homology to known proteins.</p>
			<p>We hope that presentation of this bioinformatic analysis will stimulate an attempt to experimentally verify the expression and functional role of ORFX product. Initial verification could be by means of immunoblotting with ORFX-specific antibodies or gel purification of ORFX product from virus-infected cell protein extracts, followed by mass spectrometry.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<p>In GenBank, there are whole-genome RefSeqs for six <it>Orbivirus </it>species: Bluetongue virus (BTV), African horse sickness virus (AHSV), Peruvian horse sickness virus (PHSV), Yunnan orbivirus (YUOV), Palyam virus (PALV) and Saint Croix river virus (SCRV). All six genomes comprise 10 segments. The segments homologous to BTV segment 9 (encoding VP6) were identified by finding the best blastp-match, among the 10 BTV translated segments, for the longest ORF in each of the 50 non-BTV segments. The identifications were verified, where possible, by information in the GenBank-file headers and in the literature (AHSV <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>; YUOV <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>; PALV <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>; SCRV <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>).</p>
			<p>As of 11 May 2007, there were 1273 <it>Orbivirus </it>sequences in GenBank (i.e. including partial sequences), however most of these are not segment 9. Incidently, none of these sequences has more than one CDS annotated. Segment 9 sequences were extracted (a) using the GenBank-file DEFINITION headers, and (b) by finding the best blastp-match for the longest ORF in each sequence among the 10 BTV translated segments. These were supplemented with all GenBank (16 Mar 2008) tblastn matches to the ORFX peptide sequences from the six RefSeqs (providing one additional recent sequence). After removing duplicate sequences, the following segment 9 sequences were found: (1) the 6 RefSeqs for BTV, AHSV, PHSV, YUOV, PALV and SCRV (all complete); (2) 47 other BTV sequences (mostly complete VP6 CDS; all cover ORFX completely; ~34 contain the full 5' UTR); (3) 2 other AHSV sequences (full genome); and (4) 10 PALV partial sequences (183 nt, completely contained in the ORFX region).</p>
			<p>The GenBank accession numbers are as follows: BTV &#8211; <ext-link ext-link-type="gen" ext-link-id="NC_006008">NC_006008</ext-link>, <ext-link ext-link-type="gen" ext-link-id="A22393">A22393</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AF403418">AF403418</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AF403419">AF403419</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AF403420">AF403420</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AF403421">AF403421</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AF403423">AF403423</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AY124373">AY124373</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AY493691">AY493691</ext-link>, <ext-link ext-link-type="gen" ext-link-id="D10905">D10905</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ289041">DQ289041</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ289042">DQ289042</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ289043">DQ289043</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ289044">DQ289044</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ289045">DQ289045</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ289046">DQ289046</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ289047">DQ289047</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ289048">DQ289048</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ289050">DQ289050</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ825668">DQ825668</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ825669">DQ825669</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ825671">DQ825671</ext-link>, <ext-link ext-link-type="gen" ext-link-id="DQ832170">DQ832170</ext-link>, <ext-link ext-link-type="gen" ext-link-id="L08668">L08668</ext-link>, <ext-link ext-link-type="gen" ext-link-id="L08669">L08669</ext-link>, <ext-link ext-link-type="gen" ext-link-id="L08670">L08670</ext-link>, <ext-link ext-link-type="gen" ext-link-id="L08671">L08671</ext-link>, <ext-link ext-link-type="gen" ext-link-id="L08672">L08672</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55778">U55778</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55779">U55779</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55780">U55780</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55781">U55781</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55782">U55782</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55784">U55784</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55785">U55785</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55786">U55786</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55787">U55787</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55788">U55788</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55790">U55790</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55792">U55792</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55793">U55793</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55794">U55794</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55795">U55795</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55796">U55796</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55797">U55797</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55799">U55799</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55800">U55800</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U55801">U55801</ext-link>; AHSV &#8211; <ext-link ext-link-type="gen" ext-link-id="NC_006019">NC_006019</ext-link>, <ext-link ext-link-type="gen" ext-link-id="U19881">U19881</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AM883170">AM883170</ext-link>; PHSV &#8211; <ext-link ext-link-type="gen" ext-link-id="NC_007753">NC_007753</ext-link>; YUOV &#8211; <ext-link ext-link-type="gen" ext-link-id="NC_007664">NC_007664</ext-link>; PALV &#8211; <ext-link ext-link-type="gen" ext-link-id="NC_005992">NC_005992</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034675">AB034675</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034676">AB034676</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034677">AB034677</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034678">AB034678</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034679">AB034679</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034680">AB034680</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034681">AB034681</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034682">AB034682</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034683">AB034683</ext-link>, <ext-link ext-link-type="gen" ext-link-id="AB034684">AB034684</ext-link>; SCRV &#8211; <ext-link ext-link-type="gen" ext-link-id="NC_006005">NC_006005</ext-link>.</p>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st>
			<p>The author(s) declare that they have no competing interests.</p>
		</sec>
		<sec>
			<st>
				<p>Authors' contributions</p>
			</st>
			<p>AEF carried out the bioinformatics analyses and wrote the manuscript.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We thank John F Atkins for providing encouragement and facilities. This work was supported by an award from Science Foundation Ireland to John F Atkins.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Expansion of family Reoviridae to include nine-segmented dsRNA viruses: isolation and characterization of a new virus designated Aedes pseudoscutellaris reovirus assigned to a proposed genus (Dinovernavirus)</p>
				</title>
				<aug>
					<au>
						<snm>Attoui</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Mohd Jaafar</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Belhouchet</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Biagini</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Cantaloube</snm>
						<fnm>JF</fnm>
					</au>
					<au>
						<snm>de Micco</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>de Lamballerie</snm>
						<fnm>X</fnm>
					</au>
				</aug>
				<source>Virology</source>
				<pubdate>2005</pubdate>
				<volume>343</volume>
				<fpage>212</fpage>
				<lpage>223</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.virol.2005.08.028</pubid>
						<pubid idtype="pmpid" link="fulltext">16171838</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Emerging infectious diseases. During a hot summer, bluetongue virus invades northern Europe</p>
				</title>
				<aug>
					<au>
						<snm>Enserink</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2006</pubdate>
				<volume>313</volume>
				<fpage>1218</fpage>
				<lpage>1219</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.313.5791.1218a</pubid>
						<pubid idtype="pmpid" link="fulltext">16946042</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Bluetongue outbreak in the UK</p>
				</title>
				<aug>
					<au>
						<snm>Landeg</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>Vet Rec</source>
				<pubdate>2007</pubdate>
				<volume>161</volume>
				<fpage>534</fpage>
				<lpage>535</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">17938415</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>African horse sickness</p>
				</title>
				<aug>
					<au>
						<snm>Mellor</snm>
						<fnm>PS</fnm>
					</au>
					<au>
						<snm>Hamblin</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Vet Res</source>
				<pubdate>2004</pubdate>
				<volume>35</volume>
				<fpage>445</fpage>
				<lpage>466</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1051/vetres:2004021</pubid>
						<pubid idtype="pmpid" link="fulltext">15236676</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Climate change and the recent emergence of bluetongue in Europe</p>
				</title>
				<aug>
					<au>
						<snm>Purse</snm>
						<fnm>BV</fnm>
					</au>
					<au>
						<snm>Mellor</snm>
						<fnm>PS</fnm>
					</au>
					<au>
						<snm>Rogers</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Samuel</snm>
						<fnm>AR</fnm>
					</au>
					<au>
						<snm>Mertens</snm>
						<fnm>PP</fnm>
					</au>
					<au>
						<snm>Baylis</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Nat Rev Microbiol</source>
				<pubdate>2005</pubdate>
				<volume>3</volume>
				<fpage>171</fpage>
				<lpage>181</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrmicro1090</pubid>
						<pubid idtype="pmpid" link="fulltext">15685226</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Bluetongue virus proteins</p>
				</title>
				<aug>
					<au>
						<snm>Roy</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1992</pubdate>
				<volume>73</volume>
				<fpage>3051</fpage>
				<lpage>3064</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">1335020</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Functional mapping of Bluetongue virus proteins and their interactions with host proteins during virus replication</p>
				</title>
				<aug>
					<au>
						<snm>Roy</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Cell Biochem Biophys</source>
				<pubdate>2008</pubdate>
				<volume>50</volume>
				<fpage>143</fpage>
				<lpage>157</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">18299997</pubid>
						<pubid idtype="doi">10.1007/s12013-008-9009-4</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>The bluetongue virus core: a nano-scale transcription machine</p>
				</title>
				<aug>
					<au>
						<snm>Mertens</snm>
						<fnm>PP</fnm>
					</au>
					<au>
						<snm>Diprose</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Virus Res</source>
				<pubdate>2004</pubdate>
				<volume>101</volume>
				<fpage>29</fpage>
				<lpage>43</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.virusres.2003.12.004</pubid>
						<pubid idtype="pmpid" link="fulltext">15010215</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Identification of bluetongue virus VP6 protein as a nucleic acid-binding protein and the localization of VP6 in virus-infected vertebrate cells</p>
				</title>
				<aug>
					<au>
						<snm>Roy</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Adachi</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Urakawa</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Booth</snm>
						<fnm>TF</fnm>
					</au>
					<au>
						<snm>Thomas</snm>
						<fnm>CP</fnm>
					</au>
				</aug>
				<source>J Virol</source>
				<pubdate>1990</pubdate>
				<volume>64</volume>
				<fpage>1</fpage>
				<lpage>8</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">249028</pubid>
						<pubid idtype="pmpid" link="fulltext">2152806</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Mapping and characterization of antigenic epitopes and the nucleic acid-binding domains of the VP6 protein of bluetongue viruses</p>
				</title>
				<aug>
					<au>
						<snm>Hayama</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>JK</fnm>
					</au>
				</aug>
				<source>J Virol</source>
				<pubdate>1994</pubdate>
				<volume>68</volume>
				<fpage>3604</fpage>
				<lpage>3611</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">236864</pubid>
						<pubid idtype="pmpid" link="fulltext">7514678</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Bluetongue virus VP6 protein binds ATP and exhibits an RNA-dependent ATPase function and a helicase activity that catalyze the unwinding of double-stranded RNA substrates</p>
				</title>
				<aug>
					<au>
						<snm>St&#228;uber</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Martinez-Costas</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Sutton</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Monastyrskaya</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Roy</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>J Virol</source>
				<pubdate>1997</pubdate>
				<volume>71</volume>
				<fpage>7220</fpage>
				<lpage>7226</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">192062</pubid>
						<pubid idtype="pmpid" link="fulltext">9311795</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Defining the structure-function relationships of bluetongue virus helicase protein VP6</p>
				</title>
				<aug>
					<au>
						<snm>Kar</snm>
						<fnm>AK</fnm>
					</au>
					<au>
						<snm>Roy</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>J Virol</source>
				<pubdate>2003</pubdate>
				<volume>77</volume>
				<fpage>11347</fpage>
				<lpage>11356</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">229362</pubid>
						<pubid idtype="pmpid" link="fulltext">14557620</pubid>
						<pubid idtype="doi">10.1128/JVI.77.21.11347-11356.2003</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Characterization of the nucleic acid binding activity of inner core protein VP6 of African horse sickness virus</p>
				</title>
				<aug>
					<au>
						<snm>de Waal</snm>
						<fnm>PJ</fnm>
					</au>
					<au>
						<snm>Huismans</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Arch Virol</source>
				<pubdate>2005</pubdate>
				<volume>150</volume>
				<fpage>2037</fpage>
				<lpage>2050</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s00705-005-0547-4</pubid>
						<pubid idtype="pmpid" link="fulltext">15986179</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Sequence of genome segment 9 of bluetongue virus (serotype 1, South Africa) and expression analysis demonstrating that different forms of VP6 are derived from initiation of protein synthesis at two distinct sites</p>
				</title>
				<aug>
					<au>
						<snm>Wade-Evans</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Mertens</snm>
						<fnm>PP</fnm>
					</au>
					<au>
						<snm>Belsham</snm>
						<fnm>GJ</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1992</pubdate>
				<volume>73</volume>
				<fpage>3023</fpage>
				<lpage>3026</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">1331303</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Detecting overlapping coding sequences with pairwise alignments</p>
				</title>
				<aug>
					<au>
						<snm>Firth</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>CM</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<fpage>282</fpage>
				<lpage>292</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti007</pubid>
						<pubid idtype="pmpid" link="fulltext">15347574</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Detecting overlapping coding sequences in virus genomes</p>
				</title>
				<aug>
					<au>
						<snm>Firth</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>CM</fnm>
					</au>
				</aug>
				<source>BMC Bioinformatics</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<fpage>75</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1395342</pubid>
						<pubid idtype="pmpid" link="fulltext">16483358</pubid>
						<pubid idtype="doi">10.1186/1471-2105-7-75</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>An overlapping essential gene in the Potyviridae</p>
				</title>
				<aug>
					<au>
						<snm>Chung</snm>
						<fnm>BYW</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>WA</fnm>
					</au>
					<au>
						<snm>Atkins</snm>
						<fnm>JF</fnm>
					</au>
					<au>
						<snm>Firth</snm>
						<fnm>AE</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci U S A</source>
				<pubdate>2008</pubdate>
				<volume>105</volume>
				<fpage>5897</fpage>
				<lpage>5902</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1073/pnas.0800468105</pubid>
						<pubid idtype="pmpid" link="fulltext">18408156</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>An analysis of 5'-noncoding sequences from 699 vertebrate messenger RNAs</p>
				</title>
				<aug>
					<au>
						<snm>Kozak</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1987</pubdate>
				<volume>15</volume>
				<fpage>8125</fpage>
				<lpage>8148</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">306349</pubid>
						<pubid idtype="pmpid" link="fulltext">3313277</pubid>
						<pubid idtype="doi">10.1093/nar/15.20.8125</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Basic local alignment search tool</p>
				</title>
				<aug>
					<au>
						<snm>Altschul</snm>
						<fnm>SF</fnm>
					</au>
					<au>
						<snm>Gish</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Myers</snm>
						<fnm>EW</fnm>
					</au>
					<au>
						<snm>Lipman</snm>
						<fnm>DJ</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1990</pubdate>
				<volume>215</volume>
				<fpage>403</fpage>
				<lpage>410</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">2231712</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>InterProScan &#8211; an integration platform for the signature-recognition methods in InterPro</p>
				</title>
				<aug>
					<au>
						<snm>Zdobnov</snm>
						<fnm>EM</fnm>
					</au>
					<au>
						<snm>Apweiler</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2001</pubdate>
				<volume>17</volume>
				<fpage>847</fpage>
				<lpage>848</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/17.9.847</pubid>
						<pubid idtype="pmpid" link="fulltext">11590104</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Polycistronic (tri- or bicistronic) phytoreoviral segments translatable in both plant and insect cells</p>
				</title>
				<aug>
					<au>
						<snm>Suzuki</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Sugawara</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Nuss</snm>
						<fnm>DL</fnm>
					</au>
					<au>
						<snm>Matsuura</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>J Virol</source>
				<pubdate>1996</pubdate>
				<volume>70</volume>
				<fpage>8155</fpage>
				<lpage>8159</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">190894</pubid>
						<pubid idtype="pmpid" link="fulltext">8892945</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Biosynthesis of reovirus-specified polypeptides. The s1 mRNA synthesized in vivo is structurally and functionally indistinguishable from in vitro-synthesized s1 mRNA and encodes two polypeptides, sigma 1a and sigma 1bNS</p>
				</title>
				<aug>
					<au>
						<snm>Jacobs</snm>
						<fnm>BL</fnm>
					</au>
					<au>
						<snm>Atwater</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Munemitsu</snm>
						<fnm>SM</fnm>
					</au>
					<au>
						<snm>Samuel</snm>
						<fnm>CE</fnm>
					</au>
				</aug>
				<source>Virology</source>
				<pubdate>1985</pubdate>
				<volume>147</volume>
				<fpage>9</fpage>
				<lpage>18</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0042-6822(85)90222-3</pubid>
						<pubid idtype="pmpid">2998074</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Biosynthesis of reovirus-specified polypeptides. Analysis of ribosome pausing during translation of reovirus S1 and S4 mRNAs in virus-infected and vector-transfected cells</p>
				</title>
				<aug>
					<au>
						<snm>Doohan</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Samuel</snm>
						<fnm>CE</fnm>
					</au>
				</aug>
				<source>J Biol Chem</source>
				<pubdate>1993</pubdate>
				<volume>268</volume>
				<fpage>18313</fpage>
				<lpage>18320</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">8349706</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Pushing the limits of the scanning mechanism for initiation of translation</p>
				</title>
				<aug>
					<au>
						<snm>Kozak</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>2002</pubdate>
				<volume>299</volume>
				<fpage>1</fpage>
				<lpage>34</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0378-1119(02)01056-9</pubid>
						<pubid idtype="pmpid" link="fulltext">12459250</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Sequential partially overlapping gene arrangement in the tricistronic S1 genome segments of avian reovirus and Nelson Bay reovirus: implications for translation initiation</p>
				</title>
				<aug>
					<au>
						<snm>Shmulevitz</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Yameen</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Dawe</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Shou</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>O'Hara</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Holmes</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Duncan</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>J Virol</source>
				<pubdate>2002</pubdate>
				<volume>76</volume>
				<fpage>609</fpage>
				<lpage>618</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">136829</pubid>
						<pubid idtype="pmpid" link="fulltext">11752152</pubid>
						<pubid idtype="doi">10.1128/JVI.76.2.609-618.2002</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Leaky scanning and scanning-independent ribosome migration on the tricistronic S1 mRNA of avian reovirus</p>
				</title>
				<aug>
					<au>
						<snm>Racine</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Barry</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Roy</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Dawe</snm>
						<fnm>SJ</fnm>
					</au>
					<au>
						<snm>Shmulevitz</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Duncan</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>J Biol Chem</source>
				<pubdate>2007</pubdate>
				<volume>282</volume>
				<fpage>25613</fpage>
				<lpage>25622</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1074/jbc.M703708200</pubid>
						<pubid idtype="pmpid" link="fulltext">17604272</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Polypurine (A)-rich sequences promote cross-kingdom conservation of internal ribosome entry</p>
				</title>
				<aug>
					<au>
						<snm>Dorokhov</snm>
						<fnm>YL</fnm>
					</au>
					<au>
						<snm>Skulachev</snm>
						<fnm>MV</fnm>
					</au>
					<au>
						<snm>Ivanov</snm>
						<fnm>PA</fnm>
					</au>
					<au>
						<snm>Zvereva</snm>
						<fnm>SD</fnm>
					</au>
					<au>
						<snm>Tjulkina</snm>
						<fnm>LG</fnm>
					</au>
					<au>
						<snm>Merits</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Gleba</snm>
						<fnm>YY</fnm>
					</au>
					<au>
						<snm>Hohn</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Atabekov</snm>
						<fnm>JG</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<fpage>5301</fpage>
				<lpage>5306</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">122764</pubid>
						<pubid idtype="pmpid" link="fulltext">11959981</pubid>
						<pubid idtype="doi">10.1073/pnas.082107599</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Complete sequence characterization of the genome of the St Croix River virus, a new orbivirus isolated from cells of Ixodes scapularis</p>
				</title>
				<aug>
					<au>
						<snm>Attoui</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Stirling</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Munderloh</snm>
						<fnm>UG</fnm>
					</au>
					<au>
						<snm>Billoir</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Brookes</snm>
						<fnm>SM</fnm>
					</au>
					<au>
						<snm>Burroughs</snm>
						<fnm>JN</fnm>
					</au>
					<au>
						<snm>de Micco</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Mertens</snm>
						<fnm>PP</fnm>
					</au>
					<au>
						<snm>de Lamballerie</snm>
						<fnm>X</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>2001</pubdate>
				<volume>82</volume>
				<fpage>795</fpage>
				<lpage>804</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11257184</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Comparison of bluetongue type 20 with certain viruses of the bluetongue and Eubenangee serological groups of orbiviruses</p>
				</title>
				<aug>
					<au>
						<snm>Gorman</snm>
						<fnm>BM</fnm>
					</au>
					<au>
						<snm>Taylor</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Walker</snm>
						<fnm>PJ</fnm>
					</au>
					<au>
						<snm>Davidson</snm>
						<fnm>WL</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1981</pubdate>
				<volume>57</volume>
				<fpage>251</fpage>
				<lpage>261</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">6275025</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Assignment of the genome segments of bluetongue virus type 1 to the proteins which they encode</p>
				</title>
				<aug>
					<au>
						<snm>Mertens</snm>
						<fnm>PP</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Sangar</snm>
						<fnm>DV</fnm>
					</au>
				</aug>
				<source>Virology</source>
				<pubdate>1984</pubdate>
				<volume>135</volume>
				<fpage>207</fpage>
				<lpage>217</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0042-6822(84)90131-4</pubid>
						<pubid idtype="pmpid">6328750</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Correlation of serotype specificity and protein structure of the five U.S. serotypes of bluetongue virus</p>
				</title>
				<aug>
					<au>
						<snm>Mecham</snm>
						<fnm>JO</fnm>
					</au>
					<au>
						<snm>Dean</snm>
						<fnm>VC</fnm>
					</au>
					<au>
						<snm>Jochim</snm>
						<fnm>MM</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1986</pubdate>
				<volume>67</volume>
				<fpage>2617</fpage>
				<lpage>2624</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">2432162</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Expression of two related nonstructural proteins of bluetongue virus (BTV) type 10 in insect cells by a recombinant baculovirus: production of polyclonal ascitic fluid and characterization of the gene product in BTV-infected BHK cells</p>
				</title>
				<aug>
					<au>
						<snm>French</snm>
						<fnm>TJ</fnm>
					</au>
					<au>
						<snm>Inumaru</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Roy</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>J Virol</source>
				<pubdate>1989</pubdate>
				<volume>63</volume>
				<fpage>3270</fpage>
				<lpage>3278</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">250898</pubid>
						<pubid idtype="pmpid" link="fulltext">2545903</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Identification of bluetongue virus type 17 genome segments coding for polypeptides associated with virus neutralization and intergroup reactivity</p>
				</title>
				<aug>
					<au>
						<snm>Grubman</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Appleton</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Letchworth</snm>
						<fnm>G</fnm>
						<suf>Jr</suf>
					</au>
				</aug>
				<source>Virology</source>
				<pubdate>1983</pubdate>
				<volume>131</volume>
				<fpage>355</fpage>
				<lpage>366</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0042-6822(83)90503-2</pubid>
						<pubid idtype="pmpid">6318436</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Identification and characterization of the structural and nonstructural proteins of African horsesickness virus and determination of the genome coding assignments</p>
				</title>
				<aug>
					<au>
						<snm>Grubman</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Lewis</snm>
						<fnm>SA</fnm>
					</au>
				</aug>
				<source>Virology</source>
				<pubdate>1992</pubdate>
				<volume>186</volume>
				<fpage>444</fpage>
				<lpage>451</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0042-6822(92)90009-E</pubid>
						<pubid idtype="pmpid">1531096</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Characterization of African horsesickness virus serotype 4-induced polypeptides in Vero cells and their reactivity in Western immunoblotting</p>
				</title>
				<aug>
					<au>
						<snm>Laviada</snm>
						<fnm>MD</fnm>
					</au>
					<au>
						<snm>Arias</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>S&#225;nchez-Vizca&#237;no</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1993</pubdate>
				<volume>74</volume>
				<fpage>81</fpage>
				<lpage>87</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">8423451</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Characterization of the gene encoding core protein VP6 of two African horsesickness virus serotypes</p>
				</title>
				<aug>
					<au>
						<snm>Turnbull</snm>
						<fnm>PJ</fnm>
					</au>
					<au>
						<snm>Cormack</snm>
						<fnm>SB</fnm>
					</au>
					<au>
						<snm>Huismans</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1996</pubdate>
				<volume>77</volume>
				<fpage>1421</fpage>
				<lpage>1423</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">8757982</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Yunnan orbivirus, a new orbivirus species isolated from Culex tritaeniorhynchus mosquitoes in China</p>
				</title>
				<aug>
					<au>
						<snm>Attoui</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Mohd Jaafar</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Belhouchet</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Aldrovandi</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Tao</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Liang</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Tesh</snm>
						<fnm>RB</fnm>
					</au>
					<au>
						<snm>de Micco</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>de Lamballerie</snm>
						<fnm>X</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>2005</pubdate>
				<volume>86</volume>
				<fpage>3409</fpage>
				<lpage>3417</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1099/vir.0.81258-0</pubid>
						<pubid idtype="pmpid" link="fulltext">16298988</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Molecular analysis of the genome of Chuzan virus, a member of the Palyam serogroup viruses, and its phylogenetic relationships to other orbiviruses</p>
				</title>
				<aug>
					<au>
						<snm>Yamakawa</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kubo</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Furuuchi</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>J Gen Virol</source>
				<pubdate>1999</pubdate>
				<volume>80</volume>
				<fpage>937</fpage>
				<lpage>941</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10211963</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Clustal W and Clustal X version 2.0</p>
				</title>
				<aug>
					<au>
						<snm>Larkin</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Blackshields</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>NP</fnm>
					</au>
					<au>
						<snm>Chenna</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>McGettigan</snm>
						<fnm>PA</fnm>
					</au>
					<au>
						<snm>McWilliam</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Valentin</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Wallace</snm>
						<fnm>IM</fnm>
					</au>
					<au>
						<snm>Wilm</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Lopez</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Thompson</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Gibson</snm>
						<fnm>TJ</fnm>
					</au>
					<au>
						<snm>Higgins</snm>
						<fnm>DG</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2007</pubdate>
				<volume>23</volume>
				<fpage>2947</fpage>
				<lpage>2948</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btm404</pubid>
						<pubid idtype="pmpid" link="fulltext">17846036</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
