<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2005-6-7-r58</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Creation and disruption of protein features by alternative splicing - a novel mechanism to modulate function</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Hiller</snm>
					<fnm>Michael</fnm>
					<insr iid="I1"/>
					<email>hiller@inf.uni-jena.de</email>
				</au>
				<au id="A2">
					<snm>Huse</snm>
					<fnm>Klaus</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A3">
					<snm>Platzer</snm>
					<fnm>Matthias</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A4" ca="yes">
					<snm>Backofen</snm>
					<fnm>Rolf</fnm>
					<insr iid="I1"/>
					<email>backofen@inf.uni-jena.de</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Institute of Computer Science, Friedrich-Schiller-University Jena, Chair for Bioinformatics, Ernst-Abbe-Platz 2, 07743 Jena, Germany</p>
				</ins>
				<ins id="I2">
					<p>Genome Analysis, Institute of Molecular Biotechnology, Beutenbergstrasse 11, 07745 Jena, Germany</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2005</pubdate>
			<volume>6</volume>
			<issue>7</issue>
			<fpage>R58</fpage>
			<url>http://genomebiology.com/2005/6/7/R58</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">15998447</pubid><pubid idtype="doi">10.1186/gb-2005-6-7-r58</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>25</day>
					<month>2</month>
					<year>2005</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>19</day>
					<month>4</month>
					<year>2005</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>9</day>
					<month>5</month>
					<year>2005</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>22</day>
					<month>6</month>
					<year>2005</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2005</year>
			<collab>Hiller et al.; licensee BioMed Central Ltd</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p>Creation and disruption of protein features by alternative splicing</p>
		</shorttitle>
		<shortabs>
			<p>A new mechanism of alternative splicing is proposed which creates a protein feature by putting together two non-consecutive exons and destroys a feature by inserting an exon in its body. Evidence for this rare mechanism is provided by a genome-wide search with four specific protein features.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Alternative splicing often occurs in the coding sequence and alters protein structure and function. It is mainly carried out in two ways: by skipping exons that encode a certain protein feature and by introducing a frameshift that changes the downstream protein sequence. These mechanisms are widespread and well investigated.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>Here, we propose an additional mechanism of alternative splicing to modulate protein function. This mechanism creates a protein feature by putting together two non-consecutive exons or destroys a feature by inserting an exon in its body. In contrast to other mechanisms, the individual parts of the feature are present in both splice variants but the feature is only functional in the splice form where both parts are merged. We provide evidence for this mechanism by performing a genome-wide search with four protein features: transmembrane helices, phosphorylation and glycosylation sites, and Pfam domains.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>We describe a novel type of event that creates or removes a protein feature by alternative splicing. Current data suggest that these events are rare. Besides the four features investigated here, this mechanism is conceivable for many other protein features, especially for small linear protein motifs. It is important for the characterization of functional differences of two splice forms and should be considered in genome-wide annotation efforts. Furthermore, it offers a novel strategy for <it>ab initio </it>prediction of alternative splice events.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>Alternative splicing is an important post-transcriptional process and mainly contributes to the complexity of a transcriptome and proteome <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Alternative splicing often produces two or more proteins with functional differences from one gene <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> but can also downregulate the overall protein level by producing targets for nonsense-mediated mRNA decay <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, which is used, for example, in the autoregulation of splicing factors <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Furthermore, defects in splicing are the basis for a number of diseases <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>.</p>
			<p>One major mechanism of alternative splicing to alter protein function is the insertion/deletion of functional units such as protein domains, transmembrane (TM) helices, signal peptides, or coiled-coil regions. Alternative splicing tends to insert/delete complete functional units instead of affecting parts of a unit <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Moreover, several protein domains have a tendency to be spliced out in some transcripts <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Many proteins occur in a soluble as well as a membrane-bound form. When encoded by a single gene, the soluble form can be produced by post-translational ectodomain shedding <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> or alternative splicing of exons that encode the TM helices. Indeed, 40-50% of the alternatively spliced, single-pass TM proteins have a splice form that specifically removes the TM domain <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. Furthermore, protein forms can differ in their affinity to bind ligands <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp> or in their subcellular location <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
			<p>In this paper, we present a novel mechanism to modulate function and/or subcellular localization of a protein by alternative splicing. Assuming a protein feature is encoded in two parts by two non-consecutive exons, for example, exon 2 and 4, inclusion of exon 3 results in a protein lacking this feature since it is disconnected at the sequence level. In contrast, the skipping of exon 3 leads to a protein with this feature. We provide evidence for this mechanism by considering four protein features: TM helices, phosphorylation and glycosylation sites, and Pfam domains. In general, this mechanism is conceivable for many other protein features and provides a novel strategy for <it>ab initio </it>prediction of alternative splice events.</p>
		</sec>
		<sec>
			<st>
				<p>Results and discussion</p>
			</st>
			<p>In order to find genes that encode a protein feature by two non-consecutive exons, we searched all human RefSeq transcripts for annotated features that span an exon boundary. For these exon pairs, we searched dbEST to find alternative splice events that insert a sequence between them. Thus, we only selected pairs of exons if they had expressed sequence tag (EST)-confirmed, alternative exons between them that are skipped in the given RefSeq. Apart from alternative exons, intron retention or an alternative donor/acceptor site located in the intron can lead to such an insert. We only selected inserts that preserved the open reading frame. Then we evaluated whether the longer transcript (with the insert) still encodes the feature or not. We only considered two exons for small features like TM helices and post-translational modification contexts since it is unlikely that more than two exons encode the feature. For more complex features like Pfam domains, we allowed for the domain to be encoded by more than two exons.</p>
			<p>The first protein feature we considered was TM domains. We annotated TM helices in all RefSeq transcripts with the TMHMM program <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. We found 1,807 TM domains (14% of all TM domains) that are encoded by two exons (Additional data file 1). For ten cases, we found EST evidence for an insert due to alternative splicing. As TM domains are short stretches of hydrophobic amino acids, an insert with polar residues will result in the destruction of the TM helix. Indeed, the evaluation of these ten longer transcripts with TMHMM showed that six clearly lacked the TM domain which, in three cases, leads to a soluble protein (Table <tblr tid="T1">1</tblr>). An example of the disruption of the single TM domain is depicted for <it>DIABLO </it>in Figure <figr fid="F1">1a</figr>. A more complex example is at the Rhesus blood group antigen gene (<it>RHCE</it>) where the inclusion of two exons resulted in a loss of one TM domain as well as the gain of three others (Figure <figr fid="F1">1b</figr>). The massive reconstruction of TM domains in the respective protein isoforms can have considerable consequences for the orientation of the proteins within the cellular membrane and for their interaction with other membrane components.</p>
			<tbl id="T1">
				<title>
					<p>Table 1</p>
				</title>
				<caption>
					<p>RefSeq transcripts where, due to alternative splicing, sequence insertion destroys a TM helix</p>
				</caption>
				<tblbdy cols="6">
					<r>
						<c ca="left">
							<p>Gene symbol</p>
						</c>
						<c ca="left">
							<p>Gene name</p>
						</c>
						<c ca="left">
							<p>RefSeq with TM*</p>
						</c>
						<c ca="left">
							<p>RefSeq/EST without TM<sup>&#8224;</sup></p>
						</c>
						<c ca="left">
							<p>Alternative splice event<sup>&#8225;</sup></p>
						</c>
						<c ca="left">
							<p>Impact</p>
						</c>
					</r>
					<r>
						<c cspan="6">
							<hr/>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>DIABLO</p>
						</c>
						<c ca="left">
							<p>Diablo homolog (<it>Drosophila</it>)</p>
						</c>
						<c ca="left">
							<p>NM_138929</p>
						</c>
						<c ca="left">
							<p>NM_019887</p>
						</c>
						<c ca="left">
							<p>Exon between exon 2 and 3</p>
						</c>
						<c ca="left">
							<p>Disruption of the single TM domain, soluble protein</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>
								<it>DPP8</it>
							</p>
						</c>
						<c ca="left">
							<p>Dipeptidylpeptidase 8</p>
						</c>
						<c ca="left">
							<p>NM_017743</p>
						</c>
						<c ca="left">
							<p>NM_197961</p>
						</c>
						<c ca="left">
							<p>Exon between exon 15 and 16</p>
						</c>
						<c ca="left">
							<p>Disruption of the single TM domain, soluble protein</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>
								<it>COX7A2</it>
							</p>
						</c>
						<c ca="left">
							<p>Cytochrome c oxidase subunit VIIa polypeptide 2 (liver)</p>
						</c>
						<c ca="left">
							<p>NM_001865</p>
						</c>
						<c ca="left">
							<p>BU570379</p>
						</c>
						<c ca="left">
							<p>Donor downstream of exon 3</p>
						</c>
						<c ca="left">
							<p>Disruption of the single TM domain, soluble protein</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>
								<it>RHCE</it>
							</p>
						</c>
						<c ca="left">
							<p>Rhesus blood group, CcEe antigens</p>
						</c>
						<c ca="left">
							<p>NM_138617</p>
						</c>
						<c ca="left">
							<p>NM_138618</p>
						</c>
						<c ca="left">
							<p>Two exons between exon 3 and 4</p>
						</c>
						<c ca="left">
							<p>Disruption of the fifth TM domain, insert contains three new TM domains</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>NM_014738</p>
						</c>
						<c ca="left">
							<p>BM693684</p>
						</c>
						<c ca="left">
							<p>Intron between exon 30 and 31</p>
						</c>
						<c ca="left">
							<p>Disruption of the eighth TM domain</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>NM_152672</p>
						</c>
						<c ca="left">
							<p>CF147426</p>
						</c>
						<c ca="left">
							<p>Acceptor upstream of exon 4</p>
						</c>
						<c ca="left">
							<p>Disruption of the second TM domain</p>
						</c>
					</r>
				</tblbdy>
				<tblfn>
					<p>*RefSeq transcript without the insert (shorter variant) that encodes a TM domain. <sup>&#8224;</sup>Transcript with the insert (longer variant) that destroys a TM helix. <sup>&#8225;</sup>Exon numbers refer to the RefSeq transcript with the TM helix. na, not approved; TM, transmembrane.</p>
				</tblfn>
			</tbl>
			<fig id="F1">
				<title>
					<p>Figure 1</p>
				</title>
				<caption>
					<p>TM domain destruction by exon insertion</p>
				</caption>
				<text>
					<p>TM domain destruction by exon insertion. <b>(a) </b>Exons 2 and 3 of NM_138929 of <it>DIABLO </it>encode a TM domain (shown as blue boxes). This TM domain is destroyed in another transcript (NM_019887) that includes an additional exon. The inserted exon (shown in red) encodes many polar amino acids. <b>(b) </b>Exons 3 and 4 of NM_138617 of <it>RHCE </it>encode a TM domain that is destroyed in NM_138618 by the inclusion of two exons. Interestingly, the two included exons encode three new TM domains. Thus, the skipping of exon 4 and 5 of NM_138618 results in a protein that has only two instead of three TM domains fewer. Exon numbers refer to the respective transcript. TM, transmembrane.</p>
				</text>
				<graphic file="gb-2005-6-7-r58-1"/>
			</fig>
			<p>To find further cases of feature disruption by sequence insertion, we applied the procedure to experimentally verified post-translational modification sites. Post-translational modification of proteins plays a role in various important processes. For example, phosphorylation of splicing factors can influence splicing decisions <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and glycosylation is associated with a modulation of proteolytic resistance and ligand binding <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. The residue to be modified must be located in a favorable sequence context to be recognized by the enzyme. If this residue is close to an exon boundary, an alternative splice event can change the context to an unfavorable one with the consequence that the modification cannot take place anymore. We inspected the O-GlycBase <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> and Phospho.ELM <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, and found 435 modified residues that are close to 213 different exon-exon junctions. Among them, four exon junctions showed an insert due to alternative splicing. <it>CCL14 </it>has a glycosylated serine at position 26, which is the last residue encoded by exon 1. We found two ESTs (AA612866, Z70293) with an included 48-nucleotide exon between exon 1 and 2. The NetOGlyc <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> score for the serine in the new sequence context dropped from 0.97 to 0.35 (threshold 0.5). Thus, the new context might prevent glycosylation of this residue. For <it>CDK5</it>, an alternative acceptor (BU529114) that inserts nine amino acids upstream of exon 8 alters the context of the phosphorylated serine at position 159 of the protein. The NetPhos <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> scores of both contexts differ (0.93 vs 0.43, threshold 0.5), which indicates that only one context allows recognition by the kinase and, thus, the phosphorylation of the serine. Additionally, we found two examples (<it>MGP </it>and <it>CDK2</it>) where an included exon alters the context of a phosphorylated residue, however, the scores for the new contexts dropped only marginally.</p>
			<p>For the fourth feature, we considered functional protein domains using the Pfam database <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. We found 473 inserts into a Pfam domain and nine of those resulted in a disruption of the Pfam (Table <tblr tid="T2">2</tblr>). Additionally, using the algorithm described in <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, we found three cases where the skipping of a RefSeq exon creates a new Pfam (Table <tblr tid="T2">2</tblr>). For example, skipping exon 4 of NM_024565 created the cyclin N-terminal domain (Figure <figr fid="F2">2a</figr>). Since exons 5 to 7 of this transcript encode the cyclin C-terminal domain (PF02984), only the exon skipping variant might perform the function of a cyclin. Moreover, skipping exon 2 of NM_139174 resulted in a new double-stranded RNA binding domain (Figure <figr fid="F2">2b</figr>). Downstream of this domain, the transcript encodes an adenosine-deaminase (editase) domain. Thus, the loss of the RNA binding property might act as a negative regulation of the editase activity. Most Pfam domains fold into three-dimensional structures and we cannot rule out that these 12 domains also adopt the correct folding with the insert. However, using standard cut-off scores, these Pfam domains cannot be found in the longer transcripts since the scores for both individual parts are always below the threshold.</p>
			<tbl id="T2">
				<title>
					<p>Table 2</p>
				</title>
				<caption>
					<p>RefSeq transcripts with an exon skipping splice form that puts together a new Pfam domain</p>
				</caption>
				<tblbdy cols="11">
					<r>
						<c ca="left">
							<p>Gene symbol</p>
						</c>
						<c ca="left">
							<p>Gene name</p>
						</c>
						<c ca="left">
							<p>RefSeq/EST with Pfam*</p>
						</c>
						<c ca="left">
							<p>RefSeq/EST without Pfam<sup>&#8224;</sup></p>
						</c>
						<c ca="left">
							<p>Pfam ID</p>
						</c>
						<c ca="left">
							<p>Pfam description</p>
						</c>
						<c ca="left">
							<p>Alternative splice event<sup>&#8225;</sup></p>
						</c>
						<c ca="left">
							<p>Pfam cutoff score<sup>&#167;</sup></p>
						</c>
						<c ca="left">
							<p>Score upstream<sup>&#182;</sup></p>
						</c>
						<c ca="left">
							<p>Score downstream<sup>&#165;</sup></p>
						</c>
						<c ca="left">
							<p>Score combined<sup>#</sup></p>
						</c>
					</r>
					<r>
						<c cspan="11">
							<hr/>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>NM_144604</p>
						</c>
						<c ca="left">
							<p>AK056632</p>
						</c>
						<c ca="left">
							<p>PF00642</p>
						</c>
						<c ca="left">
							<p>Zinc finger C-x8-C-x5-C-x3-H type (and similar)</p>
						</c>
						<c ca="left">
							<p>Exon between exon 3 and 4</p>
						</c>
						<c ca="left">
							<p>17.5</p>
						</c>
						<c ca="left">
							<p>-1.2</p>
						</c>
						<c ca="left">
							<p>9.4</p>
						</c>
						<c ca="left">
							<p>23.6</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>
								<it>PRSS25</it>
							</p>
						</c>
						<c ca="left">
							<p>protease, serine, 25</p>
						</c>
						<c ca="left">
							<p>NM_145074</p>
						</c>
						<c ca="left">
							<p>AF141306</p>
						</c>
						<c ca="left">
							<p>PF00089</p>
						</c>
						<c ca="left">
							<p>Trypsin</p>
						</c>
						<c ca="left">
							<p>Acceptor upstream of exon 4</p>
						</c>
						<c ca="left">
							<p>23.4</p>
						</c>
						<c ca="left">
							<p>3.0</p>
						</c>
						<c ca="left">
							<p>1.1</p>
						</c>
						<c ca="left">
							<p>30.8</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>
								<it>FOSL2</it>
							</p>
						</c>
						<c ca="left">
							<p>FOS-like antigen 2</p>
						</c>
						<c ca="left">
							<p>NM_005253</p>
						</c>
						<c ca="left">
							<p>BX647822</p>
						</c>
						<c ca="left">
							<p>PF00170</p>
						</c>
						<c ca="left">
							<p>bZIP transcription factor</p>
						</c>
						<c ca="left">
							<p>Acceptor upstream of exon 4</p>
						</c>
						<c ca="left">
							<p>23.2</p>
						</c>
						<c ca="left">
							<p>16.1</p>
						</c>
						<c ca="left">
							<p>-4.6</p>
						</c>
						<c ca="left">
							<p>31.3</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>NM_003622</p>
						</c>
						<c ca="left">
							<p>AB033056</p>
						</c>
						<c ca="left">
							<p>PF02920</p>
						</c>
						<c ca="left">
							<p>Integrase_DNA</p>
						</c>
						<c ca="left">
							<p>Exon between exon 8 and 9</p>
						</c>
						<c ca="left">
							<p>18.0</p>
						</c>
						<c ca="left">
							<p>13.4</p>
						</c>
						<c ca="left">
							<p>-5.0</p>
						</c>
						<c ca="left">
							<p>21.9</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>NM_006832</p>
						</c>
						<c ca="left">
							<p>AK091532</p>
						</c>
						<c ca="left">
							<p>PF00373</p>
						</c>
						<c ca="left">
							<p>FERM domain (Band 4.1 family)</p>
						</c>
						<c ca="left">
							<p>Exon between exon 12 and 13</p>
						</c>
						<c ca="left">
							<p>14.0</p>
						</c>
						<c ca="left">
							<p>-15.9</p>
						</c>
						<c ca="left">
							<p>10.3</p>
						</c>
						<c ca="left">
							<p>15.6</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>
								<it>PQBP1</it>
							</p>
						</c>
						<c ca="left">
							<p>Polyglutamine binding protein 1</p>
						</c>
						<c ca="left">
							<p>NM_144494</p>
						</c>
						<c ca="left">
							<p>BM692479</p>
						</c>
						<c ca="left">
							<p>PF00397</p>
						</c>
						<c ca="left">
							<p>WW domain</p>
						</c>
						<c ca="left">
							<p>Acceptor upstream of exon 3</p>
						</c>
						<c ca="left">
							<p>17.0</p>
						</c>
						<c ca="left">
							<p>5.0</p>
						</c>
						<c ca="left">
							<p>9.7</p>
						</c>
						<c ca="left">
							<p>32.5</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>
								<it>MRPL27</it>
							</p>
						</c>
						<c ca="left">
							<p>Mitochondrial ribosomal protein L27</p>
						</c>
						<c ca="left">
							<p>NM_148570</p>
						</c>
						<c ca="left">
							<p>BQ028639</p>
						</c>
						<c ca="left">
							<p>PF01016</p>
						</c>
						<c ca="left">
							<p>Ribosomal L27 protein</p>
						</c>
						<c ca="left">
							<p>Acceptor upstream of exon 4</p>
						</c>
						<c ca="left">
							<p>25.0</p>
						</c>
						<c ca="left">
							<p>2.1</p>
						</c>
						<c ca="left">
							<p>8.2</p>
						</c>
						<c ca="left">
							<p>34.0</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>
								<it>PLEKHB1</it>
							</p>
						</c>
						<c ca="left">
							<p>Pleckstrin homology domain containing, family B (evectins) member 1</p>
						</c>
						<c ca="left">
							<p>NM_021200</p>
						</c>
						<c ca="left">
							<p>BE703269</p>
						</c>
						<c ca="left">
							<p>PF00169</p>
						</c>
						<c ca="left">
							<p>PH domain</p>
						</c>
						<c ca="left">
							<p>Acceptor upstream of exon 3</p>
						</c>
						<c ca="left">
							<p>22.8</p>
						</c>
						<c ca="left">
							<p>-3.3</p>
						</c>
						<c ca="left">
							<p>11.4</p>
						</c>
						<c ca="left">
							<p>29.7</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>NM_020679</p>
						</c>
						<c ca="left">
							<p>BP265352</p>
						</c>
						<c ca="left">
							<p>PF02854</p>
						</c>
						<c ca="left">
							<p>MIF4G domain</p>
						</c>
						<c ca="left">
							<p>Donor downstream of exon 6</p>
						</c>
						<c ca="left">
							<p>14.0</p>
						</c>
						<c ca="left">
							<p>1.1</p>
						</c>
						<c ca="left">
							<p>0.2</p>
						</c>
						<c ca="left">
							<p>17.2</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>
								<it>TRUB2</it>
							</p>
						</c>
						<c ca="left">
							<p>TruB pseudouridine (psi) synthase homolog 2 (<it>E. coli</it>)</p>
						</c>
						<c ca="left">
							<p>BE793897</p>
						</c>
						<c ca="left">
							<p>NM_015679</p>
						</c>
						<c ca="left">
							<p>PF00849</p>
						</c>
						<c ca="left">
							<p>RNA pseudouridylate synthase</p>
						</c>
						<c ca="left">
							<p>Skip exon 2</p>
						</c>
						<c ca="left">
							<p>14.0</p>
						</c>
						<c ca="left">
							<p>-2.1</p>
						</c>
						<c ca="left">
							<p>-1.3</p>
						</c>
						<c ca="left">
							<p>14.7</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>BM903757</p>
						</c>
						<c ca="left">
							<p>NM_024565</p>
						</c>
						<c ca="left">
							<p>PF00134</p>
						</c>
						<c ca="left">
							<p>Cyclin, N-terminal domain</p>
						</c>
						<c ca="left">
							<p>Skip exon 4</p>
						</c>
						<c ca="left">
							<p>17.0</p>
						</c>
						<c ca="left">
							<p>0.3</p>
						</c>
						<c ca="left">
							<p>9.6</p>
						</c>
						<c ca="left">
							<p>52.9</p>
						</c>
					</r>
					<r>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>na</p>
						</c>
						<c ca="left">
							<p>BC033491</p>
						</c>
						<c ca="left">
							<p>NM_139174</p>
						</c>
						<c ca="left">
							<p>PF00035</p>
						</c>
						<c ca="left">
							<p>Double-stranded RNA binding motif</p>
						</c>
						<c ca="left">
							<p>Skip exon 2</p>
						</c>
						<c ca="left">
							<p>17.0</p>
						</c>
						<c ca="left">
							<p>-5.2</p>
						</c>
						<c ca="left">
							<p>13.5</p>
						</c>
						<c ca="left">
							<p>21.7</p>
						</c>
					</r>
				</tblbdy>
				<tblfn>
					<p>*Transcript without the insert (shorter variant) that encodes a Pfam domain. <sup>&#8224;</sup>Transcript with the insert (longer variant) that does not encode a Pfam domain. <sup>&#8225;</sup>Exon numbers refer to the RefSeq transcript. <sup>&#167;</sup>Per-domain 'gathering cut-offs' as given in the Pfam database. <sup>&#182;</sup>,<sup>&#165;</sup>Pfam score for the partial domain encoded by the upstream and downstream exon, respectively. <sup>#</sup>Pfam score for the domain that is encoded by the splice form without the insert. na, not approved.</p>
				</tblfn>
			</tbl>
			<fig id="F2">
				<title>
					<p>Figure 2</p>
				</title>
				<caption>
					<p>Pfam creation by exon skipping</p>
				</caption>
				<text>
					<p>Pfam creation by exon skipping. The alternative exon is shown in red. The two partial Pfam alignments for the RefSeq transcript and the complete alignment for the exon-skipping variant are shown above and below the partial gene structure, respectively. Dashed lines indicate parts of the exon for which a Pfam alignment has been found. <b>(a) </b>NM_024585 has a splice form that skips exon 4 (shown in red), which results in the creation of a new domain. The Pfam scores for the separated parts are far below the threshold score of 17 and, thus, the Pfam is not found for the longer transcript. <b>(b) </b>Skipping exon 2 of NM_139174 results in a new double-stranded RNA binding Pfam.</p>
				</text>
				<graphic file="gb-2005-6-7-r58-2"/>
			</fig>
			<p>In general, any EST-based approach is hampered by the bias of publicly available EST databases towards cancer-related tissues or cell lines that may exhibit aberrant splicing <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. Furthermore, a splice form that is only represented by a single EST may be a rare error by the spliceosome. Therefore, we determined the number and tissue source of the ESTs that match both splice variants for the described examples (Additional data file 2). For seven of the 20 examples, only one splice form is represented by a single EST or by cancer-related ESTs. However, the remaining examples are supported by several ESTs as well as ESTs from normal tissue, and in four cases both splice variants are contained in the RefSeq database. Thus, we conclude that the majority of the described examples are real splice variants and not artifacts or aberrant splice events.</p>
			<p>Besides the four features investigated here, there are many others that can only function if they are connected on the sequence level. Such functional sites or motifs often have a linear structure and comprise, for example, signal peptides, post-translational cleavage sites and subcellular localization signals as well as sites for protein-protein interaction. Many of these motifs are collected in the Eukaryotic Linear Motif (ELM) database <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Such features can lose their function if an insert separates them on the sequence level. For example, splicing at an alternative donor site of the protein kinase C delta leads to an insert of 26 amino acids into a caspase-3 cleavage site and to an isoform that is caspase-insensitive <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. We have not investigated such features here since only a fraction of them have been experimentally verified and a prediction results in a high number of false positives. With further efforts in verifying and characterizing these features, we expect an increasing number of examples for the proposed mechanism of modulating protein function by alternative splicing. Interestingly, the same principle was recently used to experimentally characterize exon splicing silencers (ESS) <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. In this study, ESS candidates were inserted in the middle exon of a three-exon minigene. If a candidate ESS acts as a silencer, the middle exon is skipped and only in this case a functional green fluorescent protein is encoded. Furthermore, this mechanism is not restricted to protein features but it is also conceivable for sequence and structural features at the mRNA level. For example, some of the variable first exons of <it>NOS1 </it>together with exon 2, form a hairpin structure that is involved in translational regulation, whereas other alternative first exons do not allow hairpin formation <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
			<p>From an evolutionary viewpoint, this mechanism can be explained in two ways depending on whether the protein feature is ancestral or not. If the feature is ancestral, it means it is initially encoded by two neighboring exons and the inserted part must have appeared in the intronic sequences <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. In this case, the insert simply has the function of a spacer. If the feature is not ancestral, it means the longer splice form is evolutionarily older and, therefore, the alternative exon or splice site must have been converted from a constitutive to an alternative one. This can happen, for example, by the weakening of splice sites or the creation of ESS <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Complex features with a high sequence specificity such as Pfam domains are likely to be ancestral. In contrast, small features with a loose sequence motif such as the context of a post-translational modification site can arise just by chance and can therefore be evolutionarily younger.</p>
			<p>Not all alternative splice events are represented in EST databases and, thus, the development of non-EST-based methods for <it>ab initio </it>prediction of splice events is a necessary but challenging task. Currently, there is only one method that mainly uses genomic conservation of exons and flanking introns to discriminate between alternative and constitutive exons <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Although alternative splicing often deletes functional units, it is very hard to predict such events on the protein level without ESTs. However, a search for protein features that are put together by exon skipping would provide a new way to predict alternative splice events. For that purpose, it has to be assumed that the split feature is unlikely to be encoded by two non-consecutive exons just by chance. Since Pfam domains usually have a high sequence specificity, we tested this assumption for Pfams by skipping 10,962 constitutive exons. We found only four cases (0.036%) where skipping of a constitutive exon results in an additional Pfam domain (Additional data file 3). In contrast, nine of the 473 (1.9%) alternatively spliced inserts into Pfam domains resulted in a loss of the Pfam. The odds ratio of 53 indicates that Pfam domains are unlikely to be encoded by non-consecutive exons just by chance.</p>
		</sec>
		<sec>
			<st>
				<p>Conclusion</p>
			</st>
			<p>Alternative splicing frequently modulates protein function by insertion or deletion of functional units. In this case, the functional difference is directly associated with the sequence of the inserted or deleted part. Here, we provide evidence for an additional mechanism that acts by putting together a feature from two parts encoded by non-consecutive exons. Thus, the functional difference is not related to a specific insert and the two parts of the feature are present on both the long and the short splice form. The general idea is shown in Figure <figr fid="F3">3</figr>.</p>
			<fig id="F3">
				<title>
					<p>Figure 3</p>
				</title>
				<caption>
					<p>General mechanisms to alter linear protein features by alternative splicing</p>
				</caption>
				<text>
					<p>General mechanisms to alter linear protein features by alternative splicing. <b>(a) </b>A widespread mechanism is to skip or include an alternative exon (red box) that encodes a functional unit (indicated by the light bulb). The longer splice form with the alternative exon encodes a protein with this feature, the shorter splice form encodes a protein without this feature. <b>(b) </b>The novel mechanism involves a functional unit that is encoded by two non-consecutive exons (the two parts of the light bulb). In contrast to the mechanism mentioned above, the longer splice form encodes a protein without the functional unit although both parts are present on the protein sequence. The disruption of the unit results in a loss of function. The shorter splice form encodes a protein that puts together both parts of the unit which results in a gain of function (complete light bulb).</p>
				</text>
				<graphic file="gb-2005-6-7-r58-3"/>
			</fig>
			<p>Recent alternative splicing databases include the annotation of the functional differences between two protein forms <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. For this purpose, the novel mechanism described here has to be taken into account since it is obviously not sufficient to inspect the alternative exons in the context of the splice form that includes these exons. The functional difference of the examples shown here can only be found if the complete shorter splice form is investigated simultaneously.</p>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>General procedure</p>
				</st>
				<p>All transcripts were taken from the RefSeq annotations in the UCSC Genome Browser (assembly hg16 with annotation March 2004) <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. For exon pairs that together encode a protein feature, we extracted a 40-nucleotide context (20 nucleotides from the upstream and 20 nucleotides from the downstream exon) and searched, with BLAST, the human fraction of dbEST (August 2004) <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. We only kept EST hits with two separate HSPs (high-scoring segment pairs).We discarded splice events that resulted in a frameshift and/or introduced a premature termination codon (PTC) since a frameshift leads to a new protein sequence downstream of the alternative splice site and transcripts with PTCs are frequently degraded by nonsense-mediated mRNA decay. Intron retention events were only included if the EST had a spliced intron up- or downstream. For the insertions, we checked presence of AG-GT splice sites. All splice forms were translated with the insertion and a check was made to see if the insert destroyed the feature.</p>
			</sec>
			<sec>
				<st>
					<p>TM domains</p>
				</st>
				<p>We predicted TM helices with TMHMM for all translated transcripts since, currently, TMHMM was found to be the best-performing TM prediction program <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. The TM domain location was mapped to the exon structure and we considered a TM helix as encoded by two exons if each exon encoded at least 25% of the domain.</p>
			</sec>
			<sec>
				<st>
					<p>Glycosylation and phosphorylation contexts</p>
				</st>
				<p>We used Phospho.ELM version 2.0 and O-GlycBase v6.00. The SwissProt IDs were converted to RefSeq IDs with the table from the HUGO gene nomenclature committee website <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The location of the modified residues was mapped to the exon structure and we retained those close to an exon boundary (&lt;10 amino acid distance for glycosylated and &lt;5 amino acid distance for phosphorylated residues). To compute the scores for the glycosylated serine, we used NetOGlyc 2.0 because the latest version (3.1) is not able to recognize the serine in the annotated context.</p>
			</sec>
			<sec>
				<st>
					<p>Pfam domains</p>
				</st>
				<p>Pfam domains were found with hmmpfam using the 'gathering cutoff' scores as given in the Pfam database (version 14). We considered domains with less than 200 residues that are encoded by two or more exons (each exon encodes at least two residues of the Pfam). Additionally, we used the algorithm described in <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> to find cases where the RefSeq transcript is the longer splice form and a shorter exon skipping variant exists that encodes a new Pfam domain. To confirm such candidate splice forms, we searched dbEST with BLAST and the 40-nucleotide context from the up- and downstream exon.</p>
			</sec>
			<sec>
				<st>
					<p>Test of Pfam domain creation by chance</p>
				</st>
				<p>We compiled a set of 10,962 internal coding exons with a size divisible by three that had at least six ESTs showing their inclusion but no EST indicating their skipping. Those exons were considered to be constitutive. We produced the full-length protein and the shorter protein that corresponds to the hypothetical splice form without such an exon. Then, we used hmmpfam with the gathering cut-offs to search the Pfam database and compared the Pfam family hits for the full-length and the shorter protein.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="s1">1</supplr> is a table listing the TM domains that are encoded by two exons. Additional data file <supplr sid="s2">2</supplr> contains the number of ESTs/RefSeqs and information about the tissues or libraries for both splice variants of the examples. Additional data file <supplr sid="s3">3</supplr> contains the four cases where skipping of a constitutive exon results in a new Pfam domain.</p>
			<suppl id="s1">
				<title>
					<p>Additional File 1</p>
				</title>
				<caption>
					<p>TM domains that are encoded by two exons</p>
				</caption>
				<text>
					<p>TM domains that are encoded by two exons</p>
				</text>
				<file name="gb-2005-6-7-r58-s1.xls">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="s2">
				<title>
					<p>Additional File 2</p>
				</title>
				<caption>
					<p>ESTs/RefSeqs and if available their tissue/library source for the described examples</p>
				</caption>
				<text>
					<p>ESTs/RefSeqs and if available their tissue/library source for the described examples</p>
				</text>
				<file name="gb-2005-6-7-r58-s2.txt">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="s3">
				<title>
					<p>Additional File 3</p>
				</title>
				<caption>
					<p>Pfam creation events by skipping of a constitutive exon</p>
				</caption>
				<text>
					<p>Pfam creation events by skipping of a constitutive exon</p>
				</text>
				<file name="gb-2005-6-7-r58-s3.txt">
					<p>Click here for file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We thank Anke Busch for helpful comments on the manuscript.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Alternative splicing: increasing diversity in the proteomic world.</p>
				</title>
				<aug>
					<au>
						<snm>Graveley</snm>
						<fnm>BR</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2001</pubdate>
				<volume>17</volume>
				<fpage>100</fpage>
				<lpage>107</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(00)02176-4</pubid>
						<pubid idtype="pmpid" link="fulltext">11173120</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Alternative splicing: combinatorial output from the genome.</p>
				</title>
				<aug>
					<au>
						<snm>Roberts</snm>
						<fnm>GC</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>CWJ</fnm>
					</au>
				</aug>
				<source>Curr Opin Chem Biol</source>
				<pubdate>2002</pubdate>
				<volume>6</volume>
				<fpage>375</fpage>
				<lpage>383</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1367-5931(02)00320-4</pubid>
						<pubid idtype="pmpid" link="fulltext">12023119</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Widespread occurrence of alternative splicing at NAGNAG acceptors contributes to proteome plasticity.</p>
				</title>
				<aug>
					<au>
						<snm>Hiller</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Huse</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Szafranski</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Jahn</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Hampe</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Schreiber</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Backofen</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Platzer</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2004</pubdate>
				<volume>36</volume>
				<fpage>1255</fpage>
				<lpage>1257</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1469</pubid>
						<pubid idtype="pmpid" link="fulltext">15516930</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Function of alternative splicing.</p>
				</title>
				<aug>
					<au>
						<snm>Stamm</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ben-Ari</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Rafalska</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Tang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Toiber</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Thanaraj</snm>
						<fnm>TA</fnm>
					</au>
					<au>
						<snm>Soreq</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>2005</pubdate>
				<volume>344</volume>
				<fpage>1</fpage>
				<lpage>20</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.gene.2004.10.022</pubid>
						<pubid idtype="pmpid" link="fulltext">15656968</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans.</p>
				</title>
				<aug>
					<au>
						<snm>Lewis</snm>
						<fnm>BP</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>RE</fnm>
					</au>
					<au>
						<snm>Brenner</snm>
						<fnm>SE</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2003</pubdate>
				<volume>100</volume>
				<fpage>189</fpage>
				<lpage>192</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">140922</pubid>
						<pubid idtype="pmpid" link="fulltext">12502788</pubid>
						<pubid idtype="doi">10.1073/pnas.0136770100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Autoregulation of polypyrimidine tract binding protein by alternative splicing leading to nonsense-mediated decay.</p>
				</title>
				<aug>
					<au>
						<snm>Wollerton</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Gooding</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Wagner</snm>
						<fnm>EJ</fnm>
					</au>
					<au>
						<snm>Garcia-Blanco</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>CWJ</fnm>
					</au>
				</aug>
				<source>Mol Cell</source>
				<pubdate>2004</pubdate>
				<volume>13</volume>
				<fpage>91</fpage>
				<lpage>100</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1097-2765(03)00502-1</pubid>
						<pubid idtype="pmpid" link="fulltext">14731397</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Alternative splicing in disease and therapy.</p>
				</title>
				<aug>
					<au>
						<snm>Garcia-Blanco</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Baraniak</snm>
						<fnm>AP</fnm>
					</au>
					<au>
						<snm>Lasda</snm>
						<fnm>EL</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>2004</pubdate>
				<volume>22</volume>
				<fpage>535</fpage>
				<lpage>546</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nbt964</pubid>
						<pubid idtype="pmpid" link="fulltext">15122293</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Increase of functional diversity by alternative splicing.</p>
				</title>
				<aug>
					<au>
						<snm>Kriventseva</snm>
						<fnm>EV</fnm>
					</au>
					<au>
						<snm>Koch</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Apweiler</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Vingron</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Bork</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Gelfand</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Sunyaev</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<fpage>124</fpage>
				<lpage>128</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(03)00023-4</pubid>
						<pubid idtype="pmpid" link="fulltext">12615003</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Large scale study of protein domain distribution in the context of alternative splicing.</p>
				</title>
				<aug>
					<au>
						<snm>Liu</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Altman</snm>
						<fnm>RB</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2003</pubdate>
				<volume>31</volume>
				<fpage>4828</fpage>
				<lpage>4835</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">169920</pubid>
						<pubid idtype="pmpid" link="fulltext">12907725</pubid>
						<pubid idtype="doi">10.1093/nar/gkg668</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Assessing the impact of alternative splicing on domain interactions in the human proteome.</p>
				</title>
				<aug>
					<au>
						<snm>Resch</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Xing</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Modrek</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Gorlick</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Riley</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>J Proteome Res</source>
				<pubdate>2004</pubdate>
				<volume>3</volume>
				<fpage>76</fpage>
				<lpage>83</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1021/pr034064v</pubid>
						<pubid idtype="pmpid">14998166</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Membrane protein secretases.</p>
				</title>
				<aug>
					<au>
						<snm>Hooper</snm>
						<fnm>NM</fnm>
					</au>
					<au>
						<snm>Karran</snm>
						<fnm>EH</fnm>
					</au>
					<au>
						<snm>Turner</snm>
						<fnm>AJ</fnm>
					</au>
				</aug>
				<source>Biochem J</source>
				<pubdate>1997</pubdate>
				<volume>321</volume>
				<fpage>265</fpage>
				<lpage>279</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">9020855</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Widespread production of novel soluble protein isoforms by alternative splicing removal of transmembrane anchoring domains.</p>
				</title>
				<aug>
					<au>
						<snm>Xing</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Xu</snm>
						<fnm>Q</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>FEBS Lett</source>
				<pubdate>2003</pubdate>
				<volume>555</volume>
				<fpage>572</fpage>
				<lpage>578</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0014-5793(03)01354-1</pubid>
						<pubid idtype="pmpid" link="fulltext">14675776</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>The effects of alternative splicing on transmembrane proteins in the mouse genome.</p>
				</title>
				<aug>
					<au>
						<snm>Cline</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Shigeta</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Wheeler</snm>
						<fnm>RL</fnm>
					</au>
					<au>
						<snm>Siani-Rose</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Kulp</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Loraine</snm>
						<fnm>AE</fnm>
					</au>
				</aug>
				<source>Pacific Symposium on Biocomputing: January 6-10 2004; Hawaii</source>
				<pubdate>2004</pubdate>
				<fpage>17</fpage>
				<lpage>28</lpage>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Splice Variants of G protein-coupled receptors.</p>
				</title>
				<aug>
					<au>
						<snm>Minneman</snm>
						<fnm>KP</fnm>
					</au>
				</aug>
				<source>Mol Interv</source>
				<pubdate>2001</pubdate>
				<volume>1</volume>
				<fpage>108</fpage>
				<lpage>116</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">14993330</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>A conformational switch in the Piccolo C(2)A domain regulated by alternative splicing.</p>
				</title>
				<aug>
					<au>
						<snm>Garcia</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Gerber</snm>
						<fnm>SH</fnm>
					</au>
					<au>
						<snm>Sugita</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Sudhof</snm>
						<fnm>TC</fnm>
					</au>
					<au>
						<snm>Rizo</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Nat Struct Mol Biol</source>
				<pubdate>2004</pubdate>
				<volume>11</volume>
				<fpage>45</fpage>
				<lpage>53</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nsmb707</pubid>
						<pubid idtype="pmpid" link="fulltext">14718922</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Two splice variants of a tyrosine phosphatase differ in substrate specificity, DNA binding, and subcellular location.</p>
				</title>
				<aug>
					<au>
						<snm>Kamatkar</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Radha</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Nambirajan</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Reddy</snm>
						<fnm>RS</fnm>
					</au>
					<au>
						<snm>Swarup</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>J Biol Chem</source>
				<pubdate>1996</pubdate>
				<volume>271</volume>
				<fpage>26755</fpage>
				<lpage>26761</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1074/jbc.271.43.26755</pubid>
						<pubid idtype="pmpid" link="fulltext">8900155</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.</p>
				</title>
				<aug>
					<au>
						<snm>Krogh</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Larsson</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Heijne</snm>
						<fnm>Gv</fnm>
					</au>
					<au>
						<snm>Sonnhammer</snm>
						<fnm>EL</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>2001</pubdate>
				<volume>305</volume>
				<fpage>567</fpage>
				<lpage>580</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.2000.4315</pubid>
						<pubid idtype="pmpid" link="fulltext">11152613</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Signals and their transduction pathways regulating alternative splicing: a new dimension of the human genome.</p>
				</title>
				<aug>
					<au>
						<snm>Stamm</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Hum Mol Genet</source>
				<pubdate>2002</pubdate>
				<volume>11</volume>
				<fpage>2409</fpage>
				<lpage>2416</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/hmg/11.20.2409</pubid>
						<pubid idtype="pmpid" link="fulltext">12351576</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>O-GLYCBASE version 4.0: a revised database of O-glycosylated proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Gupta</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Birch</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Rapacki</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Brunak</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Hansen</snm>
						<fnm>JE</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1999</pubdate>
				<volume>27</volume>
				<fpage>370</fpage>
				<lpage>372</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">148187</pubid>
						<pubid idtype="pmpid" link="fulltext">9847232</pubid>
						<pubid idtype="doi">10.1093/nar/27.1.370</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Phospho.ELM: a database of experimentally verified phosphorylation sites in eukaryotic proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Diella</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Cameron</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Gemund</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Linding</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Via</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Kuster</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Sicheritz-Ponten</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Blom</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Gibson</snm>
						<fnm>TJ</fnm>
					</au>
				</aug>
				<source>BMC Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<fpage>79</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">449700</pubid>
						<pubid idtype="pmpid" link="fulltext">15212693</pubid>
						<pubid idtype="doi">10.1186/1471-2105-5-79</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>NetOglyc: prediction of mucin type O-glycosylation sites based on sequence context and surface accessibility.</p>
				</title>
				<aug>
					<au>
						<snm>Hansen</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Lund</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Tolstrup</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Gooley</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Williams</snm>
						<fnm>KL</fnm>
					</au>
					<au>
						<snm>Brunak</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Glycoconj J</source>
				<pubdate>1998</pubdate>
				<volume>15</volume>
				<fpage>115</fpage>
				<lpage>130</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1023/A:1006960004440</pubid>
						<pubid idtype="pmpid" link="fulltext">9557871</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Sequence and structure-based prediction of eukaryotic protein phosphorylation sites.</p>
				</title>
				<aug>
					<au>
						<snm>Blom</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Gammeltoft</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Brunak</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1999</pubdate>
				<volume>294</volume>
				<fpage>1351</fpage>
				<lpage>1362</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1999.3310</pubid>
						<pubid idtype="pmpid" link="fulltext">10600390</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>The Pfam protein families database.</p>
				</title>
				<aug>
					<au>
						<snm>Bateman</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Coin</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Durbin</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Finn</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Hollich</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Griffiths-Jones</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Khanna</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Marshall</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Moxon</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Sonnhammer</snm>
						<fnm>ELL</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2004</pubdate>
				<volume>32</volume>
				<issue>Database issue</issue>
				<fpage>D138</fpage>
				<lpage>D141</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">308855</pubid>
						<pubid idtype="pmpid" link="fulltext">14681378</pubid>
						<pubid idtype="doi">10.1093/nar/gkh121</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Efficient prediction of alternative splice forms using protein domain homology.</p>
				</title>
				<aug>
					<au>
						<snm>Hiller</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Backofen</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Heymann</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Busch</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Glaesser</snm>
						<fnm>TM</fnm>
					</au>
					<au>
						<snm>Freytag</snm>
						<fnm>J-C</fnm>
					</au>
				</aug>
				<source>In Silico Biol</source>
				<pubdate>2004</pubdate>
				<volume>4</volume>
				<fpage>195</fpage>
				<lpage>208</lpage>
				<xrefbib>
					<pubid idtype="pmpid">15107023</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>How prevalent is functional alternative splicing in the human genome?</p>
				</title>
				<aug>
					<au>
						<snm>Sorek</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Shamir</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Ast</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<fpage>68</fpage>
				<lpage>71</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.tig.2003.12.004</pubid>
						<pubid idtype="pmpid" link="fulltext">14746986</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Discovery of novel splice forms and functional analysis of cancer-specific alternative splicing in human expressed sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Xu</snm>
						<fnm>Q</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2003</pubdate>
				<volume>31</volume>
				<fpage>5635</fpage>
				<lpage>5643</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">206480</pubid>
						<pubid idtype="pmpid" link="fulltext">14500827</pubid>
						<pubid idtype="doi">10.1093/nar/gkg786</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Puntervoll</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Linding</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Gemund</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Chabanis-Davidson</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Mattingsdal</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Cameron</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Martin</snm>
						<fnm>DMA</fnm>
					</au>
					<au>
						<snm>Ausiello</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Brannetti</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Costantini</snm>
						<fnm>A</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2003</pubdate>
				<volume>31</volume>
				<fpage>3625</fpage>
				<lpage>3630</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">168952</pubid>
						<pubid idtype="pmpid" link="fulltext">12824381</pubid>
						<pubid idtype="doi">10.1093/nar/gkg545</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Novel protein kinase C delta isoform insensitive to caspase-3.</p>
				</title>
				<aug>
					<au>
						<snm>Sakurai</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Onishi</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Tanimoto</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Kizaki</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Biol Pharm Bull</source>
				<pubdate>2001</pubdate>
				<volume>24</volume>
				<fpage>973</fpage>
				<lpage>977</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1248/bpb.24.973</pubid>
						<pubid idtype="pmpid" link="fulltext">11558579</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Systematic identification and analysis of exonic splicing silencers.</p>
				</title>
				<aug>
					<au>
						<snm>Wang</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Rolish</snm>
						<fnm>ME</fnm>
					</au>
					<au>
						<snm>Yeo</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Tung</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Mawson</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Burge</snm>
						<fnm>CB</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>2004</pubdate>
				<volume>119</volume>
				<fpage>831</fpage>
				<lpage>845</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.cell.2004.11.010</pubid>
						<pubid idtype="pmpid" link="fulltext">15607979</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Translational regulation of human neuronal nitric-oxide synthase by an alternatively spliced 5'-untranslated region leader exon.</p>
				</title>
				<aug>
					<au>
						<snm>Newton</snm>
						<fnm>DC</fnm>
					</au>
					<au>
						<snm>Bevan</snm>
						<fnm>SC</fnm>
					</au>
					<au>
						<snm>Choi</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Robb</snm>
						<fnm>GB</fnm>
					</au>
					<au>
						<snm>Millar</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Marsden</snm>
						<fnm>PA</fnm>
					</au>
				</aug>
				<source>J Biol Chem</source>
				<pubdate>2003</pubdate>
				<volume>278</volume>
				<fpage>636</fpage>
				<lpage>644</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1074/jbc.M209988200</pubid>
						<pubid idtype="pmpid" link="fulltext">12403769</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Alu-containing exons are alternatively spliced.</p>
				</title>
				<aug>
					<au>
						<snm>Sorek</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Ast</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Graur</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<fpage>1060</fpage>
				<lpage>1067</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">186627</pubid>
						<pubid idtype="pmpid" link="fulltext">12097342</pubid>
						<pubid idtype="doi">10.1101/gr.229302</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Evolution of alternative splicing: deletions, insertions and origin of functional parts of proteins from intron sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Kondrashov</snm>
						<fnm>FA</fnm>
					</au>
					<au>
						<snm>Koonin</snm>
						<fnm>EV</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<fpage>115</fpage>
				<lpage>119</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(02)00029-X</pubid>
						<pubid idtype="pmpid" link="fulltext">12615001</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss.</p>
				</title>
				<aug>
					<au>
						<snm>Modrek</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>CJ</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2003</pubdate>
				<volume>34</volume>
				<fpage>177</fpage>
				<lpage>180</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1159</pubid>
						<pubid idtype="pmpid" link="fulltext">12730695</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>How did alternative splicing evolve?</p>
				</title>
				<aug>
					<au>
						<snm>Ast</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<fpage>773</fpage>
				<lpage>782</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg1451</pubid>
						<pubid idtype="pmpid" link="fulltext">15510168</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>A Non-EST-based method for exon-skipping prediction.</p>
				</title>
				<aug>
					<au>
						<snm>Sorek</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Shemesh</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Cohen</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Basechess</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Ast</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Shamir</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2004</pubdate>
				<volume>14</volume>
				<fpage>1617</fpage>
				<lpage>1623</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">509271</pubid>
						<pubid idtype="pmpid" link="fulltext">15289480</pubid>
						<pubid idtype="doi">10.1101/gr.2572604</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>SpliceInfo: an information repository for mRNA alternative splicing in human genome.</p>
				</title>
				<aug>
					<au>
						<snm>Huang</snm>
						<fnm>H-D</fnm>
					</au>
					<au>
						<snm>Horng</snm>
						<fnm>J-T</fnm>
					</au>
					<au>
						<snm>Lin</snm>
						<fnm>F-M</fnm>
					</au>
					<au>
						<snm>Chang</snm>
						<fnm>Y-C</fnm>
					</au>
					<au>
						<snm>Huang</snm>
						<fnm>C-C</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<volume>33</volume>
				<issue>Database Issue</issue>
				<fpage>D80</fpage>
				<lpage>D85</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">540083</pubid>
						<pubid idtype="pmpid" link="fulltext">15608290</pubid>
						<pubid idtype="doi">10.1093/nar/gki129</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Human RefSeq Database</p>
				</title>
				<url>http://hgdownload.cse.ucsc.edu/goldenPath/hg16/database/refGene.txt.gz</url>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Human Fraction of dbEST</p>
				</title>
				<url>ftp://ftp.ncbi.nlm.nih.gov/blast/db/FASTA/est_human.gz</url>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Evaluation of methods for the prediction of membrane spanning regions.</p>
				</title>
				<aug>
					<au>
						<snm>Moller</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Croning</snm>
						<fnm>MD</fnm>
					</au>
					<au>
						<snm>Apweiler</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2001</pubdate>
				<volume>17</volume>
				<fpage>646</fpage>
				<lpage>653</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/17.7.646</pubid>
						<pubid idtype="pmpid" link="fulltext">11448883</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>SwissProt and RefSeq IDs</p>
				</title>
				<pubdate>2001</pubdate>
				<url>http://www.gene.ucl.ac.uk/public-files/nomen/ens1.txt</url>
			</bibl>
		</refgrp>
	</bm>
</art>
