<?xml version = '1.0' encoding = 'UTF-8'?>
<?xml-stylesheet href="/rss/styledrssBMC.css" type="text/css"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:extra="http://www.biomedcentral.com/xml/schemas/extra/" xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/" xmlns:cc="http://web.resource.org/cc/">
	<channel rdf:about="http://www.biomedcentral.com/rss">
		<extra:info rdf:parseType="Literal">
			<html:div xmlns:html="http://www.w3.org/1999/xhtml" style="font:14px Verdana, Geneva, Arial, Helvetica, sans-serif">
				<html:span style="font-weight:bold">This is an RSS newsfeed from BioMed Central</html:span>
				<html:br/>
				<html:span style="font-size: 12px;">It is intended to be used with an RSS reader. For more information about RSS newsfeeds from BioMed Central, visit <html:br/><html:a href="http://www.biomedcentral.com/info/about/rss/" style="color:#3333CC; font-size:12px;">http://www.biomedcentral.com/info/about/rss/</html:a><html:br/>
				</html:span>
			</html:div>
		</extra:info>
		<title>BMC Bioinformatics - Most viewed articles</title>
		<link>http://www.biomedcentral.com/bmcbioinformatics/mostviewed/</link>
		<description>Most viewed articles in last 30 days from BMC Bioinformatics (ISSN 1471-2105) published by 
				
				BioMed Central
		</description>
        <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
         <items>
            <rdf:Seq>
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/9/353"/>			    
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/8/S2"/>			    
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/9/284"/>			    
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/9/342"/>			    
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/9/337"/>			    
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/9/350"/>			    
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/9/292"/>			    
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/9/348"/>			    
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/9/355"/>			    
            
				    <rdf:li rdf:resource="http://www.biomedcentral.com/1471-2105/9/351"/>			    
            
            </rdf:Seq>
        </items>
    </channel>
    
		<item rdf:about="http://www.biomedcentral.com/1471-2105/9/353">
            
            <title>Identification and correction of abnormal, incomplete and mispredicted proteins in public databases</title>
			<description>Background:
Despite significant improvements in computational annotation of genomes, sequences of abnormal, incomplete or incorrectly predicted genes and proteins remain abundant in public databases. Since the majority of incomplete, abnormal or mispredicted entries are not annotated as such, these errors seriously affect the reliability of these databases. Here we describe the MisPred approach that may provide an efficient means for the quality control of databases. The current version of the MisPred approach uses five distinct routines for identifying abnormal, incomplete or mispredicted entries based on the principle that a sequence is likely to be incorrect if some of its features conflict with our current knowledge about protein-coding genes and proteins: (i) conflict between the predicted subcellular localization of proteins and the absence of the corresponding sequence signals; (ii) presence of extracellular and cytoplasmic domains and the absence of transmembrane segments; (iii) co-occurrence of extracellular and nuclear domains; (iv) violation of domain integrity; (v) chimeras encoded by two or more genes located on different chromosomes.
Results:
Analyses of predicted EnsEMBL protein sequences of nine deuterostome (Homo sapiens, Mus musculus, Rattus norvegicus, Monodelphis domestica, Gallus gallus, Xenopus tropicalis, Fugu rubripes, Danio rerio and Ciona intestinalis) and two protostome species (Caenorhabditis elegans and Drosophila melanogaster) have revealed that the absence of expected signal peptides and violation of domain integrity account for the majority of mispredictions. Analyses of sequences predicted by NCBI's GNOMON annotation pipeline show that the rates of mispredictions are comparable to those of EnsEMBL. Interestingly, even the manually curated UniProtKB/Swiss-Prot dataset is contaminated with mispredicted or abnormal proteins, although to a much lesser extent than UniProtKB/TrEMBL or the EnsEMBL or GNOMON-predicted entries.
Conclusions:
MisPred works efficiently in identifying errors in predictions generated by the most reliable gene prediction tools such as the EnsEMBL and NCBI's GNOMON pipelines and also guides the correction of errors. We suggest that application of the MisPred approach will significantly improve the quality of gene predictions and the associated databases.</description>
			<link>http://www.biomedcentral.com/1471-2105/9/353</link>		
			<dc:creator>Alinda Nagy, Hedi Hegyi, Krisztina Farkas, Hedvig Tordai, Evelin Kozma, Laszlo Banyai and Laszlo Patthy</dc:creator>
			<dc:source>BMC Bioinformatics 2008, 9:353</dc:source>
			<dc:subject>Number of accesses: 1546</dc:subject>
			<dc:date>2008-08-27</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-9-353</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>9</prism:volume>
					
			
							
					<prism:startingPage>353</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-08-27</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.biomedcentral.com/1471-2105/8/S2">
            
            <title>Advancing translational research with the Semantic Web</title>
			<description>Background:
A fundamental goal of the U.S. National Institute of Health (NIH) "Roadmap" is to strengthen Translational Research, defined as the movement of discoveries in basic research to application at the clinical level. A significant barrier to translational research is the lack of uniformly structured data across related biomedical domains. The Semantic Web is an extension of the current Web that enables navigation and meaningful use of digital resources by automatic processes. It is based on common formats that support aggregation and integration of data drawn from diverse sources. A variety of technologies have been built on this foundation that, together, support identifying, representing, and reasoning across a wide range of biomedical data. The Semantic Web Health Care and Life Sciences Interest Group (HCLSIG), set up within the framework of the World Wide Web Consortium, was launched to explore the application of these technologies in a variety of areas. Subgroups focus on making biomedical data available in RDF, working with biomedical ontologies, prototyping clinical decision support systems, working on drug safety and efficacy communication, and supporting disease researchers navigating and annotating the large amount of potentially relevant literature.
Results:
We present a scenario that shows the value of the information environment the Semantic Web can support for aiding neuroscience researchers. We then report on several projects by members of the HCLSIG, in the process illustrating the range of Semantic Web technologies that have applications in areas of biomedicine.
Conclusion:
Semantic Web technologies present both promise and challenges. Current tools and standards are already adequate to implement components of the bench-to-bedside vision. On the other hand, these technologies are young. Gaps in standards and implementations still exist and adoption is limited by typical problems with early technology, such as the need for a critical mass of practitioners and installed base, and growing pains as the technology is scaled up. Still, the potential of interoperable knowledge sources for biomedicine, at the scale of the World Wide Web, merits continued work.</description>
			<link>http://www.biomedcentral.com/1471-2105/8/S2</link>		
			<dc:creator>Alan Ruttenberg, Tim Clark, William Bug, Matthias Samwald, Olivier Bodenreider, Helen Chen, Donald Doherty, Kerstin Forsberg, Yong Gao, Vipul Kashyap, June Kinoshita, Joanne Luciano, M Scott Marshall, Chimezie Ogbuji, Jonathan Rees, Susie Stephens, Gwendolyn T Wong, Elizabeth Wu, Davide Zaccagnini, Tonya Hongsermeier, Eric Neumann, Ivan Herman and Kei-Hoi Cheung</dc:creator>
			<dc:source>BMC Bioinformatics 2007, 8:S2</dc:source>
			<dc:subject>Number of accesses: 1411</dc:subject>
			<dc:date>2007-05-09</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-8-S3-S2</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>8</prism:volume>
					
			
							
					<prism:startingPage>S2</prism:startingPage>
					
			
							
					<prism:publicationDate>2007-05-09</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.biomedcentral.com/1471-2105/9/284">
            
            <title>Methods for evaluating gene expression from Affymetrix microarray datasets</title>
			<description>Background:
Affymetrix high density oligonucleotide expression arrays are widely used across all fields of biological research for measuring genome-wide gene expression. An important step in processing oligonucleotide microarray data is to produce a single value for the gene expression level of an RNA transcript using one of a growing number of statistical methods. The challenge for the researcher is to decide on the most appropriate method to use to address a specific biological question with a given dataset. Although several research efforts have focused on assessing performance of a few methods in evaluating gene expression from RNA hybridization experiments with different datasets, the relative merits of the methods currently available in the literature for evaluating genome-wide gene expression from Affymetrix microarray data collected from real biological experiments remain actively debated.
Results:
The present study reports a comprehensive survey of the performance of all seven commonly used methods in evaluating genome-wide gene expression from a well-designed experiment using Affymetrix microarrays. The experiment profiled eight genetically divergent barley cultivars each with three biological replicates. The dataset so obtained confers a balanced and idealized structure for the present analysis. The methods were evaluated on their sensitivity for detecting differentially expressed genes, reproducibility of expression values across replicates, and consistency in calling differentially expressed genes. The number of genes detected as differentially expressed among methods differed by a factor of two or more at a given false discovery rate (FDR) level. Moreover, we propose the use of genes containing single feature polymorphisms (SFPs) as an empirical test for comparison among methods for the ability to detect true differential gene expression on the basis that SFPs largely correspond to cis-acting expression regulators. The PDNN method demonstrated superiority over all other methods in every comparison, whilst the default Affymetrix MAS5.0 method was clearly inferior.
Conclusion:
A comprehensive assessment of seven commonly used data extraction methods based on an extensive barley Affymetrix gene expression dataset has shown that the PDNN method has superior performance for the detection of differentially expressed genes.</description>
			<link>http://www.biomedcentral.com/1471-2105/9/284</link>		
			<dc:creator>Ning Jiang, Lindsey J Leach, Xiaohua Hu, Elena Potokina, Tianye Jia, Arnis Druka, Robbie Waugh, Michael J Kearsey and Zewei W Luo</dc:creator>
			<dc:source>BMC Bioinformatics 2008, 9:284</dc:source>
			<dc:subject>Number of accesses: 921</dc:subject>
			<dc:date>2008-06-17</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-9-284</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>9</prism:volume>
					
			
							
					<prism:startingPage>284</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-06-17</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.biomedcentral.com/1471-2105/9/342">
            
            <title>SBML-SAT: a systems biology markup language (SBML) based sensitivity analysis tool</title>
			<description>Background:
It has long been recognized that sensitivity analysis plays a key role in modeling and analyzing cellular and biochemical processes. Systems biology markup language (SBML) has become a well-known platform for coding and sharing mathematical models of such processes. However, current SBML compatible software tools are limited in their ability to perform global sensitivity analyses of these models.
Results:
This work introduces a freely downloadable, software package, SBML-SAT, which implements algorithms for simulation, steady state analysis, robustness analysis and local and global sensitivity analysis for SBML models. This software tool extends current capabilities through its execution of global sensitivity analyses using multi-parametric sensitivity analysis, partial rank correlation coefficient, SOBOL's method, and weighted average of local sensitivity analyses in addition to its ability to handle systems with discontinuous events and intuitive graphical user interface.
Conclusion:
SBML-SAT provides the community of systems biologists a new tool for the analysis of their SBML models of biochemical and cellular processes.</description>
			<link>http://www.biomedcentral.com/1471-2105/9/342</link>		
			<dc:creator>Zhike Zi, Yanan Zheng, Ann E Rundell and Edda Klipp</dc:creator>
			<dc:source>BMC Bioinformatics 2008, 9:342</dc:source>
			<dc:subject>Number of accesses: 883</dc:subject>
			<dc:date>2008-08-15</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-9-342</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>9</prism:volume>
					
			
							
					<prism:startingPage>342</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-08-15</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.biomedcentral.com/1471-2105/9/337">
            
            <title>Methylation Linear Discriminant Analysis (MLDA) for identifying differentially methylated CpG islands</title>
			<description>Background:
Hypermethylation of promoter CpG islands is strongly correlated to transcriptional gene silencing and epigenetic maintenance of the silenced state. As well as its role in tumor development, CpG island methylation contributes to the acquisition of resistance to chemotherapy. Differential Methylation Hybridisation (DMH) is one technique used for genome-wide DNA methylation analysis. The study of such microarray data sets should ideally account for the specific biological features of DNA methylation and the non-symmetrical distribution of the ratios of unmethylated and methylated sequences hybridised on the array. We have therefore developed a novel algorithm tailored to this type of data, Methylation Linear Discriminant Analysis (MLDA).
Results:
MLDA was programmed in R (version 2.7.0) and the package is available at CRAN 1. This approach utilizes linear regression models of non-normalised hybridisation data to define methylation status. Log-transformed signal intensities of unmethylated controls on the microarray are used as a reference. The signal intensities of DNA samples digested with methylation sensitive restriction enzymes and mock digested are then transformed to the likelihood of a locus being methylated using this reference. We tested the ability of MLDA to identify loci differentially methylated as analysed by DMH between cisplatin sensitive and resistant ovarian cancer cell lines. MLDA identified 115 differentially methylated loci and 23 out of 26 of these loci have been independently validated by Methylation Specific PCR and/or bisulphite pyrosequencing.
Conclusion:
MLDA has advantages for analyzing methylation data from CpG island microarrays, since there is a clear rational for the definition of methylation status, it uses DMH data without between-group normalisation and is less influenced by cross-hybridisation of loci. The MLDA algorithm successfully identified differentially methylated loci between two classes of samples analysed by DMH using CpG island microarrays.</description>
			<link>http://www.biomedcentral.com/1471-2105/9/337</link>		
			<dc:creator>Wei Dai, Jens M Teodoridis, Janet Graham, Constanze Zeller, Tim HM Huang, Pearlly Yan, J Keith Vass, Robert Brown and Jim Paul</dc:creator>
			<dc:source>BMC Bioinformatics 2008, 9:337</dc:source>
			<dc:subject>Number of accesses: 865</dc:subject>
			<dc:date>2008-08-08</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-9-337</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>9</prism:volume>
					
			
							
					<prism:startingPage>337</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-08-08</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.biomedcentral.com/1471-2105/9/350">
            
            <title>Integration of relational and hierarchical network         information for protein function prediction</title>
			<description>Background:
In the current climate of high-throughput computational biology, the inference of a protein's function from related measurements, such as protein-protein interaction relations, has become a canonical task.  Most existing technologies pursue this task as a classification problem, on a term-by-term basis, for each term in a database, such as the Gene Ontology (GO) database, a popular rigorous vocabulary for biological functions.  However, ontology structures are essentially hierarchies, with certain top to bottom annotation rules which protein function predictions should in principle follow.  Currently, the most common approach to imposing these hierarchical constraints on network-based classifiers is through the use of ad hoc post-processing rules.
Results:
We propose a probabilistic framework to integrate information in relational data, in the form of a protein-protein interaction network, and a hierarchically structured database of terms, in the form of the GO database, for the purpose of protein function prediction.  At the heart of our framework is a factorization of local neighborhood information in the protein-protein interaction network across successive ancestral terms in the GO hierarchy. We introduce a classifier within this framework, with computationally efficient implementation, that produces GO-term predictions that naturally obey a hierarchical `true-path' consistency from root to leaves, without the need for further post-processing.
Conclusion:
A cross-validation study, using data from the yeast Saccharomyces cerevisiae, shows our method offers substantial improvements over both standard `guilt-by-association' (i.e., Nearest-Neighbor) and more refined Markov random field methods, whether in their original form or when post-processed to artificially impose `true-path' consistency.  Further analysis of the results indicates that these improvements are associated with increased predictive capabilities (i.e., increased positive predictive value), and that this increase is consistent uniformly with GO-term depth.  Additional in silico validation on a collection of new annotations recently added to GO confirms the advantages suggested by the cross-validation study. Taken as a whole, our results show that a hierarchical approach to network-based protein function prediction, that exploits the ontological structure of protein annotation databases in a principled manner, can offer substantial advantages over the successive application of `flat' network-based methods.</description>
			<link>http://www.biomedcentral.com/1471-2105/9/350</link>		
			<dc:creator>Xiaoyu Jiang, Naoki Nariai, Martin Steffen, Simon Kasif and Eric D Kolaczyk</dc:creator>
			<dc:source>BMC Bioinformatics 2008, 9:350</dc:source>
			<dc:subject>Number of accesses: 860</dc:subject>
			<dc:date>2008-08-22</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-9-350</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>9</prism:volume>
					
			
							
					<prism:startingPage>350</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-08-22</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.biomedcentral.com/1471-2105/9/292">
            
            <title>Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models</title>
			<description>Background:
Growing interest on biological pathways has called for new statistical methods for modeling and testing a genetic pathway effect on a health outcome. The fact that genes within a pathway tend to interact with each other and relate to the outcome in a complicated way makes nonparametric methods more desirable. The kernel machine method provides a convenient, powerful and unified method for multi-dimensional parametric and nonparametric modeling of the pathway effect.
Results:
In this paper we propose a logistic kernel machine regression model for binary outcomes. This model relates the disease risk to covariates parametrically, and to genes within a genetic pathway parametrically or nonparametrically using kernel machines. The nonparametric genetic pathway effect allows for possible interactions among the genes within the same pathway and a complicated relationship of the genetic pathway and the outcome. We show that kernel machine estimation of the model components can be formulated using a logistic mixed model. Estimation hence can proceed within a mixed model framework using standard statistical software. A score test based on a Gaussian process approximation is developed to test for the genetic pathway effect. The methods are illustrated using a prostate cancer data set and evaluated using simulations. An extension to continuous and discrete outcomes using generalized kernel machine models and its connection with generalized linear mixed models is discussed.
Conclusion:
Logistic kernel machine regression and its extension generalized kernel machine regression provide a novel and flexible statistical tool for modeling pathway effects on discrete and continuous outcomes. Their close connection to mixed models and attractive performance make them have promising wide applications in bioinformatics and other biomedical areas.</description>
			<link>http://www.biomedcentral.com/1471-2105/9/292</link>		
			<dc:creator>Dawei Liu, Debashis Ghosh and Xihong Lin</dc:creator>
			<dc:source>BMC Bioinformatics 2008, 9:292</dc:source>
			<dc:subject>Number of accesses: 832</dc:subject>
			<dc:date>2008-06-24</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-9-292</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>9</prism:volume>
					
			
							
					<prism:startingPage>292</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-06-24</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.biomedcentral.com/1471-2105/9/348">
            
            <title>Gene Vector Analysis (Geneva): A unified method to detect differentially-regulated gene sets and similar microarray experiments</title>
			<description>Background:
Microarray experiments measure changes in the expression of thousands of genes.  The resulting lists of genes with changes in expression are then searched for biologically related sets using several divergent methods such as the Fisher Exact Test (as used in multiple GO enrichment tools), Parametric Analysis of Gene Expression (PAGE), Gene Set Enrichment Analysis (GSEA), and the connectivity map.  
Results:
We describe an analytical method (Geneva: Gene Vector Analysis) to relate genes to biological properties and to other similar experiments in a uniform way.  This new method works on both gene sets and on gene lists/vectors as input queries, and can effectively query databases consisting of sets of biologically related sets, or of results from other microarray experiments.  We also present an improvement to the null model estimate by using the empirical background distribution drawn from previous experiments.  We validated Geneva by rediscovering a number of previous findings, and by finding significant relationships within microarrays in the GEO repository.  
Conclusions:
Provided a reasonable corpus of previous experiments is available, this method is more accurate than the class label permutation model, especially for data sets with limited number of replicates.  Geneva is, moreover, computationally faster because the background distributions can be precomputed.  We also provide a standard evaluation data set based on 5 pairs of related experiments that should share similar functional relationships and 28 pairs of unrelated experiments from GEO.   Discovering relationships amongst GEO data sets has implications for drug repositioning, and understanding relationships between diseases and drugs. </description>
			<link>http://www.biomedcentral.com/1471-2105/9/348</link>		
			<dc:creator>Stephen W Tanner and Pankaj Agarwal</dc:creator>
			<dc:source>BMC Bioinformatics 2008, 9:348</dc:source>
			<dc:subject>Number of accesses: 811</dc:subject>
			<dc:date>2008-08-22</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-9-348</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>9</prism:volume>
					
			
							
					<prism:startingPage>348</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-08-22</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.biomedcentral.com/1471-2105/9/355">
            
            <title>NITPICK: peak identifcation for mass spectrometry data</title>
			<description>Background:
The reliable extraction of features from mass spectra is a fundamental step in the automated analysis of proteomic mass spectrometry (MS) experiments.
Results:
This contribution proposes a sparse template regression approach to peak picking called NITPICK. NITPICK is a Non-greedy, Iterative Template-based peak PICKer that deconvolves complex overlapping isotope distributions in multicomponent mass spectra. NITPICK is based on fractional averagine, a novel extension to Senko's well-known averagine model, and on a modified version of sparse, non-negative least angle regression, for which a suitable, statistically motivated early stopping criterion has been derived. The strength of NITPICK is the deconvolution of overlapping mixture mass spectra.
Conclusions:
Extensive comparative evaluation has been carried out and results are provided for simulated and real-world data sets. NITPICK outperforms pepex, to date the only alternate, publicly available, non-greedy feature extraction routine. NITPICK is available as software package for the R programming language and can be downloaded from http://hci.iwr.uni-heidelberg.de/mip/proteomics/.</description>
			<link>http://www.biomedcentral.com/1471-2105/9/355</link>		
			<dc:creator>Bernhard Y Renard, Marc Kirchner, Hanno Steen, Judith A J Steen and Fred A Hamprecht</dc:creator>
			<dc:source>BMC Bioinformatics 2008, 9:355</dc:source>
			<dc:subject>Number of accesses: 792</dc:subject>
			<dc:date>2008-08-28</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-9-355</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>9</prism:volume>
					
			
							
					<prism:startingPage>355</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-08-28</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
	
		<item rdf:about="http://www.biomedcentral.com/1471-2105/9/351">
            
            <title>SynBlast: assisting the analysis of conserved Synteny information</title>
			<description>Motivation:
In the last years more than 20 vertebrate genomes have been sequenced, and the rate at which genomic DNA information becomes available is rapidly accelerating. Gene duplication and gene loss events inherently limit the accuracy of orthology detection based on sequence similarity alone. Fully automated methods for orthology annotation do exist but often fail to identify individual members in cases of large gene families, or to distinguish missing data from traceable gene losses. This situation can be improved in many cases by including conserved synteny information. 
Results:
Here we present the SynBlast pipeline that is designed to construct and evaluate local synteny information. SynBlast uses the genomic region around a focal reference gene to retrieve candidates for homologous regions from a collection of target genomes and ranks them in accord with the available evidence for homology.  The pipeline is intended as a tool to aid high quality manual annotation in particular in those cases where automatic procedures fail. We demonstrate how SynBlast is applied to retrieving orthologous and paralogous clusters using the vertebrate Hox and ParaHox clusters as examples.
Software:
The SynBlast package written in Perl is available under the GNU General Public License at http://www.bioinf.uni-leipzig.de/Software/SynBlast/.</description>
			<link>http://www.biomedcentral.com/1471-2105/9/351</link>		
			<dc:creator>Jorg Lehmann, Peter F Stadler and Sonja J Prohaska</dc:creator>
			<dc:source>BMC Bioinformatics 2008, 9:351</dc:source>
			<dc:subject>Number of accesses: 771</dc:subject>
			<dc:date>2008-08-24</dc:date>
			<dc:identifier>doi:10.1186/1471-2105-9-351</dc:identifier>
			
			
							
					<prism:publicationName>BMC Bioinformatics</prism:publicationName>
					
			
							
					<prism:issn>1471-2105</prism:issn>
					
			
							
					<prism:volume>9</prism:volume>
					
			
							
					<prism:startingPage>351</prism:startingPage>
					
			
							
					<prism:publicationDate>2008-08-24</prism:publicationDate>
					

            <cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/>
        </item>
		
	<cc:License rdf:about="http://creativecommons.org/licenses/by/2.0/">
         <cc:permits rdf:resource="http://creativecommons.org/ns#Reproduction"/>
         <cc:permits rdf:resource="http://creativecommons.org/ns#Distribution"/>
         <cc:permits rdf:resource="http://creativecommons.org/ns#DerivativeWorks"/>
	</cc:License>
</rdf:RDF>
