MOIRAI: a compact workflow system for CAGE analysis
1 RIKEN Center for Life Science Technologies (CLST), Riken Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045 Kanagawa, Japan
2 Preventive Medicine and Diagnosis Innovation Program (PMI), Riken Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 230-0045 Kanagawa, Japan
3 Department of Biosciences and Nutrition, Karolinska Institutet, SE-171 77 Stockholm, Sweden
BMC Bioinformatics 2014, 15:144 doi:10.1186/1471-2105-15-144Published: 16 May 2014
Cap analysis of gene expression (CAGE) is a sequencing based technology to capture the 5’ ends of RNAs in a biological sample. After mapping, a CAGE peak on the genome indicates the position of an active transcriptional start site (TSS) and the number of reads correspond to its expression level. CAGE is prominently used in both the FANTOM and ENCODE project but presently there is no software package to perform the essential data processing steps.
Here we describe MOIRAI, a compact yet flexible workflow system designed to carry out the main steps in data processing and analysis of CAGE data. MOIRAI has a graphical interface allowing wet-lab researchers to create, modify and run analysis workflows. Embedded within the workflows are graphical quality control indicators allowing users assess data quality and to quickly spot potential problems. We will describe three main workflows allowing users to map, annotate and perform an expression analysis over multiple samples.
Due to the many built in quality control features MOIRAI is especially suitable to support the development of new sequencing based protocols.