Skip to main content

Proceedings of the 2012 MidSouth computational biology and bioinformatics society (MCBIOS) conference

Introduction

The ninth annual conference of the MidSouth Computational Biology and Bioinformatics Society (MCBIOS 2012), "Making Sense of the Omics Data Deluge", took place in Oxford, Mississippi February 17-8 2012. This year's Conference Chairs were Dr. Dawn Wilkins, of the University of Mississippi and Dr. Doris Kupfer, also the current MCBIOS President (2011-2), from the Federal Aviation Administration. There were 170 registrants and a total of 106 abstracts (34 oral presentations and 72 poster session abstracts).

Keynote speakers for 2012 were Dr. Michael Gribskov, Purdue University, who gave the opening address, "After the Deluge: Bioinformatics meets big data"; Dr. David J. States, OncProTech LLC, who gave his presentation remotely via WebEx entitled "Data Intensive Proteomics"; and Sultan Meghji, Appistry Inc. presenting the Saturday morning address, entitled "Simple, Fast and Affordable - Turning the myriad of data into action - technologies to support personalized medicine" and invited speaker, Dr. William Slikker, Director of the Food and Drug Administration's, National Center for Toxicological Research, presented a talk entitled "Regulatory Science: Challenges and Progress" outlining the role research at the FDA plays in their regulatory responsibilities.

Participants also had the opportunity to attend hands-on workshops on NCBI tools, presented by Dr. Peter Cooper, NCBI/NLM/NIH staff scientist, and a collaboration workshop focused on the timber rattlesnake genome, facilitated by Dr. Ed Perkins, Army Corp of Engineers.

The winners of conference awards were:

Best Oral Presentations (students):

1st Place: Shana Stoddard, University of Mississippi

2nd Place: Neal Platt, Mississippi State

3rd Place: Aleksandra Markovets, UALR

Best Oral Presentations (Post-Doctoral fellows):

1st Place: Mikhail Dozmorov, OMRF

2nd Place: Zhichao Liu, NCTR

Best Poster (Computation):

1st Place: Shraddha Thakkar, UAMS

2nd Place: Xingyan Kuang, University of Missouri

3rd Place: Sule Dogan, Mississippi State

Best Poster (Biology):

1st Place: Tamer Aldwairi, Mississippi State

2nd Place: Dilip Gautam, Mississippi State

3rd Place: Bin Pang, University of Missouri

Proceedings summary

This year, there were 13 papers accepted for publication in the conference proceedings [1–13] out of a total of 20 submitted (65%), which was the lowest number of papers published since the first MCBIOS conference in 2003 which also accepted 13 papers. It was the second lowest number of papers submitted to the proceedings (17 were submitted in 2004). This was a substantial drop from the 21 papers published in last year's Proceedings [14–34]. All papers were peer-reviewed by 2 or more reviewers. Our goal is to be inclusive, yet rigorous in the peer-review process such that accepted papers are both high quality and reflective of the work presented at the conference. Papers generally fell into five categories:

Genomic analysis

Ptitsyn et al report an algorithm for analysis of whole genomes in terms of the genes that they share. It provides an important way to quantify gain and loss of genes across phyla, and the authors have identified core genes that are common to each phylum [6].

Verma and Melcher [8] describe a Support Vector Machine (SVM) model for distinguishing peptides originating from plant host proteins or from proteobacterial plant pathogen proteins. A feature set consisting of a combination of both single amino acid compostions and dipeptide compositions exhibited the highest accuracy.

Yao et al. [9] present a detailed phylogenetic and transcriptome analysis of three classes of the secondary-wall-associated NAC domain transcription factors across 19 higher plant species. In addition, computational modeling is used to predict the genes regulated or co-regulated by these transcription factors. The study reveals coordinative functioning of several NAC genes and a number of novel genes and pathways that can potentially be involved in biosynthesis of cell walls.

Systems biology/pathways

Abundance of different databases literally creates an "omics data deluge". Hui Huang et al [3] addressed this issue by creating PAGED database http://bio.informatics.iupui.edu/PAGED, that include data from OMIM, GAD, MSigDb, miRecords and other databases as a one stop solution for exploratory science.

Zhang and Drabier [10] used different aspects of data integration by compiling an Integrated Pathway Analysis Database (IPAD, http://bioinfo.hsc.unt.edu/ipad). This database defines associations between genes, proteins, pathways, diseases, drugs and organs, essential in understanding the relationships between these entities. These relationships can be quantified by running enrichment analysis with flexible threshold options.

Zhang and Berleant [12] developed a Java application BirdsEyeView http://metnetdb.org/MetNet_BirdsEyeView.htm for visualizing gene lists and expression data in context of cellular localization, pathways, and gene ontology annotations. Developed for plant research, this customizable tool provides flexible and intuitive understanding of the processes and pathways affected by the genes of interest.

RNA-seq

Reddy et al. used RNA-Seq to develop an expression based data analysis workflow using freely available software to validate and expand the existing annotation of the cattle pathogen, Mannheimia haemolytica PHL213 [7]. Using the pipeline, the study confirmed existing M. haemolytica annotation as well as identified potential novel genes and operon structures, demonstrating and validating the use of this elegant, simple, and easily implemented bioinformatics pipeline.

Proteomics

Zhang et al compare the effects of organelle enrichment on sensitivity of protein identification by high-throughput mass-spec from aerials of A. thaliana, and further compare and contrast the biological effects of two hormone treatments, Zeatin and brassinosteroid, on protein expression levels in mitochondria and chloroplasts. Their results suggest that physical enrichment of organelles increases the sensitivity of the assay to identify organelle specific proteins. In addition, they find that the two hormones affect different biological pathways to achieve a similar physiological effect, an increase in biomass for bioenergy production [13].

Zhang and Su analyzed the flexibility of protein structures using different structures of identical proteins based on structural comparison, secondary structure and sequence alignment, and report that proteins have several stable conformations, and that structures for the identical sequences may significantly differ from one another [11]. This will be helpful in evaluating the accuracy of protein structure prediction methods, e.g. one may need to employ molecular dynamic simulation to construct a structure set as criteria for such studies.

Pechan and Gwaltney investigated the relationship between tandem mass spectral fragment ion intensities and the distribution of in vacuo protonation states that can be modeled from peptide sequences [5]. Their work suggests that it is possible to calculate the ion intensities in the mass spectra of peptides, based solely on the protein's amino acid sequence.

Miscellaneous

Halil Bisgin et al use topic modeling to analyze pharmacological similarity and evaluate their system in terms of its potential to reposition drugs - that is, to find additional uses for them. Doing so is important because new drug development is extremely expensive and time-consuming [1].

Zhifa Liu et al evaluated four different Bayesian network scoring functions, Minimum Description Length (MDL), Akaike's Information Criterion (AIC), Bayesian Dirichlet equivalence score (BDeu) and factorized Normalized Maximum Likelihood (fNML), and analyzed their performance in terms of success rate on recovering 'true' gold standard networks [4]. They report that MDL outperforms other scoring functions. This study would provide useful information when analyzing biological networks, such as the gene regulatory networks (GRN).

Fu et al [2] applied multiple instance learning via embedded selection (MILES) for the construction of quantitative structure-activity relationship (QSAR) between 3D shapes of a bioactive compounds with their targets. The authors demonstrated that their method, built solidly on previous research, allows better drug activity prediction without overfitting.

Future meetings

The Stoney Creek Inn & Conference Center in Columbia, Missouri will be the site of MCBIOS 2013 to be held April 5-6. This will be the tenth anniversary of the MCBIOS conference and will be entitled "The 10th Anniversary in a Decade of Change: Discovery in a Sea of Data". The 2012-2013 MCBIOS President is Ed Perkins, US Army Engineer Research and Development Center and Andy Perkins, Mississippi State University, is now the President-elect. MCBIOS is a regional affiliate of the International Society for Computational Biology http://www.ISCB.org. For information regarding MCBIOS and our future meetings, see http://www.MCBIOS.org.

References

  1. Bisgin H, Liu Z, Kelly R, Fang H, Xu X, Tong W: Investigating drug repositioning opportunities in FDA drug labels through topic modeling. BMC Bioinformatics 2012, 13(Suppl 15):S6. 10.1186/1471-2105-13-S15-S6

    Article  PubMed Central  PubMed  Google Scholar 

  2. Fu G, Nan X, Liu H, Patel RY, Daga PR, Chen Y, Wilkins D, Doerksen RJ: Implementation of Multiple-Instance Learning in Drug Activity Prediction. BMC Bioinformatics 2012, 13(Suppl 15):S3. 10.1186/1471-2105-13-S15-S3

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  3. Huang H, Wu X, Sonachalam M, Mandape S, Pandey R, MacDorman K, Wan P, Chen JY: PAGED: A Pathway and Gene-set Enrichment Database to Enable Molecular Phenotype Discoveries. BMC Bioinformatics 2012, 13(Suppl 15):S2. 10.1186/1471-2105-13-S15-S2

    Article  CAS  Google Scholar 

  4. Liu Z, Malone B, Yuan C: Empirical Evaluation of Scoring Functions for Bayesian Network Model Selection. BMC Bioinformatics 2012, 13(Suppl 15):S14. 10.1186/1471-2105-13-S15-S14

    Article  PubMed Central  PubMed  Google Scholar 

  5. Pechan T, Gwaltney SR: Calculations of relative intensities of fragment ions in the MSMS spectra of a doubly charged penta-peptide. BMC Bioinformatics 2012, 13(Suppl 15):S13.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  6. Ptitsyn A, Moroz LL: Computational Workflow for Analysis of Gain and Loss of Genes in Distantly Related Genomes. BMC Bioinformatics 2012, 13(Suppl 15):S5. 10.1186/1471-2105-13-S15-S5

    Article  PubMed Central  PubMed  Google Scholar 

  7. Reddy JS, Kumar R, Watt JM, Lawrence ML, Burgess SC, Nanduri B: Transcriptome profile of a bovine respiratory disease pathogen: Mannheimia haemolytica PHL213. BMC Bioinformatics 2012, 13(Suppl 15):S4.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  8. Verma R, Melcher U: A Support Vector Machine based method to distinguish proteobacterial proteins from eukaryotic plant proteins. BMC bioinformatics 2012, 13(Suppl 14):S9. 10.1186/1471-2105-13-S14-S9

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  9. Yao D, Wei Q, Xu W, Syrenne RD, Yuan JS, Su Z: Comparative Genomic Analysis of NAC Transcriptional Factors to Dissect the Regulatory Mechanisms for Cell Wall Biosynthesis. BMC Bioinformatics 2012, 13(Suppl 15):S10. 10.1186/1471-2105-13-S15-S10

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  10. Zhang F, Drabier R: IPAD: the Integrated Pathway Analysis Database for Systematic Enrichment Analysis. BMC Bioinformatics 2012, 13(Suppl 15):S7. 10.1186/1471-2105-13-S15-S7

    Article  CAS  Google Scholar 

  11. Zhang G, Su Z: Inferences from structure comparison: flexibility, secondary structure wobble and sequence alignment optimization. BMC Bioinformatics 2012, 13(Suppl 15):S12.

    CAS  Google Scholar 

  12. Zhang L, Berleant D, Wang Y, Li L, Cook D, Wurtele ES: BirdsEyeView (BEV): graphical overviews of experimental data. BMC Bioinformatics 2012, 13(Suppl 15):S11.

    PubMed Central  PubMed  Google Scholar 

  13. Zhang Y, Liu S, Dai SY, Yuan JS: Integration of shot-gun proteomics and bioinformatics analysis to explore plant hormone responses. BMC Bioinformatics 2012, 13(Suppl 15):S8. 10.1186/1471-2105-13-S15-S8

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  14. Diedrich KT, Roberts JA, Schmidt RH, Kang CK, Cho ZH, Parker DL: Validation of an arterial tortuosity measure with application to hypertension collection of clinical hypertensive patients. BMC Bioinformatics 2011, 12(Suppl 10):S15. 10.1186/1471-2105-12-S10-S15

    Article  PubMed Central  PubMed  Google Scholar 

  15. Chen M, Shi L, Kelly R, Perkins R, Fang H, Tong W: Selecting a single model or combining multiple models for microarray-based classifier development?--a comparative analysis based on large and diverse datasets generated from the MAQC-II project. BMC Bioinformatics 2011, 12(Suppl 10):S3. 10.1186/1471-2105-12-S10-S3

    Article  CAS  Google Scholar 

  16. Dozmorov MG, Giles CB, Wren JD: Predicting gene ontology from a global meta-analysis of 1-color microarray experiments. BMC Bioinformatics 2011, 12(Suppl 10):S14. 10.1186/1471-2105-12-S10-S14

    Article  PubMed Central  PubMed  Google Scholar 

  17. Nan X, Fu G, Zhao Z, Liu S, Patel RY, Liu H, Daga PR, Doerksen RJ, Dang X, Chen Y, et al.: Leveraging domain information to restructure biological prediction. BMC Bioinformatics 2011, 12(Suppl 10):S22. 10.1186/1471-2105-12-S10-S22

    Article  PubMed Central  PubMed  Google Scholar 

  18. Achuthan S, Chung BJ, Ghosh P, Rangachari V, Vaidya A: A modified Stokes-Einstein equation for Abeta aggregation. BMC Bioinformatics 2011, 12(Suppl 10):S13. 10.1186/1471-2105-12-S10-S13

    Article  PubMed Central  PubMed  Google Scholar 

  19. Winters-Hilt S, Horton-Chao E, Morales E: The NTD Nanoscope: potential applications and implementations. BMC Bioinformatics 2011, 12(Suppl 10):S21. 10.1186/1471-2105-12-S10-S21

    Article  PubMed Central  PubMed  Google Scholar 

  20. Suer S, Kockara S, Mete M: An improved border detection in dermoscopy images for density based clustering. BMC Bioinformatics 2011, 12(Suppl 10):S12. 10.1186/1471-2105-12-S10-S12

    Article  PubMed Central  PubMed  Google Scholar 

  21. Li Y, Gong P, Perkins EJ, Zhang C, Wang N: RefNetBuilder: a platform for construction of integrated reference gene regulatory networks from expressed sequence tags. BMC Bioinformatics 2011, 12(Suppl 10):S20. 10.1186/1471-2105-12-S10-S20

    Article  PubMed Central  PubMed  Google Scholar 

  22. Esfahani MS, Yoon BJ, Dougherty ER: Probabilistic reconstruction of the tumor progression process in gene regulatory networks in the presence of uncertainty. BMC Bioinformatics 2011, 12(Suppl 10):S9. 10.1186/1471-2105-12-S10-S9

    Article  PubMed Central  PubMed  Google Scholar 

  23. Bisgin H, Liu Z, Fang H, Xu X, Tong W: Mining FDA drug labels using an unsupervised learning technique--topic modeling. BMC Bioinformatics 2011, 12(Suppl 10):S11. 10.1186/1471-2105-12-S10-S11

    Article  PubMed Central  PubMed  Google Scholar 

  24. Dozmorov MG, Wren JD: High-throughput processing and normalization of one-color microarrays for transcriptional meta-analyses. BMC Bioinformatics 2011, 12(Suppl 10):S2. 10.1186/1471-2105-12-S10-S2

    Article  PubMed Central  PubMed  Google Scholar 

  25. Markovets AA, Herman D: Analysis of cancer metabolism with high-throughput technologies. BMC Bioinformatics 2011, 12(Suppl 10):S8. 10.1186/1471-2105-12-S10-S8

    Article  PubMed Central  PubMed  Google Scholar 

  26. Ghaffari N, Ivanov I, Qian X, Dougherty ER: A CoD-based stationary control policy for intervening in large gene regulatory networks. BMC Bioinformatics 2011, 12(Suppl 10):S10. 10.1186/1471-2105-12-S10-S10

    Article  PubMed Central  PubMed  Google Scholar 

  27. Roy S, Heinrich K, Phan V, Berry MW, Homayouni R: Latent Semantic Indexing of PubMed abstracts for identification of transcription factor candidates from microarray derived gene sets. BMC Bioinformatics 2011, 12(Suppl 10):S19. 10.1186/1471-2105-12-S10-S19

    Article  PubMed Central  PubMed  Google Scholar 

  28. Martha VS, Liu Z, Guo L, Su Z, Ye Y, Fang H, Ding D, Tong W, Xu X: Constructing a robust protein-protein interaction network by integrating multiple public databases. BMC Bioinformatics 2011, 12(Suppl 10):S7. 10.1186/1471-2105-12-S10-S7

    Article  PubMed Central  PubMed  Google Scholar 

  29. Mayo ML, Perkins EJ, Ghosh P: First-passage time analysis of a one-dimensional diffusion-reaction model: application to protein transport along DNA. BMC Bioinformatics 2011, 12(Suppl 10):S18. 10.1186/1471-2105-12-S10-S18

    Article  PubMed Central  PubMed  Google Scholar 

  30. Qian X, Sahraeian SM, Yoon BJ: Enhancing the accuracy of HMM-based conserved pathway prediction using global correspondence scores. BMC Bioinformatics 2011, 12(Suppl 10):S6. 10.1186/1471-2105-12-S10-S6

    Article  PubMed Central  PubMed  Google Scholar 

  31. Griffith SD, Quest DJ, Brettin TS, Cottingham RW: Scenario driven data modelling: a method for integrating diverse sources of data and data streams. BMC Bioinformatics 2011, 12(Suppl 10):S17. 10.1186/1471-2105-12-S10-S17

    Article  PubMed Central  PubMed  Google Scholar 

  32. Wang Y, Ghaffari N, Johnson CD, Braga-Neto UM, Wang H, Chen R, Zhou H: Evaluation of the coverage and depth of transcriptome by RNA-Seq in chickens. BMC Bioinformatics 2011, 12(Suppl 10):S5. 10.1186/1471-2105-12-S10-S5

    Article  CAS  Google Scholar 

  33. Matthews SJ, Williams TL: An efficient and extensible approach for compressing phylogenetic trees. BMC Bioinformatics 2011, 12(Suppl 10):S16. 10.1186/1471-2105-12-S10-S16

    Article  PubMed Central  PubMed  Google Scholar 

  34. Zhang F, Chen JY: HOMER: a human organ-specific molecular electronic repository. BMC Bioinformatics 2011, 12(Suppl 10):S4. 10.1186/1471-2105-12-S10-S4

    Article  PubMed Central  PubMed  Google Scholar 

Download references

Acknowledgements

We would like to thank the many anonymous peer reviewers who helped to ensure the quality of these Proceedings.

This article has been published as part of BMC Bioinformatics Volume 13 Supplement 15, 2012: Proceedings of the Ninth Annual MCBIOS Conference. Dealing with the Omics Data Deluge. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/13/S15

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jonathan D Wren.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

All authors served as editors for these proceedings, with JDW serving as Senior Editor. All authors helped write this editorial.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Wren, J.D., Dozmorov, M.G., Burian, D. et al. Proceedings of the 2012 MidSouth computational biology and bioinformatics society (MCBIOS) conference. BMC Bioinformatics 13 (Suppl 15), S1 (2012). https://doi.org/10.1186/1471-2105-13-S15-S1

Download citation

  • Published:

  • DOI: https://doi.org/10.1186/1471-2105-13-S15-S1

Keywords