EasyModeller: A graphical interface to MODELLER

Kuntal, Bhusan K; Aparoy, Polamarasetty; Reddanna, Pallu

doi:10.1186/1756-0500-3-226

Technical Note
Open access
Published: 16 August 2010

EasyModeller: A graphical interface to MODELLER

Bhusan K Kuntal¹,
Polamarasetty Aparoy¹ &
Pallu Reddanna¹

BMC Research Notes volume 3, Article number: 226 (2010) Cite this article

22k Accesses
156 Citations
8 Altmetric
Metrics details

Abstract

Background

MODELLER is a program for automated protein Homology Modeling. It is one of the most widely used tool for homology or comparative modeling of protein three-dimensional structures, but most users find it a bit difficult to start with MODELLER as it is command line based and requires knowledge of basic Python scripting to use it efficiently.

Findings

The study was designed with an aim to develop of "EasyModeller" tool as a frontend graphical interface to MODELLER using Perl/Tk, which can be used as a standalone tool in windows platform with MODELLER and Python preinstalled. It helps inexperienced users to perform modeling, assessment, visualization, and optimization of protein models in a simple and straightforward way.

Conclusion

EasyModeller provides a graphical straight forward interface and functions as a stand-alone tool which can be used in a standard personal computer with Microsoft Windows as the operating system.

Findings

Structural information of biological macromolecules is readily available in the Protein Data Bank (PDB) [1], http://www.pdb.org. By Sep 2009, the PDB contained ~ 60,713 experimental protein structures that can be grouped into ~ 3500 families [2]. Considering that the number of non redundant amino acid sequence entries is around 408,000 http://www.expasy.org/sprot/, there is a huge gap between known annotated sequences and available 3 D structures. Developments in genomics have also spurred the developments in X-ray crystallography and NMR techniques to solve the new protein structure, which in turn has widened their use in drug discovery [3]. However, these efforts are no where near in solving the 3 D structures of all the known proteins in any system. In the absence of experimental structures, computational methods are used to predict 3 D protein models to provide insight into the structure and function of these proteins. The steps involved in this process are [4]: (1) identification of homolog that can be used as template(s) for modeling; (2) alignment of the target sequence to the template(s); (3) backbone generation; (4) loop modeling; (5) side-chain modeling; (6) model optimization; and (7) validation of the model. Repositories like The SWISS-MODEL http://swissmodel.expasy.org/SWISS-MODEL.html, Protein Model Portal http://proteinmodelportal.org[5] and Modbase http://modbase.compbio.ucsf.edu[6], contain protein models generated using various auto-mated methods. However, without human intervention, errors as a result of inaccurate sequence alignment, and inability to identify and correctly model domains, such as loop and ligand-binding regions, are magnified, which results in the generation of low-accuracy models and thus limiting their applicability to drug discovery projects [7, 8]. In this context, the development of various user friendly and accurate tools for homology modeling is an active area of research such as new recent tools like HHPRED and Modeller at http://toolkit.tuebingen.mpg.de/sections/tertstruct toolkit, GeneSilico https://genesilico.pl/toolkit/[9–15]. MODELLER is one of the most widely used tools for homology or comparative modeling of protein three-dimensional structures. MODELLER stands apart from other packages due to its free availability, powerful features and reliable results. But most users find a bit difficult to start with MODELLER as it is command line based. Hence a freely available GUI for MODELLER would thus be very helpful to exploit the powers and advantages of this package more effectively. EasyModeller is a graphical user interface to MODELLER program.

EasyModeller is a standalone tool with a very intuitive interface which clearly defines the different steps of homology modeling (Additional file 1). The screenshot of the tool is shown in Fig. 1 which shows six steps required for building a homology model with the help of EasyModeller. User is required to follow the numbered steps one by one, which is guided by associated help information. A blue clickable panel called the "Help panel" can be used to view the help tips associated with each step. EasyModeller follows a very simple color coding consisting of green and red buttons. The features (buttons) marked red are the minimum compulsory steps to get a model while those in green are the optional ones.

The first step involves specifying the working directory, which is the folder location where the output files will be generated. This working directory will also help to keep a track of all the generated files. The second and most basic step is entering the amino acid sequence information as the input parameter. The third step is providing the template information to the program. The user can load the template structure(s) in standard formats like (.pdb, .ent, etc) acceptable in MODELLER by using the Load template(s) feature. The basic template information like its name, chains, heteroatoms, etc are shown in the display area and the CHAIN is automatically set default to the first chain as set in the PDB file. To use a different chain for the template the text box containing the chain information can be edited and the desired chain id can be entered. The text box is automatically kept blank if no chain information is found. To do multi template based modeling users can load all the template structure files one by one in order with a maximum of six templates as in Fig 2. The next step of homology modeling is aligning the query sequence with the template which is achieved in step four. The "Perform Alignment" feature aligns the query sequence with the template(s) using the align2 d function of MODELLER and displays the output alignment in the text display window of the tool. Although the display area is large enough, sometimes the output display might seem not to fit in it. So the display is made scrollable, users just need to double click on it to activate the feature and scroll the mouse wheel down to view the entire display contents. A beneficial feature of the tool is the possibility to view and manually improve the query alignment via the feature 'Edit Alignment'. Although the tool provides a preliminary option for alignment editing, for users who would like to use advanced visual alignment editing can install BioEdit [16] and manually open the appropriate alignment (.ali) files from the current working directory with BioEdit (by one time associating .ali files with the tool during installation), do the editing on the go and save them in the same location. The fifth step is generating the homology model by using the information generated so far. The "Generate Model" feature is used to achieve this by using the appropriate MODELLER function as required. As soon as the model is generated, the best model is displayed in the users default PDB viewer like Rasmol [17] as in Fig. 3. Further the generated model can be improved upon by loop modeling. MODELLER has several loop optimization methods, which all rely on scoring functions and optimization protocols adapted for loop modeling [10]. The sixth and the final step is model optimization which can be achieved by using the advanced optimization options Fig 4. The various parameters for optimization and dynamics like temperature and number of iterations can be changed by editing the default value in the corresponding text boxes. The minimized models are generated and are saved in the working directory inside a new folder called optimized models. The dynamics output binary trajectory files are also saved in the same folder which can be read in by visualization software such as CHIMERA [18] or VMD [19]. Further the model profile plot can be generated by selecting the "Plot profile of a model" option which calculates DOPE energy of the loaded model using the assess_dope function [20] and displays it as in Fig. 5.

The GUI eliminates the requirement of prior knowledge in the backend applications, thereby increasing the number of users of MODELLER and assists them to exploit the unique features of this great package more effectively. EasyModeller uses default parameters for most commands during software execution to make the process as simple as possible. User can change the parameters manually by editing the associated python script file (*.py) generated in the working directory.

EasyModeller will be updated by adding features like comparison and manual combination of multiple template structure and manual definition of spatial restraints into a more powerful GUI to MODELLER which could simultaneously display both alignment and structure windows, and have them interact with each other.

Availability and requirements

Project name: EasyModeller

Project homepage: http://www.uohyd.ernet.in/modellergui/

Operating system: Microsoft Windows (any)

Programming language: Perl (using Perl/Tk)

Other requirements: The system must have MODELLER (any version will work but preferably the latest version to get the best results) and Python (2.5 or 2.6 and not 3.11) preinstalled in the default installation directory (C://Program files/..). If the operating system is Windows Vista or Windows 7 then please run the executable file (.exe) of EasyModeller with administrative privilege (by right clicking it and selecting "Run as administrator"). Since EasyModeller uses the Microsoft Excel plot function to plot the profile graph, it is necessary to have Microsoft Excel installed in the system. A PDB viewer like Rasmol is required to visualize the generated model.

License: Free to use

Any restrictions to use by non-academics: None

References

Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res. 2000, 28 (1): 235-242. 10.1093/nar/28.1.235.
Article CAS PubMed Central PubMed Google Scholar
Cavasotto CN, Phatak SS: Homology modeling in drug discovery: current trends and applications. Drug Discov Today. 2009, 14 (13-14): 676-683. 10.1016/j.drudis.2009.04.006.
Article CAS PubMed Google Scholar
Manjasetty BA, Turnbull AP, Panjikar S, Bussow K, Chance MR: Automated technologies and novel techniques to accelerate protein crystallography for structural genomics. Proteomics. 2008, 8 (4): 612-625. 10.1002/pmic.200700687.
Article CAS PubMed Google Scholar
Krieger E, Nabuurs SB, Vriend G: Homology modeling. Methods Biochem Anal. 2003, 44: 509-523.
CAS PubMed Google Scholar
Arnold K, Kiefer F, Kopp J, Battey JN, Podvinec M, Westbrook JD, Berman HM, Bordoli L, Schwede T: The Protein Model Portal. J Struct Funct Genomics. 2009, 10 (1): 1-8. 10.1007/s10969-008-9048-5.
Article CAS PubMed Central PubMed Google Scholar
Pieper U, Eswar N, Davis FP, Braberg H, Madhusudhan MS, Rossi A, Marti-Renom M, Karchin R, Webb BM, Eramian D: MODBASE: a database of annotated comparative protein structure models and associated resources. Nucleic Acids Res. 2006, D291-295. 10.1093/nar/gkj059. 34 Database
Dalton JA, Jackson RM: An evaluation of automated homology modelling methods at low target template sequence similarity. Bioinformatics. 2007, 23 (15): 1901-1908. 10.1093/bioinformatics/btm262.
Article CAS PubMed Google Scholar
Venclovas C, Margelevicius M: The use of automatic tools and human expertise in template-based modeling of CASP8 target proteins. Proteins. 2009, 77 (Suppl 9): 81-88. 10.1002/prot.22515.
Article CAS PubMed Google Scholar
Sali A, Blundell TL: Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993, 234 (3): 779-815. 10.1006/jmbi.1993.1626.
Article CAS PubMed Google Scholar
Fiser A, Do RK, Sali A: Modeling of loops in protein structures. Protein Sci. 2000, 9 (9): 1753-1773. 10.1110/ps.9.9.1753.
Article CAS PubMed Central PubMed Google Scholar
Hildebrand A, Remmert M, Biegert A, Soding J: Fast and accurate automatic structure prediction with HHpred. Proteins. 2009, 77: 128-132. 10.1002/prot.22499.
Article CAS PubMed Google Scholar
Soding J, Biegert A, Lupas AN: The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 2005, W244-248. 10.1093/nar/gki408. 33 Web Server
Battey JN, Kopp J, Bordoli L, Read RJ, Clarke ND, Schwede T: Automated server predictions in CASP7. Proteins. 2007, 69 (Suppl 8): 68-82. 10.1002/prot.21761.
Article CAS PubMed Google Scholar
Dunbrack RL: Sequence comparison and protein structure prediction. Curr Opin Struct Biol. 2006, 16 (3): 374-384. 10.1016/j.sbi.2006.05.006.
Article CAS PubMed Google Scholar
Fiser A, Sali A: Modeller: generation and refinement of homology-based protein structure models. Methods Enzymol. 2003, 374: 461-491. full_text.
Article CAS PubMed Google Scholar
Hall TA: Bioedit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/nt. Nucleic Acids Symposium Series. 1999, 41: 95-98.
CAS Google Scholar
Sayle RA, Milner-White EJ: RASMOL: biomolecular graphics for all. Trends Biochem Sci. 1995, 20 (9): 374-10.1016/S0968-0004(00)89080-5.
Article CAS PubMed Google Scholar
Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE: UCSF Chimera--a visualization system for exploratory research and analysis. J Comput Chem. 2004, 25 (13): 1605-1612. 10.1002/jcc.20084.
Article CAS PubMed Google Scholar
Humphrey W, Dalke A, Schulten K: VMD: visual molecular dynamics. J Mol Graph. 1996, 14 (1): 33-38. 10.1016/0263-7855(96)00018-5. 27-38
Article CAS PubMed Google Scholar
Shen MY, Sali A: Statistical potential for assessment and prediction of protein structures. Protein Sci. 2006, 15 (11): 2507-2524. 10.1110/ps.062416606.
Article CAS PubMed Central PubMed Google Scholar

Download references

Acknowledgements

We thank Centre for Modelling, Simulation and Design (CMSD), University of Hyderabad for the computational facilities. We duly acknowledge Council of Scientific and Industrial Research (CSIR), Govt. of India for providing senior research fellowship to PA and UGC for providing GATE fellowship to BKK.

Author information

Authors and Affiliations

School of Life Sciences, University of Hyderabad, Hyderabad, 500 046, India
Bhusan K Kuntal, Polamarasetty Aparoy & Pallu Reddanna

Authors

Bhusan K Kuntal
View author publications
You can also search for this author in PubMed Google Scholar
Polamarasetty Aparoy
View author publications
You can also search for this author in PubMed Google Scholar
Pallu Reddanna
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pallu Reddanna.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

BKK carried out the planning and development of the GUI, PA contributed in the enhancement of GUI and in manuscript preparation and PR coordinated the whole work.

Electronic supplementary material

Additional file 1:This file contains the EasyModeller tool executable file and the details of the tool. (ZIP 2 MB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Kuntal, B.K., Aparoy, P. & Reddanna, P. EasyModeller: A graphical interface to MODELLER. BMC Res Notes 3, 226 (2010). https://doi.org/10.1186/1756-0500-3-226

EasyModeller: A graphical interface to MODELLER