International Journal of ISSN: 2470-9980IJVV

Vaccines & Vaccination
Mini Review
Volume 3 Issue 3 - 2016
The Role of Protein Engineering in the Design and Production of Recombinant Proteins
Seyed Hossein Khaleghinejad1, Mohammad Dehghan Niri2, Azam Fazilati2, Sadegh Shabani3 and Seyed Hossein Shahcheraghi4,5*
1National Institute of Genetic Engineering and Biotechnology, Tehran, Iran
2Shahid Sadoughi Hospital, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
3Department of Biology, University of Zabol, Zabol, Iran
4Infectious Diseases Research Center, University of Medical Sciences Shahid Sadoughi, Yazd, Iran
5Department of Modern Sciences & Technologies, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
Received: October 25, 2015| Published: December 30, 2016
*Corresponding author: Seyed Hossein Shahcheraghi, Infectious Diseases Research Center, Shahid Sadoughi University of Medical Sciences, Yazd, Iran, Department of Modern Sciences & Technologies, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran. Tel: +98-913-2531389, Email:
Citation: Khaleghinejad SH, Niri MD, Fazilati A, Shabani S, Shahcheraghi SH (2016) The Role of Protein Engineering in the Design and Production of Recombinant Proteins. Int J Vaccines Vaccin 3(3): 00067. DOI: 10.15406/ijvv.2016.03.00067


Proteins possess many structural and functional characteristics that these features are not seen in any other biomolecules. So many biologists have been persuaded that using protein engineering design and build their desired proteins. These engineered proteins can act as new molecular tools for scientific, medical, industrial, etc. applications, so they can satisfy many human needs that are not met by natural proteins. Protein engineering based on calculations Strategy produces and screens the protein sequences with “in silico” (i.e. before its synthesis in the laboratory). Structure-based protein engineering calculations similarly use the calculations to discover the protein sequences. However, calculation-based protein engineering also emphasizes the new and useful protein engineering and investigating the relationship between structure and function of them. This study investigates the protein engineering tools in producing the recombinant and engineered proteins. For this purpose, bioinformatics studies, how to predict the second and third buildings based on homology and adaptive modeling, predicting protein performance for the production of recombinant proteins are investigated.

Keywords: Protein engineering; Recombinant protein; Emphasizes; In silico; Biomolecules


Due to the versatility of microbial recombinant proteins, practical features and easy production, they are considered an important group of products with biotechnology value [1-7]. This group of proteins, according to their careful design, possesses specific characteristics that are very attractive for various applications [5]. In the design and production of recombinant proteins, we must take care that between the complexity of a specific protein and complexity and capabilities of an expression system there exist a direct relationship so that single protein subunits are easily capable of producing in bacterial hosts, while proteins that require mammalian proper glycosylation or the presence of multiple disulfide bonds are forced to express in higher eukaryotic host [6].

Given the complexity of engineering proteins they need a combination of computational and experimental approaches. ¬¬protein engineering strategies based on the variant changes identify the desired protein from among a large number of mutant variants. Therefore their success depends on a number of mutations required to apply appropriate methods for studying these mutations. On the other hand, protein engineering strategies based on calculations, produce and screen the protein sequences “in silico”, (i.e. before its synthesis in the laboratory). Structure-based protein engineering calculations similarly uses the calculations to discover the desired protein sequences. However, calculation-based protein engineering also emphasize the new and useful protein engineering and investigate the relationship between structure and function of them [8-12]. Random mutant, recombination, and diversification directed at the three main groups are methods used for building libraries in protein engineering [13].

Protein engineering purposes, especially in enzymes, is enlarging the active position, changes activity (change specific activity, change the characteristics of the substrate), sustainability (change thermal stability, protease stability and oxidation stability) and resistance to surfactants and detergents [14-16]. Schmidt and colleagues also used random mutation by error-prone, created a set of mutations for target enzyme to change anantio selectioiti [13].

 Mesophiltipic enzyme from B. subtilis LipA with the purpose of directed mutagenesis was put in place, one of the nine created mutations substantially increased the melting point of 15 degrees and 20 degrees in optimum temperature for mutant lipase activity compared with wild-type [1]. In 2009 the Factory et al. used Site-directed Mutagenesis method in Bacillus lipase enzyme termokatlonatous to reduce the space interference in active enzyme site for better access to the substrate, they replaced amino acid phenylalanine 181 and 182 with Alanine which leads to an increase in the enzymatic activity [9]. Protein engineering describes the process of altering the structure of an existing protein to improve its properties. It is an important technology that increases our basic understanding of how enzymes function and have evolved, and it is the key method of improving enzyme properties for applications in pharmaceuticals, green chemistry and biofuels. In the following sections, tools and databases in protein engineering and genetic engineering which are used in the design and production of recombinant proteins are discussed.

Protein Sequence Alignment

We can find similar protein sequence alignment from this NCBI website ( According to this similarity finding, we can find the most similar protein to the expected protein in terms of sequence and structure. We can also predict the protected sequence, enzyme family, amino acids in the active site, oxyanion cavity etc. in target protein through BLAST in this database. To predict the presence of Disulfide bond, we can use DISULFIND datacenter ( Also to determine the presence of signal peptide we can use Signal P 4.1 ( For a signal peptide, D must score higher than 0.45, as (Figure 1) predicted score for protein (lipase) equal to 116/0. So in this forecast does not predict a signal peptide.

Figure 1: Predicting Signal Peptide.

Choosing the Expression Host

To select the host for protein expression we can use GenScript ( In this center, codon sequence of the desired protein is compared with codon usage of expression microorganisms such as yeast and E. coli. As the number CAI (Codon Adaptation Index) is closer to number 1, the expressed desired protein is higher.

Software Study of Protein Secondary Structure

To study the effect of mutations on recombinant protein secondary structure, the bioinformatics methods were used. For this purpose, the protein secondary structure prediction software is used in PSSPRED and YASPIN servers (3). The results of recombinant protein secondary structure prediction based on the scores of the natural amino acid changes resulting from mutations shows recombinant protein secondary structure. In this method the status of each amino acid is predicted according to its place in the type of secondary structure and the certainty of this prediction is displayed with numbers from zero to nine. (Figure 2) shows an example of the second structure of a recombinant protein.

Figure 2: Recombinant Protein Secondary Structure Prediction.

Software Study of Protein Tertiary Structure

In determining the tertiary structure of recombinant proteins, first to determine the pattern, blasts against pdb from phyre server is done which at this website, the structure with the greatest homology is selected as a template. Then, the recombinant protein structure and the pattern is predicted and optimized using modeler, Easy MODELLER and PYTHON software and is displayed using the VMD software and thus the third structure is made [15].

Assessment of Ppredicted Recombinant Protein Structures

Study of normal and mutant lipase structures using adaptive method

To study the three-dimensional structure of recombinant proteins, first, the structure of these proteins has been predicted and optimized using Easy MODELLER software. Then the recombinant protein is matched on a natural protein. The RMSD (Root Mean Squar deviation) between atoms in two molecules is measured and calculated according to Å. This number is between zero and one, and as the amount of RMSD is closer to zero, it indicates that the mutation has not changed the overall structure of recombinant protein and these proteins are matched on each other [8].

Evaluation of the structure using Ramachandran diagram

Second structures of proteins are created by Sai and Phi angles and Ramachandran diagram shows the authorized status of each angle for protein structures. In fact, this chart is a way to visualize the angle φ against ψ backbone of amino acid residues in the protein structure [4].

Evaluation of protein structure through energy profile

Server proSA ( determines the quality of predicted structure based on Z-Score points and also determines the energy profile. As Absolute magnitude of Z-Score is closer to 10, the structure has a higher quality and the more negative energy profile shows a higher structure quality [2].

Function prediction of normal and mutant Lipase proteins

For predicting the function of recombinant protein, the interaction of various ligands with the recombinant and natural protein is investigated using MVD (Molegro Virtual Docker) software [11]. As a result of this ligand-protein connection by providing the MolDock score, interaction is estimated. MolDock, Escore is defined by the following energy expressions:

Escore = Einter + Eintra. In this regard Einter is the protein-ligand interaction energy and Eintra shows the ligand internal energy. The internal energy of ligand here is the same for each ligand and the effective energy in this equation depends on the energy of ligand-protein interactions and as this energy is less (more negative) the stability of the substrate at the active site is done better and substrate proteins are more stable. In studying the recombinant protein interactions with various ligands, it reveals that according to predictions done, how much is the required energy for recombinant and natural protein. Thus, we can predict the recombinant proteins function comparing to natural proteins (Especially in the case of enzymes).


Today, knowledge of enzymology has created profound changes in biotechnology industries. Until the 60s, the income of industrial enzymes was only a few thousand dollars a year, but by the growth of this industry in recent years, this income has increased. Today, most of enzymes are prepared by fermentation of bio-based materials. Protein engineering techniques are highly efficient in producing industrial enzymes.

Enzymes derived from microbial sources using protein engineering and molecular techniques are very useful because they can be produced at low cost and they show improved stability. So the production of recombinant proteins should be economically feasible and gene technology must be able to provide tools to compete with traditional sources, and produce technical enzymes and food additives. Thus, the use of molecular techniques and a tool called protein engineering lead to production of efficient systems and inexpensive components for cultivation in various processes.


  1. Ahmad S, Kamal MZ, Sankaranarayanan R, Rao NM (2008) Thermostable Bacillus subtilis lipases: in vitro evolution and structural insight. J Mol Biol 381(2): 324-340.
  2. Buchan DW (2013) Scalable web services for the PSIPRED Protein Analysis Workbench. Nucleic acids res 41: W349-W357.
  3. Krinsky N, Kaduri M, Roitman SJ, Goldfeder M, Ivanir E, et al. (2016) A Simple and Rapid Method for Preparing a Cell-Free Bacterial Lysate for Protein Synthesis. PLoS One 11(10): e0165137.
  4. Frishman D, Patrick A (1995) Knowledge‐based protein secondary structure assignment. Proteins 23(4): 566-579.
  5. Gandhi NN (1997) Applications of lipase. Journal of the American Oil Chemists’ Society 74(6): 621-634.
  6. Gellissen G, Kunze G, Gaillardin C, Cregg JM, Berardi E, et al. (2005) New yeast expression platforms based on methylotrophic Hansenula polymorpha and Pichia pastoris and on dimorphic Arxula adeninivorans and Yarrowia lipolytica – a comparison. FEMS Yeast Res 5(11): 1079-1096.
  7. Houde A, Kademi A, Leblanc D (2004) Lipases and their industrial applications. Applied biochemistry and biotechnology 118(1): 155-170.
  8. Humphrey W, Dalke A, Schulten K (1996) Visual molecular dynamics. Journal of Molecular Graphics 14(1): 33-38.
  9. Karkhane AA, Yakhchali B, Rastgar Jazii F, Bambai B (2009) The effect of substitution of Phe181 and Phe182 with Ala on activity, substrate specificity and stabilization of substrate at the active site of Bacillus thermocatenulatuslipase. Journal of Molecular Catalysis B: Enzymatic 61(3-4): 162-167.
  10. Park SJ, Cochran JR (2009) Protein Engineering and Design (1st edn), CRC Press, USA.
  11. Parulekar RS (2013) Homology modeling, molecular docking and DNA binding studies of nucleotide excision repair UvrC protein from M. tuberculosis. Protein J 32(6): 467-476.
  12. Rashid MA, Khatib K, Abdul S (2015) Protein preliminaries and structure prediction fundamentals for computer scientists. Computational Engineering.
  13. Rúa ML, Dannert SC, Wahl S, Sprauer A, Schmid RD (1997) Thermoalkalophilic lipase of Bacillus thermocatenulatus large-scale production, purification and properties: aggregation behavior and its effect on activity. J Biotechnol 56(2): 89-102.
  14. Sekhon BS (2012) Designer Proteins. Journal of Pharmaceutical Education and Research 3(1): 40.
  15. Shen MY, Sali A (2006) Statistical potential for assessment and prediction of protein structures. Protein Sci 15(11): 2507-2524.
  16. Danino H, Naor RP, Fogel C, Harosh BY, Kadir R, et al. (2016) PPARγ regulates exocrine pancreas lipase. Biochim Biophys Acta 1861(12): 1921-1928.
© 2014-2016 MedCrave Group, All rights reserved. No part of this content may be reproduced or transmitted in any form or by any means as per the standard guidelines of fair use.
Creative Commons License Open Access by MedCrave Group is licensed under a Creative Commons Attribution 4.0 International License.
Based on a work at
Best viewed in Mozilla Firefox | Google Chrome | Above IE 7.0 version | Opera |Privacy Policy