Table of Contents Prev article Next article
March , 2019 Vol 1 issues 1
Computer Aided Drug Design: An Introduction

Ibezim Akachukwu*, Nwodo N. Ngozi, Mbah C. Mbah

Department of Pharmaceutical and Medicinal Chemistry, University of Nigeria, Nsukka.


Corresponding Author:

Akachukwu Ibezim


Tel: +234-803-803-2676


Received 29 May 2018

Revised 31 October 2018

Accepted 5 November 2018


This review covers the basic principles of chemistry used in molecular modeling as they apply to medicinal chemistry. This is necessitated from the fact that the use of computers in drug design and development has become a common practice.

1 Introduction

Traditional drug discovery generally involves some trial and error processes that include experimental screening of new chemical entities which are obtained by chemical synthesis or isolated from natural origin, until the desired pharmacological properties have been developed [1]. This traditional approach cost at least 1 billion US dollar and a period of at least 10 to 15 years to discover a successful drug [2]. Experts in the field of drug discovery and development have made several efforts to surmount the earlier mentioned challenges by coming up with the following methods: combinatorial chemistry methods, high throughput screening method, Proteomic and genomic projects, etc. Combinatorial chemistry has increased the number of compounds synthesized per given time, thereby populating the number of potential drug candidates to be screened. High throughput screening technique provided the opportunity to determine the biological potency of large chemical entities simultaneously. Advances in human genome and proteome have resulted in identification of large number of human proteins which serve as drug targets. All the contributions made by these techniques mainly caused an increase in expenses, number of leads and protein targets without corresponding increase in the number of successful new drug. Therefore, there is an urgent need for methods that will cut down cost and time for the drug discovery process. The use of computers and computer programs has emerged as an answer to this need in drug discovery, and is now known as computer aided drug design (CADD) [3].

In CADD, computational methods, mainly computer programs/algorithm, are employed to calculate structures and properties of molecules. These computational methods are broadly divided into two categories: molecular mechanics and quantum mechanics.

2 Computers in Medicinal Chemistry

2.1 Quantum Mechanics and Molecular Mechanics

Before molecular mechanics and quantum mechanics calculations and operations are carried out on molecules, chemical structure of the molecule in question has to be generated and displayed on the computer screen. Molecular modeling programs often include chemical drawing and graphics display packages capable of generating both two and three dimensional chemical structures. The chemical structures can be displayed in different formats as shown in Fig 1. There are several software packages available, such as ChemDraw, Alchemy, Sybyl, Hyperchem, Discovery Studio Pro, Spartan, CAChe, etc. [4].

Fig 1. Different representations for visualizing a molecule (histamine) a: 2D structure, b: line, c: stick, d: ball and line, e: ball and stick, f: space filling. b-f are 3D structures
Fig 1. Different representations for visualizing a molecule (histamine)
a: 2D structure, b: line, c: stick, d: ball and line, e: ball and stick,
f: space filling. b-f are 3D structures
Click to view

2.2 Quantum Mechanics

Quantum mechanics uses the principles of quantum physics to calculate/describe the properties of a molecule by considering the interactions between the electrons and nuclei of the molecule. Electrons of a drug molecule are considered as the most important atomic particle in QM because the chemical behavior of a molecule (drug molecule phenomena) is governed by the probability of finding an electron in a particular location in the molecule and by the energy of that electron [5-6].

QM is based on molecular orbital theory (MOT) which enables the calculations/descriptions of electron location probabilities and energies following Schrodinger equation (1) on the potential energy surface.

Quantum Mechanics

Where H is the Hamiltonian operator for the system, E is energy (including the potential and kinetic energy components), ψ is the wave function which describes the distribution (3D coordinates) of an electron in a molecular orbital.

Solution to Schrodinger’s equation is only obtainable for the simplest molecule – hydrogen. Simplifications resulting in the following assumptions have to be made to the equation in order to make it tractable in the case of large molecules:

  • Nuclei are regarded as motionless while electrons move around it. This assumption enables computation of electronic and nuclear energy to be made separately.
  • The electrons move independent of each other, so the influence of other electrons and nuclei is taken as an average.

QM is used to calculate: molecular orbital energies and coefficient, heat of formation for specific conformations, partial atomic charges calculated from molecular orbital coefficients, electrostatic potentials. QM calculations are subdivided into three categories namely; ab initio, density functional theory and semi-empirical methods.

2.2.1 Ab Initio Methods

Ab initio is a Latin phrase meaning ‘from the beginning’. In this method, variation theory is used to compute energy of a molecule based on wave function. Full Schrodinger’s equation is used to treat all the electrons of a molecule without attempt to calibrate them against experimental data. Hartree-Fock model is an example of ab initio calculations.

Hartree and Fock combined electrons into an average field which simplified calculation by allowing Hamiltonian to be calculated for each electron independently using a new term for its interaction with the overall electron cloud.

Ab initio method implies the following:

  • All the electrons have been considered simultaneously.
  • The exact non-relativistic Hamiltonian (with fixed nuclei) is used.

 Ab Initio Methods

Where the indices i, j and a, b refer, respectively, to the electrons and to the nuclei with nuclear charges Za and Zb. H is an “operator”, a mathematical construction that operates on the molecular orbital, ψ, to determine the energy.

  • Lastly, an effort would have been made to evaluate all integrals rigorously.

Generally, ab initio method is satisfactory only for small molecules containing about tens of atoms. However, it has been applied to more computational work due to high computer horsepower. The method is used to determine properties of atoms of a molecule such as dipole moments, magnetic susceptibility, chemical shielding, spin-spin coupling constants, electron affinity, etc. GAMESS and GAMESS-UK are widely free academic software for carrying out ab initio calculations.

2.2.2 Density Functional Theory

Density Functional Theory (DFT) is QM method that calculates energy (electronic structure) of atoms of a molecule (especially the ground state) based on the electron density. Although DFT has been used to perform calculations in solid-state physics since the 1970s, it was only considered accurate enough for computations in quantum chemistry in the 1990s when the approximations in DFT were greatly refined to better model the exchange and correlation interactions. This placed DFT as a leading method used in describing electronic structure. However, the theory has not satisfactorily described intermolecular interactions like van dar Waals forces (dispersion), charge transfer excitations, global potential energy etc.

DFT method has been applied in different aspect of drug design processes such as: calculation of anion-binding properties of 2,6-diamidopyridin dipyrromethane hybrid macrocycles, analyzing the β2-adrenergic G protein-coupled receptor and predicting drug resistance of HIV-1 reverse transcriptase to nevirapine through mutations.

2.2.3 Semi Empirical Method

Semi empirical calculations were born out of necessity to solve the limitation of ab initio method i.e. computer intensity. Principally, semi empirical method is a simple Hartree Fock-Linear combination of atomic orbitals (HF-LCAO) based model that avoid all the difficult integrals, involved in ab initio method that makes it computationally intensive, which is usually calibrated against experiment. Semi empirical calculations are faster than ab initio method and uses perturbation theory to compute electronic properties such as electronic distribution and partial charges. This method is suitable for molecular systems containing hundreds of atoms. Molecular Orbital PACkage (MOPAC) and AM1 (Austin Model 1) are popular programs used in QM semi empirical calculations.

2.2.4 Quantum Mechanics in Drug Design (Electronic Charge)

Earlier, we have mentioned that MOT is the bedrock of QM methods.  Molecular orbital calculations give numeric indices that show the electronic structure (probable position of an electron in a molecular orbital and its energy) which in turn governs the biological behavior/activity of a drug molecule.  So, changes in numeric indices bring about change in electronic structure and invariably change in how a drug molecule behaves in vivo.  Two examples that show the usefulness of MO calculations results (numerical indices) in interpretation of a drug’s mechanism of action and design of new drug molecule with improved properties are in the calculation of electronic charges.

The calculation of electronic charges reveals that electron charge density of atoms in a molecule is not evenly distributed. Valence electrons of atoms of a molecule are not localized on a particular atom rather they move around the entire molecule but spend more time nearer to electronegative atoms than electropositive ones. This results in some parts of the molecule being partially negative (due to excess electron) and other parts being partially positive (due to electron deficiency). Calculated electronic charges of molecules have been found useful in predicting structure-activity relationship of drugs as shown in the following.

Consider the three inhalation anesthetic gases shown in Fig 2 alongside their calculated excess or deficient electronic charges per atom of the molecule (not the absolute values). The electronic charge values enable drug scientist to suggest/propose the metabolism of these gases as thus: enzymatic ether bond cleavage by attack of an electrophilic oxygen atom at the methyl carbon in methoxyflurane is really much feasible than other two molecules because the methyl carbon is much less positive than the methyl carbons in others. The results of predictions like these are a better understanding of metabolism and a rationale for the design of new agents with improved properties.

Second example is gotten from the charge distribution of histamine and histamine ion (Fig 3) [7-8]. Charges are thought to be localized on a particular atom as the terminal nitrogen atom of histamine ion bears the positive charge. However, calculation of partial charges shows that some of the positive charge is localized on the hydrogen attached to the terminal nitrogen.  This has important consequences in the way we think of ionic interactions between a drug and its binding site. It implies that charges areas in the binding site and the drug are more diffuse than one might think. This in turn, suggests that we have wider scope in designing novel drugs.

Fig 2. Charge densities for anesthetic gases: methoxyflurane (I), enflurane (II), isoflurane (III)
Fig 2. Charge densities for anesthetic gases:
methoxyflurane (I), enflurane (II), isoflurane (III)
Click to view



Fig 3. Charge distribution on the histamine (I) and histamine ion (II)
Fig 3. Charge distribution on the histamine (I) and
histamine ion (II)
Click to view

2.3 Molecular Mechanics (MM)

In molecular mechanics, molecules are considered as a series of spheres (the atoms) connected by springs (the bonds). A large molecule consists of the same features we know about in small molecules, but combined in different ways. Equations, used in MM follow the laws of classical physics and are applied to nuclei without consideration of the electrons. The internal energies in MM are simply based on the Newtonian laws of classical mechanics. Equations derived from classical mechanics, are used to calculate the different interactions and energies (force fields) resulting from bond stretching, angle bending, non-bonded interactions, and torsional energies [9-10].

Etot = Estr + Ebend + Etor + Evdw + Eelec + …   - - - - (3)

Where Etot is the total energy of the molecule, Estr is the bond-stretching energy term, Ebend is the angle-bending energy term, Etor is the torsional energy term, Evdw is the van dar Waal energy term, Eelec is the electrostatic energy term.

Values of bond length, bond angle, bond stretch and so on, at equilibrium are also known as force constants. These are used in the potential energy functions defined in the force field which describes a set known as force field parameters. The total energy of a molecule increases at any deviation from these equilibrium values. Each conformation of a molecule has its own total energy and a difference between the total energy of two different conformations of the same molecule is used to determine molecular stability.

2.3.1 Bond-stretching

If we consider histamine (Fig 4) we can identify a variety of bond types including C(sp2)-C(sp2), C(sp2)-C(sp3), and so on [11].

Fig 4. 2D chemical structure of histamine
Fig 4. 2D chemical structure of histamine
Click to view

There is usually an energy (interatomic force) change when the bonds stretch and contract from their ideal unstrained length. This bond-stretch energy term is described by the equation given below.

Bond Stretching

Where Kb is the bond-stretching force constant, bo is the unstrained bond length, and b is the actual bond length.

2.3.2 Bond-bending/angle

Next, we have to consider the angle-bending vibrations. It is usual to write these as harmonic ones, typically for the connected atoms A-B-C.

Bond bending angle


KABC  (K is the angle-bending force constant, θe,ABC is the equilibrium value for the bond angle θ, and θABC is the actual value for θ.

2.3.3 Torsion angle

Torsional energies are associated with atoms that are separated from each other by three bonds. The relative orientation of these atoms is defined by the dihedral or torsion angle. The torsional angle ABCD between the four bonded atoms A, B, C and D is shown in Fig 5 [10].

Dihedral angle
Fig 5. Dihedral angle
Click to view

If we use χ to denote the angle between the four atoms, then a popular dihedral potential energy term is a cosine series given by

dihedral potential energy

Where Kj is the torsional barrier, X is the actual torsional angle,  the periodicity parameter, which would be 3 for a methyl group. Xe is the equilibrium (reference) torsional angle.

2.3.4 Non bonded interactions

MM force fields have to be transferable from molecule to molecule, therefore, the necessity of non-bonded interactions. These are usually subdivided into two types; Lennard-Jones and Coulomb’s interactions [5, 12-13].

2.3.5 Lennard-Jones interaction

The Lennard-Jones potential as shown in Fig 6, describes the interactions of two neutral particles using a relatively simple mathematical model. Two neutral molecules feel both attractive and repulsive forces based on their relative proximity and polarizability. The sum of these forces gives rise to the van dar Waal interactions usually represented as Lenard-Jones potential (V(R) or E), as seen below:

Lenard-Jones potential

Where ε is the potential well depth, σ is the distance where the potential equals zero (also double the van-der-Waals radius of the atom), and Rmin is the distance where the potential reaches a minimum, i.e. the equilibrium position of the two particles.

The resulting curve from this equation looks very similar to the potential energy curve of a bond.

Fig 6. The Lennard-Jones 6-12 potential (E) approximates
the intermolecular interactions of two atoms due to Pauli
repulsion and London dispersion attraction. The potential is
defined in terms of the well-depth (ɛ) and the intercept (σ).
Other formulations use the radius where the minimum
occurs, Rmin, instead of σ.
Click to view

2.3.6 Coulomb’s interaction

Coulomb interactions or electrostatic forces are involved in attraction or repulsion of particles or objects because of their net electric charge [5]. Coulomb noticed that forces acted along the line joining the centers of two charged bodies Qa and Qb, and that the forces were either attractive or repulsive depending on whether the charges were different or of the same type. The sign of the product of the charges therefore determines the direction of the forces.

Coulomb’s interaction

Where Qa and  Qb  are the atomic charges of the interacting atoms, r is the distance between the charged bodies and is a proportionality constant taken to be 1/(4πϵ0), where the permittivity of free space ϵ0 is an experimental determined quantity with the approximate value ϵ0 -8.854 x 10-12 C2N-1m-2.

The non-bonded term comprising Eelec and Evdw are a function of the distance between atom pair rij (non-bond cutoff distance).

To summarize the concept of MM, it is worthy to note the following;

  • The energies calculated by MM are of no meaning as absolute quantity. They are only of relevance when compared to the energies of another conformation of the same molecule.
  • MM calculations make use of data or parameters stored in tables within the program and that describe interactions between different sets of atoms.
  • MM is used to calculate: energies of a molecule’s conformation, energy minimization and energies of a molecular trajectory/motion.
  • MM is fast and less intensive on computer time relative to quantum mechanics.
  • Lastly, because MM does not consider electrons, it cannot calculate electronic properties.

2.3.7 Example of MM in Drug Design (Energy Minimization)

Molecular mechanics calculations are applied in several aspects of drug design including energy minimization, docking, molecular dynamics, etc. Energy minimization will only be considered.

Energy minimization (EM) is a process by which stable, low-energy conformations of a molecule are calculated using the MM program/approach. EM is often performed to avoid atomic clashes and locate most stable conformation of molecules (Fig 7) [514]. There is a possibility of unfavorable bonded and non-bonded interactions existing in a newly generated chemical structure. MM program calculates the energy of the new molecule then varies the bond lengths, bond angles, and torsion angles (this changes the geometry of the structure) and calculates the energy of the later structure. Comparison of the two energies of the first and later structure will show if a slight alteration in bond length or bond angle has effect on the overall energy of the molecule.  The MM program will perform more geometrical changes and eventually stop at a structure in which geometrical variation result in only slight changes in energy – an energy minimum.

Fig 7. Energy minimization profile. L represents points of local minima and G is the global minimum.
Fig 7. Energy minimization profile. L represents points of local
minima and G is the global minimum.
Click to view

2.4 Structure-Based Drug Design

Structure-based drug design (SBDD), also known as receptor-based drug design, is used when receptor (mainly enzymes/proteins) and ligand (small-molecule or drug) structures are both known [15]. The structure of the receptor can be determined by experimental methods such as X-ray crystallography or NMR. Alternatively, computational techniques such as threading or homology modeling can also be applied in obtaining structure of proteins whose structures are unavailable [16]. Then the binding site of the receptor is determined by either experimentally or computationally.  In most cases, the protein is co-crystallized with a ligand. So the pocket where the co-crystallized ligand is located is considered the binding site of the protein [17]. A well characterized protein binding site, such as the interaction between the amino acids at the binding site of the protein and the ligand, can give information vital in designing of novel ligands or docking of putative ligand molecules.

The selective binding which occurs between small molecule ligand and a specific protein target is the basis of physiological activity and pharmacological actions of the ligand. Structural and energetic factors govern this binding event and are respectively captured in computational techniques by the prediction of binding mode/conformation (docking) and scoring of protein-ligand complexes.

2.4.1 Predicting Binding Mode - Docking

Docking is a computational technique used to predict, with a substantial degree of accuracy, the conformation (binding mode/pose) of ligands within the appropriate target-protein binding site [18]. It constitutes, therefore, a major technique employed in virtual screening by structure-based drug design [19]. Docking searches for different energetically permitted binding poses of a ligand at the protein active site by performing a number of trials. At the end of the trial searches, a pose is retained based on the calculated receptor-ligand interaction energy (score) of that pose. Conventionally, several poses of a molecule is generated by docking method and the score of each pose calculated using a scoring function usually affinity scoring function (Fig 8).  The dock-score of each pose of the molecule is only of importance to medicinal chemist in comparison to other, that is, when prioritized. Placing the score in decreasing order is termed ranking. The pose with the lowest theoretical binding affinity is considered the best [20].

2.4.2 Scoring and Scoring Function

Almost all available scoring functions can be grouped into two categories: knowledge-based scoring functions and energy component methods [21]. In knowledge-based scoring function, statistical tools are used to compute the interatomic contact frequencies and/or distances in a database of crystal structures of protein-ligand complexes. Ligand binding affinity towards the target protein is assumed to be favored by molecular interactions that are close to the frequency maxima of the interactions in the database and vice versa. The observed frequency distributions are converted to what is called mean force or knowledge-based potentials. PMF, DrugScore, SmoG and Bleep are examples of knowledge-based potentials that predict binding affinity [22-24].

Outline of the molecular docking system
Fig 8. Outline of the molecular docking process.
(A) Three-dimensional structure of the ligand; (B) Three-dimensional
structure of the receptor; (C) The ligand is docked into the binding
cavity of the receptor and the putative conformations are explored;
(D) The most likely binding conformation and the corresponding
 intermolecular interactions are identified. The protein backbone is
represented as a cartoon. The ligand (carbon in magenta) and active
site residues (carbon in blue) are shown in stick representation.
Water is shown as a white sphere and hydrogen bonds
are indicated as dashed lines.
Click to view

They mainly differ in the type of molecular interaction that were considered and in the size of the training database used.

Scoring functions based on the energy component methods assumes that the change in free energy upon binding of a ligand to its target can be decomposed into a sum of individual contributions:

Where ΔGint is the specific ligand-receptor interactions, ΔGsolv is the interactions of ligand and receptor with solvent, ΔGconf  is the conformational changes in the ligand and receptor and ΔGmotion is the protein and the ligand motion during the complex formation

Several applications using these methods to predict binding affinity have been developed such as LUDI, ChemScore, Validate, GOLD score, PLP, FlexX score, ScreenScore, AutoDock3 and so on [24].

2.4.3 Effect of Water-Solvation Energy

The biological system is made up of 70 % of water molecules. These water molecules play essential roles in the formation of protein-ligand complex in number of ways: They can mediate the contact between protein and a small molecule ligand by providing additional hydrogen bonds to the ligand. They can promote adaptability by allowing for promiscuous/ off-target ligand binding due to steric constraint. Consequently, the displacement of these water molecules by appropriate ligand functional moieties may be favourable to protein-ligand complex formation. Therefore, docking method which recognizes the explicit effects of structural water molecules and water-mediated interactions is highly desirable. Examples of such docking software are FlexX and SLIDE, etc [25].

Another method of drug design based on the knowledge of a biological target (structure-based drug design) is de novo method [26]. De novo design techniques are used when receptor structure is known and ligand structures are unknown. In this method, novel pharmaceutical active agents capable of interacting with a given receptor are computationally generated based entirely on the knowledge of the protein binding site. The proposed de novo model can be used to search large databases to identify compound fragments that can interact with specific sites in the receptor. GROW and LEGEND are examples of programs used in these techniques.

2.5 Molecular Dynamics

Target proteins are generally kept static while ligands are allowed to move about in docking. This is known as fixed-target-flexible-ligand docking and it is the type characterized in almost all virtual screening [27]. However, we know that biomolecules are dynamic in nature; therefore, docking is insufficient tool in computational predictions [28]. Structural dynamism of targets is accounted for through generations of multiple conformations of target by molecular dynamic techniques, Monte Carlo sampling, simulated annealing and even NMR [29].

In reality, atoms of molecules are never stable. Therefore, it is paramount to account for structural dynamism in attempt to describe biochemical systems. Dynamism of atoms of molecules in biological processes is recognized/computated by performing conformational sampling [30]. In molecular dynamic calculation, Newton second-order equation of motion (10) is solved for atom i with mass mi typically within a system of interacting forces and subjected to a net force Fi.

Molecular Dynamics

is the second derivative of the positional vector calculated with respect to time. MD simulation is a deterministic method. Exact solutions for state properties can be derived from MD simulation at a given time and after specifying the initial set of system conditions. Once the starting atomic positions have been specified (typically obtained from x-ray crystallography and NMR spectroscopy), velocities are assigned according to a Maxwellian distribution as given in equation 11.

Where P(v) is the probability, m and v are respectively the atomic mass and velocity, while k is the Boltzmann constant. According to the equipartition theorem, the system temperature T is related to the velocities (12) and (13),

with the system kinetic energy, represented by Ekin and a representing the xyz coordinates. In principle, by correctly assigning the temperature T, according to the Maxwellian distribution the system under study becomes capable of dynamically evolving in a fashion similar to real life systems undergoing thermal motion [31].

Nowadays, one of the many applications of MD simulation technique in drug design and development is in investigating both structural and temporal stability of drug-receptor complexes under modeled experimental conditions such as solvent system, ionic concentration, temperature, and pressure. This becomes increasingly important in investigating stability of ligand-receptor complex as predicted by docking because the mere occurrence of binding may not always indicate the survival of such interaction on a time scale that is sufficient for altering physiological responses [32].The predicted drug-receptor complex from docking calculation is considered stable if MD-generated drug conformations do not deviate by more than a given root mean standard deviation (rmsd), usually 2-3 Å. Additionally, MD has been applied to sample potential conformational states for a molecular target that has no suitable available crystallographic structures (structures with inaccessible or poorly defined binding sites). These samples conformations of the target (with accessible and well defined binding site cavities) can then be selected for molecular docking. Lastly, when a drug binds to a receptor, complex structure (drug-receptor complex) is formed which is more stable and causes equilibrium to shift towards the minimum energy complex structure. MD technique can be used to alternatively produce conformational states corresponding to these ligand-induced structures.

2.6 Ligand-Based Drug Design

When a receptor for a disease is unknown or the 3D structure is unavailable but a single active molecule is known then similarity searching is carried out in what is known as ligand-based virtual screening. In a situation where several actives are available then it may be possible to identify a common 3D pharmacophore, followed by a 3D database search [33]. If a reasonable number of active and inactive structures are known they can be used to train a machine learning technique such as a neural network which can then be used for virtual screening.

2.7 Pharmacophoric Screening

Ehrlich first defined pharmacophore as “a molecular framework that carries the essential features responsible for a drug’s biological activity” [34]. In pharmacophore-based screening, a typical of ligand-based drug design, a pharmacophore model is built which consist of how the positions of key amino acids will be in the active site of a target protein, feature type and direction of an active ligand. For instance, a key amino acid that acts as hydrogen bond donor should be in the location of a hydrogen bond acceptor feature in the pharmacophore model. A pharmacophore modeling cuts across ligand-based and structure-based. If the model is developed based on the knowledge of the ligand i.e. where several different known active molecules are used to identify the common important features, or from the target protein structure. However, pharmacophore model is categorized as structure-based when it is built based on the knowledge of the protein target structure [35].

2.8 SAR, QSAR and 3D-QSAR

Structure-Activity Relationship (SAR) approach attempts to explain the biological activity of a drug molecule as dependent on its molecular structure. Whereas Quantitative Structure-Activity Relationship (QSAR) moves further to examine the actual structure, characterize and quantify their physicochemical properties in numeric indices as parallel to their biological activity [36]. Crum-Brown and Fraser [37] were the first to publish equation in the field of QSAR, which set forth the idea that the biological activity of a compound Φ is a function of its structural properties C.

Φ = f (C) -  -  -  -   (14)

To account for the effect of 3D molecular shape to biological activity, 3D-QSAR was developed and added to QSAR models [38]. All of these efforts by SAR, QSAR and 3D-QSAR enable scientists involved in drug research to suggest mechanism of drug action and make predictions of more profitable areas for drug synthesis. This means that QSAR models allow the calculation of biological properties of novel analogues in advance, so that only the ones with improved potency get to be synthesized. Also, if an analogue is found which defies the model, it suggests that some other factors are important and that provides a lead for further studies [39].

Several physicochemical parameters can be calculated in developing a QSAR model. However, the most common parameters used are hydrophobic, electronic and steric properties [40].  Due to the complexity that could emerge in calculating all these properties simultaneously and relating them to biological activity, each of the properties is varied one at a time while the rest is kept constant.  In a simple case, several compounds with varying physicochemical properties (e.g. log P) are prepared and tested to investigate how these affect the biological activity (log 1/C). A graph of log 1/C vs log P is plotted and with the help of statistical tool (usually linear regression analysis by least square method), QSAR equations/models are developed. The regression or correlation coefficient (r) is a measure of how well the physicochemical parameter present in the equation explains the observed variance in biological activity. r values ≥ 0.9 are considered good fit while for regression coefficient quoted as r2, r2 ≥ 0.8 is taken as a good fit. Other statistical measures calculated to ensure goodness of fit include standard deviation, F-tests, etc.

2.8.1 Hydrophobicity

Drugs have to cross lipid-soluble regions such as cell membranes and fatty tissues, compete with metabolism process, which are fast for lipophilic drugs and with excretion process which are fast for water-soluble drugs before getting to their receptor [41]. Therefore, hydrophobic/lipophobic character of drugs play vital role to their biological effects, hence, it is necessary to predict this quantitatively. Hydrophobic property of a drug is determined experimentally by testing the drug’s relative distribution in an n-octanol/water mixture vis-à-vis its biological activity.

Where P is the relative distribution also known as partition coefficient

The biological activity is generally expressed as 1/C, where C is the concentration of drug required to achieve a defined level of biological activity. The reciprocal of the concentration (1/C) is used because more active drugs will achieve a defined biological activity at lower concentration.

A plot of log (1/C) vs log P gives a straight line (Fig 9a) for cases with small range of log P, i.e. log P = 1-4, give equation 16:

Where K1 and K2 are proportionality constants.

Typical model relating biological to log P in small range (a) and large range (b)
Fig 9. Typical model relating biological to log P in small
range (a) and large range (b)
Click to view

Parabolic curve is obtained when biological activity is plotted against log P when P is large (Fig 9b) [42]. If the partition coefficient is the only factor influencing biological activity, the parabolic curve can be expressed by the equation (17):

Take for example QSAR equation derived when a set of large nonspecific, nonionic substances comprising of alcohols, ethers, and amides were tested for a narcotic effect on tadpoles.

The equation shows that fat solubility determines the accumulation of molecules in the nerve tissues of the tadpole which in turn influences their anesthetic potency.  The r value indicates that the line resulting from the equation is a good fit. All the substances examined have their log P values on the upward slope of the parabolic curve. It would seem as though continuous increase in the log P will bring about an unending increase in biological effect. This is not so rather an optimum value of log P given in Fig 9b as log P0 is observed. In fact, any further increase in log P after log P0 results in declining biological effect.

Consider the anesthetic action of ethers whose mechanism of action do not involve drug-receptor interaction but solely on their ability to cross lipid membranes (cell membranes). The QSAR model/equation (19) for the ethers was generated as given below:


This equation can then be used to predict the anesthetic property of novel compounds. None the less, it should be noted that each QSAR model only applies to a series of compounds which have the same general structure.  In the example, the model is derived solely for anesthetic ethers and therefore not applicable to other structural type of anesthetics.

The current progress in this area now makes it possible to calculate log P by computing the contributions that various substituents make to hydrophobicity. This helps to save resources and time because only substituents which make positive contribution to log P is synthesized.

2.8.2 Electronic Effect

The ionization and polarity of a drug molecule are influenced by the electronic state (electron withdrawing and donating property) of its various substituents. This in turn could affect drug’s transport to the receptor neighborhood and the drug-receptor interaction. For aromatic rings, the measure of electronic withdrawing and donating ability of a substituent is given by σ, known as Hammett substituent constant. The σx for a particular substituent (x) is defined by the equation:


Where KH is the equilibrium or dissociation constant; subscript H signifies that there are no substituents on the aromatic ring. Kx is dissociation constant of the analogue bearing the x substituent. Note that Kx can either be smaller than KH which happens when the substituent is electron donating group and vice versa. This leaves σx either as negative or positive values respectively.

Due to the fact that σx takes resonance and inductive effects into account, its values depends on the position of the substituent on the parent aromatic compound.  σp and  σm symbolize substituents at para and meta positions respectively. For example; the electron withdrawing power of nitro group in meta-nitrobenzene is due to inductive effect only (σm = 0.71). But in para-nitrobenzene, both inductive and resonance influence participate (σp = 0.78). The same account for the discrepancies in Hammett substituent constant observed for hydroxyl group at meta (σ

ISSN: 2659 - 1472


  • Original Reseach Articles ( 1 )
  • Reviews ( 1 )
  • Mini Reviews ( 0 )

Special features of

  • Accepting papers in areas of Pharmaceutical and Life Sciences related to Pharmaceutical Development and Manufacturing
  • Provides an online platform for sharing research information among our authors. Digital Pharm Research Network
  • Information on Conferences, Fellowships, Scholarships, Travel Grants