Virtual screening is a computational technique used in drug discovery to evaluate large libraries of small molecules or compounds to identify potential drug candidates that can bind to a biological target, such as a protein. Virtual screening can complement traditional high-throughput screening in the quest for new therapeutics.
Virtual screening approaches include structure-based (using a known 3D conformation of the target protein) and ligand-based virtual screening (using known active molecules as templates).
Structure-based virtual screening (SBVS) is a technique used in computational chemistry when the 3D structure of the target protein is known, often obtained from experimental techniques like x-ray crystallography or cryogenic electron microscopy (cryo-EM). The primary method used in SBVS is molecular docking, or molecular ligand docking, where virtual screening software simulates the binding of small molecules (ligands) to a target protein substructure, such as a kinase active site. The goal is to predict the best orientation, interactions, and fit between the ligand and the protein. Scoring functions are then applied to evaluate the strength of these interactions, helping prioritize compounds with the highest predicted binding affinity. The principle here is to identify compounds that can effectively bind to and act as an inhibitor or modulator of the target's bioactivity, which may be helpful as potential drugs.
In cases where the 3D structure of the target protein is unavailable, researchers use a cheminformatics approach called ligand-based virtual screening (LBVS). This approach identifies compounds similar to known active ligands based on their chemical structure or properties. The principle here is that compounds structurally similar to known active compounds are likely to exhibit similar biological activity in protein-ligand interactions. Two key methods used in LBVS include quantitative structure-activity relationship (QSAR) modeling, which correlates molecular properties with biological activity, and pharmacophore modeling, which identifies essential chemical features responsible for biological activity. These features are then used to screen compound libraries for molecules with similar characteristics.
Using virtual screening, researchers can significantly reduce the time and cost associated with traditional wet-lab high-throughput screening (HTS) assays. Instead of testing millions of compounds in the lab, virtual screening allows for the rapid evaluation of thousands or millions of compounds in silico, focusing only on the most promising candidates for experimental validation.
A few key design aspects guide the identification of potential drug candidates in virtual screening:
These steps allow researchers to efficiently filter and evaluate large molecular datasets, streamlining the discovery of viable drug candidates.
The hit rate of virtual screening refers to the percentage of compounds identified as hits out of the total screened. The hit rate can vary widely depending on the quality of the compound library, the target protein, and the screening methods used. Generally, hit rates for virtual screening are low, often ranging between 0.1% and 5%. However, the hit rate can improve with higher-quality target structures and more sophisticated molecule scoring algorithms, among other techniques.
Generative virtual screening is a computational method that uses generative AI to design and optimize chemical structures with desirable properties. It offers several advantages over traditional virtual screening in terms of efficiency, flexibility, and accuracy.
Traditional HTS methods may yield more hits but are more resource-intensive. Virtual screening, on the other hand, efficiently narrows down large datasets, making the process faster and more cost-effective.
Here’s how traditional and generative virtual screenings compare.