ChEMBL Database Skill - Querying Bioactive Molecules and Drug Discovery Data

ChEMBL Database Skill Details

Skill Overview

The ChEMBL Database skill allows you to query the ChEMBL bioactivity molecule database maintained by the European Bioinformatics Institute (EBI) via a Python client, accessing information on over 2 million compounds, 19 million bioactivity measurements, and 13,000+ drug targets for drug discovery and medicinal chemistry research.

Use Cases

1. Lead Compound Discovery and Screening

When you need to find potential inhibitors or active compounds for a specific drug target, you can query biological activity data such as IC50 and Ki by target to screen for high‑activity lead compounds. Filtering by properties such as molecular weight and LogP is supported to quickly locate candidate molecules that meet drug‑like criteria.

2. Structure-Activity Relationship (SAR) Studies

When you need to analyze how structural changes affect biological activity, you can find related compounds via similarity or substructure searches and retrieve their activity data for comparative analysis. Flexible queries based on SMILES structures are supported.

3. Drug Information Lookup and Mechanism Exploration

When you need to understand the mechanism of action, indications, and target information of approved or clinical candidate drugs, you can query the drug database to obtain mechanisms, bioactivity data, and clinical application information, supporting drug repurposing research.

Core Features

1. Multidimensional Compound Search

Supports exact or fuzzy searches by ChEMBL ID, compound name, and molecular properties (molecular weight, LogP, number of hydrogen bond donors/acceptors, etc.), enabling access to detailed structural information and property data for over 2 million compounds.

2. Bioactivity Data Retrieval

Provides query functions for standard activity types such as IC50, Ki, and EC50; supports filtering by target, compound, and activity thresholds. Results include standardized activity units and pChEMBL values for easy analysis.

3. Structure-Based Search

Supports SMILES‑based similarity searches (with adjustable similarity thresholds) and substructure searches to quickly find structurally related compounds for analog design and SAR analysis.

Frequently Asked Questions

What is the ChEMBL database?

ChEMBL is a manually curated bioactive molecule database maintained by the European Bioinformatics Institute (EBI), containing over 2 million compounds, 19 million bioactivity measurements, and 13,000+ drug targets. It covers approved drugs, clinical candidates, and bioactive compounds and is an important global resource for drug discovery research.

How do I query inhibitors for a specific target?

First search by target name to obtain the target’s ChEMBL ID, then use the activity endpoint to filter activity data for that target. You can set the standard activity type (e.g., IC50) and activity thresholds (e.g., ≤100 nM) to select highly active inhibitors, and finally retrieve compound IDs to get molecular details.

Is ChEMBL data free to use?

The ChEMBL database is completely open and free; it can be accessed via the Python client or REST API without registration. The Python client automatically caches results (default 24 hours) to improve performance and reduce API calls, making it suitable for academic research and industrial drug development.

chembl-database

Author

Category

Install