gene-database

Query NCBI Gene via E-utilities/Datasets API. Search by symbol/ID, retrieve gene info (RefSeqs, GO, locations, phenotypes), batch lookups, for gene annotation and functional analysis.

Category

Other Tools

Install

Hot:12

Download and extract to your skills directory

Copy command and send to OpenClaw for auto-install:

Download and install this skill https://openskills.cc/api/download?slug=k-dense-ai-scientific-skills-gene-database&locale=en&source=copy

Gene Database - NCBI Gene Information Query Tool

Overview


Gene Database is a convenient NCBI Gene database query tool that supports retrieving gene information via E-utilities and the Datasets API, including gene symbol lookup, sequence retrieval, functional annotation, and batch processing capabilities.

Use Cases

1. Gene Annotation and Functional Analysis


Researchers working on genomics projects need to quickly obtain detailed gene annotation information. This tool can retrieve complete information such as gene names, aliases, reference sequences (RefSeqs), Gene Ontology (GO) annotations, and chromosomal locations. It is suitable for gene function studies, pathway analysis, and variant interpretation.

2. Batch Gene Data Processing


When a project involves a large number of genes, querying them one by one is very time-consuming. This tool provides batch query functionality that can process hundreds of gene symbols or Gene IDs at once, automatically handling rate limits. It is suitable for gene panel validation, building annotation tables, and cross-referencing gene identifiers.

3. Sequence Data Retrieval


Whether you need transcript sequences, protein sequences, or genomic sequences, this tool allows you to quickly obtain them. It supports output in multiple formats such as JSON, XML, and FASTA, meeting the needs of downstream sequence analysis, alignment, and visualization.

Core Features

1. Flexible Gene Search


Supports searching by gene symbol, Gene ID, disease association, GO terms, chromosomal location, and other methods. Species scope can be specified to avoid gene symbol ambiguity. Provides both E-utilities and Datasets API query options to accommodate queries from simple to complex.

2. Comprehensive Gene Information Retrieval


Retrieves comprehensive information including gene names, aliases, reference sequences (mRNA and protein), chromosomal localization, GO annotations, and related literature. The Datasets API can return metadata and sequences in a single request, simplifying data processing workflows.

3. Efficient Batch Processing


Includes built-in batch query scripts that support reading gene lists from files or accepting ID lists directly. Automatically handles API rate limits (up to 10 requests/sec when using an API key), ensuring stable performance for large-scale queries. Suitable for validating gene lists and bulk retrieval of annotation information.

Frequently Asked Questions

How do I obtain an NCBI API key?


After registering an NCBI account, visit https://www.ncbi.nlm.nih.gov/account/ to obtain a free API key. Using an API key can increase the query rate from 3 requests/sec to 10 requests/sec, greatly improving batch processing efficiency.

What is the difference between E-utilities and the Datasets API?


E-utilities is NCBI’s traditional API, supporting complex queries across all Entrez databases and offering greater flexibility. The Datasets API is a newer interface optimized for gene data and can return complete metadata and sequence data in a single request, making it simpler to use. For simple queries, the Datasets API is recommended; for complex cross-database queries, use E-utilities.

Which data output formats are supported?


The tool supports multiple formats including JSON, XML, GenBank, FASTA, and plain text. JSON is suitable for programmatic processing, XML preserves full metadata hierarchy, and FASTA is optimized for sequence analysis. Choose the output format that best fits your downstream tools.