clinvar-database

Query NCBI ClinVar for variant clinical significance. Search by gene/position, interpret pathogenicity classifications, access via E-utilities API or FTP, annotate VCFs, for genomic medicine.

Category

Other Tools

Install

Hot:13

Download and extract to your skills directory

Copy command and send to OpenClaw for auto-install:

Download and install this skill https://openskills.cc/api/download?slug=k-dense-ai-scientific-skills-clinvar-database&locale=en&source=copy

ClinVar Database Skill - Querying and Interpreting Clinical Significance of Genetic Variants

Skill Overview

The ClinVar Database skill provides capabilities to query, interpret, and process data from the NCBI ClinVar database, helping users retrieve clinical significance classifications of genetic variants, access the E-utilities API, download bulk data, and annotate VCF files.

Applicable Scenarios

  • Querying clinical significance of genetic variants
  • When you need to verify whether a variant in a genetic test report has clinical significance, this skill lets you quickly search ClinVar for variant records in specific genes (e.g., BRCA1, TP53) to obtain pathogenicity classifications (Pathogenic, Likely Pathogenic, VUS, Benign, etc.) and review status star ratings.

  • Bulk data download and local database construction
  • Suitable for building a local ClinVar database or integrating ClinVar into bioinformatics pipelines. Download full datasets via FTP in XML, VCF, or tabular formats (updated monthly). Both GRCh37 and GRCh38 genome builds are supported, enabling offline processing of millions of variant records.

  • Clinical annotation of VCF files
  • In variant-calling analyses, use ClinVar VCF files downloaded with this skill and annotate your variant calls with clinical significance using bcftools, filtering for pathogenic variants for further interpretation.

    Core Features

  • Multiple ways to query ClinVar
  • Supports querying variant information via the NCBI web interface, the E-utilities API, or local data files. You can search by gene name, clinical significance, disease name, chromosome position, and other combined dimensions. curl and Biopython examples are provided.

  • Interpretation of clinical significance
  • Provides explanations of the ACMG/AMP five-tier pathogenicity classification (Pathogenic, Likely Pathogenic, Variant of Uncertain Significance, Likely Benign, Benign) and ClinVar’s star-based review status system, helping users understand the reliability of variant data at different evidence levels.

  • Multi-format data handling
  • Supports three major data formats: XML (most comprehensive), VCF (suitable for pipeline integration), and tab-delimited tables (suitable for quick analyses). Offers code examples using Python, bcftools, pandas, and other tools covering data parsing, filtering, annotation, and summary statistics.

    Frequently Asked Questions

    What is ClinVar?

    ClinVar is a free public database maintained by the National Center for Biotechnology Information (NCBI) that collects reports linking human genetic variants to phenotypes/diseases and provides standardized clinical significance classifications. It is widely used in clinical genetics and genomics research.

    How should I interpret ClinVar’s pathogenicity classifications?

    ClinVar adopts the ACMG/AMP guideline five-tier system: Pathogenic indicates the variant causes disease (approximately 99% probability); Likely Pathogenic indicates it is very likely to be disease-causing (approximately 90% probability); VUS (Variant of Uncertain Significance) means current evidence is insufficient to make a determination; Likely Benign and Benign indicate the variant is unlikely or definitively not disease-causing. Also pay attention to the star ratings (★ to ★★★★); higher stars indicate more rigorous review and more reliable results.

    Can ClinVar data be used directly for clinical diagnosis?

    No. ClinVar provides reference information on variants and cannot replace the judgment of qualified genetic counselors or clinical geneticists. For decisions affecting patient health, you must consult qualified genetics professionals. Additionally, consider the review status (prefer ratings of three stars or higher), check for conflicting interpretations, and confirm that the genome build (GRCh37 or GRCh38) matches your data.