biorxiv-database

Efficient database search tool for bioRxiv preprint server. Use this skill when searching for life sciences preprints by keywords, authors, date ranges, or categories, retrieving paper metadata, downloading PDFs, or conducting literature reviews.

Category

Other Tools

Install

Hot:11

Download and extract to your skills directory

Copy command and send to OpenClaw for auto-install:

Download and install this skill https://openskills.cc/api/download?slug=k-dense-ai-scientific-skills-biorxiv-database&locale=en&source=copy

bioRxiv Database - Life Sciences Preprint Search Tool

Overview of Capabilities


bioRxiv Database is an efficient Python-based search tool for retrieving and downloading life sciences papers from the bioRxiv preprint server, supporting searches by keywords, authors, date ranges, and subject categories.

Applicable Scenarios

  • Literature Reviews and Research Surveys

  • Quickly search recent preprints in specific research areas (e.g., CRISPR, neuroscience, genomics), obtain metadata and abstracts of papers that have not yet been formally published, and assist systematic literature reviews.

  • Author Publication Tracking

  • Search for all preprints an author has posted on bioRxiv by name, with support for filtering by time range, making it easy to track a researcher's latest work.

  • Research Trend Analysis

  • Retrieve papers by date range and subject category to analyze research hotspots and publication trends in specific fields, and obtain structured JSON data for subsequent data processing.

    Core Features

  • Multi-dimensional Search

  • Supports keyword searches (title, abstract, author), author searches, exact DOI queries, date range searches, and can filter results by the 27 bioRxiv subject categories, returning complete metadata including DOI, title, authors, abstract, PDF link, and more.

  • Bulk PDF Download

  • Supports single or bulk downloading of preprint PDF files; provides a Python API that can be integrated into automated workflows for offline reading and full-text analysis.

  • Structured Data Output

  • All search results are returned in standard JSON format, making it easy to integrate with Pandas, reference management software, and other tools, and supporting data export, statistical analysis, and visualization.

    Frequently Asked Questions

    What is a bioRxiv preprint?


    bioRxiv is a preprint server for the life sciences where researchers post draft manuscripts that have not yet undergone peer review, with the goal of rapidly disseminating research findings. Preprints are often later submitted to formal journals for publication.

    How do I search for papers on bioRxiv?


    You can search by keywords, author name, date range, or DOI. For example, to search for CRISPR-related papers: python scripts/biorxiv_search.py --keywords "CRISPR" --days-back 90 --category genomics.

    Can I bulk download bioRxiv paper PDFs?


    Yes — you can bulk download via the Python API. First run a search to get the list of papers, then extract DOIs to download PDFs one by one, or use the --download-pdf parameter to download a single paper directly.

    What subject categories does bioRxiv have?


    bioRxiv includes 27 subject categories, including Animal Behavior and Cognition, Biochemistry, Bioengineering, Bioinformatics, Biophysics, Cancer Biology, Cell Biology, Clinical Trials, Developmental Biology, Ecology, Epidemiology, Evolutionary Biology, Genetics, Genomics, Immunology, Microbiology, Molecular Biology, Neuroscience, Paleontology, Pathology, Pharmacology and Toxicology, Physiology, Plant Biology, Science Communication and Education, Synthetic Biology, Systems Biology, and Zoology.