baoyu-danger-gemini-web

Skill Overview

Use reverse engineering of the Gemini Web API to implement AI content generation, supporting text generation, image creation, reference-image visual understanding, and multi-turn conversational sessions.

Use Cases

Claude Skill Image Backend: When other skills require image generation capabilities, it can be integrated as a backend service

Free AI Content Creation: Scenarios where you need to generate text or images but want to avoid official API costs

Multimodal AI Interaction: Applications that need to handle both text and visual inputs while maintaining conversation context

Core Features

Image Generation: Generate images from text descriptions, and support image-to-image variants using reference images

Text Generation: Use Gemini models for high-quality text generation, with support for selecting multiple models (gemini-3-pro, gemini-2.5-pro, gemini-2.5-flash)

Session Management: Supports multi-turn conversations, automatically saves and restores session state to maintain continuity of conversational context

Frequently Asked Questions

What is baoyu-danger-gemini-web?

It is an AI generation tool that implements Gemini model capabilities through reverse engineering of the Gemini Web API. It provides features such as text generation, image creation, and visual input references, allowing you to call Gemini without using the official API.

How do I generate images with Gemini?

Run the command npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "your description" --image output.png. This will generate the image. On first use, it will automatically open a browser to complete Google login authentication.

Does the Gemini Web API require payment?

This skill uses reverse engineering to call the Gemini Web interface and does not directly use the official paid API. However, please note that this is an unofficial usage method; on first use, you must read and agree to the relevant disclaimers.

baoyu-danger-gemini-web

Author

Category

Install