Comprehensive Gemini integration supporting Veo 3 video generation, Gemini Flash text generation, chat completions, and multimodal AI capabilities via the Google Gemini API.
Counts the number of tokens in text using gemini tokenization. useful for estimating costs, checking input limits, and optimizing prompts before making api calls.
Downloads a generated veo video to local storage. takes the video uri from a completed operation and saves it to the specified file path.
Generates text embeddings using gemini embedding models. converts text into numerical vectors for semantic search, similarity comparison, clustering, and classification tasks.
Generates text content from prompts using gemini models. supports various models like gemini flash and pro with configurable temperature, token limits, and safety settings for diverse text generation tasks.
Generates images from text prompts using gemini 2.5 flash image preview model. supports creative image generation with customizable parameters like aspect ratio, safety settings, and optional file saving.
Generates videos from text prompts using google's veo models. creates high-quality video content with customizable aspect ratios, duration, and style controls. returns operation id for tracking progress.