
Introducing Extensions for Google’s Gemini CLI
Google has rolled out a new feature for its Gemini Command Line Interface (CLI)—the ability to utilize “extensions.”This innovative addition aims to enhance user experience by allowing individuals to integrate their preferred tools and tailor their AI-powered command line interactions.
Accessing and Utilizing Extensions
The extensions are publicly hosted on GitHub, enabling users to create and publish their own extensions without needing Google’s direct approval. Currently, the extension gallery features over 50 different options, including:
- A Stripe extension for seamless integration of its payment API into various workflows.
- A Terraform extension designed for automating infrastructure tasks.
- An extension connecting with Chrome DevTools to assist in coding agents.
- Google’s proprietary Nanobanana extension, which allows users to generate and modify images through text prompts directly from the terminal.
How Extensions Work
Once you have installed an extension, Gemini can automatically determine when to utilize these tools based on your prompting. Alternatively, you have the option to explicitly call upon them through custom commands defined by the extensions. To check which extensions are currently installed, simply use the command /extension
.
Creating Your Own Gemini CLI Extension
Developers interested in creating their own Gemini CLI extensions can get started using one of the provided templates. The development process begins with configuring the extension’s properties in a gemini-extension.json
manifest file. This file points to a Model Context Protocol (MCP) server, which acts as a small application that listens for commands from the CLI. You can also include a commands
directory for adding simple TOML files, enabling the creation of customized command shortcuts for more complex prompts.
The Evolution of Gemini CLI
Initially released earlier this year, Gemini CLI provided developers with a robust platform for directly interacting with AI through their terminals. From the outset, the tool offered significant functionality, such as reading local files and executing shell commands to troubleshoot issues. By logging in with your personal Google account, users benefit from a generous free tier of 60 requests per minute.
Subsequent updates have integrated Gemini CLI into various development environments, including Firebase Studio and the Zed Editor. These integrations have expanded its capabilities further, allowing users to engage in tasks such as code generation, refactoring existing functions, and providing clear explanations for intricate code blocks.
For more information on the latest developments, visit this source.
Leave a Reply