A powerful Python-based API wrapper and command-line interface for interacting with xAI's Grok language models. This toolkit provides seamless integration with xAI's Grok language models for chat, text completion, and vision analysis capabilities.
- π€ Interactive chat sessions with Grok by xAI language models
- π Single-shot text completion
- ποΈ Vision analysis with support for local images and URLs
- β‘ Real-time streaming responses
- π― Configurable model parameters
- π Conversation history management
- π€ File upload handling for images
- βοΈ Extensive parameter validation
- π¨ Beautiful loading animations
- Installation
- API Key Configuration
- Usage
- Special Commands
- Advanced Configuration
- Supported Models
- Error Handling and Safety
- Supported File Types
- Caching and Cleanup
- Contributing
- Issues and Support
- Feature Requests
- Versioning and Changelog
- Security
- License
-
Clone the repository:
git clone /~https://github.com/RMNCLDYO/grok-ai-toolkit.git
-
Navigate to the repository folder:
cd grok-ai-toolkit
-
Install the required dependencies:
pip install -r requirements.txt
-
Obtain an API key from xAI.
-
You have three options for managing your API key:
Click here to view the API key configuration options
-
Setting it as an environment variable on your device (recommended for everyday use)
- Navigate to your terminal.
- Add your API key like so:
export GROK_API_KEY=your_api_key
This method allows the API key to be loaded automatically when using the wrapper or CLI.
-
Using an .env file (recommended for development):
- Install python-dotenv if you haven't already:
pip install python-dotenv
. - Create a .env file in the project's root directory.
- Add your API key to the .env file like so:
GROK_API_KEY=your_api_key
This method allows the API key to be loaded automatically when using the wrapper or CLI.
- Install python-dotenv if you haven't already:
-
Direct Input:
-
If you prefer not to use a
.env
file, you can directly pass your API key as an argument to the CLI or the wrapper functions.CLI
--api_key "your_api_key"
Wrapper
api_key="your_api_key"
This method requires manually inputting your API key each time you initiate an API call.
-
-
For generating text based on a prompt or a set of instructions.
CLI
python cli.py --text --prompt "Write a story about AI."
Wrapper
from grok import Text
Text().run(prompt="Write a story about AI.")
For interactive conversations with the AI model.
CLI
python cli.py --chat
Wrapper
from grok import Chat
Chat().run()
For processing images alongside text, allowing for visual analysis and multimodal interactions. This mode enables you to upload images (from local paths or URLs) and chat with the AI about their content.
For processing local images.
CLI
python cli.py --vision --prompt "What's in this image?" --image_path "/path/to/image.jpg"
Wrapper
from grok import Vision
Vision().run(prompt="What's in this image?", image_path="/path/to/image.jpg")
For processing images from URLs.
CLI
python cli.py --vision --prompt "What's in this image?" --image_url "https://example.com/image.jpg"
Wrapper
from grok import Vision
Vision().run(prompt="What's in this image?", image_url="https://example.com/image.jpg")
During interaction with the toolkit, you can use the following special commands:
/exit
or/quit
: End the conversation and exit the program./clear
: Clear the conversation history./upload
: (Vision mode only) Upload an image for analysis.- Usage:
/upload file_path_or_url [optional prompt]
- Example:
/upload image.jpg What do you see in this image?
- Usage:
Description | CLI Flags | CLI Usage | Wrapper Usage |
---|---|---|---|
Chat mode | -c , --chat |
--chat |
See mode usage above. |
Text mode | -t , --text |
--text |
See mode usage above. |
Vision mode | -v , --vision |
--vision |
See mode usage above. |
User prompt | -p , --prompt |
--prompt "Your prompt here" |
prompt="Your prompt here" |
Image path | -ip , --image_path |
--image_path "image.jpg" |
image_path="image.jpg" |
Image URL | -iu , --image_url |
--image_url "https://example.com/image.jpg" |
image_url="https://example.com/image.jpg" |
Enable streaming | -s , --stream |
--stream |
stream=True |
API Key | -ak , --api_key |
--api_key "your_api_key" |
api_key="your_api_key" |
Model name | -m , --model |
--model "model_name" |
model="model_name" |
System prompt | -sp , --system_prompt |
--system_prompt "Set custom instructions" |
system_prompt="Set custom instructions" |
Max tokens | -mt , --max_tokens |
--max_tokens 1024 |
max_tokens=1024 |
Temperature | -tm , --temperature |
--temperature 0.7 |
temperature=0.7 |
Top-p | -tp , --top_p |
--top_p 0.9 |
top_p=0.9 |
Top-k | -tk , --top_k |
--top_k 40 |
top_k=40 |
Frequency penalty | -fp , --frequency_penalty |
--frequency_penalty 0.5 |
frequency_penalty=0.5 |
Presence penalty | -pp , --presence_penalty |
--presence_penalty 0.5 |
presence_penalty=0.5 |
Number of completions | -n |
--n 1 |
n=1 |
Stop sequences | -stop , --stop_sequences |
--stop_sequences [".", "\n"] |
stop=[".", "\n"] |
Model | Input | Output | Context ( Max Tokens) |
---|---|---|---|
grok-2-vision-1212 | Image, Text | Text | 8192 |
grok-2-1212 | Text | Text | 131072 |
grok-vision-beta | Image, Text | Text | 8192 |
grok-beta | Text | Text | 131072 |
The grok-2
and grok-2-latest
model names are aliased to the latest chat models, currently grok-2-1212
. You can use grok-2
and grok-2-latest
if you want to automatically access the latest model version, or choose a model with a specific version number to keep a consistent output.
Note
Note your model access might vary depending on various factors such as geographical location, account limitations, etc. For the most up-to-date information on your team's model access, visit the API Models page on your xAI Console Models Page.
The Grok AI Toolkit includes robust error handling to help you diagnose and resolve issues quickly. Here are some common error codes and their solutions:
HTTP Code | Status | Description | Solution |
---|---|---|---|
400 | Bad Request | Malformed request or invalid parameters | Check request format and parameters |
401 | Unauthorized | Invalid or missing API key | Verify your API key |
403 | Forbidden | Insufficient permissions | Check API key permissions |
404 | Not Found | Resource not found | Verify endpoint and model availability |
429 | Too Many Requests | Rate limit exceeded | Reduce request frequency |
500 | Internal Server Error | Server-side error | Retry after a short wait |
503 | Service Unavailable | Service temporarily down | Retry after a short wait |
The Grok AI Toolkit supports the following image formats for vision processing:
Category | File Extensions |
---|---|
Images | jpg , jpeg , png , gif , webp , bmp |
Warning
xAI has a limit of 10MB for file uploads.
The Grok AI Toolkit implements a caching mechanism for uploaded images to improve performance and reduce unnecessary network requests:
- When an image is uploaded from a URL, it's stored in a temporary cache folder (
.grok_ai_toolkit_cache
). - The cache is automatically cleaned up at the end of each session.
- Cached files are managed efficiently to prevent storage bloat.
The caching system is automated and requires no manual management.
Contributions are welcome!
Please refer to CONTRIBUTING.md for detailed guidelines on how to contribute to this project.
Encountered a bug? We'd love to hear about it. Please follow these steps to report any issues:
- Check if the issue has already been reported.
- Use the Bug Report template to create a detailed report.
- Submit the report here.
Your report will help us make the project better for everyone.
Got an idea for a new feature? Feel free to suggest it. Here's how:
- Check if the feature has already been suggested or implemented.
- Use the Feature Request template to create a detailed request.
- Submit the request here.
Your suggestions for improvements are always welcome.
Stay up-to-date with the latest changes and improvements in each version:
- CHANGELOG.md provides detailed descriptions of each release.
Your security is important to us. If you discover a security vulnerability, please follow our responsible disclosure guidelines found in SECURITY.md. Please refrain from disclosing any vulnerabilities publicly until said vulnerability has been reported and addressed.
Licensed under the MIT License. See LICENSE for details.