gpt-4o
Here are 123 public repositories matching this topic...
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
-
Updated
Jan 18, 2025 - Python
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
-
Updated
Jan 15, 2025 - Python
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
-
Updated
Dec 25, 2024 - Python
Start building LLM-empowered multi-agent applications in an easier way.
-
Updated
Jan 13, 2025 - Python
We do NOT and WILL not have any Crypto Projects, they are a complete SCAM | Task oriented AI agent framework for digital workers and vertical AI agents
-
Updated
Jan 17, 2025 - Python
Multilingual Voice Understanding Model
-
Updated
Jan 8, 2025 - Python
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatible with popular workflow platforms like Dify and Coze.
-
Updated
Jan 14, 2025 - Python
Devon: An open-source pair programmer
-
Updated
Aug 27, 2024 - Python
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
-
Updated
Aug 23, 2024 - Python
⚡️ Build Your Own chatgpt Bot|🧀 Discord/Slack/Kook/Telegram |⛓ ToolCall|🔖 Plugin Support | 🌻 out-of-box | gpt-4o
-
Updated
Jan 9, 2025 - Python
Extract clean data from anywhere, powered by vision-language models ⚡
-
Updated
Jan 2, 2025 - Python
End-to-end platform for building voice first multimodal agents
-
Updated
Oct 28, 2024 - Python
RAG-GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information retrieval.
-
Updated
Jul 19, 2024 - Python
High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
-
Updated
Jan 10, 2025 - Python
Engy is an AI-powered development tool that generates fully functional web applications from natural language, streamlining the process from idea to working prototype.
-
Updated
Nov 5, 2024 - Python
MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.
-
Updated
Jan 8, 2025 - Python
Improve this page
Add a description, image, and links to the gpt-4o topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpt-4o topic, visit your repo's landing page and select "manage topics."