Stars
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Open Source framework for voice and multimodal conversational AI
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Build datasets using natural language
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on model. Users can send images via WhatsApp to try on garments…
NVIDIA AI Blueprint for digital human for customer service.
The purpose of the "Meta Agent with More Agents" project is to dynamically solve complex queries by breaking them down into smaller tasks and assigning each to specialized AI agents. The Meta Agent…
The fastest way to build robust AI agents
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Deploy your agentic worfklows to production
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
ChatGPT CLI is a versatile tool for interacting with LLM models through OpenAI and Azure, as well as models from Perplexity AI and Llama. It supports prompts and history tracking for seamless, cont…
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Access large language models from the command-line
oliverbob / gpt-ui
Forked from open-webui/open-webuiChatGPT-Style Web UI Client for Ollama 🦙. See customized version at https://askai.city by Bob Reyes
A dynamic form-building tool that allows users to create, customize, and validate forms seamlessly within web applications.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
OpenMusic: SOTA Text-to-music (TTM) Generation
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!