Skip to content
View harveywon's full-sized avatar

Block or report harveywon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CSP-J/S/X, NOIP, NOI, IOI, 信息学奥林匹克竞赛历年真题收录 | QQ交流群529507453

Rich Text Format 358 150 Updated Dec 19, 2024

Let AI be your browser operator.

HTML 5,438 284 Updated Jan 31, 2025

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 7,199 689 Updated Jan 23, 2025

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,258 2,286 Updated Jun 26, 2024

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,794 598 Updated Jul 2, 2024

Convert any PDF into a podcast episode!

Python 1,978 215 Updated Dec 7, 2024

A generative speech model for daily dialogue.

Python 34,040 3,688 Updated Jan 25, 2025

Prompt Visualization | Art Gallery

Python 467 39 Updated Jun 12, 2024

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 25,130 1,897 Updated Jan 27, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,621 445 Updated Jan 27, 2025
Python 1,307 77 Updated Oct 30, 2024

Kolors的ComfyUI原生采样器实现(Kolors ComfyUI Native Sampler Implementation)

Python 529 29 Updated Jan 27, 2025

Diffusers wrapper to run Kwai-Kolors model

Python 575 31 Updated Oct 18, 2024

Kolors Team

Python 4,134 308 Updated Nov 13, 2024

real time face swap and one-click video deepfake with only a single image

Python 43,545 6,345 Updated Feb 1, 2025

State-of-the-art 2D and 3D Face Analysis Project

Python 24,136 5,489 Updated Dec 5, 2024

Improved file parsing for LLM’s

Python 2,669 104 Updated Nov 13, 2024

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

29,291 3,370 Updated Mar 25, 2024

[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

883 36 Updated Apr 28, 2024

Large Action Model framework to develop AI Web Agents

Python 5,844 532 Updated Jan 21, 2025

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Python 998 277 Updated Oct 5, 2023

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Python 2,807 719 Updated Jul 28, 2022

A curated list of image captioning and related area resources. :-)

1,066 184 Updated Mar 28, 2023
Python 50 4 Updated May 28, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,668 249 Updated Dec 12, 2023

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 45,887 5,479 Updated Dec 18, 2024

Web Scraping with GPT-4 Vision API and Puppeteer

JavaScript 555 252 Updated Jan 31, 2024

AI-Driven Children’s Storytelling Web App using Next.js, OpenAI, Stability.ai, and ElevenLabs

JavaScript 23 3 Updated Sep 12, 2023
Next