Hands-On Generative AI with Transformers and Diffusion Models

Hi 🤗 This repository contains all the code and exercise answers of the book Hands-On Generative AI with Transformers and Diffusion Models.

About the Authors

Omar Sanseviero (X, LinkedIn, website): Omar Sanseviero was the Chief Llama Officer and Head of Platform and Community at Hugging Face, leading the developer advocacy engineering, on-device, and moonshot teams. Omar has extensive engineering experience working at Google in Google Assistant and TensorFlow Graphics. Omar’s work at Hugging Face was at the intersection of open source, product, research, and technical communities.
Pedro Cuenca (X, LinkedIn): Pedro Cuenca is a machine learning engineer at Hugging Face working on diffusion software, models, and applications. He has 20+ years of software development experience in fields like internet applications and iOS. As a cofounder and CTO of LateNiteSoft, he worked on the technology behind Camera+, a successful iPhone app that used custom ML models for photography enhancement. He created deep-learning models for tasks such as photography enhancement and super-resolution. He was also involved in the development of and operations behind DALL·E mini. He brings a practical vision of integrating AI research into real-world services and the challenges and optimizations involved.
Apolinario Passos (X, LinkedIn, website): Apolinario Passos is a machine learning art engineer at Hugging Face working across different teams on multiple machine learning for art and creativity use cases. Apolinario has 10+ years of professional and artistic experience, alternating between holding art exhibitions, coding, and product management, having been a head of product at World Data Lab. Apolinario aims to ensure that the ML ecosystem supports and makes sense for artistic use cases. He is also an artist that works with interactive installations using AI.
Jonathan Whitaker (X, LinkedIn, website): Jonathan Whitaker is a data scientist and deep learning researcher focused on generative modeling. In addition to his R&D work at answer.ai, he focuses on sharing knowledge via the DataScienceCastnet YouTube channel and various free online resources he has created.

Getting the Book

The book is available on:

Usage

To get the most out of this book, we recommend running the code examples as you read along. Experimenting with the code by making changes and exploring different scenarios will enhance your understanding. Working with transformers and diffusion models can be computationally intensive, so having access to a computer with an GPU is needed.

There are multiple online options that you can use, such as Google Colaboratory and Kaggle Notebooks. Most code should work on any Google Colab instance. We recommend you use GPU runtimes, which provide a T4 for free, specially for chapters with training loops.

There are many support utilities and helper functions used throughout the book. To access them, please install the genaibook package:

pip install genaibook

This will, in turn, install the libraries required to run transformers and diffusion models, along with PyTorch, Matplotlib, NumPy, and other essentials.

Chapter	Colab
1. An Introduction to Generative Media
2. Transformers
3. Compressing and Representing Information
4. Diffusion Models
5. Stable Diffusion and Conditional Generation
6. Fine-Tuning Language Models
7. Fine-Tuning Stable Diffusion
8. Creative Applications of Text-To-Image Models
9. Generating Audio
Appendix C. End-to-End Retrieval Augmented Generation

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.gitignore		.gitignore
01_introduction.ipynb		01_introduction.ipynb
02_transformers.ipynb		02_transformers.ipynb
03_compressing.ipynb		03_compressing.ipynb
04_diffusion.ipynb		04_diffusion.ipynb
05_stable_diffusion.ipynb		05_stable_diffusion.ipynb
06_fine_tuning_language_models.ipynb		06_fine_tuning_language_models.ipynb
07_fine_tuning_diffusion.ipynb		07_fine_tuning_diffusion.ipynb
08_creative_applications_of_t2i.ipynb		08_creative_applications_of_t2i.ipynb
09_generating_audio.ipynb		09_generating_audio.ipynb
13_rag.ipynb		13_rag.ipynb
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hands-On Generative AI with Transformers and Diffusion Models

About the Authors

Getting the Book

Usage

Table of Contents

About

Releases

Packages

Languages

davidcassagne/genaibook

Folders and files

Latest commit

History

Repository files navigation

Hands-On Generative AI with Transformers and Diffusion Models

About the Authors

Getting the Book

Usage

Table of Contents

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages