HCText2Image

Train a toy autoregressive image generator and a VQ encoder/decoder.

I ran this for a few weeks on my laion-icons dataset. It does learn something—it clearly understands something about colors and basic shapes, as shown by the few cases where the model actually follows the prompt. However, for most prompts, the samples are pretty much garbage.

For a big dump of samples, see this page.

Here are some samples for the following prompts, sweeping guidance scales 1, 2, 4, and 8:

a red heart icon, red heart vector graphic
a green tree, a tree with green leaves
A blue square. A simple blue square icon.
a cute corgi vector graphic. corgi dog graphic

For most complex prompts, the model just totally fails in my experience. I'd expect it to need a lot more compute before we end up with anything particularly useful.

Data and pretrained models

Pre-trained VQ encoder and decoder: download here (26 MiB)
Pre-trained 24-layer generative model: download here (898 MiB)
Pre-tokenized laion-icons dataset: download here (7.7 GiB)

Using the model yourself

After downloading the above model checkpoints, you can run a local server for generating images.

$ swift run -c release HCText2Image server vqmodel_ssim_high.plist transformer_75e-5_d24_bs8.plist <port>

This will listen on http://localhost:<port>. You can load it in your browser to enter a prompt and sample an image.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Images		Images
Sources/HCText2Image		Sources/HCText2Image
.gitignore		.gitignore
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HCText2Image

Data and pretrained models

Using the model yourself

About

Releases

Packages

Languages

unixpickle/HCText2Image

Folders and files

Latest commit

History

Repository files navigation

HCText2Image

Data and pretrained models

Using the model yourself

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages