AFIG (Autoregressive Fourier Image Generation)

blog post explainer: https://www.ethansmith2000.com/post/mimicking-diffusion-models-by-sequencing-frequency-coefficients

This repo contains 3 types of training for this method:

train_mixture_cov.py - GMM with covariance predictions
train_mixture_unroll.py - univariate GMM
train_quantized.py - quantized the range of values into a discrete vocabulary

getting set up and training should be as easy as

pip install -r requirements.txt
python train_[method].py

The results that looked nice are from the quantized method

You can visualize the generation process by taking a finished image and running visualize_reconstruction found in utils.py, from which the frames can then be made a video with the moviepy function

Future Plans

Try some more on the GMM methods, see if we can fix the STDs at the std of the dataset or use a fixed per position learnable std
Can we do this in latent space? I'd really like to somehow encode each concentric ring into a single token to make this scalable.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model.py		model.py
requirements.txt		requirements.txt
train_mixture_cov.py		train_mixture_cov.py
train_mixture_unroll.py		train_mixture_unroll.py
train_quantized.py		train_quantized.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AFIG (Autoregressive Fourier Image Generation)

Future Plans

About

Releases

Packages

Languages

License

ethansmith2000/AFIG

Folders and files

Latest commit

History

Repository files navigation

AFIG (Autoregressive Fourier Image Generation)

Future Plans

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages