Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
-
Updated
May 11, 2024 - Python
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun
min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Turn any face into a video game character, pixel art, claymation, 3D or toy
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
LTX-Video Support for ComfyUI
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Stable diffusion webui based on diffusers.
official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"
Add a description, image, and links to the text-to-image topic page so that developers can more easily learn about it.
To associate your repository with the text-to-image topic, visit your repo's landing page and select "manage topics."