Layout Aware Parsing in Python

EmoTaG: Emotion-Aware Talking Head Synthesis

Official implementation of EmoTaG (CVPR 2026). HDTF Multi-identity pre-training 70 videos (one identity each, 90–240 s) sampled from HDTF, to learn the identity-agnostic audio-motion prior. MEAD ...

GitHub

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

TL;DR: Text Prompt -> LLM as a Request Parser -> Intermediate Representation (such as an image layout) -> Stable Diffusion -> Image. [2023.8] Our repo has been largely improved: now we have a repo ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

EmoTaG: Emotion-Aware Talking Head Synthesis

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

Trending now