Blog(2)
In real-world scenarios, both face images and videos may suffer from unknown and varied types of degradation, such as down-sampling, noise, blur, and compression.
Text-To-Music (TTM) generation model generates music tracks from text descriptions such as “A rock and roll song played by guitar”.
Research Areas(0)
Publications(6)
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning
AuthorRuchika Chavhan, Da Li, Timothy Hospedales
PublishedInternational Conference on Learning Representation (ICLR)
Date2025-04-25
Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP
AuthorShell Xu Hu
PublishedConference on Empirical Methods in Natural Language Processing (EMNLP)
Date2024-11-13
You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation
AuthorMehdi Noroozi, Isma Hadji, Brais Martinez, Adrian Bulat, Georgios Tzimiropoulos
PublishedEuropean Conference on Computer Vision (ECCV)
Date2024-09-30
News(3)
Stable Diffusion [1] for Super Resolution (i.e. SD-SR), has been shown to produce steep improvements compared to previous SR approaches.
In the rapidly evolving field of artificial intelligence, diffusion models have emerged as powerful tools for generating high-quality images.
Others(0)