Blog(2)
In real-world scenarios, both face images and videos may suffer from unknown and varied types of degradation, such as down-sampling, noise, blur, and compression.
Text-To-Music (TTM) generation model generates music tracks from text descriptions such as “A rock and roll song played by guitar”.
Research Areas(0)
Publications(9)
Hearable Image: On-Device Image-Driven Sound Effect Generation for Hearing What You See
AuthorDeokjun Eom, Nahyun Kim, Woohyun Nam, Kyung-Rae Kim, Chaebin Im, Jungwon Park
PublishedInternational Conference on Information and Knowledge Management (CIKM)
Date2025-11-10
RestoreGrad: Signal Restoration Using Conditional Denoising Diffusion Models with Jointly Learned Prior
AuthorChinghua Lee, Chouchang Yang ,Retiree, Yashas Malur Saidutta, Yilin Shen, Hongxia Jin
PublishedInternational Conference on Machine Learning (ICML)
Date2025-05-01
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning
AuthorRuchika Chavhan, Da Li, Timothy Hospedales
PublishedInternational Conference on Learning Representation (ICLR)
Date2025-04-25
News(3)
Stable Diffusion [1] for Super Resolution (i.e. SD-SR), has been shown to produce steep improvements compared to previous SR approaches.
In the rapidly evolving field of artificial intelligence, diffusion models have emerged as powerful tools for generating high-quality images.
Others(0)