-
FM4Music - nicolaus625
A Survey
-
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation,
arXiv, 2412.09428
, arxiv, pdf, cication: -1Baisen Wang, Le Zhuo, Zhaokai Wang, ..., Yue Liao, Si Liu · (VMB - wbs2788)
-
MuMu-LLaMA: Multi-modal Music Understanding and Generation via Large Language Models,
arXiv, 2412.06660
, arxiv, pdf, cication: -1Shansong Liu, Atin Sakkeer Hussain, Qilong Wu, ..., Chenshuo Sun, Ying Shan
-
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks,
arXiv, 2409.13832
, arxiv, pdf, cication: 1Yu Zhang, Changhao Pan, Wenxiang Guo, ..., Xinyu Cheng, Zhou Zhao
· (huggingface) · (GTSinger - GTSinger)
-
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization,
arXiv, 2410.12957
, arxiv, pdf, cication: -1Ruiqi Li, Siqi Zheng, Xize Cheng, ..., Shengpeng Ji, Zhou Zhao · (muvi-v2m.github)