Image to Music: Cross-Modal Melody Generation Through Image Captioning

Kaplan, Alper

Advances in machine learning in recent years have also been seen in computationally creative systems. Interest in machine-generated artifacts paved a way for creative models to evolve as such. But the earlier methods mostly explored a one domain approach and cross-modal learning has stayed relatively unexplored. Thus, the direct mapping between modalities for cross-modal creative models is not fully explored. This work proposes a novel methodology for generating symbolic music through images by directly mapping their features. A CNN encoder and deep stacked LSTM decoder are the base models as ...Daha fazlası

Preprint 2023 Yeditepe University Academic and Open Access Information System 49 Görüntülenme