Melody-Lyrics Matching with Contrastive Alignment Loss


📔 ArXiv    📔 HAL    💻 Code

This is the companion page for the paper: “Melody-Lyrics Matching with Contrastive Alignment Loss”, currently under review.

We provide more examples to supplement the example (Fig. 7) in the paper. Each example includes (1) Playable MIDI notes; (2) Reference and top 2 lyrics retrieved by our proposed method, all aligned with the melody in terms of words; (3) the same set of lyrics, but aligned with the melody at the syllable/sylphone level. Note that the MIDI data in the studied dataset includes only onset time, offset time, and pitch information, with no velocity (dynamics) information. To facilitate better understanding of the melody, we render the MIDI sequences into audio with both piano and violin timbres using FluidSynth.

Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin
Piano
Violin

Citation

If you use our work in your research, please cite our paper:

@article{wang2025melody,
  title={Melody-Lyrics Matching with Contrastive Alignment Loss},
  author={Wang, Changhong and Olvera, Michel and Richard, Ga{\"e}l},
  journal={arXiv preprint arXiv:2508.00123},
  year={2025}
}