Corpora

ACM_MIRUM
type: Audio
size: 1410 excerpts (60s)
metadata: tempo
ADC2004
type: Audio
size: 20 excerpts
metadata: predominant pitch
AMG1608
AMG1608 is a dataset for music emotion analysis. It contains frame-level acoustic features extracted from 1608 30-second music clips and corresponding valence-arousal (VA) annotations provided by 665 subjects.
type: Audio
size: 1,608
metadata: valence & arousal
APL
type: Audio
size: 620 segments
metadata: piano practice
artist20
type: Audio
size: 1413 songs
metadata: 20 artists
Audio Content Analysis Datasets
Companion datasets to the book Audio Content Analysis by Alexander Lerch
type: Audio
bach10
type: Audio / Symbolic
size: 10 chorales
metadata: multitrack & aligned MIDI
ballroom
type: Audio
size: 698 excerpts (30s)
metadata: 8 genres & tempo & (down-)beats
beatboxset1
type: Audio
size: 14 clips
metadata: perc. annotation
C224a
type: Audio
size: 224 artists
metadata: 14 genres
C3ka
type: Audio
size: 3000 artists
metadata: 18 genres
C49ka-C111ka
type: Audio
size: 48800/110588 artists
metadata: genres
CAL10k
type: Audio
size: 10870 songs
metadata: tags
CAL500
type: Audio
size: 502 songs
metadata: tags
CCMixter
type: Audio
size: 50 mixes
metadata: vocal & background track
Chopin22
type: Audio
size: 44 recordings
metadata: audio & aligned MIDI
CMMSD
type: Audio
size: 36 excerpts
metadata: note/rest/transition & onsets & vibrato
Coidach
type: Audio
size: 26420 songs
metadata: 55 genres
Compmusic Corpora
Data collections of cultural music from various sources that evolve and grow.
type: Audio
corpusCOFLA
type: Audio
size: 1800 flamenco recordings
metadata: editorial & predominant melody
covers80
type: Audio
size: 80 song pairs
metadata: cover songs
CREL Singing Voice Database
Dataset for research of physical characteristics of different singing expressions
type: Audio
metadata: segmented with temporal markers for each expression
DAMP
type: Audio
size: 34000 monophonic recordings
metadata: karaoke performances
DEAM
The biggest publicly available music affect dataset., which has 1802 songs. It contains average and std of valence and arousal value of each excerpt. It has audio files, feature and annotations.
type: Audio
size: 1,802
metadata: valence & arousal
DEAPDataset
type: Audio
size: 120 music video excerpts
metadata: valence & arousal & dominance & physiological data
DREANSS
type: Audio
size: 18 excerpts
metadata: onset times & perc. instruments
DrumPt
type: Audio
size: app. 2000 annotations
metadata: 4 playing techniques
emoMusic
type: Audio
size: 744 excerpts (45s)
metadata: arousal & valence
Emotify
Emotify dataset has no arousal/valence values, but it provides the audio and is annotated with the GEMS. The discrete emotion tags include amazement, solemnity, tenderness, nostalgia, calmness, power, joyful activation, tension, and sadness.

type: Audio
size: 400 excerpts
metadata: induced emotion

ENST-Drums
type: Audio
size: 318 segments
metadata: onset times & perc. instruments & playing technique
Extendedballroom
type: Audio
size: 4000 excerpts (30s)
metadata: 9 genres & tempo &amp
ffuhrmann
type: Audio
size: 6951 excerpts/220 songs
metadata: 11 predom. instr.
FlaBase
type: Audio
size: 1102 artists & 74 palos & 2860 albums & 13311 tracks
metadata: editorial & biographical & musicological information on flamenco
FMA-medium
type: Audio
size: 14511 excerpts (30s)
metadata: 20 genres
FMA-small
type: Audio
size: 4000 excerpts (30s)
metadata: 10 genres
Fugue
Reference data for computational music analysis. Now contains a dataset of ground truth structures for fugues.
type: Symbolic
size: 36 pieces
metadata: fugue analysis
GiantStepsKey
Datasets for automatic evaluation of tempo estimation and key detection algorithms.
type: Audio
size: 604 files
metadata: key
GiantStepsTempo
Datasets for automatic evaluation of tempo estimation and key detection algorithms.
type: Audio
size: 664 files
metadata: tempo
GNMID14
type: Audio
size: 110M music ID matches
metadata: timestamp & country
Good-sounds.org
type: Audio
size: 8750 notes
metadata: 12 instruments, pitch, sound quality
GPT
type: Audio
size: 6580 clips
metadata: 7 guitar playing techniques
GTZAN
type: Audio
size: 1000 excerpts (30s)
metadata: 10 genres & tempo & key1 & key2 & beat/downbeat & metrical levels
Hainsworth
type: Audio
size: 245 excerpts (60s)
metadata: tempo
HJDB
type: Audio
size: 236 excerpts
metadata: downbeat
holzapfel:onset
type: Audio
size: 78 excerpts
metadata: onset times
homburg
type: Audio
size: 1889 excerpts (10s)
metadata: 9 genres
IADS
type: Audio
size: 111 sound snippets
metadata: valence & arousal & dominance
IDMT-MT
type: Audio
size: 12 songs
metadata: multitrack & style
IDMT-SMT-Audio-Effects
type: Audio
size: 55044 recordings
metadata: effects on bass and guitar notes
IDMT-SMT-Bass
type: Audio
size: 4300 excerpts
metadata: bass performance styles
IDMT-SMT-Bass-SINGLE-TRACK
type: Audio
size: 17 bass lines (?)
metadata: style annotated bass lines
IDMT-SMT-Drums
type: Audio
size: 518 files
metadata: onset times & perc. instruments
IDMT-SMT-Guitar
type: Audio
size: 4700+400 note events
metadata: 9 guitar playing techniques
iKala Dataset
Comprised of 252 30-second excerpts sampled from 206 iKala songs
type: Audio
size: 252
metadata: Pitch contour, timestamped lyrics
INRIA:EuroVision
type: Audio
size: 124 songs
metadata: structure
INRIA:Quaero
type: Audio
size: 159 songs
metadata: structure
IRMAS
type: Audio
size: 2874 excerpts
metadata: 11 instruments
ISMIR2004Genre
type: Audio
size: 729 excerpts (30s)
metadata: 6 genres
ISMIR2004Tempo
type: Audio
size: 465 excerpts (20s)
metadata: tempo
Isophonics
Datasets, Ontologies, and other goodies.
type: Audio
J-DISC
J-DISC is a resource for searching and exploring jazz recordings created by the Center for Jazz Studies at Columbia University.
type: Audio
Jamendo
type: Audio
size: 61+16+16 songs
metadata: voice activity
JGDB
type: Audio
size: random generated excerpts
metadata: multitrack & MIDI
Jordan:Classical
type: Audio
size: 15 pieces
metadata: structure
Jordan:Jazz
type: Audio
size: 15 pieces
metadata: structure
LabROSA:APT
type: Audio
size: 29 piano excerpts
metadata: MIDI
LabROSA:MIDI
type: Audio / Symbolic (midi)
size: 4 songs
Lakh MIDI DataSet
The Lakh MIDI dataset is a collection of 176,581 unique MIDI files, 45,129 of which have been matched and aligned to entries in the Million Song Dataset.
type: Symbolic (midi)
size: 176,581
last.fm
type: Audio
size: 992 users
metadata: listening habits
Latin
type: Audio
size: 3160 songs
metadata: 10 genres
magnatagatune
type: Audio
size: 25863 excerpts (30s)
metadata: similarity
MAPS Database
A piano database for multipitch estimation and automatic transcription of music.
type: Audio
size: 238 pieces
metadata: Proud truth pitch information
MARD
type: Audio
size: 66566 songs
metadata: album reviews
MARG Note-level Singing Dataset
Dataset produced by the Music & Audio Research Group for work in automatic music transcription
type: Audio
metadata: ground truth pitch information (monophonic)
MARG-AMT
type: Audio
size: 30 melodies
metadata: MIDI pitch & onset/offset times
McGill Billboard
type: Audio
size: 740 songs
metadata: chords
McGill Billboard Annotations
Annotations and audio features for the first 1000 randomly selected entries from Billboard chart slots presented at ISMIR 2011, and the additional 300 entries used to evaluate audio chord estimation for MIREX 2012.
type: Audio
metadata: high-level structure, timestamped chord labels, instrument information
MedleyDB
type: Audio
size: 122 songs
metadata: multitrack & genre & melody f0 & instrument activation
Meertens Tunes Collections
The MTC consist of a number of melodic data sets (Dutch Songs), both vocal and instrumental. MTC is open access available for research purposes and is especially valuable for MIR research.
MidiDB
MIDI transcriptions of many popular songs, including EDM.
type: Symbolic (midi)
Million Musical Tweets Dataset
The “Million Musical Tweets Dataset” (MMTD) contains listening histories inferred from microblogs. Each listening event identified via twitter-id and user-id is annotated with temporal (date, time, weekday, timezone), spatial (longitude, latitude, continent, country, county, state, city), and contextual (information on the country) information. In addition, pointers to artist and track are provided as a matter of course.
type: Audio
size: 1,000,000
Million Song Dataset
A collection of audio features and metadata for a million contemporary popular music tracks.
type: Audio
size: 1,000,000
MIR Datasets
A list of datasets maintained at the Music Inforation Retrieval Wiki.
MIR Lab
Corpora prepared by MIR Lab.
MIR-1K Dataset
One thousand clip dataset for singing voice separation from MIR Lab,
type: Audio
size: 1,000
metadata: pitch contour, lyrics, indices and types for unvoiced frames.
mirex05Train
type: Audio
size: 13 excerpts
metadata: predominant pitch
mirex06Train
type: Audio
size: 20 excerpts (30s)
metadata: tempo & beats
MMTD
type: Audio
size: 1086808 tweets
metadata: listening behavior
Modal
type: Audio
size: 71 snippets
metadata: onset times
Mood Swing Dataset
It contains V/A value of 240 songs.
type: Audio
size: 240
MOODetector:Bi-Modal
type: Audio
size: 133 excerpts
metadata: lyrics & valence & arousal
MOODetector:Multi-Modal
type: Audio
size: 903 excerpts (30s)
metadata: lyrics & MIDI & mood
MSD
type: Audio
size: 1000000 songs
metadata: meta data & proprietary features
MTG-QBH
type: Audio
size: 118 queries/481 songs
metadata: title & artist
MuseData
An electronic library of Classical Music scores
type: Symbolic (Midi, MuseData, Humdrum)
size: 881
Music Mood Rating Dataverse
It contains average ratings of discrete emotion tags, including valence, arousal, atmosphere, happy, dark, sad, angry, sensual, sentimental.
type: Audio
size: 600
metadata: Annotations
Music Recommendation Dataset (KGRec-music)
Two different datasets with users, items, implicit feedback interactions between users and items, item tags, and item text descriptions are provided, one for Music Recommendation (KGRec-music), and other for Sound Recommendation (KGRec-sound)
type: Audio
Music Technology Group Datasets
Various datasets compiled as part of research projects carried out at the MTG.
MusicClef 2012
The MusiClef 2012 – Multimodal Music Data Set provides editorial metadata, various audio features, user tags, web pages, and expert labels on a set of 1355 popular songs. It was used in the MusiClef 2012 Evaluation Campaign.
type: Audio
size: 1355 songs
metadata: tags
MusicMicro
type: Audio
size: 136866 users
metadata: music listening patterns
MusicMicro Dataset
The “MusicMicro 11.11-09.12” data set contains listening histories inferred from microblogs. Each listening event identified via twitter-id and user-id is annotated with temporal (month and weekday) and spatial (longitude, latitude, country, and city) information. In addition, pointers to artist and track are provided as a matter of course.
MusicNet
type: Audio
size: 330 recordings
metadata: pitch and onsets
musiXmatch Database
Official lyrics collection of the Million Song Dataset.
size: 1,000,000
ODB
type: Audio
size: 19 excerpts
metadata: onset times
Onset_Leveau
type: Audio
size: 21 excerpts
metadata: onset times
Petrucci Music Library
The datasets backing the Music Ngram Viewer.
type: Symbolic
metadata: N-gram per year
Phonation Modes Dataset
A collection of datasets for detection of phonation modes: breathy, neutral, flow and pressed.
type: Audio
size: 900
metadata: Phonation mode ground truth
PlaylistDataset
type: Audio
size: 75262 songs/2840553 transitions
metadata: playlists
QBT-Extended
type: Symbolic
size: 3365 queries/51 songs
metadata: taps
QMUL:Beatles
type: Audio
size: 181 songs
metadata: structure & key & chords & beats
QMUL:King
type: Audio
size: 14 songs
metadata: structure & key & chords
QMUL:MichaelJackson
type: Audio
size: 38 songs
metadata: structure
QMUL:MultiTrack
type: Audio
size: 104 songs
metadata: structure & multitrack
QMUL:Queen
type: Audio
size: 51/31 songs
metadata: structure/key & chords
QMUL:RSS
type: Audio
size: 60 songs
metadata: structure
QMUL:Zweieck
type: Audio
size: 18 songs
metadata: structure & key & chords & beats
QUASI
type: Audio
size: 11 songs
metadata: multitrack
RECOLA Database
Multimodal recordings of spontaneous collaborative and affective interactions in French.
type: Audio
metadata: Segmentation of spoken utterances, probability of speech, acoustic low-level descriptors
Repovizz
A framework for remote storage, visual browsing, annotation, and exchange of multi-modal data.
RockCorpus
type: Audio
size: 200 songs
metadata: chords & melody & bars
RWC
type: Audio
size: 115 songs/50 classical/100 songs
metadata: lyrics & 10 genre & 50 instruments & chords & structure & aligned MIDI
RWC Music Database
The RWC (Real World Computing) Music Database is a copyright-cleared music database (DB) available to researchers as a common foundation for research.
type: Audio
size: 315
metadata: ground truth midi
Saarland Music Data
Saarland Music Data (SMD) – SMD supplies free music recordings of Western classical music (SMD Western Music) as well as MIDI-audio pairs (SMD MIDI-Audio Piano Music), which have been generated by using hybrid acoustic / digital pianos (Disklavier).
type: Symbolic / Audio
SALAMI
type: Audio
size: 779 songs
metadata: structure
Sargon
type: Audio
size: 4 songs
metadata: structure
SASD
type: Audio
size: 268+2336artists
metadata: artist biographies & similarity
Schenker
A dataset of MusicXML excerpts and corresponding Schenkerian analyses in a computer-readable format.
type: Symbolic (musicXML)
size: 41 pieces
metadata: MusicXML & Schenker analysis
Seyerlehner:1517-Artists
type: Audio
size: 3180 songs
metadata: 19 genres
Seyerlehner:Annotated
type: Audio
size: 190 songs
metadata: 19 genres
Seyerlehner:Pop
type: Audio
size: 1105 songs
metadata: tempo
Seyerlehner:Unique
type: Audio
size: 3115 excerpts (30s)
metadata: 14 genres
SISEC
type: Audio
size: 5 excerpts
metadata: multitrack & mix
SMC:MIREX
type: Audio
size: 217 excerpts
metadata: tempo & beat positions
SMD
type: Audio
size: 50 recordings
metadata: audio & aligned MIDI
Soundtrack
The selection of the excerpts has been done in terms of dimensional and discrete emotion model (see the paper for details) and evaluated by pilot study and a larger scale study. The soundtracks are short (approx. 15 second) excerpts from film soundtracks.
type: Audio
size: 360
metadata: valence & energy & tension & mood
SPAM
type: Audio
size: 50 songs
metadata: structure
Su-AMT
type: Audio
size: 10 excerpts
metadata: onset times & pitch
Suomen Kansan eSävelmät
Digital Archive of Finnish Folk Tunes.
type: Audio
size: 9,000
metadata: notation, key, meter, place of collection, lyrics
SymbTr
A Turkish Makam Music Symbolic Data Collection.
type: Audio
size: 2,000
metadata: Phrase boundaries, segment boundaries.
The Bellmann Corpus
Released in 2013, consisting of musical scores for over 650 pieces (or complete sections of multi-movement works) for piano or harpsichord
type: Symbolic (midi)
size: 650
The Meertens Tune Collections
type: Audio
size: 3000-7000 melodies
metadata: phrases & key & meter
Tonal Harmony Excerpts
MIDI files from the workbook and instructor’s manual for Tonal Harmony by Stefan Kostka and Dorothy Payne.
type: Symbolic (midi)
size: 46
metadata: Ground truth chord labels.
TONAS
type: Audio
size: 72 single-voiced excerpts
metadata: pitch
TPD
type: Audio
size: 23385 songs
metadata: popularity rating
TRIOS
type: Audio
size: 5 excerpts
metadata: multitrack & aligned MIDI
Tunebot
type: Audio
size: 10000 queries/? songs
metadata: title & artist
UMA-Piano
type: Audio
size: 275040 recordings
metadata: piano chords
uspop2002
type: Audio
size: 8752 songs
metadata: tags & genre & chords
Weimar Jazz Database (WJAZZD)
A component of the Jazzomat project, WJAZZD is a database of jazz solo transcriptions available to the public to further enhance and improve jazz and MIR research.
type: Symbolic