Corpora

GigaMIDI (upcoming dataset by Keon Ju Maverick Lee)

type: MIDI

size: To be released

metadata: To be released

MetaMIDI

type: MIDI

size: 436,631 MIDI files

metadata: Scraped artist + title metadata for 221,504 MIDIs, Scraped genre metadata for 143,868 MIDIs, Audio-MIDI matching procedure, which produced 10,796,557 audio-MIDI matches linking 237,236 MIDIs, including 168,032 MIDIs matched to MusicBrainz IDs via the Spotify/MusicBrainz linking procedure.

Drum Space

type: MIDI

size: 33,000 MIDI files

metadata: unique drum tracks and non-expressive synthetic data generated using neural network, offer 2-D latent space representation using t-SNE algorithm in the website link.

Groove MIDI Dataset (GMD)

type: MIDI & Audio

size: 1,150 MIDI files and over 22,000 measures

metadata: drummer ids, drumming style, distinction between drum beats and fills, expressively-performed MIDI tracks

MAESTRO (MIDI and Audio Edited for Synchronous TRacks and Organization)

type: MIDI & Audio

size: 1,276 MIDI files & audio waveforms

metadata: 200 hours of virtuosic piano expressive performances captured with fine alignment (~3 ms) between note labels and audio waveforms, including composer IDs available

AbaSynthphony MIDI Pack V001

type: MIDI

size: 50,000 MIDI files

metadata: melodic tracks with fixed velocity levels

AbaSynthphony MIDI Pack V002 Drum

type: MIDI

size: 50,000 MIDI files

metadata: drum tracks with fixed velocity levels

AbaSynthphony MIDI Pack V003 Dance Music Drum

type: MIDI

size: 50,000 MIDI files

metadata: dance music drum tracks with fixed velocity levels

Emo-Soundscapes

type: audio

size: 1,213 (6-second length) Creative Commons licensed audio clips

metadata: ground truth annotations of perceived emotion in 1213 soundscape recordings using a crowdsourcing listening experiment, where 1182 annotators from 74 different countries rank the audio clips according to the perceived valence/arousal

ASAP (Aligned Scores and Performances)

type: MIDI

size: 1,290 MIDI files

metadata: A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.

IsoVAT

type: Audio and MIDI

size: 90 clips in total (30 clips per affective dimension)

metadata: Affective Music Composition (Valence, Arousal, and Tension)

ADC2004
type: Audio
size: 20 excerpts
metadata: predominant pitch

AMG1608
AMG1608 is a dataset for music emotion analysis. It contains frame-level acoustic features extracted from 1608 30-second music clips and corresponding valence-arousal (VA) annotations provided by 665 subjects.
type: Audio
size: 1,608
metadata: valence & arousal

APL
type: Audio
size: 620 segments
metadata: piano practice

artist20
type: Audio
size: 1413 songs
metadata: 20 artists

Audio Content Analysis Datasets
Companion datasets to the book Audio Content Analysis by Alexander Lerch
type: Audio

bach10
type: Audio / Symbolic
size: 10 chorales
metadata: multitrack & aligned MIDI

ballroom
type: Audio
size: 698 excerpts (30s)
metadata: 8 genres & tempo & (down-)beats

beatboxset1
type: Audio
size: 14 clips
metadata: perc. annotation

C224a
type: Audio
size: 224 artists
metadata: 14 genres

C3ka
type: Audio
size: 3000 artists
metadata: 18 genres

C49ka-C111ka
type: Audio
size: 48800/110588 artists
metadata: genres

CAL10k
type: Audio
size: 10870 songs
metadata: tags

CAL500
type: Audio
size: 502 songs
metadata: tags

Center for Computer Assisted Research in the Humanities
Musedata, Themefinder, Humdrum and Kern resources.
type: Symbolic
metadata: tags

CCMixter
type: Audio
size: 50 mixes
metadata: vocal & background track

Chopin22
type: Audio
size: 44 recordings
metadata: audio & aligned MIDI

CMMSD
type: Audio
size: 36 excerpts
metadata: note/rest/transition & onsets & vibrato

Coidach
type: Audio
size: 26420 songs
metadata: 55 genres

Compmusic Corpora
Data collections of cultural music from various sources that evolve and grow.
type: Audio

corpusCOFLA
type: Audio
size: 1800 flamenco recordings
metadata: editorial & predominant melody

covers80
type: Audio
size: 80 song pairs
metadata: cover songs

CREL Singing Voice Database
Dataset for research of physical characteristics of different singing expressions
type: Audio
metadata: segmented with temporal markers for each expression

DAMP
type: Audio
size: 34000 monophonic recordings
metadata: karaoke performances

DEAM
The biggest publicly available music affect dataset., which has 1802 songs. It contains average and std of valence and arousal value of each excerpt. It has audio files, feature and annotations.
type: Audio
size: 1,802
metadata: valence & arousal

DEAPDataset
type: Audio
size: 120 music video excerpts
metadata: valence & arousal & dominance & physiological data

DREANSS
type: Audio
size: 18 excerpts
metadata: onset times & perc. instruments

DrumPt
type: Audio
size: app. 2000 annotations
metadata: 4 playing techniques

emoMusic
type: Audio
size: 744 excerpts (45s)
metadata: arousal & valence

Emotify
Emotify dataset has no arousal/valence values, but it provides the audio and is annotated with the GEMS. The discrete emotion tags include amazement, solemnity, tenderness, nostalgia, calmness, power, joyful activation, tension, and sadness.

type: Audio
size: 400 excerpts
metadata: induced emotion

ENST-Drums
type: Audio
size: 318 segments
metadata: onset times & perc. instruments & playing technique

Extendedballroom
type: Audio
size: 4000 excerpts (30s)
metadata: 9 genres & tempo &amp

ffuhrmann
type: Audio
size: 6951 excerpts/220 songs
metadata: 11 predom. instr.

FlaBase
type: Audio
size: 1102 artists & 74 palos & 2860 albums & 13311 tracks
metadata: editorial & biographical & musicological information on flamenco

FMA-medium
type: Audio
size: 14511 excerpts (30s)
metadata: 20 genres

FMA-small
type: Audio
size: 4000 excerpts (30s)
metadata: 10 genres

Fugue
Reference data for computational music analysis. Now contains a dataset of ground truth structures for fugues.
type: Symbolic
size: 36 pieces
metadata: fugue analysis

GiantStepsKey
Datasets for automatic evaluation of tempo estimation and key detection algorithms.
type: Audio
size: 604 files
metadata: key

GiantStepsTempo
Datasets for automatic evaluation of tempo estimation and key detection algorithms.
type: Audio
size: 664 files
metadata: tempo

GNMID14
type: Audio
size: 110M music ID matches
metadata: timestamp & country

Good-sounds.org
type: Audio
size: 8750 notes
metadata: 12 instruments, pitch, sound quality

GPT
type: Audio
size: 6580 clips
metadata: 7 guitar playing techniques

GTZAN
type: Audio
size: 1000 excerpts (30s)
metadata: 10 genres & tempo & key1 & key2 & beat/downbeat & metrical levels

Hainsworth
type: Audio
size: 245 excerpts (60s)
metadata: tempo

HJDB
type: Audio
size: 236 excerpts
metadata: downbeat

holzapfel:onset
type: Audio
size: 78 excerpts
metadata: onset times

homburg
type: Audio
size: 1889 excerpts (10s)
metadata: 9 genres

IADS
type: Audio
size: 111 sound snippets
metadata: valence & arousal & dominance

IDMT-MT
type: Audio
size: 12 songs
metadata: multitrack & style

IDMT-SMT-Audio-Effects
type: Audio
size: 55044 recordings
metadata: effects on bass and guitar notes

IDMT-SMT-Bass
type: Audio
size: 4300 excerpts
metadata: bass performance styles

IDMT-SMT-Bass-SINGLE-TRACK
type: Audio
size: 17 bass lines (?)
metadata: style annotated bass lines

IDMT-SMT-Drums
type: Audio
size: 518 files
metadata: onset times & perc. instruments

IDMT-SMT-Guitar
type: Audio
size: 4700+400 note events
metadata: 9 guitar playing techniques

iKala Dataset
Comprised of 252 30-second excerpts sampled from 206 iKala songs
type: Audio
size: 252
metadata: Pitch contour, timestamped lyrics

INRIA:EuroVision
type: Audio
size: 124 songs
metadata: structure

INRIA:Quaero
type: Audio
size: 159 songs
metadata: structure

IRMAS
type: Audio
size: 2874 excerpts
metadata: 11 instruments

ISMIR2004Genre
type: Audio
size: 729 excerpts (30s)
metadata: 6 genres

ISMIR2004Tempo
type: Audio
size: 465 excerpts (20s)
metadata: tempo

Isophonics
Datasets, Ontologies, and other goodies.
type: Audio

J-DISC
J-DISC is a resource for searching and exploring jazz recordings created by the Center for Jazz Studies at Columbia University.
type: Audio

Jamendo
type: Audio
size: 61+16+16 songs
metadata: voice activity

JGDB
type: Audio
size: random generated excerpts
metadata: multitrack & MIDI

Jordan:Classical
type: Audio
size: 15 pieces
metadata: structure

Jordan:Jazz
type: Audio
size: 15 pieces
metadata: structure

LabROSA:APT
type: Audio
size: 29 piano excerpts
metadata: MIDI

LabROSA:MIDI
type: Audio / Symbolic (midi)
size: 4 songs

Lakh MIDI DataSet
The Lakh MIDI dataset is a collection of 176,581 unique MIDI files, 45,129 of which have been matched and aligned to entries in the Million Song Dataset.
type: Symbolic (midi)
size: 176,581

last.fm
type: Audio
size: 992 users
metadata: listening habits

Latin
type: Audio
size: 3160 songs
metadata: 10 genres

magnatagatune
type: Audio
size: 25863 excerpts (30s)
metadata: similarity

MAPS Database
A piano database for multipitch estimation and automatic transcription of music.
type: Audio
size: 238 pieces
metadata: Proud truth pitch information

MARD
type: Audio
size: 66566 songs
metadata: album reviews

MARG Note-level Singing Dataset
Dataset produced by the Music & Audio Research Group for work in automatic music transcription
type: Audio
metadata: ground truth pitch information (monophonic)

MARG-AMT
type: Audio
size: 30 melodies
metadata: MIDI pitch & onset/offset times

McGill Billboard
type: Audio
size: 740 songs
metadata: chords

McGill Billboard Annotations
Annotations and audio features for the first 1000 randomly selected entries from Billboard chart slots presented at ISMIR 2011, and the additional 300 entries used to evaluate audio chord estimation for MIREX 2012.
type: Audio
metadata: high-level structure, timestamped chord labels, instrument information

MedleyDB
type: Audio
size: 122 songs
metadata: multitrack & genre & melody f0 & instrument activation

Meertens Tunes Collections
The MTC consist of a number of melodic data sets (Dutch Songs), both vocal and instrumental. MTC is open access available for research purposes and is especially valuable for MIR research.

MidiDB
MIDI transcriptions of many popular songs, including EDM.
type: Symbolic (midi)

Million Musical Tweets Dataset
The “Million Musical Tweets Dataset” (MMTD) contains listening histories inferred from microblogs. Each listening event identified via twitter-id and user-id is annotated with temporal (date, time, weekday, timezone), spatial (longitude, latitude, continent, country, county, state, city), and contextual (information on the country) information. In addition, pointers to artist and track are provided as a matter of course.
type: Audio
size: 1,000,000

Million Song Dataset
A collection of audio features and metadata for a million contemporary popular music tracks.
type: Audio
size: 1,000,000

MIR Datasets
A list of datasets maintained at the Music Inforation Retrieval Wiki.

MIR Lab
Corpora prepared by MIR Lab.

MIR-1K Dataset
One thousand clip dataset for singing voice separation from MIR Lab,
type: Audio
size: 1,000
metadata: pitch contour, lyrics, indices and types for unvoiced frames.

mirex05Train
type: Audio
size: 13 excerpts
metadata: predominant pitch

mirex06Train
type: Audio
size: 20 excerpts (30s)
metadata: tempo & beats

MMTD
type: Audio
size: 1086808 tweets
metadata: listening behavior

Modal
type: Audio
size: 71 snippets
metadata: onset times

Mood Swing Dataset
It contains V/A value of 240 songs.
type: Audio
size: 240

MOODetector:Bi-Modal
type: Audio
size: 133 excerpts
metadata: lyrics & valence & arousal

MOODetector:Multi-Modal
type: Audio
size: 903 excerpts (30s)
metadata: lyrics & MIDI & mood

MSD
type: Audio
size: 1000000 songs
metadata: meta data & proprietary features

MTG-QBH
type: Audio
size: 118 queries/481 songs
metadata: title & artist

MuseData
An electronic library of Classical Music scores
type: Symbolic (Midi, MuseData, Humdrum)
size: 881

Music Mood Rating Dataverse
It contains average ratings of discrete emotion tags, including valence, arousal, atmosphere, happy, dark, sad, angry, sensual, sentimental.
type: Audio
size: 600
metadata: Annotations

Music Recommendation Dataset (KGRec-music)
Two different datasets with users, items, implicit feedback interactions between users and items, item tags, and item text descriptions are provided, one for Music Recommendation (KGRec-music), and other for Sound Recommendation (KGRec-sound)
type: Audio

Music Technology Group Datasets
Various datasets compiled as part of research projects carried out at the MTG.

MusicClef 2012
The MusiClef 2012 – Multimodal Music Data Set provides editorial metadata, various audio features, user tags, web pages, and expert labels on a set of 1355 popular songs. It was used in the MusiClef 2012 Evaluation Campaign.
type: Audio
size: 1355 songs
metadata: tags

MusicMicro
type: Audio
size: 136866 users
metadata: music listening patterns

MusicMicro Dataset
The “MusicMicro 11.11-09.12” data set contains listening histories inferred from microblogs. Each listening event identified via twitter-id and user-id is annotated with temporal (month and weekday) and spatial (longitude, latitude, country, and city) information. In addition, pointers to artist and track are provided as a matter of course.

MusicNet
type: Audio
size: 330 recordings
metadata: pitch and onsets

musiXmatch Database
Official lyrics collection of the Million Song Dataset.
size: 1,000,000

NSynth
type: Audio
size: 305,979 excerpts
metadata: 305,979 musical notes, each with a unique pitch, timbre, and envelope

ODB
type: Audio
size: 19 excerpts
metadata: onset times

Onset_Leveau
type: Audio
size: 21 excerpts
metadata: onset times

Petrucci Music Library
The datasets backing the Music Ngram Viewer.
type: Symbolic
metadata: N-gram per year

Phonation Modes Dataset
A collection of datasets for detection of phonation modes: breathy, neutral, flow and pressed.
type: Audio
size: 900
metadata: Phonation mode ground truth

PlaylistDataset
type: Audio
size: 75262 songs/2840553 transitions
metadata: playlists

QBT-Extended
type: Symbolic
size: 3365 queries/51 songs
metadata: taps

QMUL:Beatles
type: Audio
size: 181 songs
metadata: structure & key & chords & beats

QMUL:King
type: Audio
size: 14 songs
metadata: structure & key & chords

QMUL:MichaelJackson
type: Audio
size: 38 songs
metadata: structure

QMUL:MultiTrack
type: Audio
size: 104 songs
metadata: structure & multitrack

QMUL:Queen
type: Audio
size: 51/31 songs
metadata: structure/key & chords

QMUL:RSS
type: Audio
size: 60 songs
metadata: structure

QMUL:Zweieck
type: Audio
size: 18 songs
metadata: structure & key & chords & beats

QUASI
type: Audio
size: 11 songs
metadata: multitrack

RECOLA Database
Multimodal recordings of spontaneous collaborative and affective interactions in French.
type: Audio
metadata: Segmentation of spoken utterances, probability of speech, acoustic low-level descriptors

Repovizz
A framework for remote storage, visual browsing, annotation, and exchange of multi-modal data.

RockCorpus
type: Audio
size: 200 songs
metadata: chords & melody & bars

RWC
type: Audio
size: 115 songs/50 classical/100 songs
metadata: lyrics & 10 genre & 50 instruments & chords & structure & aligned MIDI

RWC Music Database
The RWC (Real World Computing) Music Database is a copyright-cleared music database (DB) available to researchers as a common foundation for research.
type: Audio
size: 315
metadata: ground truth midi

Saarland Music Data
Saarland Music Data (SMD) – SMD supplies free music recordings of Western classical music (SMD Western Music) as well as MIDI-audio pairs (SMD MIDI-Audio Piano Music), which have been generated by using hybrid acoustic / digital pianos (Disklavier).
type: Symbolic / Audio

SALAMI
type: Audio
size: 779 songs
metadata: structure

Sargon
type: Audio
size: 4 songs
metadata: structure

SASD
type: Audio
size: 268+2336artists
metadata: artist biographies & similarity

Schenker
A dataset of MusicXML excerpts and corresponding Schenkerian analyses in a computer-readable format.
type: Symbolic (musicXML)
size: 41 pieces
metadata: MusicXML & Schenker analysis

Seyerlehner:1517-Artists
type: Audio
size: 3180 songs
metadata: 19 genres

Seyerlehner:Annotated
type: Audio
size: 190 songs
metadata: 19 genres

Seyerlehner:Pop
type: Audio
size: 1105 songs
metadata: tempo

Seyerlehner:Unique
type: Audio
size: 3115 excerpts (30s)
metadata: 14 genres

SISEC
type: Audio
size: 5 excerpts
metadata: multitrack & mix

SMC:MIREX
type: Audio
size: 217 excerpts
metadata: tempo & beat positions

SMD
type: Audio
size: 50 recordings
metadata: audio & aligned MIDI

Soundtrack
The selection of the excerpts has been done in terms of dimensional and discrete emotion model (see the paper for details) and evaluated by pilot study and a larger scale study. The soundtracks are short (approx. 15 second) excerpts from film soundtracks.
type: Audio
size: 360
metadata: valence & energy & tension & mood

SPAM
type: Audio
size: 50 songs
metadata: structure

Su-AMT
type: Audio
size: 10 excerpts
metadata: onset times & pitch

Suomen Kansan eSävelmät
Digital Archive of Finnish Folk Tunes.
type: Audio
size: 9,000
metadata: notation, key, meter, place of collection, lyrics

SymbTr
A Turkish Makam Music Symbolic Data Collection.
type: Audio
size: 2,000
metadata: Phrase boundaries, segment boundaries.

The Bellmann Corpus
Released in 2013, consisting of musical scores for over 650 pieces (or complete sections of multi-movement works) for piano or harpsichord
type: Symbolic (midi)
size: 650

The Meertens Tune Collections
type: Audio
size: 3000-7000 melodies
metadata: phrases & key & meter

Tonal Harmony Excerpts
MIDI files from the workbook and instructor’s manual for Tonal Harmony by Stefan Kostka and Dorothy Payne.
type: Symbolic (midi)
size: 46
metadata: Ground truth chord labels.

TONAS
type: Audio
size: 72 single-voiced excerpts
metadata: pitch

TPD
type: Audio
size: 23385 songs
metadata: popularity rating

TRIOS
type: Audio
size: 5 excerpts
metadata: multitrack & aligned MIDI

Tunebot
type: Audio
size: 10000 queries/? songs
metadata: title & artist

UMA-Piano
type: Audio
size: 275040 recordings
metadata: piano chords

uspop2002
type: Audio
size: 8752 songs
metadata: tags & genre & chords

Weimar Jazz Database (WJAZZD)
A component of the Jazzomat project, WJAZZD is a database of jazz solo transcriptions available to the public to further enhance and improve jazz and MIR research.
type: Symbolic

ACM_MIRUM
type: Audio
size: 1410 excerpts (60s)
metadata: tempo