Skip to content

patchbanks/WaivOps-EDM-HSE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

EDM-HSE Dataset

EDM-HSE is an open audio dataset containing a collection of code-generated drum recordings in the style of modern electronic house music. It includes 8,000 audio loops recorded in uncompressed stereo WAV format, created using custom audio samples and a MIDI drum dataset. The dataset also comes with paired JSON files containing MIDI note numbers (pitch) and tempo data, intended for supervised training of generative AI audio models.

Overview

The EDM-HSE Dataset was developed using an algorithmic framework to generate probable drum notations commonly played by EDM music producers. For supervised training with labeled data, a variational mixing technique was applied to the rendered audio files. This method systematically includes or excludes drum notes, assisting the model in recognizing patterns and relationships between drum instruments, thereby enhancing its generalization capabilities.

The primary purpose of this dataset is to provide accessible content for machine learning applications in music and audio. Potential use cases include generative music, feature extraction, tempo detection, audio classification, rhythm analysis, drum synthesis, music information retrieval (MIR), sound design and signal processing.

Specifications

  • 8,000 audio loops (approximately 17 hours)
  • 16-bit WAV format
  • Tempo range: 120–130 BPM
  • Paired label data (WAV + JSON)
  • Variational drum patterns
  • Subgenre styles (Big room, electro, minimal, classic)

A JSON file is provided for referencing and converting MIDI note numbers to text labels. You can update the text labels to suit your preferences.

Examples

See examples folder to preview mp3 demos.

License

This dataset is developed by WaivOps, a crowdsourced music project managed by sound label company Patchbanks. All recordings have been compiled by verified sources for copyright clearance.

The EDM-HSE Dataset is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0).

Download

The audio files are provided in 24-bit WAV format and encoded at 44.1kHz.

Direct WAV Download (7.6GB) edm_hse_id_001-004_wav.tar.gz

Direct JSON Download (245KB) edm_hse_id_001-004_json.tar.gz

DOI

File Name Reference

Label Reference
bpm The tempo of the audio file
edm Main genre (edm)
drm instrument (drums)
id Identification number
_00 Playlist track number

Citation

If you use this dataset for a research or development project, please cite the following references:

@misc{EDM-HSE,
author = {WaivOps},
title = {WaivOps EDM-HSE: Open Audio Resources for Machine Learning in Music},
year = {2024},
doi = {10.5281/zenodo.13769544},
url = {https://doi.org/10.5281/zenodo.13769544},
}

Additional Info

Please note that this dataset has not been fully reviewed and may contain minor notational errors or audio defects.

For any questions or feedback please email info@patchbanks.com.

About

EDM-HSE is an open audio dataset featuring 8000 house music drum loops.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors