Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding loader for Hainsworth dataset #617

Closed
wants to merge 248 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
248 commits
Select commit Hold shift + click to select a range
4e84bf5
Functionality tests (#84)
rabitt Jul 12, 2019
47f8b50
[WIP] Data home reorg (#90)
rabitt Jul 18, 2019
ad3b674
make and update contributing md, add lightweight dev installation ins…
tkell Jul 13, 2019
92df466
fixed some typos
rabitt Jul 18, 2019
246c416
draft for mirdata rtd. including example, faq(placeholder) (#86)
keunwoochoi Jul 19, 2019
008a7be
Fix failing rtd build (#93)
rabitt Jul 19, 2019
abc670f
[WIP] Increase test coverage (#92)
rabitt Jul 20, 2019
bbb93ca
Add __repr__ to existing datasets (#94)
tkell Jul 21, 2019
0f74497
Adding RWC collection (#96)
magdalenafuentes Jul 22, 2019
8f78267
Fix when annotations are missing and test (#97)
magdalenafuentes Jul 22, 2019
78d6c2f
Common downloader (#99)
rabitt Aug 3, 2019
18f6d29
Small fixes (consistency and bugs) (#103)
magdalenafuentes Aug 5, 2019
cf9ea9e
update tox to run black, run black
tkell Aug 15, 2019
dafb3a1
replace print with logging.info (#108)
rabitt Aug 23, 2019
ff1d2ab
Implements GuitarSet loader (#109)
tomxi Aug 23, 2019
5cca53a
fix #110 (#112)
rabitt Aug 23, 2019
a932f5b
fix for guitarset.download() (#115)
tomxi Aug 27, 2019
14fdae7
Librosa now >= 0.7.0 (#123)
drubinstein Sep 6, 2019
459ee2b
make all subdirectories (#124)
rabitt Sep 10, 2019
4e9c2e6
Checksum and minor fixes (#119)
magdalenafuentes Sep 11, 2019
7cb88a5
multi channel support w/ GuitarSet (#125)
tomxi Sep 12, 2019
ffdd6ce
Medley-solos-DB (#131)
Sep 29, 2019
39f561e
bump version
Sep 29, 2019
4c218fb
remove validate from inside load (#132)
rabitt Sep 29, 2019
4850b5c
Move back from requests to urllib (#134)
drubinstein Oct 3, 2019
c4541a9
bump version
Oct 4, 2019
1e720c5
Speed up import mirdata (#137)
rabitt Oct 28, 2019
ce4ffc0
Tests for download errors (#141)
rabitt Oct 30, 2019
82dbab1
Adding DALI dataset (#78)
magdalenafuentes Nov 2, 2019
be60512
to JAMS functionality [WIP] (#130)
magdalenafuentes Nov 2, 2019
c603c66
Fix tar checksums (#144)
rabitt Nov 2, 2019
026cdfe
first update on docstrings (#143)
keunwoochoi Nov 2, 2019
7b2ca7a
bump version
rabitt Nov 2, 2019
9e69c30
146 - Uncomment audio loader asserts in salami (#148)
drubinstein Nov 3, 2019
0c95435
145 - Dali is now optional (#147)
drubinstein Nov 3, 2019
f4a1328
Rachel/0.1.0 release (#150)
rabitt Nov 4, 2019
81e3c68
Add GTZAN-Genre dataset (#155)
andreasjansson Nov 22, 2019
82289cc
bump version
Nov 22, 2019
2fed796
document validator + print message in case of success (#170)
Feb 23, 2020
018bff4
[CR] remove RWC-genre dataset (#180)
Mar 5, 2020
408d3e2
define version module (#181)
Mar 6, 2020
f788c7f
add project description on PyPI (#171)
Mar 6, 2020
d641af8
TinySOL (#165)
Mar 6, 2020
778b4aa
Fixing and updating the docs (#183)
Mar 6, 2020
049f6d2
[MRG] Add MultipitchData (#182)
Mar 6, 2020
94bec7d
set version to 0.2.0a0
Mar 6, 2020
7fe8ce7
set version to 0.2.dev0
Mar 6, 2020
e6fd791
Add default tests for loaders (#186)
rabitt Mar 6, 2020
fd440c5
remove six from requirements + remove unused imports (#190)
Mar 10, 2020
a3b3216
Fix docs for v0.2, add sndfile (#193)
Mar 11, 2020
7bf0ea3
patch docs (#194)
rabitt Mar 12, 2020
485b700
Updating NoteData type to be compatible with mir_eval trancription (#…
magdalenafuentes Mar 12, 2020
cfd4af5
better support for remote data (#188)
rabitt Mar 13, 2020
0cc1eba
Update contributing with updated checklist and contribution policy (#…
magdalenafuentes Apr 1, 2020
2c8bde8
Create pull_request_template.md (#199)
magdalenafuentes Apr 1, 2020
f86777b
update loader docstrings (#202)
Apr 1, 2020
f88e188
Docs beautification (#201)
rabitt Apr 1, 2020
e9e2a98
Table of all datasets (#203)
rabitt Apr 2, 2020
153f0ad
Update README.md
rabitt Apr 2, 2020
a920570
add a base Track class, remove all reprs (#205)
rabitt Apr 3, 2020
5ea6198
set version number to 0.2.0b0
Apr 4, 2020
c53d4a7
set version number to 0.2.0dev0 (after 0.2b0 release)
Apr 4, 2020
f149a14
The __future__ is now (#209)
Apr 6, 2020
6eea852
Improve PR template info (#214)
magdalenafuentes Apr 7, 2020
261ea4c
gtzan_genre.Track.to_jams (#212)
Apr 8, 2020
dbacd1f
Groove MIDI (#207)
kyungyunlee Apr 8, 2020
34d13e4
Document tempo. groove_midi.bpm is now groove_midi.tempo (#232)
rabitt Apr 9, 2020
2d66940
bugfix dali.download(), implement dali.Track.to_jams() (#230)
Apr 10, 2020
685a0ab
Fix groove_midi download structure (#239)
rabitt Apr 10, 2020
2ef1077
Add local exhaustive dataset tests (#238)
rabitt Apr 10, 2020
be5c3b8
module.load_* functions raise an IOError when file does not exist (#240)
rabitt Apr 13, 2020
62833c3
Update CONTRIBUTING.md
rabitt Apr 13, 2020
c026528
Download() refactor (#216)
magdalenafuentes Apr 16, 2020
3165e24
fixes #226 (#247)
rabitt Apr 17, 2020
fb71620
fixes #248 (#249)
rabitt Apr 22, 2020
191bd5d
Automatically push pypi version on release (#250)
rabitt Apr 22, 2020
3f8d1d0
Adding loader for Maestro (#260)
rabitt Apr 29, 2020
75585d7
Update LICENSE
rabitt Apr 30, 2020
53c45f9
fixes #246 (#264)
rabitt Jun 30, 2020
e9a1eb8
fix for issue 273 (#274)
nkundiushuti Sep 28, 2020
fcb2a87
Fixes #279 (#280)
rabitt Sep 29, 2020
7b61354
bump librosa version (#278)
rabitt Sep 30, 2020
cc58816
Adding loader for GiantSteps EDM key dataset (#275)
PRamoneda Oct 11, 2020
9983c2a
Adding loader for GiantSteps EDM tempo dataset (#281)
PRamoneda Oct 19, 2020
8c7a832
Adding loader for Beatport EDM key (#286)
PRamoneda Oct 22, 2020
1d01f53
Update pythonpublish.yml
rabitt Oct 23, 2020
773de73
Revert "Update pythonpublish.yml"
Oct 23, 2020
c028cbd
Docs fix 0.2.0 (#302)
rabitt Oct 27, 2020
801d01a
MultiTrack class (#295)
rabitt Oct 27, 2020
368c680
Revert "MultiTrack class (#295)" (#303)
rabitt Oct 27, 2020
71f100f
Multitrack class (#304)
rabitt Oct 28, 2020
91dd713
Adding loader for Mridangam Stroke Dataset (#285)
genisplaja Oct 29, 2020
fe6d692
fixes 309 (#310)
rabitt Nov 2, 2020
b149179
bugs beatport key dataset loader (#312)
PRamoneda Nov 3, 2020
ff201ca
Dataset object (#296)
rabitt Nov 3, 2020
2a972af
fix #306 ziplib issue : unzipping archives created on Windows (#319)
nkundiushuti Nov 4, 2020
01d3b7f
fix path to indexes in setup.py (#322)
nkundiushuti Nov 5, 2020
359c9a0
corrected track->core (#323)
nkundiushuti Nov 5, 2020
8bdc847
Update README.md (#324)
nkundiushuti Nov 5, 2020
fe41fd6
Adding loader for Saraga (no multitrack) (#330)
genisplaja Nov 9, 2020
4a0e7bd
Fix for issue #329 about datahome printing (#333)
genisplaja Nov 9, 2020
e69b5d2
Adding loader for IRMAS Dataset (#320)
genisplaja Nov 12, 2020
46bb009
Adding loader for cante100 (COFLA research corpus) (#317)
genisplaja Nov 12, 2020
036b336
Remote index (#335)
PRamoneda Nov 17, 2020
4d315ad
Index structure update (#327)
magdalenafuentes Nov 17, 2020
e4621e6
Fix RWC metadata (#338)
magdalenafuentes Nov 20, 2020
37c3b60
Marius/pytestemail (#337)
nkundiushuti Nov 20, 2020
4df7991
Fix validation of datasets without metadata (#342)
magdalenafuentes Nov 24, 2020
7948ce8
Fix tinysol localpath (#353)
magdalenafuentes Dec 1, 2020
e07baff
Adding loaders for Saraga Carnatic and Hindustani datasets (#354)
genisplaja Dec 11, 2020
9b6d9b2
fix remote bug (#352)
PRamoneda Dec 12, 2020
efbe510
Adding loader for acoustic brainz genre dataset (#318)
PRamoneda Dec 12, 2020
ae10ca7
fix cancelled download bug (#344)
PRamoneda Dec 12, 2020
468957f
Tonality classicalDB Dataset!!! (#331)
PRamoneda Dec 12, 2020
bbc9be3
Fix destination directory for cante100 (#361)
genisplaja Dec 15, 2020
ce4efcd
idea citation (#358)
magdalenafuentes Dec 17, 2020
33f6c43
fix docs warnings, restructure docs index (#363)
rabitt Dec 17, 2020
96bdfc4
Docs audit - further formatting (#367)
magdalenafuentes Dec 17, 2020
8f6dde7
Fix css folder (#368)
magdalenafuentes Dec 17, 2020
bf9e808
Docs - Update Contributing and Overview (#370)
magdalenafuentes Dec 17, 2020
92afa31
Rename tonality_classicalDB (#371)
genisplaja Dec 18, 2020
25d10e8
NamedTuples to Classes (#336)
magdalenafuentes Dec 18, 2020
3c4bcaf
Add bibtex Acousticbrainz (#375)
magdalenafuentes Dec 18, 2020
490cfd4
Using csv for data loading (#373)
genisplaja Dec 21, 2020
6b86211
Better docs for datasets (#377)
rabitt Dec 21, 2020
9ffacd3
Add acoustic_brainz test to gitignore (#386)
genisplaja Dec 21, 2020
74f200f
move legacy index creation scripts to legacy folder (#389)
rabitt Dec 22, 2020
beb7a02
cut files (#391)
PRamoneda Dec 22, 2020
184b60c
Collapsible Dataset Info (#387)
rabitt Dec 22, 2020
908f597
fixing consistency (#379)
PRamoneda Dec 23, 2020
a743b8f
Sama and tempo annotation bug fixes for Saraga Carnatic and Hindustan…
genisplaja Dec 23, 2020
b94dca2
Shorten audios and fix tests (#395)
genisplaja Jan 5, 2021
b767cdd
Docs nits (#393)
magdalenafuentes Jan 7, 2021
fc987ef
Writing tutorial documentation (#378)
genisplaja Jan 7, 2021
66676f9
remove line audio (#399)
PRamoneda Jan 7, 2021
5c48c6d
Fix RWC empty files and index (#403)
magdalenafuentes Jan 8, 2021
1dcf975
Dataset.readme (#408)
rabitt Jan 8, 2021
bd8db03
Fix beatport decoding issue (#404)
genisplaja Jan 8, 2021
7865c02
Force missing annotations to be None (#406)
rabitt Jan 9, 2021
996bda4
Add a list_datasets function (#407)
rabitt Jan 10, 2021
15f8e37
Add licenses loaders and doc nits (#410)
magdalenafuentes Jan 11, 2021
36b88f5
fix full dataset (#411)
PRamoneda Jan 11, 2021
b6043d0
resolves bug when downloading twice (#414)
rabitt Jan 12, 2021
1d97df7
Contributing conventions docs (#416)
rabitt Jan 12, 2021
27f93a5
Update version.py
magdalenafuentes Jan 12, 2021
3b315f0
fix typo from "RWC classical" to "RWC popular" (#418)
ooyamatakehisa Jan 14, 2021
94aab79
track docstring in dataset is incorrectly formatted (#419)
rabitt Jan 14, 2021
79d827b
string to structure (#376)
drubinstein Jan 14, 2021
92ec73f
formatting (#421)
drubinstein Jan 14, 2021
595aee2
removed Track.title in Tutorial (#430)
nkundiushuti Jan 18, 2021
13ceba8
Update links to documentation (#422)
rabitt Jan 19, 2021
fe36796
Add mypy (not strict) (#423)
drubinstein Jan 21, 2021
2136b36
Simplify track object + update contributing docs (#436)
rabitt Jan 26, 2021
fa8f581
Update version.py
rabitt Jan 26, 2021
45fb566
fix black command (#444)
PRamoneda Jan 28, 2021
896b913
# -*- coding: utf-8 -*-\n is not required on Python3 (#431)
PRamoneda Jan 29, 2021
a6b2799
Move development info to contributing (#446)
genisplaja Jan 29, 2021
f459a68
Simplify index loading (#443)
rabitt Feb 1, 2021
e132509
Increase track attribute test coverage (#460)
rabitt Feb 2, 2021
131e46f
Update version.py
magdalenafuentes Feb 2, 2021
67b8980
Update contributing.rst (#463)
nkundiushuti Feb 3, 2021
4396b1e
Adding loader for OTMM makam recognition dataset (CompMusic corpus) (…
genisplaja Feb 4, 2021
2011cfc
Adding loader for McGill Billboard (updated) (#466)
Spijkervet Feb 15, 2021
7146820
Building audio/annotation paths in core (#468)
genisplaja Feb 18, 2021
5b94bf8
Phenicx Anechoic / Multitrack (#461)
nkundiushuti Feb 19, 2021
37eca4f
Remode md5 from all make_index and import from validate (#478)
PRamoneda Mar 10, 2021
10d6c85
add readthedocs config file (#479)
rabitt Mar 11, 2021
b430c3c
Move datasets custom tests to subfolder (#475)
magdalenafuentes Mar 12, 2021
61a4a91
Closes #484: Move some test files from tests/datasets/ to tests/ (#485)
tyffical Mar 24, 2021
d53c107
Adding loader for Jingju A Cappella Dataset (Part 1) (#472)
genisplaja Apr 6, 2021
7ae5446
adding loader for Queen dataset (#356)
PRamoneda Apr 7, 2021
47f6ca1
Adding loader for TONAS (#471)
genisplaja Apr 7, 2021
65a1857
Marius/fixunzip487 (#490)
nkundiushuti Apr 9, 2021
43db2e5
Add non-standard case instructions to contributing docs (#491)
rabitt Apr 12, 2021
49b09e8
Adding loader for Dagstuhl ChoirSet (#481)
sebastianrosenzweig Apr 15, 2021
21bbfd1
Adding loader for Freesound One-Shot Percussive Sounds (#493)
genisplaja Apr 17, 2021
0db026c
Update version.py
rabitt Apr 17, 2021
aa7a152
Add Haydn op20 loader (#447)
PRamoneda Apr 18, 2021
e7703fb
Multiple version support (#496)
rabitt Apr 19, 2021
6f73f2e
adding loader for good sounds (#417)
PRamoneda Apr 22, 2021
fc2c3ab
Billboard audio is not available on request (#499)
chrisdonahue Apr 23, 2021
7204c26
litle fixes (#500)
PRamoneda Apr 24, 2021
ce2e001
Annotation Extensions (#504)
rabitt Apr 29, 2021
35bce80
Update version.py
rabitt Apr 29, 2021
d5e442c
Add note annotations for medleydb_pitch and ikala (#507)
rabitt May 2, 2021
4e248e3
to_matrix converter for NoteData and F0Data (#506)
rabitt May 2, 2021
cddc1bf
add note to multif0 converter (#508)
rabitt May 3, 2021
29336f7
Update version.py
rabitt May 3, 2021
7f39702
Add to_mir_eval for NoteData, MultiF0Data, and F0Data (#510)
rabitt May 5, 2021
f231512
Add mix-level note and multif0 annotations to dagstuhl_choirset (#511)
rabitt May 9, 2021
eb9f9a6
Update version.py
rabitt May 9, 2021
90bb732
speed up deduplication in NoteData (#513)
rabitt May 10, 2021
0dbf90b
Adding loader for Slakh (#498)
rabitt Jul 16, 2021
ab94fcc
Update version.py
rabitt Jul 19, 2021
35b17a3
fix documentation (#512)
rabitt Aug 5, 2021
91e1c4b
Adding loader for jamendo moodtheme (#505)
PRamoneda Sep 7, 2021
f8a6f94
pin music 21 version (#527)
rabitt Sep 27, 2021
fa0f66e
Use smart_open (#518)
rabitt Sep 27, 2021
8f3c538
Adding da tacos loader (#434)
PRamoneda Sep 27, 2021
e6b5c0e
patch slakh dataset docs (#523)
rabitt Sep 27, 2021
c83e503
Deprecate Dataset.load_* (#528)
rabitt Sep 28, 2021
074c7ed
Update version.py
rabitt Sep 28, 2021
e1249f3
ignore haydn test file (#535)
rabitt Oct 29, 2021
3ed7725
vocadito (#532)
drubinstein Oct 29, 2021
94ada5a
Dataset splits & Checksum inconsistencies (#530)
migperfer Nov 5, 2021
900e9cc
Update version.py
rabitt Nov 5, 2021
3449e42
Adding mini set, beat and tempo to GTZAN genre (#537)
magdalenafuentes Nov 5, 2021
388ec72
Patch new splitting functions (#538)
rabitt Nov 6, 2021
579bb3c
bump version
magdalenafuentes Nov 6, 2021
3ddccd2
tests currently fail for extractall_unicode after a package update (#…
nkundiushuti Jul 22, 2022
a9d44ea
[WIP] OpenMIC2018 (#544)
bmcfee Sep 21, 2022
937d118
Adding loader for Filosax (#536)
dave-foster Oct 4, 2022
f6c4d4e
Update quick_reference.rst to table.rst
magdalenafuentes Oct 13, 2022
83663c6
[Docs] fix orchset.track_ids() --> orchest.track_ids (#554)
mimbres Oct 19, 2022
2029f6e
Adding loader for Four-Way Tabla dataset (ISMIR 21) (#557)
genisplaja Oct 27, 2022
a926737
Use different testing example for Saraga Hindustani (#563)
genisplaja Nov 2, 2022
224dff4
Adding loader for CompMusic Raga Dataset (#559)
genisplaja Nov 22, 2022
6172225
Adding loader for (CompMusic) Indian Art Music Tonic (#565)
genisplaja Nov 22, 2022
9604bd0
Adding loader for (CompMusic) Hindustani Music Rhythm (#566)
genisplaja Nov 22, 2022
c2ce9d1
Adding loader for (Compmusic) Carnatic Music Rhythm (#560)
genisplaja Nov 22, 2022
fbc8dc8
Updating file paths and other issues with windows environment (#567)
harshpalan Nov 23, 2022
34ac34e
Adding loader for EGFxSet (#556)
iranroman Nov 25, 2022
7724ae6
Bump version
magdalenafuentes Nov 25, 2022
f46aef1
580 mirdata multiple fixes (#581)
guillemcortes Mar 7, 2023
9234877
Adding loader for BAF dataset (#583)
guillemcortes Mar 15, 2023
4dd2e33
Adding loader for CompMusic Carnatic Varnam (#579)
genisplaja Mar 27, 2023
fc9e99d
GitHub Actions migration (#596)
Oct 29, 2023
578cba8
Update badges url (#598)
magdalenafuentes Oct 29, 2023
4e03b58
example_metadta.csv -> example_metadata.csv (#591)
carlthome Oct 31, 2023
f48167e
fix broken link for "OMRAS2 Metadata Project 2009" (#555)
CyxouD Oct 31, 2023
22a2467
Fix setup extras installation (Add missing comma). (#603)
guillemcortes Oct 31, 2023
3094765
minor fixes: documentation (#600)
giovana-morais Nov 2, 2023
724bd68
New version of CompMusic Turkish Makam Dataset (#576)
genisplaja Nov 2, 2023
13a6efa
Adding loader for IDMT-SMT-AUDIO-EFFECTS (#595)
francescopapaleo Nov 2, 2023
636c1c8
Adding loader for CIPI dataset (#599)
PRamoneda Nov 2, 2023
f4df604
Adding loader for SCMS (#597)
genisplaja Nov 3, 2023
716fcbc
Adding loader for Candombe (#606)
tanmayy24 Nov 3, 2023
29a4bab
Bump version (#607)
magdalenafuentes Nov 3, 2023
12f2bc7
Addition of badges for release (#608)
tanmayy24 Dec 8, 2023
9500dad
IDMT-SMT-AUDIO-EFFECTS custom download function, tests and resources …
francescopapaleo Dec 20, 2023
de8d4c0
Adding loader for hainsworth
rythmm24 Jan 22, 2024
da6c1a6
Edited loader info
rythmm24 Jan 22, 2024
2dbc70f
Update in audio count
rythmm24 Jan 23, 2024
1c2f10a
Update in doc for Hainsworth
rythmm24 Jan 25, 2024
7a2fc9c
Adding loader for Ballroom dataset (#613)
tanmayy24 Feb 1, 2024
798a1ef
Fixes in black version (#622)
tanmayy24 Feb 2, 2024
9e48a3e
Merge branch 'master' into tanmay/hainsworth
tanmayy24 Feb 4, 2024
4592554
rerunning pipeline
rythmm24 Oct 10, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
80 changes: 0 additions & 80 deletions .circleci/config.yml

This file was deleted.

33 changes: 33 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE/new_loader.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
### Title

Please use the following title: "Adding loader for MyDATASET". If your pull request is work in progress, change your title to "[WIP] Adding loader for MyDATASET" to avoid reviews while the loader is not ready.

### Description

Please include the following information at the top level docstring for the dataset's module mydataset.py:

- [ ] Describe annotations included in the dataset
- [ ] Indicate the size of the datasets (e.g. number files and duration, hours)
- [ ] Mention the origin of the dataset (e.g. creator, institution)
- [ ] Describe the type of music included in the dataset
- [ ] Indicate any relevant papers related to the dataset
- [ ] Include a description about how the data can be accessed and the license it uses (if applicable)

#### Dataset loaders checklist:

- [ ] Create a script in `scripts/`, e.g. `make_my_dataset_index.py`, which generates an index file.
- [ ] Run the script on the canonical version of the dataset and save the index in `mirdata/indexes/` e.g. `my_dataset_index.json`.
- [ ] Create a module in mirdata, e.g. `mirdata/my_dataset.py`
- [ ] Create tests for your loader in `tests/datasets/`, e.g. `test_my_dataset.py`
- [ ] Add your module to `docs/source/mirdata.rst` and `docs/source/table.rst`
- [ ] Run `black`, `flake8` and `mypy` (see [Running your tests locally](https://mirdata.readthedocs.io/en/stable/source/contributing.html?highlight=contributing#running-your-tests-locally)).
- [ ] Run `tests/test_full_dataset.py` on your dataset.
- [ ] Check that codecov coverage does not decrease.

If your dataset **is not fully downloadable** there are two extra steps you should follow:
- [ ] Contacting the mirdata organizers by opening an issue or PR so we can discuss how to proceed with the closed dataset.
- [ ] Show that the version used to create the checksum is the "canonical" one, either by getting the version from the dataset creator, or by verifying equivalence with several other copies of the dataset.
- [ ] Make sure someone has run `pytest -s tests/test_full_dataset.py --local --dataset my_dataset` once on your dataset locally and confirmed it passes.

#### Please-do-not-edit flag
To reduce friction, we will make commits on top of contributor's pull requests by default unless they use the `please-do-not-edit` flag. If you don't want this to happen don't forget to add the flag when you start your pull request.
19 changes: 19 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE/pull_request_template.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
### Pull request name

While your working on your PR please add the [WIP] indication in the title, to avoid reviewing before it's ready. Also, please try to have a descriptive title.

### Description

Please include a summary of the change and which issue is fixed.

Fixes # (issue)

### Type of change

Please add a description and delete options that are not relevant.

- [ ] Change 1 (description)
- [ ] Change 2 (description)


Please describe the tests that you ran to verify your changes if it applies.
41 changes: 41 additions & 0 deletions .github/environment-ci.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
name: mirdata-dev
channels:
- conda-forge
- defaults
dependencies:
# required
- pip
- chardet>=5.0.0
- Deprecated>=1.2.14
- h5py>=3.7.0
- librosa>=0.10.1
- numpy>=1.21.6
- openpyxl>=3.0.10
- pandas>=1.3.5
- pyyaml>=6.0
- requests>=2.31.0
- scipy>=1.7.3
- tqdm>=4.66.1
#######
- libvorbis

# optional, but required for testing
- pytest>=7.4.3
- pytest-pep8>=1.0.6
- pytest-cov>=4.1.0
- pytest-mock>=3.11.1
- pytest-localserver>=0.8.1
- pytest-xdist>=3.3.1

- pip:
- attrs>=23.1.0
- black>=23.3.0
- coveralls>=3.3.1
- dali-dataset>=1.1
- decorator>=5.1.1
- future>=0.18.3
- jams>=0.3.4
- music21>=7.3.3
- pretty_midi>=0.2.10
- smart_open[all]>=6.4.0
- testcontainers>=3.7.1
22 changes: 22 additions & 0 deletions .github/environment-lint.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
name: lint
channels:
- conda-forge
- defaults
dependencies:
# required.
- pip
- Deprecated >= 1.2.14
- flake8>=5.0.4
- librosa>=0.10.1
- mypy>=0.982
- numpy>=1.21.6
- pandas>=1.3.5
- pytest>=7.4.3
- requests>=2.31.0
- tqdm>=4.66.1
- types-decorator>=5.1.8.3

- pip:
- jams>=0.3.4
- types-Deprecated
- types-PyYAML
115 changes: 115 additions & 0 deletions .github/workflows/ci.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,115 @@
name: CI

on:
push:
branches:
- master
pull_request:
branches:
- master

concurrency:
group: ${{ github.workflow }}-${{ github.ref }}
cancel-in-progress: True

jobs:
test:
name: "Python ${{ matrix.python-version }} on ${{ matrix.os }}"
runs-on: ${{ matrix.os }}

strategy:
fail-fast: false
matrix:
include:
- os: ubuntu-latest
python-version: "3.7"
envfile: ".github/environment-ci.yml"
channel-priority: "strict"

- os: ubuntu-latest
python-version: "3.8"
channel-priority: "strict"
envfile: ".github/environment-ci.yml"

- os: ubuntu-latest
python-version: "3.9"
channel-priority: "strict"
envfile: ".github/environment-ci.yml"

- os: ubuntu-latest
python-version: "3.10"
channel-priority: "strict"
envfile: ".github/environment-ci.yml"

- os: macos-latest
python-version: "3.10"
channel-priority: "strict"
envfile: ".github/environment-ci.yml"

steps:
- uses: actions/checkout@v3
with:
submodules: true

- name: Install OS dependencies
shell: bash -l {0}
run: |
case "${{ runner.os }}" in
Linux)
sudo apt-get update -yy
sudo apt-get install -yy wget libsndfile-dev libsox-fmt-mp3 ffmpeg zip
;;
macOS)
brew install libsamplerate wget libsndfile sox ffmpeg zip
;;
esac

- name: Cache conda
uses: actions/cache@v3
env:
CACHE_NUMBER: 2
with:
path: ~/conda_pkgs_dir
key: ${{ runner.os }}-${{ matrix.python-version }}-conda-${{ env.CACHE_NUMBER }}-${{ hashFiles( matrix.envfile ) }}
restore-keys: |
${{ runner.os }}-${{ matrix.python-version }}-conda-${{ env.CACHE_NUMBER }}-
${{ runner.os }}-${{ matrix.python-version }}-conda-
${{ runner.os }}-conda-

- name: Setup conda
uses: conda-incubator/setup-miniconda@v2
with:
miniforge-variant: Mambaforge
miniforge-version: latest
use-mamba: true
channel-priority: ${{ matrix.channel-priority }}
activate-environment: test
auto-update-conda: false
python-version: ${{ matrix.python-version }}
use-only-tar-bz2: false

- name: Install conda dependencies
run: mamba env update -n test -f ${{ matrix.envfile }}
if: steps.cache.outputs.cache-hit != 'true'

- name: Conda info
shell: bash -l {0}
run: |
conda info -a
conda list

- name: Run pytest
shell: bash -l {0}
run: pytest -n auto -vv --cov-report term-missing --cov-report=xml --cov=mirdata tests

- name: Upload coverage to Codecov
uses: codecov/codecov-action@v3
with:
token: ${{ secrets.CODECOV_TOKEN }}
files: ./coverage.xml
directory: ./coverage/reports/
flags: unittests
env_vars: OS,PYTHON
name: codecov-umbrella
fail_ci_if_error: true
verbose: true
25 changes: 25 additions & 0 deletions .github/workflows/formatting.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
name: Formatting

on:
push:
branches:
- master
pull_request:
branches:
- master

jobs:
black:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
- name: black on mirdata
uses: psf/black@stable
with:
options: "--check --verbose"
src: "mirdata"
- name: black on tests
uses: psf/black@stable
with:
options: "--check --verbose"
src: "tests"
Loading