Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: support yaml dataset definitions #914

Merged
merged 123 commits into from
Mar 15, 2025
Merged

Conversation

SiQube
Copy link
Member

@SiQube SiQube commented Jan 1, 2025

💥 Breaking Changes

  • DatasetLibrary.get() now returns an instantiated DatasetDefinition object instead of a DatasetDefinition class.

Description

Supports specifying a DatasetDefinition as a yaml file.

Example file:

name: YourDataset

has_files:
  gaze: true
  precomputed_events: false
  precomputed_reading_measures: false

mirrors:
  gaze:
    - https://osf.io/download/

resources:
  gaze:
    - resource: your_resource/
      filename: target_filename.zip

Copy link

codecov bot commented Jan 1, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (a84d433) to head (52073b1).
Report is 1 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff            @@
##              main      #914   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           78        79    +1     
  Lines         3564      3588   +24     
  Branches       629       645   +16     
=========================================
+ Hits          3564      3588   +24     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@SiQube
Copy link
Member Author

SiQube commented Jan 8, 2025

  • pymovements.datasets.BSC => pymovements.DatasetDefinition('BSC')

SiQube and others added 7 commits February 12, 2025 10:30
Co-authored-by: SiQube <reich.davidr@gmail.com>
Co-authored-by: Daniel G. Krakowczyk <daniel.krakowczyk@uni-potsdam.de>
…10 (#913)

Co-authored-by: Daniel G. Krakowczyk <daniel.krakowczyk@uni-potsdam.de>
updates:

- github.com/asottile/pyupgrade: v3.19.0 → v3.19.1
- github.com/jsh9/pydoclint: 0.5.10 → 0.6.0
- github.com/hhatto/autopep8: v2.3.1 → v2.3.2
- github.com/PyCQA/pylint: v3.3.2 → v3.3.3


Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Daniel G. Krakowczyk <daniel.krakowczyk@uni-potsdam.de>
Co-authored-by: senisioi <sergiu.nisioi@gmail.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
@dkrako dkrako enabled auto-merge (squash) March 15, 2025 14:53
Copy link
Contributor

@dkrako dkrako left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This PR is a major breakthrough for pymovements!

YAML dataset definitions will simplify the dataset contribution process by a margin.

Thank you for all the hard work and keeping up with all my change requests!

As a side note: this PR set up a new record for the most comments in a PR, with a whopping count of 155 comments.

You now occupy the top 3 in the list. I may have played a part in this 😇

@dkrako dkrako merged commit 42f33a2 into main Mar 15, 2025
25 checks passed
@dkrako dkrako deleted the yaml-public-datasets-loader branch March 15, 2025 15:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request essential important highlight
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants