You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We want to explore the implementation of a simple torch.utils.data.Dataset-based data pipeline for sleap-nn. In our initial implementation, we used IterDatapipe, which is being deprecated and new multi-threaded implementations are addressed currently. We then transitioned to LitData, which significantly improved the training speed. However, the use of LitData comes with a drawback of increase in disk footprint with the generation of bin files.
We want to add an alternative option for users to utilize either torch.utils.data.Dataset based data pipeline with custom caching implementation (Ref: Ultralytics BaseDataset) or current LitData pipeline. Additionally, we will benchmark ther performance of this new pipeline to evaluate its feasibility as a replacement for LitData to optimize resource utilization.
PR1:
Vanilla torch.utils.data.Dataset implementation with no caching
Have individual torch dataset classes for each model type.
We want to explore the implementation of a simple
torch.utils.data.Dataset
-based data pipeline for sleap-nn. In our initial implementation, we usedIterDatapipe
, which is being deprecated and new multi-threaded implementations are addressed currently. We then transitioned to LitData, which significantly improved the training speed. However, the use of LitData comes with a drawback of increase in disk footprint with the generation ofbin
files.We want to add an alternative option for users to utilize either
torch.utils.data.Dataset
based data pipeline with custom caching implementation (Ref: UltralyticsBaseDataset
) or current LitData pipeline. Additionally, we will benchmark ther performance of this new pipeline to evaluate its feasibility as a replacement for LitData to optimize resource utilization.PR1:
torch.utils.data.Dataset
implementation with no cachingPR2
BaseDataset
])(https://github.com/ultralytics/ultralytics/blob/1a5c35366ef4577b00c35f9e8c5d5d0f05a61859/ultralytics/data/base.py#L189)The text was updated successfully, but these errors were encountered: