-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OSError: Not enough disk space. #2972
Comments
Maybe we can change the disk space calculating API from
|
Hi @qqaatw, thanks for reporting. Could you please try: dataset = load_dataset("natural_questions", cache_dir=os.path.abspath(args.dataset_cache_dir)) |
@albertvillanova it works! Thanks for your suggestion. Is that a bug of |
|
I had encountered the same error when running a command In case anyone encountered the same issue, this was my fix: # starting the container (mount project directory onto /app, so that the code and data in my project directory are available in the container)
docker run -it --rm -v $(pwd):/app my-demo:latest bash # other code ...
ds = load_dataset('food101', cache_dir="/app/data") # set cache_dir to the absolute path of a directory (e.g. /app/data) that's mounted from the host (MacOS in my case) into the docker container
# this assumes ./data directory exists in your project folder. If not, create it or point it to any other existing directory where you want to store the cache Thanks @albertvillanova for posting the fix above :-) |
import datasets |
Describe the bug
I'm trying to download
natural_questions
dataset from the Internet, and I've specified the cache_dir which locates in a mounted disk and has enough disk space. However, even though the space is enough, the disk space checking function still reports the space of root/
disk having no enough space.The file system structure is like below. The root
/
has115G
disk space available, and thesda1
is mounted to/mnt
, which has1.2T
disk space available:Steps to reproduce the bug
Expected results
Can download the dataset without an error.
Actual results
The following error raised:
Environment info
datasets
version: 1.9.0The text was updated successfully, but these errors were encountered: