Testing open_virtual_mfdataset on GES DISC Cloud OPeNDAP collections #912
Replies: 2 comments 2 replies
-
Hey @battistowx thanks for reporting the bugs and opening this discussion! Here is my understanding on the codecs error. Older dmrpp versions do not contain zlib (aka "deflate") compression level (relevant issue here) so the dmrpp parser has to choose the recommended default zlib compression level |
Beta Was this translation helpful? Give feedback.
-
Testing M2TMNXSLV (Monthly)For this one, I have success when opening 1980-2010. When I approach 1980-2020, however, I get this same but slightly different error:
When looking at these three dmrpp files throughout the extent, it looks like
|
Beta Was this translation helpful? Give feedback.
-
I've started to test out the functionality of
open_virtual_mfdataset
on a few NASA GES DISC collections, to prepare for upcoming tutorials. I wanted to record my testing experience, along with a few thoughts and questions in this thread. For my testing, I am using our cloud OPeNDAP-enabled MERRA-2 collection, which are all NetCDF files, and have sizes of roughly 300-400MB per file. As of today, our cloud OPeNDAP-enabled collections consist only of NLDAS and MERRA-2, which are all in NetCDF.Errors:
KeyError: 0
triggered when runningaccess="direct"
outside of us-west-2.TypeError: Union[arg, ...]: each arg must be a type. Got {}
triggered whenh5py
is not installed.NotImplementedError: The ManifestArray class cannot concatenate arrays which were stored using different codecs, But found codecs Codec(compressor=None, filters=[{'id': 'shuffle', 'elementsize': 4}, {'id': 'zlib', 'level': 2}]) vs Codec(compressor=None, filters=[{'id': 'shuffle', 'elementsize': 4}, {'id': 'zlib', 'level': 6}]) .See https://github.com/zarr-developers/zarr-specs/issues/288
raised when opening certain granules which contain differentzlib
compression levels for some reason.More about the codec error:
This one is particularly interesting. When accessing the first 10 granules of M2T1NXSLV, which are from 1980-01-01 to 1980-01-10, no errors appear and the
ManifestArrays
are all created. However, once I hit that 11th granule, I get the codec error saying that the compression level is different. Indeed, when opening that 11th granule and analyzing its attributes, I see that the level is 6, instead of 2. When picking other random date ranges, such as in 1990, no errors and the compression levels are all the same. I will report this back to our internal staff, and will also try other MERRA-2 collections to see if this is a trend.As I test, I'll continue to add posts to this thread.
Great work on this new functionality, everyone!
Beta Was this translation helpful? Give feedback.
All reactions