The implementation of open_datatree
works, but is inefficient, because it calls open_dataset
once for every group in the file. We should refactor this to improve the performance, which would fix issues like xarray-contrib/datatree#330.
We discussed this in the datatree meeting, and my understanding is that concretely we need to:
open_datatree
, probably involving first writing then benchmarking the opening of a special netCDF file that has no data but lots of groups. (tracked in Add benchmark test for open_datatree #9100)NetCDFDatastore
class to only create one CachingFileManager
object per file, not one per group, see manager = CachingFileManager( .NetCDF4BackendEntrypoint.open_datatree
to use an implementation that goes through NetCDFDatastore
without calling the top-level xr.open_dataset
again.xr.open_datatree
on a netCDF file has actually improved.It would be great to get this done soon as part of the datatree integration project. @kmuehlbauer I know you were interested - are you willing / do you have time to take this task on?
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4