NLDAS-icechunk

Original Dataset

https://ldas.gsfc.nasa.gov/nldas/v3

Deployed Dataset

The NLDAS virtual icechunk store is available on s3://nasa-waterinsight/virtual-zarr-store/NLDAS-3-icechunk/ `

You can use the data as follows:

Usage Example

import icechunk
import xarray as xr

storage = icechunk.s3_storage(
    bucket='nasa-waterinsight',
    prefix=f"virtual-zarr-store/NLDAS-3-icechunk",
    region="us-west-2",
    anonymous=True,
)

chunk_url = "s3://nasa-waterinsight/NLDAS3/forcing/daily/"
virtual_credentials = icechunk.containers_credentials({
    chunk_url: icechunk.s3_anonymous_credentials()
})

repo = icechunk.Repository.open(
    storage=storage,
    authorize_virtual_chunk_access=virtual_credentials,
)

session = repo.readonly_session('main')
ds = xr.open_zarr(session.store, consolidated=False, zarr_version=3, chunks={})
ds

Dependency management

This repo uses uv as package/project manager.

Running Jupyter Notebooks on the NASA VEDA hub

To reproduce results in the notebooks, you need to build a custom kernel with

uv sync
uv run bash
python -m ipykernel install --user --name=lndasenv --display-name="LNDAS-VENV"

Then select the "LNDAS-VENV" kernel on the upper right corner drop-down in your notebook (you might have to refresh the browser to see it).

Running scripts

You can run the scripts with

uv run <scriptname>

Benchmark timings

See the notebooks/benchmark.ipynb for the code. I ran these on a 16 core/64 GB server on the VEDA JupyterHub. I did not randomize the points, but instead aimed that every point is on a different spatial chunk. This should be the worst case scenario, and more regional cases should run faster than these times.

Single point: 2min 11s
5 points: 6min 37s
10 points: 10m 24s

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
README.md		README.md
lithops_config.json		lithops_config.json
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLDAS-icechunk

Original Dataset

Deployed Dataset

Usage Example

Dependency management

Running Jupyter Notebooks on the NASA VEDA hub

Running scripts

Benchmark timings

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NLDAS-icechunk

Original Dataset

Deployed Dataset

Usage Example

Dependency management

Running Jupyter Notebooks on the NASA VEDA hub

Running scripts

Benchmark timings

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages