update links for gpu blog post to v1.0 (#788)

negin513 · pre-commit-ci[bot] · web-flow · commit 6beeb3373ee9 · 2025-07-14T19:12:12.000-06:00
* update links to v1.0 * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
diff --git a/src/posts/gpu-pipeline/index.md b/src/posts/gpu-pipeline/index.md
@@ -107,7 +107,7 @@ During the hackathon, we tested the following strategies to improve the data loa
 
 The copy of the ERA5 dataset we were using initially had a suboptimal chunking scheme of `{'time': 10, 'channel': C, 'height': H, 'width': W}`, which meant that a minimum of 10 time steps of data was being read even if we only needed 2 consecutive time steps.
 We decided to rechunk the data to align with our access pattern of 1-timestep at a time, while reformating to Zarr format 3.
-The full script is available [here](https://github.com/pangeo-data/ncar-hackathon-xarray-on-gpus/blob/main/rechunk/era5_rechunking.ipynb), with the main code looking like so:
+The full script is available [here](https://github.com/pangeo-data/ncar-hackathon-xarray-on-gpus/blob/v1.0/rechunk/era5_rechunking.ipynb), with the main code looking like so:
 
 ```python
 import xarray as xr
@@ -198,7 +198,7 @@ With nvCOMP, all steps of data loading including reading from disk, decompressio
 
 To unlock this, we would need zarr-python to support GPU-based decompression codecs, with one for Zstandard (Zstd) currently being implemented in [this PR](https://github.com/zarr-developers/zarr-python/pull/2863).
 
-We tested the performance of GPU-based decompression using nvCOMP with Zarr-Python 3 and KvikIO, and compared it to CPU-based decompression using [this data reading benchmark here](https://github.com/pangeo-data/ncar-hackathon-xarray-on-gpus/blob/main/benchmark/era5_zarr_benchmark.py).
+We tested the performance of GPU-based decompression using nvCOMP with Zarr-Python 3 and KvikIO, and compared it to CPU-based decompression using [this data reading benchmark here](https://github.com/pangeo-data/ncar-hackathon-xarray-on-gpus/tree/v1.0/benchmarks/era5_zarr_benchmark.py).
 
 Here are the results:
 
@@ -220,7 +220,7 @@ Ideally, we want to minimize idle time on both the CPU and GPU by overlapping th
 
 To address this inefficiency, we adopted [NVIDIA DALI (Data Loading Library)](https://docs.nvidia.com/deeplearning/dali/user-guide/docs/index.html), which provides a flexible, GPU-accelerated data pipeline with built-in support for asynchronous execution across CPU and GPU stages. DALI helps reduce CPU pressure, enables concurrent preprocessing, and increases training throughput by pipelining operations.
 
-First, we began with a minimal example in the [zarr_DALI directory](https://github.com/pangeo-data/ncar-hackathon-xarray-on-gpus/tree/main/zarr_DALI) with short, contained examples of a DALI pipeline loading directly from Zarr stores. This example shows how to build a custom DALI `pipeline` that uses an `ExternalSource` operator to load batched image data from a Zarr store and transfer them directly to GPU memory using CuPy arrays.
+First, we began with a minimal example in the [zarr_dali_example directory](https://github.com/pangeo-data/ncar-hackathon-xarray-on-gpus/tree/v1.0/zarr_dali_example) with short, contained examples of a DALI pipeline loading directly from Zarr stores. This example shows how to build a custom DALI `pipeline` that uses an `ExternalSource` operator to load batched image data from a Zarr store and transfer them directly to GPU memory using CuPy arrays.
 
 In short, to use DALI with Zarr for data loading, you need to:
 
@@ -273,7 +273,7 @@ output = pipe.run()
 images_gpu, labels_gpu = output
 ```
 
-Next, checkout the [end-to-end example](https://github.com/pangeo-data/ncar-hackathon-xarray-on-gpus/tree/main/zarr_ML_optimization) directory, where we showed how to use DALI to load data from Zarr stores, preprocess it on the GPU, and feed it into a PyTorch model for training.
+Next, checkout the [end-to-end example](https://github.com/pangeo-data/ncar-hackathon-xarray-on-gpus/tree/v1.0/zarr_ML_optimization) directory, where we showed how to use DALI to load data from Zarr stores, preprocess it on the GPU, and feed it into a PyTorch model for training.
 
 Profiling results show that the DALI pipeline enables efficient overlap of CPU and GPU operations, significantly reducing GPU idle time and boosting overall training throughput.
 
diff --git a/src/posts/pangeo-ml-ecosystem-2023/index.md b/src/posts/pangeo-ml-ecosystem-2023/index.md
@@ -94,13 +94,11 @@ Lastly, we highlighted some of the high-level Pangeo ML libraries enabling user
 ## Where to learn more
 
 - Educational resources:
-
   - [Project Pythia Cookbooks](https://cookbooks.projectpythia.org)
   - [GeoSMART Machine Learning Curriculum](https://geo-smart.github.io/mlgeo-book)
   - [University of Washington Hackweeks as a Service](https://guidebook.hackweek.io)
 
 - Pangeo ML Working Group:
-
   - [Monthly meetings](https://pangeo.io/meeting-notes.html#working-group-meetings)
   - [Discourse Forum](https://discourse.pangeo.io/tag/machine-learning)