Fix batchnorm in testmode without track stats #576

paulnovo · 2024-04-21T18:50:01Z

In test mode, the CUDA cuDNN implementation of batchnorm was not matching the CPU batchnorm in FLUX. In FLUX, with track_stats=False, the mean and variance of the current batch are used. Here, mean and variance were initialized to 0 and 1, respectively, and passed to cudnnBatchNormalizationForwardInference.

To fix this, we need to calculate the mean and variance over the current batch to match the CPU implementation. Unfortunately, cudnnBatchNormalizationForwardInference requires a trained running mean and variance. However, batchnorm train and test should be identical without tracked stats since they both normalize over the current batch. As a result we can use cudnnBatchNormalizationForwardTraining in test mode as well, which works without a running mean and variance.

This is needed to help address FluxML/Flux.jl#1606 along with Flux.jl PR 2427.

PR Checklist

Tests are added
Documentation, if applicable

In test mode, the CUDA cuDNN implementation of batchnorm was not matching the CPU batchnorm in FLUX. In FLUX, with track_stats=False, the mean and variance of the current batch are used. Here, mean and variance were initialized to 0 and 1, respectively, and passed to cudnnBatchNormalizationForwardInference. To fix this, we need to calculate the mean and variance over the current batch to match the CPU implementation. Unfortunately, cudnnBatchNormalizationForwardInference requires a trained running mean and variance. However, batchnorm train and test should be identical without tracked stats since they both normalize over the current batch. As a result we can use cudnnBatchNormalizationForwardTraining in test mode as well, which works without a running mean and variance.

paulnovo mentioned this pull request Apr 21, 2024

Allow BatchNorm on CUDA with track_stats=False FluxML/Flux.jl#2427

Merged

3 tasks

CarloLucibello merged commit e8e7572 into FluxML:master Apr 23, 2024
11 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix batchnorm in testmode without track stats #576

Fix batchnorm in testmode without track stats #576

paulnovo commented Apr 21, 2024 •

edited

Loading

Fix batchnorm in testmode without track stats #576

Fix batchnorm in testmode without track stats #576

Conversation

paulnovo commented Apr 21, 2024 • edited Loading

PR Checklist

paulnovo commented Apr 21, 2024 •

edited

Loading