Allow user to change the channel axis for BatchNorm function and the likes #1666

vboussange · 2021-07-16T12:03:42Z

Proposition for issue #1664

This PR allows the user to customize the channel axis for normalisation functions (BatchNorm, GroupNorm and InstanceNorm).

Example

using Flux
channel_size = 3
channel_axis = 1
BN = BatchNorm(channel_size, dim = channel_axis)
x = randn(channel_size, 10, 10)
BN(x)

DhairyaLGandhi · 2021-07-16T14:03:33Z

src/layers/normalise.jl

@@ -243,9 +245,11 @@ mutable struct BatchNorm{F,V,N,W}
  track_stats::Bool
  active::Union{Bool, Nothing}
  chs::Int # number of channels
+  dim::Union{Int, Nothing} # channel dimension


This should never be nothing, but use N - 1 as default.

How do you know N - 1 a priori though? The question is what is an appropriate sentinel value. Perhaps a negative offset from the end, since by default channels are at dim end - 1?

Right so, it will need to be defined at runtime.

DhairyaLGandhi · 2021-07-16T15:51:06Z

How well is cudnn able to handle this?

ToucheSir · 2022-02-08T18:31:22Z

Looping back to answer the cuDNN question: it supports one other configuration for channel dim via CUDNN_TENSOR_NHWC. I think adding that to NNlibCUDA at https://github.com/FluxML/NNlibCUDA.jl/blob/96a334633ef3a3707c85fc1754c2c7eb8849db4e/src/cudnn/batchnorm.jl#L27 would be a good first step for getting this going again. Here are a couple example implementation PRs from MXNet and PyTorch.

Victor added 6 commits July 14, 2021 19:18

adding dim to BatchNorm

9de594d

fix pb with InstanceNorm

a1cb5e2

extended channel axis option to InstanceNorm and GroupNorm

7f7751c

quick ix

3a772c8

tests passing, ready to PR

a4d0107

added changes

6113c34

vboussange mentioned this pull request Jul 16, 2021

Allow user to change the channel axis for BatchNorm function and the likes #1664

Open

DhairyaLGandhi reviewed Jul 16, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow user to change the channel axis for BatchNorm function and the likes #1666

Allow user to change the channel axis for BatchNorm function and the likes #1666

vboussange commented Jul 16, 2021

DhairyaLGandhi Jul 16, 2021

ToucheSir Jul 16, 2021

DhairyaLGandhi Jul 16, 2021

DhairyaLGandhi commented Jul 16, 2021

ToucheSir commented Feb 8, 2022

Allow user to change the channel axis for BatchNorm function and the likes #1666

Are you sure you want to change the base?

Allow user to change the channel axis for BatchNorm function and the likes #1666

Conversation

vboussange commented Jul 16, 2021

Example

DhairyaLGandhi Jul 16, 2021

Choose a reason for hiding this comment

ToucheSir Jul 16, 2021

Choose a reason for hiding this comment

DhairyaLGandhi Jul 16, 2021

Choose a reason for hiding this comment

DhairyaLGandhi commented Jul 16, 2021

ToucheSir commented Feb 8, 2022