Conv2DTranspose not working with tensorflow QAT per-channel quantization with range learning scheme #3973

deltacosos · 2025-04-09T11:56:45Z

If I run test-case from below link, it works as expected and prints correctly: https://github.com/quic/aimet/blob/ce8e344685e1949bb5fbd0c7836b16defce33981/TrainingExtensions/tensorflow/test/python/test_per_channel_quantization_keras.py

Skipping registering GPU devices...
1/1 [==============================] - 0s 305ms/step
2025-04-09 14:30:56.376729: W tensorflow/core/grappler/optimizers/loop_optimizer.cc:907] Skipping loop optimization for Merge node with control input: StatefulPartitionedCall_1/StatefulPartitionedCall/StatefulPartitionedCall/cond/branch_executed/_1032
1/1 [==============================] - 3s 3s/step - loss: 350.1171
1/1 [==============================] - 0s 15ms/step - loss: 123.7629
1/1 [==============================] - 0s 23ms/step - loss: 101.5782
1/1 [==============================] - 0s 23ms/step - loss: 132.9027
1/1 [==============================] - 0s 24ms/step - loss: 144.4747
1/1 [==============================] - 0s 17ms/step - loss: 100.9962
1/1 [==============================] - 0s 23ms/step - loss: 47.0214
1/1 [==============================] - 0s 16ms/step - loss: 29.8525
1/1 [==============================] - 0s 14ms/step - loss: 47.3242
1/1 [==============================] - 0s 23ms/step - loss: 66.4983

But if I modify the network in a way that it contains Conv2DTranspose, I am facing following error:

            inputs = tf.keras.layers.Input(shape=(32, 32, 4,))
            conv_op = tf.keras.layers.Conv2D(2, (3, 3),
                                             kernel_initializer=tf.random_uniform_initializer(-1, 2),
                                             bias_initializer='random_uniform',
                                             padding='SAME')(inputs)
            relu_op = tf.keras.layers.ReLU()(conv_op)
            up = tf.keras.layers.Conv2DTranspose(filters=32, kernel_size=2, strides=2, activation='relu', padding='same')(relu_op)
            reshape = tf.keras.layers.Flatten()(up)
            dense = tf.keras.layers.Dense(10, bias_initializer='random_uniform')(reshape)
            model = tf.keras.Model(inputs=inputs, outputs=dense, name="conv_functional")

    File "../anaconda3/envs/envaimet/lib/python3.8/site-packages/aimet_tensorflow/keras/quantsim.py", line 697, in train_step
      self._fill_missing_encoding_min_max_gradients(gradients)
    File "../anaconda3/envs/envaimet/lib/python3.8/site-packages/aimet_tensorflow/keras/quantsim.py", line 661, in _fill_missing_encoding_min_max_gradients
      dloss_by_dmin, dloss_by_dmax = param_quantizer.get_gradients_for_encoding_min_max(weight_tensor,
    File "../anaconda3/envs/envaimet/lib/python3.8/site-packages/aimet_tensorflow/keras/quant_sim/tensor_quantizer.py", line 884, in get_gradients_for_encoding_min_max
      gradients = quantsim_per_channel_custom_grad_learned_grid(weight_tensor,
    File "../anaconda3/envs/envaimet/lib/python3.8/site-packages/aimet_tensorflow/keras/quant_sim/quantsim_straight_through_grad.py", line 364, in quantsim_per_channel_custom_grad_learned_grid
      dloss_by_dmin, dloss_by_dmax, dloss_by_dx = \
    File "../anaconda3/envs/envaimet/lib/python3.8/site-packages/aimet_tensorflow/keras/quant_sim/quantsim_straight_through_grad.py", line 332, in _compute_dloss_by_dmin_dmax_and_dx_for_per_channel
      dloss_by_dmin, dloss_by_dmax, dloss_by_dx = \
    File "../anaconda3/envs/envaimet/lib/python3.8/site-packages/aimet_tensorflow/keras/quant_sim/quantsim_straight_through_grad.py", line 233, in _compute_dloss_by_dmin_dmax_and_dx
      dloss_by_dmax = tf.cast(_compute_dloss_by_dmax(x, grad, scaling, rounded_offset, bitwidth, is_symmetric),
    File "../anaconda3/envs/envaimet/lib/python3.8/site-packages/aimet_tensorflow/keras/quant_sim/quantsim_straight_through_grad.py", line 167, in _compute_dloss_by_dmax
      r_x_by_s_minus_x_by_s = tf.round(x / scaling) - (x / scaling)
Node: 'truediv_4'
Incompatible shapes: [2,2,32,2] vs. [32]
	 [[{{node truediv_4}}]] [Op:__inference_train_function_9818]

It looks like that there is some dimensional mismatch because of the fact, that in tensorflow weights of transpose convolutions are in shape [k,k,out_channels,in_channels] while in regular convolution the shape is [k,k,in_channels,out_channels].

Is this known issue like missing feature? It would be nice to use also transpose convolutions with range learning schemes.

The text was updated successfully, but these errors were encountered:

deltacosos · 2025-04-16T10:14:46Z

Any update on this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conv2DTranspose not working with tensorflow QAT per-channel quantization with range learning scheme #3973

Conv2DTranspose not working with tensorflow QAT per-channel quantization with range learning scheme #3973

deltacosos commented Apr 9, 2025

deltacosos commented Apr 16, 2025

Conv2DTranspose not working with tensorflow QAT per-channel quantization with range learning scheme #3973

Conv2DTranspose not working with tensorflow QAT per-channel quantization with range learning scheme #3973

Comments

deltacosos commented Apr 9, 2025

deltacosos commented Apr 16, 2025