Haste RNN support #8

thegodone · 2021-06-23T04:00:32Z

Do you see a way to replace the keras GRU by the Haste_tf GRU version from here https://github.com/lmnt-com/haste/blob/master/docs/tf/haste_tf.md ?

there are lot of advantage in this package for regularization at cuda level etc...

unfortunately I don't see a way to plug the haste_tf at keras level lmnt-com/haste#33 ?

your help would be appreciate

thegodone · 2021-06-23T16:28:54Z

I was able to do this :

import numpy as np
import tensorflow as tf
import haste_tf as haste

train_x = np.random.rand(500,40,20)
train_y = np.random.rand(500,40,1)

inputs = tf.keras.Input(shape=(train_x.shape[1], train_x.shape[2]))
haste1 = haste.LayerNormGRU(20, direction='unidirectional', zoneout=0.1, dropout=0.1)
fc1 = tf.keras.layers.Dense(60, activation='relu', kernel_initializer='he_uniform')
dr1 = tf.keras.layers.Dropout(0.2)
fc2 = tf.keras.layers.Dense(1)

x, state = haste1(inputs, training=True)
x = fc1(inputs)
x = dr1(x)
outputs = fc2(x)

model = tf.keras.Model(inputs=inputs, outputs=outputs)

print(model.summary())

opt = tf.keras.optimizers.Adam(learning_rate=0.01)
model.compile(loss='categorical_crossentropy', optimizer=opt)

model_hist = model.fit(train_x, train_y, epochs=10, batch_size=32, verbose=1)

is it possible to add this layer in place of pure "GRU" ?

PatReis · 2021-06-28T09:15:00Z

I will look into it. However, since kgcnn is fully based on keras, I am not fully sure if we sould add more external dependencies apart from keras. But maybe as a layer module is should be fine.

thegodone · 2021-06-28T11:04:44Z

I really want to test this special layer if it's possible. Maybe you can an exception for this one.

thegodone · 2021-06-30T04:57:46Z

there is a typo here

"dopout" => "dropout"

thegodone · 2021-06-30T05:07:37Z

another issue than

PatReis · 2021-06-30T08:59:03Z

Thanks, I am still trying to get haste installed. But I think this error is not from us. We must post this error to haste, since I think there must be a
tf.compat.v1.get_variable("gamma",shape=1,initializer=tf.compat.v1.keras.initializers.ones())
since
tf.compat.v1.get_variable("gamma",initializer=tf.compat.v1.keras.initializers.ones())
will always throw an error regardless of the input.

I am not sure that there is a great advantage using haste Cells. I think there is an advantage using the final haste GRU. But because the recurrent step requires some attention pooling over the nodes (for graph networks) we are stuck with the GRUCells. However, I will think about this, maybe I did make a mistake in my logic/implementation.

thegodone · 2021-06-30T09:03:45Z

if you want to open an issue in Haste repo. Go ahead

PatReis · 2021-06-30T09:14:06Z

Okay, thanks, I opened and issue in haste.

thegodone closed this as completed Oct 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haste RNN support #8

Haste RNN support #8

thegodone commented Jun 23, 2021

thegodone commented Jun 23, 2021 •

edited

Loading

PatReis commented Jun 28, 2021

thegodone commented Jun 28, 2021

thegodone commented Jun 30, 2021

thegodone commented Jun 30, 2021

PatReis commented Jun 30, 2021

thegodone commented Jun 30, 2021 via email •

edited

Loading

PatReis commented Jun 30, 2021

Haste RNN support #8

Haste RNN support #8

Comments

thegodone commented Jun 23, 2021

thegodone commented Jun 23, 2021 • edited Loading

PatReis commented Jun 28, 2021

thegodone commented Jun 28, 2021

thegodone commented Jun 30, 2021

thegodone commented Jun 30, 2021

PatReis commented Jun 30, 2021

thegodone commented Jun 30, 2021 via email • edited Loading

PatReis commented Jun 30, 2021

thegodone commented Jun 23, 2021 •

edited

Loading

thegodone commented Jun 30, 2021 via email •

edited

Loading