How to register your own gradient for an operation consisting of tf operations

Question

How to register your own gradient for an operation consisting of tf operations

In particular, I have a simple fprop, which is the composition of tf operations. I want to override the tensor flow gradient calculation using my own gradient method using RegisterGradient.

What is wrong with this code?

import tensorflow as tf from tensorflow.python.framework import ops @ops.RegisterGradient("MyopGrad") def frop_grad(op, grad): x = op.inputs[0] return 0 * x # zero out to see the difference: def fprop(x): x = tf.sqrt(x) out = tf.maximum(x, .2) return out a = tf.Variable(tf.constant([5., 4., 3., 2., 1.], dtype=tf.float32)) h = fprop(a) h = tf.identity(h, name="Myop") grad = tf.gradients(h, a) g = tf.get_default_graph() with g.gradient_override_map({'Myop': 'MyopGrad'}): with tf.Session() as sess: sess.run(tf.initialize_all_variables()) result = sess.run(grad) print(result[0])

I want to see all zeros in print, but instead I get:

 [ 0.2236068 0.25000003 0.28867513 0.35355341 0.5 ]

+7

python machine-learning tensorflow

google_addict Apr 6 '17 at 13:22

source share

1 answer

Mzhm · Accepted Answer · 2017-06-16T15:19:08+0000

You need to define op within the scope with g.gradient_override_map({'Myop': 'MyopGrad'})

In addition, you need to map Identity , not Myop to your new gradient.

Here is the complete code:

 import tensorflow as tf from tensorflow.python.framework import ops @ops.RegisterGradient("MyopGrad") def frop_grad(op, grad): x = op.inputs[0] return 0 * x # zero out to see the difference: def fprop(x): x = tf.sqrt(x) out = tf.maximum(x, .2) return out a = tf.Variable(tf.constant([5., 4., 3., 2., 1.], dtype=tf.float32)) h = fprop(a) g = tf.get_default_graph() with g.gradient_override_map({'Identity': 'MyopGrad'}): h = tf.identity(h, name="Myop") grad = tf.gradients(h, a) with tf.Session() as sess: sess.run(tf.initialize_all_variables()) result = sess.run(grad) print(result[0])

Output:

 [ 0. 0. 0. 0. 0.]

How to register your own gradient for an operation consisting of tf operations

More articles: