如何在张量流中随机设置恢复的权重为零?

时间:2017-01-20 15:22:04

标签: python tensorflow neural-network pycharm data-science

下午好。 我是一个张力流新手,目前正试图解决这个问题: 1)获得一个简单的神经网络,训练它,打印精度(完成) 2)保存(完成) 3)恢复它(完成) 4)随机设置恢复的权重为零。 (安培)

我已经阅读了这个主题:Dynamically changing weights in TensorFlow并尝试了几件事,但无济于事。 这是我的代码:

from __future__ import print_function

import tensorflow as tf

# Import MNIST data
from tensorflow.examples.tutorials.mnist import input_data
mnist = input_data.read_data_sets("/tmp/data/", one_hot=True)

# Parameters
learning_rate = 0.01
training_epochs = 20

batch_size = 100
display_step = 1

# tf Graph Input
x = tf.placeholder(tf.float32, [None, 784])  # mnist data image 28*28
y = tf.placeholder(tf.float32, [None, 10])  # 0-9 digits recognition => 10 classes

# Set model weights
W = tf.Variable(tf.zeros([784, 10]))
b = tf.Variable(tf.zeros([10]))

# Construct model
pred = tf.nn.softmax(tf.matmul(x, W) + b)   # Softmax

# Minimize error using cross entropy
cost = tf.reduce_mean(-tf.reduce_sum(y*tf.log(pred), reduction_indices=1))
# Gradient Descent
optimizer = tf.train.GradientDescentOptimizer(learning_rate).minimize(cost)

# Initializing the variables
init_op = tf.global_variables_initializer()
saver = tf.train.Saver()

# Launch the graph
with tf.Session() as sess:
    sess.run(init_op)

# Training cycle
for epoch in range(training_epochs):
    avg_cost = 0.
    total_batch = int(mnist.train.num_examples/batch_size)
    # Loop over all batches
    for i in range(total_batch):
        batch_xs, batch_ys = mnist.train.next_batch(batch_size)
        # Run optimization op (backprop) and cost op (to get loss value)
        _, c = sess.run([optimizer, cost], feed_dict={x: batch_xs,
                                                      y: batch_ys})
    # Compute average loss
    avg_cost += c / total_batch
    # Display logs per epoch step
    if (epoch+1) % display_step == 0:
      print("Epoch:", '%04d' % (epoch+1), "cost=", "{:.9f}".format(avg_cost))

print("Optimization Finished!")

# Test model
correct_prediction = tf.equal(tf.argmax(pred, 1), tf.argmax(y, 1))
# Calculate accuracy
accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

# Save the variables to disk.

save_path = saver.save(sess,"/Users/mac/PycharmProjects/untitled1/MyModel",
write_meta_graph=True)
print("Model saved in file: %s" % save_path)
print("Accuracy_old:", accuracy.eval({x: mnist.test.images, y:mnist.test.labels})) 

new_saver = tf.train.import_meta_graph('MyModel.meta')
new_saver.restore(sess, tf.train.latest_checkpoint('./'))
all_vars = tf.get_collection('vars')
for v in all_vars:
    v_ = sess.run(v)
    print(v_)

#Rand = tf.Variable(tf.random_normal([784, 10]))
#Zeroes = tf.mul(tf.zeros([784, 10]),Rand)
#W = tf.mul(Zeroes,Rand)
W = tf.mul(W, 0)
print("Accuracy_new:", accuracy.eval({x: mnist.test.images,     y:mnist.test.labels}))

我尝试使用随机分布乘以零,而不是简单的0,没有任何变化,即使我试图将W = 0,精度也是一样的。

非常感谢某人的建议。

1 个答案:

答案 0 :(得分:1)

该行

W = tf.mul(W, 0)

在图表中创建一个未被任何人使用的新节点 - accuracy仍在使用旧的W,这就是您没有看到任何变化的原因。更改W的方法是使用TensorFlow分配并运行它(请参阅How to assign value to a tensorflow variable?),例如

assign_op = W.assign(tf.mul(W,0))
sess.run(assign_op)