Tensorflow常用函数说明（一） 2. 神经网络相关操作 3.普通操作 4.规范化 5.矩阵操作

首先最开始应该清楚一个知识，最外面的那个[ [ [ ]]]括号代表第一维，对应维度数字0，第二个对应1，多维时最后一个对应数字-1；因为后面有用到

1 矩阵变换

tf.shape(Tensor)

返回张量的形状。但是注意，tf.shape函数本身也是返回一个张量。而在tf中，张量是需要用sess.run(Tensor)来得到具体的值的。

x=[[1,2,3],[4,5,6]]
shape=tf.shape(x)
with tf.Session() as sess:
    print (shape)
    print (sess.run(shape))

#输出
Tensor("Shape:0", shape=(2,), dtype=int32)
[2 3]

tf.expand_dims(Tensor,dim)

为张量增加一个维度

为张量+1维。官网的例子：’t’ is a tensor of shape [2]
shape(expand_dims(t, 0)) ==> [1, 2] #当值为0时表示，把原来的一个形状为[2]的向量表示成1*2的
shape(expand_dims(t, 1)) ==> [2, 1] #当值为1时表示，把原来的一个形状为[2]的向量表示成2*1的
shape(expand_dims(t, -1)) ==> [2, 1]#当值为-1时表示最后一个维度，把原来的一个形状为[2]的向量表示成2*1的

sess = tf.InteractiveSession()
labels = [1,2,3]
x = tf.expand_dims(labels, 0)#变成1*3
print(sess.run(x))
x = tf.expand_dims(labels, 1)#变成3*1
print(sess.run(x))
#>>>[[1 2 3]]
#>>>[[1]
#        [2]
#        [3]]

# 't2' is a tensor of shape [2, 3, 5]
shape(expand_dims(t2, 0)) ==> [1, 2, 3, 5]#在第一个位置增加一个维度
shape(expand_dims(t2, 2)) ==> [2, 3, 1, 5]#在第三个位置增加一个维度
shape(expand_dims(t2, 3)) ==> [2, 3, 5, 1]#在第四个位置增加一个维度

tf.squeeze() Function函数作用：从张量的shape中删除维度大小为1

tf.squeeze(input, axis=None, name=None, squeeze_dims=None)

squeeze_dims不指定就删除维度大小为1 的

  t' is a tensor of shape [1, 2, 1, 3, 1, 1]
  tf.shape(tf.squeeze(t))  # [2, 3]
  

  t' is a tensor of shape [1, 2, 1, 3, 1, 1]#指定要删的第几个位置
  tf.shape(tf.squeeze(t, [2, 4]))  # [1, 2, 3, 1]

tf.pack()==tf.stack()

将一个R维张量列表沿着axis轴组合成一个R+1维的张量。

  # 'x' is [1, 4]
  # 'y' is [2, 5]
  # 'z' is [3, 6]
  tf.stack([x, y, z]) => [[1, 4], [2, 5], [3, 6]]  # Pack along first dim.
  tf.stack([x, y, z], axis=1) => [[1, 2, 3], [4, 5, 6]]
  tf.stack([6,10])=>[6,10]=>在一些应用中可以作为6*10的shape

tf.concat

tf.concat(values, axis, name='concat')

Concatenates tensors along one dimension.
将张量沿着指定维数拼接起来。个人感觉跟前面的pack用法类似

axis：必须是一个数，表明在哪一维上连接

如果axis是0，那么在某一个shape的第一个维度上连，对应到实际，就是叠放到列上

t1 = [[1, 2, 3], [4, 5, 6]] #shape(t1)=[2,3]  2所在的表示第一维，3所在的表示第二维
t2 = [[7, 8, 9], [10, 11, 12]]#shape(t2)=[2,3]
tf.concat([t1, t2],0) == > [[1, 2, 3], [4, 5, 6], [7, 8, 9], [10, 11, 12]] ##shape=[4,3]

如果axis是1，那么在某一个shape的第二个维度上连

t1 = [[1, 2, 3], [4, 5, 6]]#shape(t1)=[2,3]

t2 = [[7, 8, 9], [10, 11, 12]]#shape(t2)=[2,3]

tf.concat([t1, t2],1) ==> [[1, 2, 3, 7, 8, 9], [4, 5, 6, 10, 11, 12]]##shape=[2,6]

如果axis是-1，那么在某一个shape的最后维度上连

t1 = [[[1, 2], [2, 3]], [[4, 4], [5, 3]]]

t2 = [[[7, 4], [8, 4]], [[2, 10], [15, 11]]]
#shape是[2,2,2]
w=tf.concat([t1, t2], -1)#最后一维，最里面的那一维，最后一个2表示的那个
e=tf.concat([t1, t2], 1)#
q=tf.concat([t1, t2], 0)#第一维，就是最外面的那一维，第一个2
with tf.Session() as sess:
print(sess.run(tf.shape(t1)))
print("result:w=",' ',sess.run(w))
print("result:e=",' ',sess.run(e))
print("result:q=",' ',sess.run(q))

####################输出结果################
[2 2 2]
result:w= 
 [[[ 1  2  7  4]
  [ 2  3  8  4]]

 [[ 4  4  2 10]
  [ 5  3 15 11]]]
result:e= 
 [[[ 1  2]
  [ 2  3]
  [ 7  4]
  [ 8  4]]

 [[ 4  4]
  [ 5  3]
  [ 2 10]
  [15 11]]]
result:q= 
 [[[ 1  2]
  [ 2  3]]

 [[ 4  4]
  [ 5  3]]

 [[ 7  4]
  [ 8  4]]

 [[ 2 10]
  [15 11]]]

values：就是两个或者一组待连接的tensor了

这里要注意的是：如果是两个向量，它们是无法调用

tf.concat(1, [t1, t2])

因为向量对应的shape只有一个维度，当然不能在第二维上连了，虽然实际中两个向量可以在行上连，但是放在程序里是会报错的

所以要用到前面讲过的tf.expand_dims来扩维：

t1=tf.constant([1,2,3])
t2=tf.constant([4,5,6])
#concated = tf.concat( [t1,t2],1)这样会报错
#concated = tf.concat( [t1,t2],0)正确
t1=tf.expand_dims(tf.constant([1,2,3]),1)
t2=tf.expand_dims(tf.constant([4,5,6]),1)
concated = tf.concat([t1,t2],1)#这样就是正确的

tf.random_shuffle

tf.random_shuffle(value,seed=None,name=None)
沿着value的第一维进行随机重新排列

sess = tf.InteractiveSession()
a=[[1,2],[3,4],[5,6]]
x = tf.random_shuffle(a)
print(sess.run(x))
#===>[[3 4],[5 6],[1 2]]

tf.argmax | tf.argmin

tf.argmax(input=tensor,dimention=axis)
找到给定的张量tensor中在指定轴axis上的最大值/最小值的位置。

a=tf.get_variable(name='a',
                  shape=[3,4],
                  dtype=tf.float32,
                  initializer=tf.random_uniform_initializer(minval=-1,maxval=1))
b=tf.argmax(input=a,dimension=0)#列
c=tf.argmax(input=a,dimension=1)#行
sess = tf.InteractiveSession()
sess.run(tf.initialize_all_variables())
print(sess.run(a))
#[[ 0.04261756 -0.34297419 -0.87816691 -0.15430689]
# [ 0.18663144  0.86972666 -0.06103253  0.38307118]
# [ 0.84588599 -0.45432305 -0.39736366  0.38526249]]
print(sess.run(b))
#[2 1 1 2]
print(sess.run(c))
#[0 1 0]

tf.equal

tf.equal(x, y, name=None):
判断两个tensor是否每个元素都相等。返回一个格式为bool的tensor，True or False

tf.cast

cast(x, dtype, name=None)
将x的数据格式转化成dtype.例如，原来x的数据格式是bool，那么将其转化成float以后，就能够将其转化成0和1的序列。反之也可以

a = tf.Variable([1,0,0,1,1])
b = tf.cast(a,dtype=tf.bool)
c  = tf.cast(True,dtype=tf.float32)
sess = tf.InteractiveSession()
sess.run(tf.initialize_all_variables())
print(sess.run(b))
print(sess.run(c))
#[ True False False  True  True]
#1.0

tf.reshape

reshape(tensor, shape, name=None)
顾名思义，就是将tensor按照新的shape重新排列。一般来说，shape有三种用法：
如果 shape=[-1], 表示要将tensor展开成一个list
如果 shape=[a,b,c,…] 其中每个a,b,c,..均>0，那么就是常规用法
如果 shape=[a,-1,c,…] 此时b=-1，a,c,..依然>0。这表示tf会根据tensor的原尺寸，自动计算b的值。
官方给的例子已经很详细了，我就不写示例代码了

# tensor 't' is [1, 2, 3, 4, 5, 6, 7, 8, 9]
# tensor 't' has shape [9]
reshape(t, [3, 3]) ==> [[1, 2, 3],
                        [4, 5, 6],
                        [7, 8, 9]]

# tensor 't' is [[[1, 1], [2, 2]],
#                [[3, 3], [4, 4]]]
# tensor 't' has shape [2, 2, 2]
reshape(t, [2, 4]) ==> [[1, 1, 2, 2],
                        [3, 3, 4, 4]]

# tensor 't' is [[[1, 1, 1],
#                 [2, 2, 2]],
#                [[3, 3, 3],
#                 [4, 4, 4]],
#                [[5, 5, 5],
#                 [6, 6, 6]]]
# tensor 't' has shape [3, 2, 3]
# pass '[-1]' to flatten 't',将tensor转化成一个list
reshape(t, [-1]) ==> [1, 1, 1, 2, 2, 2, 3, 3, 3, 4, 4, 4, 5, 5, 5, 6, 6, 6]

# -1 can also be used to infer the shape###指代未知的数字
# -1 is inferred to be 9:
reshape(t, [2, -1]) ==> [[1, 1, 1, 2, 2, 2, 3, 3, 3],
                         [4, 4, 4, 5, 5, 5, 6, 6, 6]]

# -1 is inferred to be 2:
reshape(t, [-1, 9]) ==> [[1, 1, 1, 2, 2, 2, 3, 3, 3],
                         [4, 4, 4, 5, 5, 5, 6, 6, 6]]

# -1 is inferred to be 3:
reshape(t, [ 2, -1, 3]) ==> [[[1, 1, 1],
                              [2, 2, 2],
                              [3, 3, 3]],
                             [[4, 4, 4],
                              [5, 5, 5],
                              [6, 6, 6]]]

embedding_lookup(params, ids, partition_strategy=”mod”, name=None,
validate_indices=True):

简单的来讲，就是将一个数字序列ids转化成embedding序列表示。
假设params.shape=[v,h], ids.shape=[m], 那么该函数会返回一个shape=[m,h]的张量。用数学来表示，就是

Tensorflow常用函数说明（一）
2. 神经网络相关操作
3.普通操作
4.规范化
5.矩阵操作

那么这个有什么用呢？如果你了解word2vec的话，就知道我们可以根据文档来对每个单词生成向量。单词向量可以进一步用来测量单词的相似度等等。那么假设我们现在已经获得了每个单词的向量，都存在param中。那么根据单词id序列ids,就可以通过embedding_lookup来获得embedding表示的序列。

tf.trainable_variables

返回所有可训练的变量。
在创造变量(tf.Variable, tf.get_variable 等操作)时，都会有一个trainable的选项，表示该变量是否可训练。这个函数会返回图中所有trainable=True的变量。
tf.get_variable(…), tf.Variable(…)的默认选项是True, 而 tf.constant(…)只能是False

import tensorflow as tf
from pprint import pprint

a = tf.get_variable('a',shape=[5,2])    # 默认 trainable=True
b = tf.get_variable('b',shape=[2,5],trainable=False)
c = tf.constant([1,2,3],dtype=tf.int32,shape=[8],name='c') # 因为是常量，所以trainable=False
d = tf.Variable(tf.random_uniform(shape=[3,3]),name='d')
tvar = tf.trainable_variables()
tvar_name = [x.name for x in tvar]
print(tvar)
# [<tensorflow.python.ops.variables.Variable object at 0x7f9c8db8ca20>, <tensorflow.python.ops.variables.Variable object at 0x7f9c8db8c9b0>]
print(tvar_name)
# ['a:0', 'd:0']

sess = tf.InteractiveSession()
sess.run(tf.initialize_all_variables())
pprint(sess.run(tvar))
#[array([[ 0.27307487, -0.66074866],
#       [ 0.56380701,  0.62759042],
#       [ 0.50012994,  0.42331111],
#       [ 0.29258847, -0.09185416],
#       [-0.35913971,  0.3228929 ]], dtype=float32),
# array([[ 0.85308731,  0.73948073,  0.63190091],
#       [ 0.5821209 ,  0.74533939,  0.69830012],
#       [ 0.61058474,  0.76497936,  0.10329771]], dtype=float32)]

Tensorflow常用函数说明（一） 2. 神经网络相关操作 3.普通操作 4.规范化 5.矩阵操作

首先最开始应该清楚一个知识，最外面的那个[ [ [ ]]]括号代表第一维，对应维度数字0，第二个对应1，多维时最后一个对应数字-1；因为后面有用到

1 矩阵变换

tf.concat

tf.random_shuffle

tf.argmax | tf.argmin

tf.equal

tf.cast

tf.reshape

tf.trainable_variables

tf.gradients

tf.clip_by_global_norm

tf.nn.dropout

tensorflow的共享变量,tf.Variable(),tf.get_variable(),tf.Variable_scope(),tf.name_scope()联系与区别

tf.get_variable函数的使用

3.普通操作

tf.linspace | tf.range

tf.assign

4.规范化

tf.variable_scope

tf.get_variable_scope

5.矩阵操作

5.1矩阵生成

Tensorflow常用函数说明（一） 2. 神经网络相关操作 3.普通操作 4.规范化 5.矩阵操作

首先最开始应该清楚一个知识，最外面的那个[ [ [ ]]]括号代表第一维，对应维度数字0，第二个对应1，多维时最后一个对应数字-1；因为后面有用到

1 矩阵变换

tf.concat

tf.random_shuffle

tf.argmax | tf.argmin

tf.equal

tf.cast

tf.reshape

tf.trainable_variables

tf.gradients

tf.clip_by_global_norm

tf.nn.dropout

tensorflow的共享变量,tf.Variable(),tf.get_variable(),tf.Variable_scope(),tf.name_scope()联系与区别

tf.get_variable函数的使用

3.普通操作

tf.linspace | tf.range

tf.assign

4.规范化

tf.variable_scope

tf.get_variable_scope

5.矩阵操作

5.1矩阵生成

相关推荐