tf.nn.embedding_lookup中关于partition_strategy参数详解|电子爱好者

admin管理员组
文章数量:1568433

tfnnembedding_lookup
- 数学上的原理
- API介绍
- 简单示例
  - 程序
  - 注解
- partition_strategy参数的示例
  - mod案例1
  - mod案例2
  - div案例1
  - div案例2
参考资料

tf.nn.embedding_lookup

embedding_lookup常用于NLP中将one-hot编码转换我对应的向量编码。

数学上的原理

假设一共有 $m$ 个物体，每个物体有自己唯一的id，那么从物体的集合到 $\mathbb R^m$ 有一个trivial的嵌入，就是把它映射到 $\mathbb R^m$ 中的标准基，这种嵌入叫做One-hot embedding/encoding.

应用中一般将物体嵌入到一个低维空间 $\mathbb R^n(n \ll m)$ ，只需要再compose上一个从 $\mathbb R^m$ 到 $\mathbb R^n$ 的线性映射就好了。每一个 $n\times m$ 的矩阵M都定义了 $\mathbb R^m$ 到 $\mathbb R^n$ 的一个线性映射: $x \mapsto Mx$ 。当 $x$ 是一个标准基向量的时候， $Mx$ 对应矩阵 M <script type="math/tex" id="MathJax-Element-26">M</script>中的一列，这就是对应id的向量表示。这个概念用神经网络图来表示如下：

从id(索引)找到对应的One-hot encoding，然后红色的weight就直接对应了输出节点的值(注意这里没有activation function)，也就是对应的embedding向量。

API介绍

依据inputs_ids来寻找embedding_params中对应的元素.

 embedding_lookup(
     params,   # embedding_params 对应的转换向量
     ids,      # inputs_ids，标记着要查询的id
     partition_strategy='mod',   #分割方式 
     name=None,
     validate_indices=True, # deprecated
     max_norm=None
 )

参数	description	注解
params	A single tensor representing the complete embedding tensor, or a list of P tensors all of same shape except for the first dimension, representing sharded embedding tensors. Alternatively, a PartitionedVariable, created by partitioning along dimension 0. Each element must be appropriately sized for the given partition_strategy.	params是由一个tensor或者多个tensor组成的列表(多个tensor组成时，每个tensor除了第一个维度其他维度需相等)
ids	A Tensor with type int32 or int64 containing the ids to be looked up in params.	ids是一个整型的tensor，ids的每个元素代表要在params中取的每个元素的第0维的逻辑index.
partition_strategy	A string specifying the partitioning strategy, relevant if len(params) > 1. Currently “div” and “mod” are supported. Default is “mod”.	逻辑index是由partition_strategy指定，partition_strategy用来设定ids的切分方式，目前有两种切分方式’div’和’mod’.
返回值	The results of the lookup are concatenated into a dense tensor. The returned tensor has shape shape(ids) + shape(params)[1:].	返回值是一个dense tensor.返回的shape为shape(ids)+shape(params)[1:]

embedding_lookup中的partition_strategy参数比较难理解(this function is hard to understand, until you get the point!),下面会有特别的解释。

简单示例

下面我们通过一个常见的案例来解释embedding_lookup的用法：

程序

# coding:utf8
import tensorflow as tf
import numpy as np

input_ids = tf.placeholder(dtype=tf.int32, shape=[None])
_input_ids = tf.placeholder(dtype=tf.int32, shape=[3, 2])

embedding_param = tf.Variable(np.identity(8, dtype=np.int32))   # 生成一个8x8的单位矩阵
input_embedding = tf.nn.embedding_lookup(embedding_param, input_ids)
_input_embedding = tf.nn.embedding_lookup(embedding_param, _input_ids)

sess = tf.InteractiveSession()
sess.run(tf.global_variables_initializer())


print('embedding:')
print(embedding_param.eval())

var1 = [1, 2, 6, 4, 2, 5, 7]
print('\n var1:')
print(var1)

print('\nprojecting result:')
print(sess.run(input_embedding, feed_dict={input_ids: var1}))

var2 = [[1, 4], [6, 3], [2, 5]]
print('\n _var2:')
print(var2)

print('\n _projecting result:')
print(sess.run(_input_embedding, feed_dict={_input_ids: var2}))

'''

输出:
embedding:
[[1 0 0 0 0 0 0 0]
 [0 1 0 0 0 0 0 0]
 [0 0 1 0 0 0 0 0]
 [0 0 0 1 0 0 0 0]
 [0 0 0 0 1 0 0 0]
 [0 0 0 0 0 1 0 0]
 [0 0 0 0 0 0 1 0]
 [0 0 0 0 0 0 0 1]]

 var1:
[1, 2, 6, 4, 2, 5, 7]

projecting result:
[[0 1 0 0 0 0 0 0]
 [0 0 1 0 0 0 0 0]
 [0 0 0 0 0 0 1 0]
 [0 0 0 0 1 0 0 0]
 [0 0 1 0 0 0 0 0]
 [0 0 0 0 0 1 0 0]
 [0 0 0 0 0 0 0 1]]

 _var2:
[[1, 4], [6, 3], [2, 5]]

 _projecting result:
[[[0 1 0 0 0 0 0 0]
  [0 0 0 0 1 0 0 0]]

 [[0 0 0 0 0 0 1 0]
  [0 0 0 1 0 0 0 0]]

 [[0 0 1 0 0 0 0 0]
  [0 0 0 0 0 1 0 0]]]

'''

注解

embedding_param参数是一个8*8的单位矩阵(这个这是由一个tensor构成的params，即len(params)=1，partition_strategy只在len(params)>1时才作用)。

embedding_param=          # embedding_param只由一个tensor组成  故len(embedding_param) = 1
[[1 0 0 0 0 0 0 0]
 [0 1 0 0 0 0 0 0]
 [0 0 1 0 0 0 0 0]
 [0 0 0 1 0 0 0 0]
 [0 0 0 0 1 0 0 0]
 [0 0 0 0 0 1 0 0]
 [0 0 0 0 0 0 1 0]
 [0 0 0 0 0 0 0 1]]

我们ids为var1，照着此id从embedding_param取对应的行元素.

    var1 = [1, 2, 6, 4, 2, 5, 7]
    # 1即取第2行  --> [0 1 0 0 0 0 0 0]
    # 2即取第3行  --> [0 0 1 0 0 0 0 0]
    # etc.

我们ids为var2，照着此id从embedding_param取对应的行元素

    var2 = [[1, 4], [6, 3], [2, 5]]
    '''
    [1, 4] 即取2,5行 
    [[0 1 0 0 0 0 0 0]
     [0 0 0 0 1 0 0 0]]

    后面同理
    '''

partition_strategy参数的示例

关于partition_strategy参数的示例

api描述	注解
If len(params) > 1, each element id of ids is partitioned between the elements of params according to the partition_strategy. In all strategies, if the id space does not evenly divide the number of partitions, each of the first (max_id + 1) % len(params) partitions will be assigned one more id.	如果len(params) > 1，params的元素分割方式是依据partition_strategy的。如果分段不能整分的话，则前(max_id + 1) % len(params)多分一个id.
If partition_strategy is “mod”, we assign each id to partition p = id % len(params). For instance, 13 ids are split across 5 partitions as: [[0, 5, 10], [1, 6, 11], [2, 7, 12], [3, 8], [4, 9]]	例如，如果partition_strategy =’mod’.如果我们的params是由5个tensor组成，他们的第一个维度相加为13，则分割策略为[[0, 5, 10], [1, 6, 11], [2, 7, 12], [3, 8], [4, 9]]
If partition_strategy is “div”, we assign ids to partitions in a contiguous manner. In this case, 13 ids are split across 5 partitions as: [[0, 1, 2], [3, 4, 5], [6, 7, 8], [9, 10], [11, 12]]	例如，如果partition_strategy =’div’.如果我们的params是由5个tensor组成，他们的第一个维度相加为13，则分割策略为[[0, 1, 2], [3, 4, 5], [6, 7, 8], [9, 10], [11, 12]]

看api迷迷糊糊的，就看下面的四个例子，就会明白这个函数的操作方法了~

‘mod’案例1

   # coding:utf8
   import tensorflow as tf
   import numpy as np


   def test_embedding_lookup():
       a = np.arange(12).reshape(3, 4)
       b = np.arange(12, 16).reshape(1, 4)
       c = np.arange(16, 28).reshape(3, 4)
       print(a)
       print('\n')
       print(b)
       print('\n')
       print(c)
       print('\n')

       a = tf.Variable(a)
       b = tf.Variable(b)
       c = tf.Variable(c)

       t = tf.nn.embedding_lookup([a, b, c],
           partition_strategy='mod', ids=[0, 3, 6, 1, 2, 5, 8])

       init = tf.global_variables_initializer()
       sess = tf.Session()
       sess.run(init)
       m = sess.run(t)
       print(m)


   test_embedding_lookup()

        '''
        分析:
         这里我们注意到params是由[a, b, c]这三个tensor组成。即len(params)=3,且a,b,c这三个tensor的第一维度分别为3,1,3。
         在把这个三个tensor组合过程中，我们按照partition_strategy='mod'策略分割。即每个tensor的元素之间相差len(params).这里分割方式为[a, b, c]  == [[0,3,6], [1,4,7], [2,5,8]]  
         这里程序还不知道4和7是找不到对应的元素的，在获取元素时候会报错
        a=[[ 0  1  2  3]     = [0, 3, 6]  -->  [0  1  2  3]  = 0
           [ 4  5  6  7]                  -->  [4  5  6  7]  = 3
           [ 8  9 10 11]]                 -->  [8  9 10 11]  = 6

        b=[[12 13 14 15]]    = [1, 4, 7]  -->  [12 13 14 15] = 1
                                          -->  运行时报错  = 4
                                          -->  运行时报错  = 7


        c = etc..      


        输出:
        [[ 0  1  2  3]
         [ 4  5  6  7]
         [ 8  9 10 11]]

        [[12 13 14 15]]

        [[16 17 18 19]
         [20 21 22 23]
         [24 25 26 27]]

        [[ 0  1  2  3]  # 0
         [ 4  5  6  7]  # 3
         [ 8  9 10 11]  # 6
         [12 13 14 15]  # 1
         [16 17 18 19]  # 2
         [20 21 22 23]  # 5
         [24 25 26 27]] # 8

        '''

‘mod’案例2

   # coding:utf8
   import tensorflow as tf
   import numpy as np


   def test_embedding_lookup():
       a = np.arange(12).reshape(3, 4)
       b = np.arange(12, 16).reshape(1, 4)
       c = np.arange(16, 28).reshape(3, 4)
       print(a)
       print('\n')
       print(b)
       print('\n')
       print(c)
       print('\n')

       a = tf.Variable(a)
       b = tf.Variable(b)
       c = tf.Variable(c)

       t = tf.nn.embedding_lookup([a, c, b],
           partition_strategy='mod', ids=[0, 3, 6, 1, 4, 7, 2])

       init = tf.global_variables_initializer()
       sess = tf.Session()
       sess.run(init)
       m = sess.run(t)
       print(m)


   test_embedding_lookup()

        '''
        分析:
         这里我们把params从[a, b, c]改为[a, c, b]这三个tensor组成。a,c,b这三个tensor的第一维度分别为3,3,1。
         在把这个三个tensor组合过程中，依旧是每个tensor的元素之间相差len(params).这里分割方式为[a, c, b]  == [[0,3,6], [1,4,7], [2,5,8]]  
         这里程序还不知道4和7是找不到对应的元素的，在获取元素时候会报错
        a=[[ 0  1  2  3]     = [0, 3, 6]  -->  [0  1  2  3]  = 0
           [ 4  5  6  7]                  -->  [4  5  6  7]  = 3
           [ 8  9 10 11]]                 -->  [8  9 10 11]  = 6

        c=[[16 17 18 19]     = [1, 4, 7]  -->  [16 17 18 19]  = 1
           [20 21 22 23]                  -->  [20 21 22 23]  = 4
           [24 25 26 27]]                -->  [24 25 26 27]  = 7


        b=[[12 13 14 15]]    = [2, 5, 8]  -->  [12 13 14 15] = 2
                                          -->  运行时报错  = 5
                                          -->  运行时报错  = 8      


        输出:
        [[ 0  1  2  3]
         [ 4  5  6  7]
         [ 8  9 10 11]]

        [[12 13 14 15]]

        [[16 17 18 19]
         [20 21 22 23]
         [24 25 26 27]]

        [[ 0  1  2  3]  # 0
         [ 4  5  6  7]  # 3
         [ 8  9 10 11]  # 6
         [16 17 18 19]  # 1
         [20 21 22 23]  # 4
         [24 25 26 27]  # 7
         [12 13 14 15]] # 2

        '''

‘div’案例1

   # coding:utf8
   import tensorflow as tf
   import numpy as np


   def test_embedding_lookup():
       a = np.arange(12).reshape(3, 4)
       b = np.arange(12, 16).reshape(1, 4)
       c = np.arange(16, 28).reshape(3, 4)
       print(a)
       print('\n')
       print(b)
       print('\n')
       print(c)
       print('\n')

       a = tf.Variable(a)
       b = tf.Variable(b)
       c = tf.Variable(c)

       t = tf.nn.embedding_lookup([a, b, c],
           partition_strategy='div', ids=[0, 1, 2, 3, 5, 6])

       init = tf.global_variables_initializer()
       sess = tf.Session()
       sess.run(init)
       m = sess.run(t)
       print(m)


   test_embedding_lookup()

        '''
        分析:
         这里我们把params依旧是[a, b, c]，三个tensor的第一维度分别为3,1,3。

         在把这个三个tensor组合过程中，这我们按照partition_strategy='div'策略分割。即每个tensor的元素之间相差1.如果不够等分的话，前面(max_id+1)%len(params)多分一个元素。这里一共7个元素，分为3组，即3、2、2分配。

         这里分割方式为[a, b, c]  == [[0,1,2], [3,4], [5,6]]  

         这里程序还不知道4和7是找不到对应的元素的，在获取元素时候会报错
        a=[[ 0  1  2  3]     = [0, 1, 2]  -->  [0  1  2  3]  = 0
           [ 4  5  6  7]                  -->  [4  5  6  7]  = 1
           [ 8  9 10 11]]                 -->  [8  9 10 11]  = 2




        b=[[12 13 14 15]]    = [3, 4]  -->  [12 13 14 15] = 3
                                          -->  运行时报错  = 4

        c=[[16 17 18 19]     = [5, 6]  -->  [16 17 18 19]  = 5
           [20 21 22 23]                  -->  [20 21 22 23]  = 6
           [24 25 26 27]]                -->  [24 25 26 27]  = 这个是找不到的了   


        输出:
        [[ 0  1  2  3]
         [ 4  5  6  7]
         [ 8  9 10 11]]

        [[12 13 14 15]]

        [[16 17 18 19]
         [20 21 22 23]
         [24 25 26 27]]

        [[ 0  1  2  3]  # 0
         [ 4  5  6  7]  # 1
         [ 8  9 10 11]  # 2
         [12 13 14 15]  # 3
         [16 17 18 19]  # 5
         [20 21 22 23]] # 6

        '''

‘div’案例2

   # coding:utf8
   import tensorflow as tf
   import numpy as np


   def test_embedding_lookup():
       a = np.arange(12).reshape(3, 4)
       b = np.arange(12, 16).reshape(1, 4)
       c = np.arange(16, 28).reshape(3, 4)
       print(a)
       print('\n')
       print(b)
       print('\n')
       print(c)
       print('\n')

       a = tf.Variable(a)
       b = tf.Variable(b)
       c = tf.Variable(c)

       t = tf.nn.embedding_lookup([a, c, b],
           partition_strategy='div', ids=[0, 1, 2, 3, 4, 5])

       init = tf.global_variables_initializer()
       sess = tf.Session()
       sess.run(init)
       m = sess.run(t)
       print(m)


   test_embedding_lookup()

        '''
        分析:
         这里我们把params改为[a, c, b]，三个tensor的第一维度分别为3,3,1。

         在把这个三个tensor组合过程中，这我们按照partition_strategy='div'策略分割。这里一共7个元素，分为3组，即3、2、2分配。

         这里分割方式为[a, c, b]  == [[0,1,2], [3,4], [5,6]]  

         这里程序还不知道4和7是找不到对应的元素的，在获取元素时候会报错
        a=[[ 0  1  2  3]     = [0, 1, 2]  -->  [0  1  2  3]  = 0
           [ 4  5  6  7]                  -->  [4  5  6  7]  = 1
           [ 8  9 10 11]]                 -->  [8  9 10 11]  = 2

        c=[[16 17 18 19]     = [3, 4]  -->  [16 17 18 19]  = 3
           [20 21 22 23]                  -->  [20 21 22 23]  = 4
           [24 25 26 27]]                -->  [24 25 26 27]  = 这个是找不到的了   

        b=[[12 13 14 15]]    = [5, 6]  -->  [12 13 14 15] = 5
                                          -->  运行时报错  = 6


        输出:
        [[ 0  1  2  3]
         [ 4  5  6  7]
         [ 8  9 10 11]]

        [[12 13 14 15]]

        [[16 17 18 19]
         [20 21 22 23]
         [24 25 26 27]]

        [[ 0  1  2  3]  # 0
         [ 4  5  6  7]  # 1
         [ 8  9 10 11]  # 2
         [16 17 18 19]  # 3
         [20 21 22 23]  # 4
         [16 17 18 19]] # 5

        '''

参考资料

https://stackoverflow/questions/34870614/what-does-tf-nn-embedding-lookup-function-do/41922877#41922877?newreg=5119f86ea49b43aa8988a833294ceb3e

https://www.zhihu/question/52250059

本文标签：详解参数 nn TF partitionstrategy

版权声明：本文标题：tf.nn.embedding_lookup中关于partition_strategy参数详解内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://www.elefans.com/dongtai/1725700404a1037398.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

电子爱好者 - 最新技术资讯及电子产品介绍！

tf.nn.embedding_lookup中关于partition_strategy参数详解

tf.nn.embedding_lookup

数学上的原理

API介绍

简单示例

程序

注解

partition_strategy参数的示例

‘mod’案例1

‘mod’案例2

‘div’案例1

‘div’案例2

参考资料

更多相关文章

计算机视觉入门大全：基础概念、运行原理、应用案例详解

详解四种生成和解析XML文档的方法（附各方法优缺点）

android 输入法更换_详解安卓手机输入法和键盘切换方式的教程

win7计算机凭证,windows凭证管理,详解win7电脑添加和管理windows凭证的方法

Linux操作系统之文件系统详解

Linux文件系统管理之挂载命令详解

最新Linux挂载命令mount用法及参数详解_mount指定网卡

2024年运维最新Linux挂载命令mount用法及参数详解_mount指定网卡，真的太香了

如何使用计算机网络打印机,如何共享网络打印机 共享网络打印机教程详解

Android Studio设计APP实现与51单片机通过WIFI模块（ESP8266-01S）通讯控制LED灯亮灭的设计源码【详解】

JVM详解----java

windows上VMware虚拟机安装mac os系统步骤详解

cad连筋字体怎么安装_CAD图纸怎么快速统一修改字体大小等参数？【AutoCAD教程】...

详解六大QQ病毒特征及清除方法

SQL Server性能优化详解

超级详细的Oracle安装图文详解！手把手教会您从下载到安装！

贝壳一键还原1.0 使用详解

linux 查看u盘文件,图文详解Linux下使用U盘的方法

win10安装qt5.12保姆级教程（图文详解，简单快捷）

ibm用u盘安装linux,详解使用磁盘克隆和U盘安装IBM AIX操作系统

发表评论

推荐文章

两台计算机怎样互相访问文件,单位的多台电脑如何互访和调取共享文件？

cad和mysql_在建筑行业，BIM 最终会取代 CAD 吗？

Debian11安装帆软Finereport11服务器保姆版

windows系统应用

官方正式版安装包-AVID Pro Tools 2020.12 WiN-MAC

热门文章

同一计算机打印机无法连接,共享打印机无法连接怎么办 共享打印机无法连接解决方法【图文教程】...

ESP8266wifi模块连接原子云实现手机与单片机的通信

使用mac简体中文，导入百度输入法字库

为什么php-fpm会使用内存一直增加_百度输入法重大BUG：内存占用随用户词库增大而暴涨...

Java基础编程 （尚硅谷）

操作系统常见面试题大总结

软件测试之【单元测试、系统测试、集成测试】

VMware虚拟机安装Win11正式版

win10桌面排序计算机,Windows10正式版下设置桌面自动排列图标的详细步骤

Linux 磁盘配额,RAID,LVM

最新文章

acer笔记本装金泰克SSD&amp;重装win10系统

小米笔记本15.6装win10系统

不重装系统解决win10更新错误0x800f0922

Window10企业版激活

windows系统激活了还可以取消吗

重装win7系统后打开截图工具显示“截图工具当前未在计算机上运行”怎么办

NUC装win10问题

加装固态后怎么重装window10系统

配置Windows Server 2012

windows系统不激活会怎样

联想G40-70mwin8.1重装win10详细方法

重装系统（win10企业版）

怎么把ide改成ahci_Win10系统无需重装，硬盘IDE改为AHCI模式的方法

正版windows系统是永久激活的吗

如何查看windows系统激活状态

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

如何使用计算机网络打印机,如何共享网络打印机共享网络打印机教程详解

同一计算机打印机无法连接,共享打印机无法连接怎么办共享打印机无法连接解决方法【图文教程】...

Java基础编程（尚硅谷）

acer笔记本装金泰克SSD&重装win10系统

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载