代码错误记录：TypeError: dropout(): argument ‘input‘ (position 1) must be Tensor, not str|电子爱好者

admin管理员组
文章数量:1611401

TypeError: dropout（）: argument 'input' （position 1） must be Tensor, not str

背景
解决方法 1 （直接在输出上进行修改）
- 整体代码
解决方法2 （直接在模型上进行修改）
参考链接

背景

使用 hugging face 中的预训练模型完成文本分类任务的过程中。出现了这个问题。

问题排查的过程中，发现这里定义的 cls_layer() 出现问题。

问题是数据类型错误，因此需要检查pooler_output的数据产生的位置和输出类型

解决方法 1 （直接在输出上进行修改）

定位位置，寻找pooler_output的输出

这个pooler_output是关于 bert_layer 中 [CLS]的输出向量，这里的返回值是一个字典类型，因此我们需要设置它的返回是不是字典类型

整体代码

class SentencePairClassifier(nn.Module):
    def __init__(self, bert_model="albert-base-v2", freeze_bert=False):
        super(SentencePairClassifier, self).__init__()
        #  Instantiating BERT-based model object
        self.bert_layer = AutoModel.from_pretrained(bert_model)
        
        #  Fix the hidden-state size of the encoder outputs (If you want to add other pre-trained models here, search for the encoder output size)
        if bert_model == "albert-base-v2":  # 12M parameters
            hidden_size = 768
        elif bert_model == "albert-large-v2":  # 18M parameters
            hidden_size = 1024
        elif bert_model == "albert-xlarge-v2":  # 60M parameters
            hidden_size = 2048
        elif bert_model == "albert-xxlarge-v2":  # 235M parameters
            hidden_size = 4096
        elif bert_model == "bert-base-uncased": # 110M parameters
            hidden_size = 768
            
        # Freeze bert layers and only train the classification layer weights
        if freeze_bert:
            for p in self.bert_layer.parameters():
                p.requires_grad = False
                
        # Classification layer
        self.cls_layer = nn.Linear(hidden_size, 1)
        self.dropout = nn.Dropout(p=0.1)
        
        
    @autocast()  # run in mixed precision
    
    def forward(self, input_ids, attn_masks, token_type_ids):
        '''
        Inputs:
            -input_ids : Tensor  containing token ids
            -attn_masks : Tensor containing attention masks to be used to focus on non-padded values
            -token_type_ids : Tensor containing token type ids to be used to identify sentence1 and sentence2
        
        outputs:
            - last_hidden_state: 最后一层的隐藏层向量表征
            - pooler_output: 最后一层 输出 
            - all_hidden_state: 全部层的 隐藏层向量表征 
        注：all_hidden_state可以将后面的4层取出来，做mean，然后在拼接到 classifier上。
        '''
        # Feeding the inputs to the BERT-based model to obtain contextualized representations
        cont_reps, pooler_output = self.bert_layer(input_ids, attn_masks, token_type_ids, return_dict=False) ## , return_dict=False)
        
        # Feeding to the classifier layer the last layer hidden-state of the [CLS] token further processed by a
        # Linear Layer and a Tanh activation. The Linear layer weights were trained from the sentence order prediction (ALBERT) or next sentence prediction (BERT)
        # objective during pre-training.
        logits = self.cls_layer(self.dropout(pooler_output))
        
        return logits

思路是：

解决方法2 （直接在模型上进行修改）

参考链接

https://stackoverflow/questions/65082243/dropout-argument-input-position-1-must-be-tensor-not-str-when-using-bert

本文标签：错误代码 dropout TypeError argument

版权声明：本文标题：代码错误记录：TypeError: dropout(): argument ‘input‘ (position 1) must be Tensor, not str 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://www.elefans.com/dongtai/1728604320a1165259.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

sqlalchemy下连接MYSQL出现的错误：This session is in ‘prepared‘ state； no further SQL can be emitted ...

21小时前

InvalidRequestError: This session is in 'prepared' state; no further SQL can be emitted within this transactio

java代码连接hadoop FileSystem 连接hdfs报错：Connection refused: no further information

21小时前

java代码连接hadoop FileSystem 连接hdfs报错：Connection refused: no further information 确保应用正常启动、JPS该起的进程都要启动的情况下&#

关于Error parsing HTTP request header Note: further occurrences of HTTP header parsing errors错误的原因

20小时前

今天对项目进行维护，突然发现修改内容后不能保存，前端页面显示白屏，后台tomcat在debug模式下出现：Error parsing HTTP requ

further occurrences of HTTP header parsing errors will be logged at DEBUG level.错误

20小时前

今天进行项目测试的时候出现了further occurrences of HTTP header parsing errors will be logged at DEBUG level.错误，查了半天资料&#

Proxmox VE(PVE)开启IOMMU功能实现硬件直通及直通错误解决

20小时前

一、写在前面什么是硬件直通(Passthrough) VT-d 、DirectPath IO，通过 DirectPath IO，虚拟机可以使用 IO 内存管理单元访问平台上的物理 PCI

Hbase代码运行报错：no route to host......

20小时前

Hbase 代码运行报错：no route to host:no further information… 解决方法 ： window系统： 检查window的hosts

基于YOLOv8YOLOv7YOLOv6YOLOv5的夜视行人检测系统（Python+PySide6界面+训练代码）

20小时前

摘要：开发高效的夜视行人检测系统对于提升夜间安全和监控效能至关重要。本篇博客详尽介绍了如何利用深度学习技术搭建一个夜视行人检测系统，并提供了完整的实现代码。本系统采用了先进的YOLOv8算法&am

【错误解决】解决anaconda下python不能被激活

18小时前

【错误解决】解决anaconda下python不能被激活出现如下报错：This Python interpreter is in a conda environment, but the environment h

maple激活后不联网就会初始化错误解决方案——添加虚拟网卡

18小时前

引言最近安装了maple2018版，激活后发现一个问题，程序启动显示“maple初始化错误”。软件安装过程已经参照其他教程进行了激活。经过一番摸索发现，是因为maple启动时会检测是否联网，没有联网就会报错。补充说明：除了联网，还有计算

使用精灵标注助手制作yolov3训练数据集（附解析xml代码）

10小时前

一、标注数据 1、将获取图片存放到同一个文件夹下本次标注数据供分为4类（person，dog，tiger，car）&

Python错误UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xc7 in position 0: invalid continuation

5小时前

Python错误UnicodeDecodeError: ‘utf-8’ codec can’t decode byte 0xc7 in position 0: invalid continuation 问题描述： 换了

报错Unexpected token 「 in JSON at position 0 的错误解析

5小时前

报错Unexpected token < in JSON at position 0 的错误解析 Unhandled Rejection (SyntaxError): Unexpected token < in JSON at

解码错误。‘gb2312‘ codec can‘t decode byte 0xf3 in position 307307: illegal multibyte sequence

5小时前

一般在decode加errors"ignore"就可以了。例如： decode(gb2312,errorsignore)

解决错误:SyntaxError: Unexpected token % in JSON at position 1

4小时前

问题描述:当在HTML页面中,进行jSON对象的传递时,出现了SyntaxError: Unexpected token % in JSON at position 1异常传递页面:被传递的页面:问题定位: 当查看URL的值

Pytorch调用预训练模型输出结果时报错argument ‘input‘ (position 1) must be Tensor, not collections.OrderedDict

4小时前

在使用pytorch中的torchvision.models.segmentation.fcn_resnet50进行获得已经训练好的预训练模型时，所得结果的输出给我提示说argument 'input' (positio

vue开发错误记录：Unexpected token o in JSON at position 1

4小时前

在使用 Element ui 开发 table 表格的时候遇到如下错误问题一 Invalid prop: type check failed for prop "data". Expected Array, got

惠普笔记本LED灯闪烁代码故障含义

4小时前

原文地址双灯闪烁在CQ40这类机器中比较常见，一般是开机加电屏无显示，加电10多秒后大小写和数字键盘灯开始闪烁，其实这些闪烁的次数是有含义的！就类似于

Win7串口开发的的一些错误以及解决方案

1小时前

文章目录 [toc] 背景遇到的问题1 看得到串口，但是一直打开失败，GetLastError4332 看得到串口(COM16)，但是一直打开失败,GetLastError2

java queue capacity_Java BlockingQueue remainingCapacity()用法及代码示例

1小时前

BlockingQueue的remainingCapacity()方法返回可以添加到BlockingQueue而不会阻塞的更多元素的数量。返回的容量在以下三种情况下出现： 如果剩余容量为零，则不

MessageBox.Show方法出现“容量超出了最大容量。参数名: capacity”错误！

40分钟前

我遇到一个奇怪的问题，就是 MessageBox.Show方法中的title属性赋值时不能使用过多的汉字。我当时用了四个汉字，就报了“容量超出了最大容量。参数名: capacity”错误。

电子爱好者 - 最新技术资讯及电子产品介绍！

代码错误记录：TypeError: dropout(): argument ‘input‘ (position 1) must be Tensor, not str

TypeError: dropout（）: argument 'input' （position 1） must be Tensor, not str

背景

解决方法 1 （直接在输出上进行修改）

整体代码

解决方法2 （直接在模型上进行修改）

参考链接

更多相关文章

sqlalchemy下连接MYSQL出现的错误：This session is in ‘prepared‘ state； no further SQL can be emitted ...

java代码连接hadoop FileSystem 连接hdfs报错：Connection refused: no further information

关于Error parsing HTTP request header Note: further occurrences of HTTP header parsing errors错误的原因

further occurrences of HTTP header parsing errors will be logged at DEBUG level.错误

Proxmox VE(PVE)开启IOMMU功能实现硬件直通及直通错误解决

Hbase代码运行报错：no route to host......

基于YOLOv8YOLOv7YOLOv6YOLOv5的夜视行人检测系统（Python+PySide6界面+训练代码）

【错误解决】解决anaconda下python不能被激活

maple激活后不联网就会初始化错误解决方案——添加虚拟网卡

使用精灵标注助手制作yolov3训练数据集（附解析xml代码）

Python错误UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xc7 in position 0: invalid continuation

报错Unexpected token 「 in JSON at position 0 的错误解析

解码错误。‘gb2312‘ codec can‘t decode byte 0xf3 in position 307307: illegal multibyte sequence

解决错误:SyntaxError: Unexpected token % in JSON at position 1

Pytorch调用预训练模型输出结果时报错argument ‘input‘ (position 1) must be Tensor, not collections.OrderedDict

vue开发错误记录：Unexpected token o in JSON at position 1

惠普笔记本LED灯闪烁代码故障含义

Win7串口开发的的一些错误以及解决方案

java queue capacity_Java BlockingQueue remainingCapacity()用法及代码示例

MessageBox.Show方法出现“容量超出了最大容量。参数名: capacity”错误！

发表评论

推荐文章

Error parsing HTTP request header Note: further occurrences of HTTP request parsing

彻底解决Qt中文乱码以及汉字编码的问题(UTF-8GBK)

惠普735G5笔记本摒弃HP自带全家桶，全新安装win10无需激活，HP软件按需安装即可

【小5聊】jquery基础之offset和position的top、left值

【PyTorch问题】CUDA out of memory. Tried to allocate 4.69 GiB (GPU 0； 8.00 GiB total capacity...略

热门文章

全面解读Google Chrome浏览器特性与技术

基于hexo和aws云搭建个人博客，0基础0费用，有点豪横（2W字超详细图文教程）

安卓第三方友盟登录与分享

【转】app测试基本步骤

excel制作简单账本

光模块行业术语之名词interpretation（三）

【小5聊】jquery基础之offset和position的top、left值

项目中遇到的position:fixed;无效问题

Fortran语言初探及Win7 64位下Fortran开发环境配置

YARN Capacity Scheduler（容量调度器）

最新文章

低配电脑装深度linux,低配电脑装什么系统

计算机重新装xp系统软件,关于安装软件重启XP电脑后软件不见的处理方法

为什么电脑用久了，就算重新安装系统也会变得很慢？

苹果电脑可以装windows系统吗_给苹果电脑安装Windows系统

win7怎么重新安装系统,win7如何重新装系统

多台电脑安装系统的快捷方式--使用系统镜像

电脑已经有了一个Windows10，再多装一个Windows10组成双系统

鸿蒙系统平板电脑能安装吗,平板电脑已预装鸿蒙系统，我们来看看效果

电脑版html5安装教,电脑安装系统教程|如何为电脑安装系统

华为鸿蒙系统怎么装到电脑上,华为正式发布鸿蒙系统 鸿蒙系统怎么样如何操作安装？...

dell电脑如何安装ubuntu系统_Dell Win10系统安装成Ubuntu16.04

树莓派如何重新装Linux系统,如何给树莓派Raspberry重新安装修复操作系统

计算机重做系统有什么好处,为什么要重新安装系统？重新安装有什么好处？

关于装电脑系统的心得总结

电脑测试软件怎么重装,如何重新安装电脑系统

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

华为鸿蒙系统怎么装到电脑上,华为正式发布鸿蒙系统鸿蒙系统怎么样如何操作安装？...

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载