Bootstrap Your Own Latent A New Approach to Self-Supervised Learning|电子爱好者

admin管理员组
文章数量:1652185

1. framwork

1) two distribution of augmentation

5) q uses the same architecture as g(FC+BN+RELU+FC)

6) with k the current training step and K the maximum number of training steps.(, the exponential moving average parameter τ starts from τbase = 0.996 and is increased to one during training.)

2 Intuitions on BYOL’s behavior

a collapsed constant representation

1)BYOL’s target parameters ξ updates are not in the direction of ∇ξL BYOL θ,ξ ,There is therefore no a priori reason why BYOL’s parameters would converge to a minimum of L BYOL θ,ξ .

2)assuming BYOL’s predictor to be optimal

====>

hence our hypothesis on these collapsed constant equilibria being unstable.

3.Building intuitions with ablations

batch size

only drops for smaller values due to batch normalization layers in the encoder

image augmentation

bootstrapping

ablation to contrastive methods

. To evaluate the influence of the target network, the predictor and the coefficient β, we perform an ablation over them

2) target network

using a target network is beneficial but it has two distinct effects we would like to understand from which effect the improvement comes from.(stopping the gradient through the prediction targets and stabilizing the targets with averaging)

conclusion:making the prediction targets stable and stale is the main cause of the improvement rather than the change in the objective due to the stop gradient.

3）predictor

In this setup, we remove the exponential moving average (i.e., set τ = 0 over the full training in Eq. 1), and multiply the learning rate of the predictor by a constant λ compared to the learning rate used for the rest of the network; all other hyperparameters are unchanged. As shown in Table 21, using sufficiently large values of λ provides a reasonably good level of performance and the performance sharply decreases with λ to 0.01% top-1 accuracy (no better than random) for λ = 0.

To show that this effect is directly related to a change of behavior in the predictor, and not only to a change of learning rate in any subpart of the network, we perform a similar experiment by using a multiplier λ on the predictor’s learning rate, and a different multiplier µ for the projector.

conclusion : one of the contributions of the target network is to maintain a near optimal predictor at all times

Optimal linear predictor in closed form

At 300 epochs, when using the closed form optimal predictor, and directly hard copying the weights of the online network into the target, we obtain a top-1 accuracy of fill.

Network hyperparameters

removing the weight decay in either BYOL or SimCLR leads to network divergence

本文标签： Latent bootstrap Approach Learning supervised

版权声明：本文标题：Bootstrap Your Own Latent A New Approach to Self-Supervised Learning 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://www.elefans.com/dongtai/1729579187a1207398.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

Bootstrap Your Own Latent A New Approach to Self-Supervised Learning

更多相关文章

联邦学习笔记—《Communication-Efficient Learning of Deep Networks from Decentralized Data》

李菲菲课程笔记：Deep Learning for Computer Vision – Introduction to Convolution Neural Networks

Llama模型家族之使用 Supervised Fine-Tuning（SFT）微调预训练Llama 3 语言模型（八） 使用 LoRA 微调 LLM 的实用技巧

精读FREE: Feature Refinement for Generalized Zero-Shot Learning

Context Encoders: Feature Learning by Inpainting

自监督学习（六）Context Encoders: Feature Learning by Inpainting

机器人局部避障的动态窗口法(dynamic window approach)

【预训练语言模型】RoBERTa: A Robustly Optimized BERT Pretraining Approach

【压缩感知 SDA】A Deep Learning Approach to Structured Signal Recovery

A Contrastive Learning Approach for Hierarchy Text Classification源码阅读

RoBERTa: A Robustly Optimized BERT Pretraining Approach（通篇翻译）

论文阅读：HybridAlpha: An Efficient Approach for Privacy-Preserving Federated Learning

【论文阅读】A Transformer-based Approach for Source Code Summarization

Learning to Know Where to See: A Visibility-Aware Approach for Occluded Person Re-identification阅读记录

论文笔记：Weighted Graph Cuts without Eigenvectors:A Multilevel Approach

A Minimalist Approach to Offline Reinforcement Learning[TD3+BC]阅读笔记

【论文笔记】A Unified Approach for Tracking UAVs in Infrared

论文笔记：A Robust Learning Approach to Domain Adaptive Object Detection

（IJCAI-17）Transfer learning in multi-armed bandits: A causal approach

车道线检测--Towards End-to-End Lane Detection: an Instance Segmentation Approach

发表评论

推荐文章

从零到百万用户的扩展之路

控制面板中的java无法正常显示

银河麒麟桌面操作系统V10（MIPS）桌面文件双击无法打开，只能右键打开如何解决？

[已解决]360极速浏览器.为什么后台一直占用20%的cpu?

Firefox滚动条在Win10和Win11下表现不一致问题？

热门文章

“GitHub: Your account has been flagged.”的完美解决方法

ORA-00844: Parameter not taking MEMORY_TARGET into account

Octopus11.4并行版安装

安装宝塔面板之后无法打开和访问phpmyadmin（解决方法大全）

打开计算机无法关闭窗口,电脑中“打开或关闭Windows功能”窗口出错无法显示怎么解决...

uploadify上传文件在360浏览器急速模式下失败

python怎样打开加密的文件,如何使用python打开密码保护的excel文件？

Ubuntu16.04键盘图标不见了

用 DiskGenius 和 HDD Regenerator 修复硬盘逻辑坏道和隐藏物理坏道

Docker容器与容器云(第2版) pdf百度网盘下载

最新文章

CDR2024破解完整版下载安装永久激活最新

windows系统激活时间查询

中文linux 老旧电脑,安装Bodhi Linux让老旧电脑重新焕发活力

网络安全初学者工具安装：Kali，Windows xp虚拟机，pikachu靶场，burpsuite安装配置，phpstudy安装（学习笔记）

XP SP3无法安装IIS 系统版本iis 5.1 iis 6

Autodesk 3DS Max v2025 激活版下载及安装教程

win7虚拟机黑苹果_苹果Mac虚拟机安装Win7系统的方法【图文教程】

MathType7永久免费无需激活版下载，数学神器轻松get！

QT历届版本下载总汇

在XP下安装Ubuntu双系统

Windows server 2022datacenter版本的j激活过程

mathtype2024最新破解永久激活码密钥序列号+下载安装教程

【C++软件调试技术】使用 Windbg 分析软件异常时的诸多细节与技巧总结

Java版本历史

跟老男孩学 Linux 运维：Web 集群实战

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

在哪些场景下应优先考虑使用treenode

treenode在树形结构中的角色是什么

如何通过treenode实现二叉树

Llama模型家族之使用 Supervised Fine-Tuning（SFT）微调预训练Llama 3 语言模型（八）使用 LoRA 微调 LLM 的实用技巧

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载