Multi-Modal Knowledge Graph Construction and Application: A Survey|电子爱好者

admin管理员组
文章数量:1602102

Absract:

存在问题：1.现实世界知识爆炸；2现存KG是with pure symbol,不好让机器去理解。

->解决问题方案：Multi-Modal KG，这可以更好地实现人类水平的机器翻译。

->得出结果：MMKG

概览：

1.defintion of MMKGs;

2.the preliminaries on multi-modal tasks and techniques;

3.systematically review the challenges,progress,opportunities on the construction and application of MMKGs;

4.analyses of the strength and weakness of different solutions.

1.Introduction

one hand: the dog and the experience of dogs--象征与其物理世界意义联系起来；

on the other hand:

1 图像中更好抽取类似关系抽取，属性抽取，（eg:Partof(keyboard and the screen are parts of a laptop)）

2.可以形成more informative entity-level sentence instead of a vague concept-level with MMKG(eg:Donald Trump is making a speech(with MMKGs);A tall man with blond hair is making a speech(no use MMKGs))。

Construction:（conclude opposite directions）[challenges,progress,opportunities]

One is from images to symbols 即 labeling images with symbols inKG
The other is from symbols to images 即 grounding symbols in KG to images

Application:

In-MMKG:旨在解决MMKG本身的质量或集成问题；
Out-of-MMKG:通用的多模式任务,MMKG可以提供帮助。

2.Definition and Preliminaries

2.1 first defines two representation ways for KGs;

2.2 review some preliminaries on multi-modal tasks and techniques;

2.3 followed with a discussion on the connections between MMKGs and the existing multi-modal tasks and techniques.

2.1 Definitions and Representation of MMKGs

two different ways for representing MMKGs:

A-MMKG:take multi-modal data as particular attribute values of entities/concepts
N-MMKG:take multi-data as entities in KGs

N-MMKG通常将一幅图像抽象为若干图像描述符，这些描述符通常概括为图像实体在像素级的特征向量。因此可以通过简单的计算得到图像之间的关系（eg:通过图像描述符向量的内积得到图像的相似度）

2.2 Preliminaries on Multi-Modal Tasks and Techniques

well-studied multi-modal tasks
multi-modal learning techniques
followed with important progress on multi-modal pretrained language model

Multi-Modal tasks

(a problem is characterized as multi-modal if it involves data of multiple modalities)

多模态任务整合并模拟了多种交际模式，以便从多模态数据中获取知识或理解。

Multi-Modal Learning

多模态学习主要是对多模态之间的对应关系进行建模，以理解多模态数据。

面临的挑战;

Multi-Modal Representation
Multi-Modal Translation
Multi-Modal Alignment
Multi-Modal Fusion
Multi-Modal Co-Learning

Multi-Modal Pretrained Language Model(多模态预训练语言模型)

近年来，学者们设计了一些自监督预训练任务，

In terms of the Transformered-based fusion process of different modality

（就不同模态的基于Transformered的融合过程而言）

多模态预训练语言模型可分为

single-stream models
two-stream models

2.3 Discussion

虽然利用多模态学习技术和多模态预训练语言模型来处理多种多模态任务已经有了很大的研究成果，但引入多模态知识来提高已有多模态任务的性能仍是一个新型趋势。MMKG可以从以下几个方面为这些下游任务带来好处：

MMKG provieds sufficient background knowledge to enrich the representation of entities and concepts,especially for the long-tail ones.
MMKG enables the understanding of unseen objects in images
MMKG enables multi-modal reasoning
MMKG usually provides multi-modal data as additional features to bridge the information gaps in some NLP tasks.

PS:

长尾（long-tail）问题：

长尾问题是实际生产数据中的一种数据分布。其中关键的特点在于占据影响比例相对较小的部分分布着较多的实例。一个例子是统计指定话题下的100w的微博，其中的字按频次排期，除了头部的数据外，频次较低的字有着极大的数量。

常见的长尾问题解决方案：

高频部分通过人工筛选 + 人工标注，产出高质量可用数据。
低频部分，通过自动化构建的方式，产出一份可用的指定质量的数据。

To sum up:

在没有大规模MMKG支持的情况下，以往使用多模态信息的努力仍然有限。我们设想，当大规模的高质量的MMKG可用时，许多任务可以进一步改进

本文标签： Knowledge modal Multi Graph Survey

版权声明：本文标题：Multi-Modal Knowledge Graph Construction and Application: A Survey 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://www.elefans.com/dongtai/1728396326a1157049.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

Multi-Modal Knowledge Graph Construction and Application: A Survey

Absract:

1.Introduction

2.Definition and Preliminaries

2.1 Definitions and Representation of MMKGs

2.2 Preliminaries on Multi-Modal Tasks and Techniques

2.3 Discussion

更多相关文章

A Comprehensive Study of Knowledge Editing for Large Language Models

深度学习——Multi-Purpose Image Deraining (MPID)

CF 1082C. Multi-Subject Competition

CodeForces - 1082CMulti-Subject Competition前缀和+ 思维

Codeforces 1082 C. Multi-Subject Competition-有点意思 (Educational Codeforces Round 55 (Rated for Div. 2...

强化学习 之 多智能体（Multi-Agent）强化学习

Graph-based Knowledge Tracing: Modeling Student Proficiency Using Graph Neural Network

使用Joern来生成code property graph的过程记录（包括目前存在的问题）

MUSTer：Multi-Store Tracker:A Cognitive Psychology Inspired Approach to Object Tracking

Lecture #4:How to Conduct and Write Literature Survey(I)

关于RedisTemplate的ERR EXEC without MULTI错误

GIKT:A Graph-based Interaction Model for Knowledge Tracing

【论文学习】GraphFM: Graph Factorization Machines for Feature Interaction Modeling

文献笔记|知识追踪|GIKT: A Graph-based Interaction Model for Knowledge Tracing

解决：AttributeError: ‘Graph‘ object has no attribute ‘number_of_selfloops‘

解决NetworkX遇到 AttributeError: ‘Graph‘ object has no attribute ‘node‘ 问题

解决问题：AttributeError: ‘Graph‘ object has no attribute ‘node‘

ChatGPT 拓展资料：论文阅读A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to Chat

UVa 10720 - Graph Construction

论文解析 Transition-based Directed Graph Construction for Emotion-Cause Pair Extraction – 20’ACL

发表评论

推荐文章

多屏幕切换到但屏幕，有软件无法在当前屏幕上无法显示

关修远的笔记（黑马程序员）

java设计模式(上)

win11电脑时不时断网

Smart_Construction 开源项目指南

热门文章

幼儿园管理系统的设计与实现

炫酷ubantu桌面，compiz特效和配置

vmware虚拟机三种配置方式,与本机及网络通信详解

CC++后台开发基础知识

【Freeswitch从入门到精通】六、Condition

TCPIP拥塞控制总结...

优秀开源音乐项目---落雪音乐软件（免费听歌下载歌曲）

Win10系统环境变量中的Path不小心被删除了

ps cs3 汉化包（去掉BUG版）的补充说明~回复大人Orz！

JRebel最新版（2024.1.2）在线激活

最新文章

win10系统开不了机

Windows7系统如何禁用驱动程序签名强制

在Windows 10 IoT核心版上运行ASP.NET Core 2应用程序，并设置开机启动

Win10开机示Logo后黑屏的全方位解决方案

云计算基本概念

解决Win10开机慢的问题：轻松享受快速启动体验

w ndows10密码更改,windows10账户安全登录密码

win7驱动程序未经签名可以使用吗_如何禁用win7旗舰版系统驱动程序签名强制

服务器被黑善后工作

云计算了解

Windoww 如何禁止驱动程序签名强制

联想拯救者Y7000P2019双系统安装与卸载（win10+ubuntu18.04+NVIDIA GeForce RTX2060+CUDA10.0+Cudnn+pytorch）

WIN10 开机转圈解决方案

win7怎么禁用驱动强制数字签名？win7驱动程序强制数字签名禁用方法

高效率的网站打开速度优化方法

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

强化学习之多智能体（Multi-Agent）强化学习

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载