【论文阅读】Evaluating Mixed-initiative Conversational Search Systems via User Simulation|电子爱好者

admin管理员组
文章数量:1564700

文章目录

- Original Paper
- Motivation
- Contribution
- Methods
- - Semantically-controlled text generation
  - GPT2-based simulated user
- Datasets
- - Qulac and ClariQ
  - Multi-turn conversational data
- Future Work
- Knowledge

Original Paper

Evaluating Mixed-initiative Conversational Search Systems via User Simulation:

Motivation

Propose a conversational User Simulator, called USi, for automatic evaluation of such conversational search system.

Contribution

propose a user simulator, USi, for conversational search system evaluation, capable of answering clarifying questions prompted by the search system
perform extensive set of experiments to evaluate the feasibility of substituting real users with the user simulator
release a dataset of multi-turn interactions acquired through crowdsourcing

Methods

Semantically-controlled text generation

We define the task of generating answers to clarifying questions as a sequence generation task.

Current SOTA language models formulate the task as next-word prediction task:

generated text are prone to hallucination and in general lack semantic guidance

Answer generation needs to be conditioned on the underlying information need:

a i a_i ai is the current token of the answer
a < i a_{<i} a<i are all the previous ones
i n , q , c q in,q,cq in,q,cq correspond to the information need, initial query, current clarifying question

GPT2-based simulated user

base USi in the GPT-2 model with language modelling and classification losses(DoubleHead GPT-2)
- learn to generate the appropriate seq through the language modelling loss
- learn to distinguish a correct answer to the distractor one
- the two losses are linearly combined

Singel-turn responses:

GPT-2 input:
- accept as input two sequences: one with the original target answer in the end, the other with the distractor answer
- sample distractor answer from ClariQ dataset.

Conversation history-aware model:

history-aware GPT-2 input:
- [user] and [system] indicate the conversational turns between user and the conversational system respectively.

Inference:

omit the answer a a a from the input seq.
In order to generate answers, we use a combination of SOTA sampling techniques to generate a textual sequence from the trained model

The results are mainly about the setting of single-turn. Only some qualitative analysis for multi-turn are provided.

Datasets

Qulac and ClariQ

both built for single-turn offline evaluation.

Qulac: (topic, facet, clarifying_question, answer). ClariQ is an extension of Qulac and contains additional non-ambiguous topics.

facet from Qulac and ClariQ represents the underlying information need, as it describes in detail what the intent behind the issued query is. Moreover, question represents the current asked question, while answer is our language modelling target.

Multi-turn conversational data

A major drawback of above datasets is that they are both built for single-turn offline evaluation.

we construct multi-turn data that resembles a more realistic interaction between a user and the system. Our user simulator USi is then further fine-tuned on this data.

construct a crowdsourcing-based human-to-human interaction
construct in 500 conversations up to depth of three
construct edge cases: provide answers to additional 500 clarifying questions of poor quality, up to the depth of two

Future Work

a pair-wise comparison of multi-turn conversations.
aim to observe user simulator behaviour in unexpected, edge case scenarios
- for example, people will repeat the answer is the clarifying question is repeated. We want USi to do so.

Knowledge

Multi-turn passage retrieval: The system needs to understand the conversational context and retrieve appropriate passages from the collection.

Document-retrieval task with the answer to the prompted clarifying question: the initial query is expanded with the text of the clarifying question and the user’s answer and the fed into a retrieval model.

本文标签：论文 mixed Initiative Conversational Evaluating

版权声明：本文标题：【论文阅读】Evaluating Mixed-initiative Conversational Search Systems via User Simulation 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://www.elefans.com/dianzi/1727253331a1105095.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

【论文阅读】Evaluating Mixed-initiative Conversational Search Systems via User Simulation

文章目录

Original Paper

Motivation

Contribution

Methods

Semantically-controlled text generation

GPT2-based simulated user

Datasets

Qulac and ClariQ

Multi-turn conversational data

Future Work

Knowledge

更多相关文章

利用Adobe Photoshop 2020导入和批量输出论文中的图片

基于STC89C52单片机的智能灯光毕业设计论文

程序开发类本科论文结构【2024年修改】

android 毕业设计答辩ppt,别小看毕业答辩PPT，它和你的论文一样重要

计算机组装与维修拆卸论文,浅谈计算机组装维修论文

七个简单步骤撰写课程论文（academic essay）

论文翻译：ChatGPT: Bullshit spewer or the end of traditional assessments in higher education?

Introduction:论文引言句式积累

推荐开源项目：Text Encoding Initiative Repository

ADNI（Alzheimer`s disease neuroimaging initiative）介绍

理解OCI（Open Container Initiative）及docker的OCI实现(转)

推荐开源项目：Bug Bounty Standardization Initiative

**探索Anime Translation Initiative：开启你的动画翻译新纪元**

Agile Initiative, Epic, and StoryTask

UVa 497 - Strategic Defense Initiative

WAI(Web. Accessibility Initiative)标准

The i'm initiative is available only in the US

[论文阅读笔记04]GFTE：Graph-based Financial Table Extraction

基于java实现Android移动应用商店设计与实现演示【附项目源码+论文说明】

【论文解读|IJCAI2021】Towards a New Generation of Cognitive Diagnosis

发表评论

推荐文章

产品与运营之应用商店推广

远程时出现黑屏状况的处理方法

win10安装MinGW-W64的一点心得

Win10系统安装3dsmax2014常见问题及解决方案

【五一专属】阿里云ECS大测评#五一专属|向所有热爱分享的“技术劳动者”致敬#

热门文章

电脑硬件名词基础扫盲

APP上架到各大应用商店的小总结

VS code无法连接商店安装拓展应用

windows在安装双系统ubuntu过程中遇到的各种问题

mac打开ppt陷入报错循环

ubuntu18.04能够连wifi，但无法上网

关于讯飞语音听写RecognizerDialog 去除这个弹框view中的任何控件 更改其中内容

Android SDK删除内置的触宝输入法

如何用个人电脑搭建一台本地服务器，并部署云原生开发工具TitanIDE到服务器详细教程

魔兽3无法启动此程序因为计算机中丢失,win10运行war3出错无法启动怎么办_win10系统war3不能启动如何解决...

最新文章

Mac输入法设置

lubuntu输入法设置_Ubuntu 设置中文输入法

Linux 搜狗输入法 繁简切换 输入框显示 解决方案 WebStorm快捷键冲突 Ctrl+Shift+F

windows11 删除输入法

android百度日语输入法下载,百度日文输入法

百度词库bdict、搜狗细胞词库scel 转 txt 格式

centos图形化界面安装,中文输入法,mysql安装

android手机软件入门,新手入门Android手机必装软件之输入法篇

百度AI的2020

ubuntu输入法崩溃问题

【ubuntu】 输入法消失，重启（sogou）

都2021年了，输入法还能怎么玩出花？百度智慧输入：toB商业化！

百度输入法开放API 宣称可随意移植使用

Android10 内置第三方输入法

PC端输入法双拼皮肤分享

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

探索Anime Translation Initiative：开启你的动画翻译新纪元

关于讯飞语音听写RecognizerDialog 去除这个弹框view中的任何控件更改其中内容

Linux 搜狗输入法繁简切换输入框显示解决方案 WebStorm快捷键冲突 Ctrl+Shift+F

【ubuntu】输入法消失，重启（sogou）

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载