【Langchain实践】FewShotPromptTemplate实践总结|电子爱好者

admin管理员组
文章数量:1666728

FewShotPromptTemplate总结

加载本地模型完成嵌入

一般常用的模型是OpenAIEmbeddings,但是这要求很好的网络和一个OpenAI的key。因此能用本地模型实现嵌入是一个理想的替代方案。

langchain提供了多种嵌入的方式，这个网址里面包含了所有langchain支持的嵌入模型。
由于Huggingface支持的模型很多，加载本地模型采用HuggingFaceEmbedding的方式适用面更广。而且，如果目标语言是中文，需要嵌入中文效果非常好的模型。为此，使用本地模型完成嵌入的代码如下：

# 嵌入
from langchain.embeddings.huggingface import HuggingFaceEmbeddings
# 加载从huggingface下载到本地的模型
embedding = HuggingFaceEmbeddings(model_name='path-on-machine')

注意：需要将huggingface仓库里的所有文件下载下来，尤其是1_Pooling文件夹，这个文件夹没有下载模型是无法从本地加载成功的。

构建模版

在prompt模版构建过程中，langchain采用的format格式化的方式填充数据。format的底层原理是f-string语法，通过识别{}来填充数据，而当模版中存在{}时，使用f-string来填充数据会报missing some input keys的错误。这个问题的解决方法在github参考上有讨论，详细见参考[2]。一种很直接的方式是换一种模版。langchain还支持其他两种模版，jinja2和mustache。

# langchain_core/prompts/few_shot.py#121
template_format: Literal["f-string", "mustache", "jinja2"] = "f-string"

使用jinja2模版语法的代码如下，jinja相关的语法见参考[3]：

context_prompt = """
问题:{{ prompt }}
回答：{{ response }}"""
example_prompt = PromptTemplate(input_variables=["prompt", "response"], template=context_prompt,template_format="jinja2")

向量数据库

langchain支持了很多的向量数据库，如Chroma、FAISS等，见参考[4]。如果一个向量数据不起作用，可以尝试另一个向量数据库。笔者在实践过程中，Chroma对于k设置为任何值都只返回第一个查询结果，然后重复k次，后面换了FAISS数据库才得以解决。

# 构造选择器
example_selector = SemanticSimilarityExampleSelector.from_examples(
    # This is the list of examples available to select from.
    examples,
    # This is the embedding class used to produce embeddings which are used to measure semantic similarity.
    embedding,
    # This is the VectorStore class that is used to store the embeddings and do a similarity search over.
    FAISS,
    # Chroma只能返回一个结果
    # Chroma,
    # This is the number of examples to produce.
    k=3,
)

整个FewShotPromptTemplate的代码框架如下：

# 数据加载

# 嵌入
from langchain.embeddings.huggingface import HuggingFaceEmbeddings
# 向量数据库
from langchain_chroma import Chroma
from langchain_community.vectorstores import FAISS

# 选择器
from langchain_core.example_selectors import SemanticSimilarityExampleSelector
# prompt模板
from langchain_core.prompts.few_shot import FewShotPromptTemplate
from langchain_core.prompts.prompt import PromptTemplate


import pandas as pd
import json

# 原始数据
examples = [{"prompt":11,"response":22},{"prompt":333,"response":444}]

# 嵌入模型
embedding = HuggingFaceEmbeddings(model_name='./huggingface/text2vec-base-chinese')


context_prompt = """
问题:{{ prompt }}
回答：{{ response }}"""

example_prompt = PromptTemplate(input_variables=["prompt", "response"], template=context_prompt,template_format="jinja2")

prompt = FewShotPromptTemplate(
    examples=examples,
    example_prompt=example_prompt,
    suffix="Question: {{ input }}",
    input_variables=["input"],
    template_format = 'jinja2'
)

# print(prompt.format(input="今天天气真好"))

# 构造选择器
example_selector = SemanticSimilarityExampleSelector.from_examples(
    # This is the list of examples available to select from.
    examples,
    # This is the embedding class used to produce embeddings which are used to measure semantic similarity.
    embedding,
    # This is the VectorStore class that is used to store the embeddings and do a similarity search over.
    FAISS,
    # Chroma只能返回一个结果
    # Chroma,
    # This is the number of examples to produce.
    k=3,
)
# Select the most similar example to the input.
# question = "who am I?"
# selected_examples = example_selector.select_examples({"prompt": question})
# print(f"Examples most similar to the input: {question}")
# for example in selected_examples:
#     print("\n")
#     print(example)
#     for k, v in example.items():
#         print(f"{k}: {v}")
# 使用FewShotPromptTemplate
prompt = FewShotPromptTemplate(
    example_selector=example_selector,
    example_prompt=example_prompt,
    suffix="Question: {{ input }}",
    input_variables=["input"],
    template_format = 'jinja2'
)

# # 字段
print(prompt.format(input="Love yourself!"))

参考

[1] 大规模文本嵌入的排行榜

[2] langchain的template构造过程缺少输入关键词missing input keys

[3] jinja2语法

[4] langchain中的vector store

本文标签： LangChain FewShotPromptTemplate

版权声明：本文标题：【Langchain实践】FewShotPromptTemplate实践总结内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://www.elefans.com/xitong/1730076042a1221789.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

【Langchain实践】FewShotPromptTemplate实践总结

FewShotPromptTemplate总结

加载本地模型完成嵌入

构建模版

向量数据库

参考

更多相关文章

Langchain 的 Conversation buffer memory

【Langchain多Agent实践】一个有推销功能的旅游聊天机器人

手把手教你Langchain-chatchat 接入Dify

【2024最全最细Langchain教程-10】Langchain记忆模块

【langchain学习】LLMChain和ConversationChain的用法以及区别（附代码示例）

【LangChain】内存管理简介及实践

大模型从入门到应用——LangChain：代理（Agents）-[代理执行器（Agent Executor）：使用Agents的异步API和创建ChatGPT克隆]

使用 LangChain 和 Elasticsearch 的隐私优先 AI 搜索

基于LangChain-Chatchat实现智能问答系统

Elasticsearch：使用 Langchain 和 OpenAI 进行问答

突破界限：LangChain 引领 AI 应用构建的新时代

初识langchain：LLM大模型+Langchain实战

基于huggingface和langchain快速开发大模型应用

[LangChain核心模块]模型的输入和输出-＞Prompts

LangChain入门：24.通过Baby AGI实现自动生成和执行任务

很火的 LangChain 是个什么东东？

基于LangChain+LLM的相关技术研究及初步实践

LangChain 48 终极解决 实战Langchain访问OpenAI ChatGPT API Account deactivated的另类方法，访问跳板机API

LangChain入门：2.OpenAPI调用ChatGPT模型

开源模型应用落地-LangChain实用小技巧-ChatPromptTemplate的partial方法（一）

发表评论

推荐文章

公司邮箱如何注册？免费公司邮箱域名如何注册？

WinRAR去广告教程

认识cocos2d-x jsbinding

点心发布新版安卓优化大师

SQL SERVER 19安装 SQL Prompt 10.02版本

热门文章

通过js唤醒app或者跳转应用市场

苹果Mac电脑的复制粘贴不能用了

【Watir Webdriver】自动登录QQ邮箱并发送电子邮件

前端单点登录（SSO）实现方法（二级域名与主域名）

全网邮箱email地址采集api接口及实现分析

国际顶级会议期刊级别介绍

高层游戏引擎——基于OGRE所实现的高层游戏引擎框架

w10系统怎样打开计算机策略,Win10系统组策略在哪里打开

鸿蒙智慧屏安装应用,谁说华为智慧屏不能装APP，我来打脸了，附零难度安装APP教程...

bootice工具修复linux,bootice工具怎么修复引导win7

最新文章

ubuntu24.04安装搜狗输入法，解决系统自带fcitx5与fcitx冲突问题

解决ubuntu安装完搜狗输入法只能使用英文，无法输入中文

安装搜狗输入法无法切到搜狗

Python自建chatgpt服务器：使用Flask实现类似Chat服务器流式API接口

Vue3实现类ChatGPT聊天式流式输出(vue-sse实现)

linux 18.04安装搜狗输入法后不能输出中文

ubuntu20.04下搜狗输入法不能输中文问题解决

linux下搜狗安装目录,搜狗输入法Linux版配置文件详解

Ubuntu20.10 安装搜狗输入法

ChatGPT有哪些应用场景

Ubuntu 20.04安装搜狗输入法无法输入中文

解决ubuntu20搜狗输入法输入不了中文问题

linux安装搜狗输入法后无法输入中文

如何在Android中构建一个ChatGPT以图像形式输出的程序

[Cursor Tool] 面向编程的ChatGPT工具的入门使用指南

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

在哪些场景下应优先考虑使用treenode

LangChain 48 终极解决实战Langchain访问OpenAI ChatGPT API Account deactivated的另类方法，访问跳板机API

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载