面向产品构建基于RAG的LLM应用

1,076次阅读

详细介绍了如何从头开始构建一个基于检索增强生成(RAG)的大型语言模型(LLM)应用。

主要步骤包括：

加载数据、分割文本、嵌入数据、索引数据、检索相关文本块、生成回复。
为了扩展应用，实现了在Ray Data上进行并行计算的功能。
为评估不同系统配置，实现了组件级评估和端到端评估。
比较了不同的文本块大小、块数、嵌入模型和LLM的性能。
实现了查询路由，根据查询复杂性将其发送到合适的LLM。
使用Ray Serve架构应用，实现弹性伸缩。
讨论了LLM应用的一阶和二阶影响。
提出后续工作，包括持续更新、微调嵌入模型和LLM、收集用户反馈等。
强调了Ray和Anyscale如何帮助构建、扩展和产品化LLM应用。

GitHub: github.com/ray-project/llm-applications
Notebook: github.com/ray-project/llm-applications/blob/main/notebooks/rag.ipynb

正文完

可以使用微信扫码关注公众号（ID：xzluomor）

AI AR HTML RSS 产品大型语言模型架构

发表至：智源

2023-09-14

EMNLP2023论文：基于机器翻译模型采用约束束搜索算法生成优化的机器翻译质量评估伪数据

哥伦比亚大学 | Zero-1-to-3: 由一张图像生成三维物体

无需额外训练提升模型30%性能！DeepMind科学家点赞MIT博士生实习成果

ACL2023 | 使用文本包裹分子进行生成式预训练

Stability最新发布的生成式音频模型Stable Audio

卡内基梅隆大学计算机学院推出「机器人」本科学位

评论（没有评论）

文章搜索

最新评论

ufabet มีเกมให้เลือกเล่นมากมาย: เกมเดิมพันหลากหลาย ครบทุกค่ายดัง

tornado crypto mixer Discover the power of privacy with TornadoCash! Learn how this decentralized mixer ensures your transactions remain confidential.

ดูบอลสด Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

ดูบอลสด Pretty! This has been a really wonderful post. Many thanks for providing these details.

ดูบอลสด Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

Obrazy Sztuka Nowoczesna Thank you for this wonderful contribution to the topic. Your ability to explain complex ideas simply is admirable.

ufabet Hi there to all, for the reason that I am genuinely keen of reading this website’s post to be updated on a regular basis. It carries pleasant stuff.

ufabet You’re so awesome! I don’t believe I have read a single thing like that before. So great to find someone with some original thoughts on this topic. Really.. thank you for starting this up. This website is something that is needed on the internet, someone with a little originality!

ufabet Very well presented. Every quote was awesome and thanks for sharing the content. Keep sharing and keep motivating others.

热评文章

Generated by Feedzy

面向产品构建基于RAG的LLM应用

n8n实战：Webhook、条件判断与API集成详解

谷歌太壕了！编程Agent大招至简：开源且免费，百万上下文、多模态、MCP全支持

百度文心快码AI IDE上线，首创设计稿一键转代码、支持MCP

老黄新鲜一刀，RTX 5050正式官宣

国产GPU历史性时刻！摩尔线程、沐曦同日获IPO受理

00后投身具身智能创业，剑指机器人界「Model 3」！已推出21个自由度灵巧手

监督学习也能从错误中学习反思？！清华英伟达联合提出隐式负向策略爆炸提升数学能力

AI也会闹情绪了！Gemini代码调试不成功直接摆烂，马斯克都来围观

百度文心快码AI IDE上线，首创设计稿一键转代码、支持MCP

曝苹果拟收购Perplexity AI，人才一并拿走