嵌入+知识图:RAG系统的终极工具
The advent of large language models (LLMs) , trained on vast amounts of text data, has been one of the most significant breakthroughs in natural language processing. The ability of these models to generate remarkably fluent and coherent text with just a short prompt has opened up new possibilities for conversational AI, creative writing, and a wide array of other applications.
TimeGPT:时间序列预测的第一个基础模型
写杀手提示:掌握人工智能惊人结果的提示工程
Prompt engineering, or prompt design, is crafting instructions for LLMs to get desired responses. It’s essential for ensuring accurate, high-quality responses from a large language model.
高级RAG 07:探索表格的RAG
Implementing RAG presents a challenge, especially when it comes to effectively parsing and understanding tables in unstructured documents. This is particularly difficult with scanned documents or documents in image format. There are at least three aspects of these challenges:
使用LLM从非结构化文本中提取结构化数据
This is Part 1 of my “Understanding Unstructured Data” series. Part 2 focuses on analyzing structured data extracted from unstructured text with a LangChain agent.
Apache Kafka+矢量数据库+LLM=实时GenAI
Generative AI (GenAI) enables advanced AI use cases and innovation but also changes how the enterprise architecture looks like. Large Language Models (LLM), Vector Databases, and Retrieval Augmentation Generation (RAG) require new data integration patterns and data engineering best practices.
如何在LLM应用程序中提高RAG结果:从基础到高级
If you’re building any meaningful product/feature with LLMs (large language models), you’ll probably use the technique called RAG (retrieval-augmented generation). It can allow you to integrate external data that was not available in the LLM’s training data into the LLM’s text generation process, which can greatly reduce the nightmare of hallucination and improve the relevance of the text responses.
告别Python中的循环,欢迎矢量化!
如何使用RAG改进LLM
This article is part of a larger series on using large language models in practice. In the previous post, we fine-tuned Mistral-7b-Instruct to respond to YouTube comments using QLoRA.
如何制作RAG系统以获得对您数据的强大访问
A RAG system is an innovative approach to information retrieval. It utilizes traditional information retrieval approaches like vector similarity search combined with state-of-the-art large language model technology. Combined, these technologies make up a robust system that can access vast amounts of information from a simple prompt.