| 开发者人工智能社区 --开发者开聊

LLM应用程序开发的工程实践

developer.chat

13 April 2024

LLM engineering involves much more than just prompt design or prompt engineering. In this article, we share a set of engineering practices that helped us deliver a prototype LLM application rapidly and reliably in a recent project. We'll share techniques for automated testing and adversarial testing of LLM applications, refactoring, as well as considerations for architecting LLM applications and responsible AI.

阅读更多关于 LLM应用程序开发的工程实践
登录发表评论

基于LLM的聊天机器人，用于高级数据分析、可视化和自动洞察提取

developer.chat

13 April 2024

7个顶级大型语言模型用例和应用程序

developer.chat

13 April 2024

Step into the realm of language magic with our in-depth tutorial on Large Language Model (LLM) use cases and applications. From personalized recommendations to smart chatbots, discover how these linguistic powerhouses are revolutionizing industries.

Languagechain Agents

developer.chat

13 April 2024

```

Now, I will execute this query to get the total sales per country.

[('USA', 523.0600000000003), ('Canada', 303.9599999999999), ('France', 195.09999999999994), ('Brazil', 190.09999999999997), ('Germany', 156.48), ('United Kingdom', 112.85999999999999), ('Czech Republic', 90.24000000000001), ('Portugal', 77.23999999999998), ('India', 75.25999999999999), ('Chile', 46.62)]The total sales per country are as follows:

1. USA: $523.06
2. Canada: $303.96
3. France: $195.10
4. Brazil: $190.10
5. Germany: $156.48
6. United Kingdom: $112.86
7. Czech Republic: $90.24
8.

阅读更多关于 Languagechain Agents
登录发表评论

基于Milvus和LlamaIndex的检索增强生成

developer.chat

13 April 2024

This guide demonstrates how to build a Retrieval-Augmented Generation (RAG) system using LlamaIndex and Milvus.

The RAG system combines a retrieval system with a generative model to generate new text based on a given prompt. The system first retrieves relevant documents from a corpus using a vector similarity search engine like Milvus, and then uses a generative model to generate new text based on the retrieved documents.

阅读更多关于基于Milvus和LlamaIndex的检索增强生成
登录发表评论

评估大型语言模型（LLM）：准确评估的标准度量集

developer.chat

13 April 2024

Large Language Models (LLMs) are a type of artificial intelligence model that can generate human-like text. They are trained on large amounts of text data and can be used for a variety of natural language processing tasks, such as language translation, question answering, and text generation.

Evaluating LLMs is important to ensure that they are performing well and generating high-quality text. This is especially important for applications where the generated text is used to make decisions or provide information to users.

如何评估LLM：一个完整的度量框架

developer.chat

13 April 2024

Over the past year, excitement around Large Language Models (LLMs) skyrocketed. With ChatGPT and BingChat, we saw LLMs approach human-level performance in everything from performance on standardized exams to generative art. However, many of these LLM-based features are new and have a lot of unknowns, hence require careful release to preserve privacy and social responsibility. While offline evaluation is suitable for early development of features, it cannot assess how model changes benefit or degrade the user experience in production.

阅读更多关于如何评估LLM：一个完整的度量框架
登录发表评论

基于语义搜索和LLM的问答模型有效性评价

developer.chat

13 April 2024

The question answering system, that is based on semantic search and LLM currently one of the most popular application of LLM functionality. But what after we build it? How to evaluate the work of QnA system?

LLM评估指标：LLM评估所需的一切

developer.chat

13 April 2024

Although evaluating the outputs of Large Language Models (LLMs) is essential for anyone looking to ship robust LLM applications, LLM evaluation remains a challenging task for many. Whether you are refining a model’s accuracy through fine-tuning or enhancing a Retrieval-Augmented Generation (RAG) system’s contextual relevancy, understanding how to develop and decide on the appropriate set of LLM evaluation metrics for your use case is imperative to building a bulletproof LLM evaluation pipeline.

阅读更多关于 LLM评估指标：LLM评估所需的一切
登录发表评论

评估QA：度量、预测和零响应

developer.chat

13 April 2024

阅读更多关于评估QA：度量、预测和零响应
登录发表评论

热门内容

今日:

总体:

最近浏览：

标签（标签）

LLM应用程序开发的工程实践

基于LLM的聊天机器人，用于高级数据分析、可视化和自动洞察提取

7个顶级大型语言模型用例和应用程序

Languagechain Agents

基于Milvus和LlamaIndex的检索增强生成

评估大型语言模型（LLM）：准确评估的标准度量集

如何评估LLM：一个完整的度量框架

基于语义搜索和LLM的问答模型有效性评价

LLM评估指标：LLM评估所需的一切

评估QA：度量、预测和零响应

标签（标签）

Search