Empowering Large Language Models for Textual Data Augmentation - 代码天地

Empowering Large Language Models for Textual Data Augmentation

企业开发 2024-11-01 21:07:59 阅读次数: 0

本文是LLM系列文章，针对《Empowering Large Language Models for Textual Data Augmentation》的翻译。

为大型语言模型提供文本数据增强

摘要
1 引言
2 相关工作
3 前言
4 提出的方法-自我 LLMDA
5 实验
6 结论
7 局限性

摘要

凭借理解和执行自然语言指令的能力，大型语言模型（LLM）有可能成为文本数据增强的强大工具。但是，增强数据的质量在很大程度上取决于提供的增强指令，并且有效性可能会因不同的下游任务而波动。虽然手动制作和选择指令可以提供一些改进，但由于下游任务的多样性，这种方法在实践中面临可扩展性和一致性问题。在这项工作中，我们通过提出一种新的解决方案来解决这些限制，该解决方案可以自动生成大量增强指令并选择最合适的任务知情指令，从而使 LLM 能够为不同的下游任务创建高质量的增强数据。从实证上讲，与非 LLM 和基于 LLM 的数据增强方法相比，所提出的方法始终生成质量更好的增强数据，从而在来自广泛应用领域的 26 个小样本学习任务上获得最佳性能。

1 引言

2 相关工作

3 前言

4 提出的方法-自我 LLMDA

5 实验

6 结论

在这项工作中，我们介绍了

猜你喜欢

转载自blog.csdn.net/c_cpp_csharp/article/details/143332950

Empowering Large Language Models for Textual Data Augmentation

WizardKM:Empowering Large Language Models to Follow Complex Instructions

Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation

【开源AI大模型】WizardCoder: Empowering Code Large Language Models with Evol-Instruct

Are Large Language Models Chameleons?

Bring Your Data！Self- supervised Evolution of Large Language Models

Challenges and Applications of Large Language Models

A Survey of Large Language Models Attribution

Artificial Agency and Large Language Models

Large Language Models in Finance: A Survey

Instruction Mining:High-Quality Instruction Data Selection for Large Language Models

Re74 读论文：DataGemma Knowing When to Ask - Bridging Large Language Models and Data

微调大模型（Finetuning Large Language Models）—Data_preparation（四）

【论文精读】Emergent Abilities of Large Language Models

Are Emergent Abilities of Large Language Models a Mirage?

论文阅读 A Survey of Large Language Models 1

论文阅读 A Survey of Large Language Models 3

论文阅读 A Survey of Large Language Models 2

Augmented Large Language Models with Parametric Knowledge Guiding

Enabling Large Language Models to Generate Text with Citations

A Survey on Model Compression for Large Language Models

Recommender Systems in the Era of Large Language Models (LLMs)

TASKBENCH: BENCHMARKING LARGE LANGUAGE MODELS FOR TASK AUTOMATION

Trends in Integration of Knowledge and Large Language Models

A Survey of Text Watermarking in the Era of Large Language Models

A Survey on Multimodal Large Language Models for Autonomous Driving

Large Language Models in Targeted Sentiment Analysis for Russian

A Survey on Multimodal Large Language Models论文解读

ADELIE: Aligning Large Language Models on Information Extraction

SEO: Stochastic Experience Optimization for Large Language Models

今日推荐

Electron中的关于静态资源加载问题解决方案

《Cursor-AI编程》基础篇-界面指南

《Cursor-AI编程》基础篇-Tab代码智能补充

《Cursor-AI编程》基础篇-Composer功能详解

《Cursor-AI编程》基础篇-Chat功能详解

《Cursor-AI编程》进阶篇-自定义模型

《Cursor-AI编程》进阶篇-上下文详解

【大模型系列篇】最强检索增强技术GraphRAG基本原理详解

【大模型系列篇】基于Ollama和GraphRAG v2.0.0快速构建知识图谱

解释什么是迁移学习？在 CNN 中如何应用？（面试题200合集，高频、关键）

解释数据增强（Data Augmentation）的概念和方法（（面试题200合集，高频、关键））

揭秘大模型“魔法”：Function Calling 让 AI 不止会说，更能“做”！

周排行

ConfigurationClassParser类的parse方法源码解析

基础大讲堂-java 位运算符

ConsecutiveInteger判断给定的整数n能否表示成连续的m(m>1)个正整数之和

多项式问题之六——多项式快速幂

Spring Security技术栈开发企业级认证与授权（四）RESTful API服务异常处理

Linux基础命令---apachectl

MATLAB中的线性插值

Unity编辑器拓展之十七：NGUI ComponentSelector增加搜索框

SqlServer 备份还原教程

[Unity动画]01.

每日归档

更多

2025-04-12(10529)

2025-04-11(9561)

2025-04-10(1213)

2025-04-09(10354)

2025-04-08(12998)

2025-04-07(0)

2025-04-06(0)

2025-04-05(0)

2025-04-04(0)

2025-04-03(0)