Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refine

发布人

论文简述：在这篇名为Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models的论文中，作者们探讨了开源语言模型（LLM）在成本性能权衡方面的民主化问题。他们注意到，目前市场上主要依赖于专有LLM服务，这导致了访问受限和信息隐私担忧的问题。为了应对这一挑战，作者提出了一种名为自我反思的开源模型迭代改进方法，以及一种新的评估指标——性能、精炼和推理成本得分（PeRFICS）来为特定任务选择最佳模型。实验结果显示，针对不同大小的开源模型（从7B到65B），其平均性能提高了8.2%。尤为引人注意的是，即使模型的内存占用非常小（如Vicuna-7B），整体性能也提高了11.74%，在高创造力和开放性任务方面甚至达到了25.39%的提升。Vicuna-13B进一步在Vicuna基准测试中超越了ChatGPT。这一工作对于资源受限和关注信息隐私的环境具有重要意义，它们可以在不产生高昂成本、牺牲性能和隐私的情况下利用LLM。通过域无关的自我反思过程以及新的评估指标PeRFICS，论文为模型选择提供了明智的决策支持，从而降低了成本和实现了LLM的民主化。
论文链接： https://arxiv.org/pdf/2310.07611

打开封面下载高清视频观看高清视频视频下载器

Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refine

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

An Image is Worth Multiple Words: Learning Object Level Concepts using Multi-Con

Large Language Models Cannot Self-Correct Reasoning Yet

FineControlNet：基于空间对齐文本控制注入的图像生成细粒度文本控制

【AI视频】图生视频支持ControlNet pose AI视频动作驱动作 CogvideoX-Fun-pose ComfyUI工作流

英伟达最强开源模型 Llama 3.1 Nemotron 70B ：超越GPT-4o？| 英伟达 Nim 平台值得探索

AutoMix: Automatically Mixing Language Models

Improving Large Language Model Fine-tuning for Solving Math Problems

MusicAgent: An AI Agent for Music Understanding and Generation with Large Langua

小红书颜值博主视频：Flux+ CogvideoX AI视频 ComfyUI工作流对比

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

TOFU: A Task of Fictitious Unlearning for LLMs

HallusionBench: You See What You Think? Or You Think What You See? An Image-Cont

ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation

Editing Personality for LLMs

TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarizat

单张图像到3D的高效生成：基于分摊生成的3D高斯模型

TopoMLP: An Simple yet Strong Pipeline for Driving Topology Reasoning

Ziya2: Data-centric Learning is All LLMs Need

基于多模态大模型开源OCR2.0模型

LLM Lies: Hallucinations are not Bugs, but Features as Adversarial Examples

Label Supervised LLaMA Finetuning

Efficient Tool Use with Chain-of-Abstraction Reasoning

基于LLM的输入输出安全保障模型：Llama Guard在人机对话中的应用

DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

System 2 Attention (is something you might need too)

FreeControl：实现任意文本到图像扩散模型的无训练空间控制

FlowVid：利用不完美光流实现一致性视频到视频合成

基于文本到图像扩散模型的文本到视频生成架构研究

Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language M

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting

TrustLLM: Trustworthiness in Large Language Models

Question Aware Vision Transformer for Multimodal Reasoning

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild

多模态基础模型研究综述

In-Context Learning Creates Task Vectors

SyncDreamer：基于单图像生成3D物体新方法

Large Language Model Cascades with Mixture of Thoughts Representations for Cost-

AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort