[ICLR2024] Skeleton-of-Thought: Prompting Large Language Models for Efficient Pa - 视频下载 Video Downloader

[ICLR2024] Skeleton-of-Thought: Prompting Large Language Models for Efficient Pa

发布人

打开封面下载高清视频观看高清视频视频下载器

[NeurIPS2023 Workshop] LLM-MQ: Mixed-precision Quantization for Efficient LLM De

[MLSys2024] FlashDecoding++: Faster Large Language Model Inference with Asynchro

[ICML2024] Evaluating Quantized Large Language Models

[FPGA2024] FlightLLM: Efficient Large Language Model Inference with a Complete M

大语言模型量化简介

[MLSYS2023] Exploiting Hardware Utilization and Adaptive Dataflow for Efficient

[ASPLOS2024] FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Alge

[ICML2024] Language Agents with Reinforcement Learning for Strategic Play in the

《稀疏存内计算电脑与架构》-岳金山

[WACV2024] TCP: Triplet Contrastive-relationship Preserving for Class-Incrementa

[AAAI2023] Memory-Oriented Structural Pruning for Efficient Image Restoration

[MLCOS workshop] dgsparse Towards Efficient Graph Neural Networks on GPUs

2024智源大会袁进辉 AI系统领域还有哪些重要的问题？

阿里大佬耗时6个月终于把AI大模型录成了完整的视频教程，通俗易懂，学完即可入职，挑战年薪百万！

GitHub17.5k，还未出版中英文就全有了！只用笔记本就能构建大语言模型！

[AAAI2024] Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with

[FPT2023] UAV Swarm Planning Accelerator on FPGA with low latency and fixed-poin

[CVPR2024] FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diff

(AI4Med Series) Towards Building Multilingual Language Model for Medicine

开始本地大型语言模型的 Llamafile|Beginning Llamafile for Local Large Language Models (LLMs)

[WSDM2024] DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting

[DAC2023] Processing-In-Hierarchical-Memory Architecture for Billion-Scale Appro

[AAMAS2023] Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-T

[DATE2024] DyPIM: Dynamic-inference-enabled Processing-In-Memory Accelerator

如何使用AI+自动化工具playwright狂赚2000万美金

Transformer结合U-Net登上Nature子刊！最新研究，精度与效率齐飞

To Believe or Not to Believe Your LLM(Google 2024)

[DAC2023] An Efficient Accelerator for Point-based and Voxel-based Point Cloud

吴恩达大模型系列教程：2024年6月《构建你自己的数据库智能体|Building Your Own Database Agent》（附代码课件）

[HPCA2023] Realizing Extreme Endurance Through Fault-aware Wear Leveling and Imp

Aligner: 一种基于残差思想的大语言模型对齐器（组会分享）

单卡 4090 部署智谱 GLM-4-9B-Chat，30s 极速体验！

「官方教程」GLM-4-9B 实战部署和微调

[FCCM20] Enabling Efficient and Flexible FPGA Virtualization for Deep Learning

《智能计算系统-从深度学习到大模型第2版》第九章-大模型设计系统

[ICCV2023] Ada3D: Exploiting the Spatial Redundancy with Adaptive Inference for

大模型人都在学的LLM大模型黑书，豆瓣网评分高达9.9！外网超火的LLM应用书籍来了（附中文版PDF）

2024吴恩达LLM大模型教程《构建主动式RAG|Building Agentic RAG with LlamaIndex》中英字幕（附学习课件）

AI界又吵起来了！ACL 2024论文盖棺定论：大语言模型≠世界模拟器引热议，图灵奖得主 Yann LeCun：我同意！

训练自己的LLM大模型最牛指南来了！这本 <从头开始构建大型语言模型>实在太香了（附PDF）