V
主页
[ICLR2024] Skeleton-of-Thought: Prompting Large Language Models for Efficient Pa
发布人
打开封面
下载高清视频
观看高清视频
视频下载器
[NeurIPS2023 Workshop] LLM-MQ: Mixed-precision Quantization for Efficient LLM De
[MLSys2024] FlashDecoding++: Faster Large Language Model Inference with Asynchro
[ICML2024] Evaluating Quantized Large Language Models
[FPGA2024] FlightLLM: Efficient Large Language Model Inference with a Complete M
大语言模型量化简介
[MLSYS2023] Exploiting Hardware Utilization and Adaptive Dataflow for Efficient
[ASPLOS2024] FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Alge
[ICML2024] Language Agents with Reinforcement Learning for Strategic Play in the
《稀疏存内计算电脑与架构》-岳金山
[WACV2024] TCP: Triplet Contrastive-relationship Preserving for Class-Incrementa
[AAAI2023] Memory-Oriented Structural Pruning for Efficient Image Restoration
[MLCOS workshop] dgsparse Towards Efficient Graph Neural Networks on GPUs
2024智源大会 袁进辉 AI系统领域还有哪些重要的问题?
阿里大佬耗时6个月终于把AI大模型录成了完整的视频教程,通俗易懂,学完即可入职,挑战年薪百万!
GitHub17.5k,还未出版中英文就全有了!只用笔记本就能构建大语言模型!
[AAAI2024] Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with
[FPT2023] UAV Swarm Planning Accelerator on FPGA with low latency and fixed-poin
[CVPR2024] FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diff
(AI4Med Series) Towards Building Multilingual Language Model for Medicine
开始本地大型语言模型的 Llamafile|Beginning Llamafile for Local Large Language Models (LLMs)
[WSDM2024] DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting
[DAC2023] Processing-In-Hierarchical-Memory Architecture for Billion-Scale Appro
[AAMAS2023] Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-T
[DATE2024] DyPIM: Dynamic-inference-enabled Processing-In-Memory Accelerator
如何使用AI+自动化工具playwright狂赚2000万美金
Transformer结合U-Net登上Nature子刊! 最新研究,精度与效率齐飞
To Believe or Not to Believe Your LLM(Google 2024)
[DAC2023] An Efficient Accelerator for Point-based and Voxel-based Point Cloud
吴恩达大模型系列教程:2024年6月 《构建你自己的数据库智能体|Building Your Own Database Agent》(附代码课件)
[HPCA2023] Realizing Extreme Endurance Through Fault-aware Wear Leveling and Imp
Aligner: 一种基于残差思想的大语言模型对齐器(组会分享)
单卡 4090 部署智谱 GLM-4-9B-Chat,30s 极速体验!
「官方教程」GLM-4-9B 实战部署和微调
[FCCM20] Enabling Efficient and Flexible FPGA Virtualization for Deep Learning
《智能计算系统-从深度学习到大模型 第2版》第九章-大模型设计系统
[ICCV2023] Ada3D: Exploiting the Spatial Redundancy with Adaptive Inference for
大模型人都在学的LLM大模型黑书,豆瓣网评分高达9.9!外网超火的LLM应用书籍来了(附中文版PDF)
2024吴恩达LLM大模型教程《构建主动式RAG|Building Agentic RAG with LlamaIndex》中英字幕(附学习课件)
AI界又吵起来了!ACL 2024论文盖棺定论:大语言模型≠世界模拟器引热议,图灵奖得主 Yann LeCun:我同意!
训练自己的LLM大模型最牛指南来了!这本 <从头开始构建大型语言模型>实在太香了(附PDF)