V
主页
LLaMA2 explained KV-Cache, Rotary Positional Embedding, RMS Norm
发布人
00:00 Main talk starts - intro & motivation 10:20 Behind the scenes: how Tri got started with Flash Attention 16:40 Motivation: modelling long sequences 19:50 Brief recap of attention 23:15 Memory bottleneck, IO awareness 35:10 Flash Attention 2.0 improvements 37:20 Behind the scenes of Flash Attention 2.0 (refactor of CUTLASS 3) 43:45 Future directions 46:45 Q&A
打开封面
下载高清视频
观看高清视频
视频下载器
The Mastermind Behind GPT-4 and the Future of AI Ilya Sutskever
OpenAI's Ilya Sutskever The man who made AI work
Flash Attention 2.0 with Tri Dao (author)! _ Discord server talks
EMNLP 2022 Tutorial Modular and Parameter-Efficient Fine-Tuning for NLP Model
A Theory for Emergence of Complex Skills in Language Models
A.I. Could Solve Some of Humanity’s Hardest Problems. It Already Has.
Modular and Composable Transfer Learning with Jonas Pfeiffer
谷歌Gemini全能大模型震撼发布,全部多模态能力演示,Gemini击败GPT-4,官方全部演示视频收录,Google高管出镜,谷歌背水一战击败GPT
AlphaGeometry 成功解决 IMO 几何问题!Google DeepMind 击败 ChatGPT,难以置信的智能水平,大模型数学大突破
ACL 2023 Tutorial Retrieval-based Language Models and Applications
Interview with Dr. Ilya Sutskever, co-founder of OPEN AI - at the Open Universit
Retrieval Meets Large Language Models Unlocking New Capabilities
Neural Scaling Laws and GPT-3 - Jared Kaplan
Policy Gradient Theorem Explained - Reinforcement Learning
Neural Scaling Laws and GPT-3
DeepMind揭秘ChatGPT为什么能掌握复杂技能,普林斯顿教授揭秘大模型为什么能获得复杂技能,涌现能力,OpenAI ChatGPT深层原理揭秘
[1hr Talk] Intro to Large Language Models
Learning linear models in-context with transformers
DPO算法详解 : Direct Preference Optimization 算法详解 (RLHF的替代算法)
The Paperclip Maximizer
What, if anything,do AIs understand? Talk with ChatGPT Co-Founder Ilya Sutskever
OpenAI Q* 比大家想的还要恐怖?大的真的要来了?奥特曼的秘密项目Q*,OpenAI实现AGI,ChatGPT实现通用人工智能,GPT走向通用人工智能
图灵奖得主 Yoshua Bengio 谈 Deep Learning I
GitHub Universe 2023 opening keynote
AI Safety, RLHF, and Self-Supervision
Mistral AI's Open Source Initiative CEO of Mistral AI
强推!不愧是公认的讲的最好的【AI大模型全套教程】李宏毅大佬12小时带你从入门到进阶,一套全解决!
一种对于泛化的观察 :压缩即智能
Microsoft AI Ignite : 追击OpenAI, 微软深夜发布100+更新, 人人定制Copilot, 自研两款AI芯片
Let's build the GPT Tokenizer
2024吴恩达《基于LangChain的大模型应用开发+构建和评估高级RAG模型应用》带你实现大模型落地!
【2024最新】从入门到提示词工程师:全网最通俗易懂Prompt-Learning提示词学习教程!学不会我退出IT圈!
首发:解读OpenAI Q* GPT走向通用人工智能AGI,毁灭人类的Q*算法?OpenAI秘密Q*曝光,OpenAI实现AGI,ChatGPT实现通用人工智能
墙裂推荐!2024最新的大模型学习路线整理出来了,17分钟全程介绍AI大模型干货内容,各个知识点逐一讲解,配套视频资料,轻松搞定AI大模型!
Meta's Chief AI Scientist Yann LeCun talks about the future of AI
Google GEMINI 1.5 Capabilities SHOCKED everyone!
奥特曼的GPT5最新访谈,ChatGPT能力大揭秘,Sam Altman的秘密项目,OpenAI GPT5大揭秘
Reality is a Paradox - Mathematics, Physics, Truth & Love
【200集全】机器学习十大算法,线性回归、逻辑回归、KNN、决策树、K-means、SVM、集成学习、EM算法、GMM高斯混合模型一口气学完,原理+代码讲解
研一研二必看【Matlab论文复现】,原文解读+代码复现+手推公式,看完这个教程简直醍醐灌顶!!!(人工智能/神经网络/优化算法)