V
主页
[论文速览]OpenVLA: An Open-Source Vision-Language-Action Model[2406.09246]
发布人
论文题目:OpenVLA: An Open-Source Vision-Language-Action Model 论文地址:https://arxiv.org/abs/2406.09246 代码:https://openvla.github.io/ * 本视频旨在传递一篇论文的存在推荐感兴趣的您阅读,并不是详细介绍,受up能力限制经常出现中英混杂,散装英语等现象,请见谅。如论文报道出了偏差,欢迎各位怒斥。 ** 新论文推荐,过往论文查找,欢迎编辑这个文档: https://docs.qq.com/sheet/DSUdOTG9xWUdydVB6 *** Slides每1-2月会上传到置顶动态地址
打开封面
下载高清视频
观看高清视频
视频下载器
[论文速览]LLaVA: Visual Instruction Tuning[2304.08485]
[论文夕拾]Diffusion Models for Robotics
【强到离谱】草履虫都能学会!OpenCV+YOLO 实时目标检测,计算机博士手把手带你做毕设!(深度学习/计算机视觉/YOLOlOpenCv)
老师不教我来教!OpenCV与YOLO的结合使用:目标实时追踪 计算机博士带你做毕设!
[论文速览]GENIMA: Generative Image as Action Models[2407.07875]
[论文速览]Open Vocab. Semantic Seg. with Patch Aligned Contrastive...[2212.04994]
[论文速览]Align before Fuse / ALBEF: ...[2107.07651]
[论文简析]Visual Autoregressive Modeling: ...via Next-Scale Prediction[2404.02905]
[论文速览]Open-vocabulary Object Segmentation with Diffusion Models[2301.05221]
[论文简析]Contrastive Language, Action, and State Pre-training...[2304.10782]
[论文速览]Ferret: Refer and Ground Anything Anywhere at Any Granularity[2310.07704]
[论文速览]Denoising Diffusion Probabilistic Models / DDPM[2006.11239]
[论文速览]CRG: Improving Grounding in VLM w/o training[2403.02325]
[论文速览]Ferret-v2: An Improved...for Referring and Grounding with LLMs[2404.07973]
[论文简析]MLP-Mixer: An all-MLP Architecture for Vision[2105.01601]
[论文简析]DINO Emerging Properties in SelfSupervised Vision Transformers[2104.14294]
2.2 机器人学基础-用欧拉角表示旋转矩阵
[论文速览]NeRF-RL: Reinforcement Learning with Neural Radiance Fields[2206.01634]
[论文简析]DiffSeg: Unsupervised Zero-Shot Seg. using Stable Diffusion[2308.12469]
【全374集】2024最新清华内部版!终于把AI大模型(LLM)讲清楚了!全程干货讲解,通俗易懂,拿走不谢!
西交|深度学习研讨班-4|从Attention到Transformer再到Mamba
究极全面!CVPR2024可复现论文合集,原文/代码/演示全都有!(深度学习/计算机视觉)
【AI大模型】3天构建一个AI Chatbot聊天机器人,2025版最系统全面的AI大模型教程(LLM+RAG+Open AI+GPT+生成式AI)
B站讲的最好的AI大模型实战(2024最新版)LLM+RAG系统+GPT-4o+OpenAI,全部都讲明白了!!
[论文速览]LongLoRA: Efficient Fine-tuning of Long-Context LLMs[2309.12307]
[论文简析]Point Transformer V2[2210.05666]
[论文速览]LLaMA-Adapter: Efficient Fine-tuning..Zero-init Attention[2303.16199]
[论文简析]Red Circle: Visual Prompt Engineering for VLMs[2304.06712]
2024最通俗易懂的【OpenCV入门教程】草履虫都能学会!附课件代码
爆肝整理!CVPR2024可复现论文合集,原文/代码/演示全都有!(深度学习/计算机视觉)
孙正义:比人类聪明一万倍的,超级智能AGI将在2035年到来!人工智能技术
[论文速览]DDPG&TD3[1509.02971][1802.09477]
【Agent+RAG】10小时博士精讲AI Agent(人工智能体)系列—提示工程、文本大模型、GPT...
[论文速览]Structured Denoising Diffusion Models in Discrete State-Spaces[2107.03006]
[论文简析]DAT: Vision Transformer with Deformable Attention[2201.00520]
[论文速览]LLaRA: Supercharging Robot Learning Data for VLM Policy[2406.20095]
[论文简析]Transformers are Sample Efficient World Models[2209.00588]
[论文速览]A Self-Improving Generalist Agent for Robotic Manipulation[2306.11706]
【能写进简历的项目】3小时带你掌握基于深度学习的车道线检测算法,原理详解+项目实战,看完就能跑通!毕设/课设/深度学习/计算机视觉/OpenCV
[论文简析]BiFormer: Vision Transformer with Bi-Level Routing Attention[2303.08810]