V
主页
京东 11.11 红包
[论文速览]A Self-Improving Generalist Agent for Robotic Manipulation[2306.11706]
发布人
论文题目:RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation 论文地址:https://arxiv.org/abs/2306.11706 GATO: BV1EK421k7Sy VQ-GAN: BV1ym4y1d7iP * 本视频旨在传递一篇论文的存在推荐感兴趣的您阅读,并不是详细介绍,受up能力限制经常出现中英混杂,散装英语等现象,请见谅。如论文报道出了偏差,欢迎各位怒斥。 ** 新论文推荐,过往论文查找,欢迎编辑这个文档: https://docs.qq.com/sheet/DSUdOTG9xWUdydVB6 *** Slides每1-2月会上传到置顶动态地址
打开封面
下载高清视频
观看高清视频
视频下载器
[论文夕拾]Diffusion Models for Robotics
[论文速览]LLaVA: Visual Instruction Tuning[2304.08485]
小波变换+注意力机制,数据处理领域的“王炸”组合,创新性拉满!
[论文简析]A Generalist Agent / Gato[2205.06175]
深度学习环境配置一套搞定:anaconda+pytorch+pycharm+cuda全详解,带你从0配置环境到跑通代码!
[论文速览]Theia: Distilling Diverse Vision Foundation Models for Robot..[2407.20179]
关于我只用两个月就结合AI发了SCI1区论文——经验分享和本人案例
[论文速览]LoRA: Low-Rank Adaptation of Large Language Models[2106.09685]
强推!导师放养SCI论文发到手软!研究生手把手带你亲历SCI论文,从基本套路讲解和手把手实操演示【SCI论文写作】(人工智能/计算机视觉)
[论文简析]DINO Emerging Properties in SelfSupervised Vision Transformers[2104.14294]
[论文速览]Denoising Diffusion Probabilistic Models / DDPM[2006.11239]
[论文简析]Point Transformer V2[2210.05666]
[论文速览]Ferret-v2: An Improved...for Referring and Grounding with LLMs[2404.07973]
[论文速览]OpenVLA: An Open-Source Vision-Language-Action Model[2406.09246]
研究生话题:入门机器学习深度学习需要多久时间?这次终于能搞明白了!
[论文简析]Crossway Diffusion: Improving Diffusion-based ... via SSL[2307.01849]
[论文速览]Drag Your GAN: Interactive Point-based Manipulation...[2305.10973]
神经网络杀疯了,登上nature:35年首次被证明具有泛化能力,能像人类一样思考!人工智能/机器学习/神经网络/深度学习/计算机视觉
[论文简析]Energy-Inspired Self-Supervised Pretraining for Vision Models[2302.01384]
[论文简析]Visual Autoregressive Modeling: ...via Next-Scale Prediction[2404.02905]
标星48.3K,包含380多个深度学习想项目代码,直接抄就完事了!
李飞飞团队提出ReKep,让机器人具备空间智能,还能整合GPT-4o!视觉与机器人学习的深度融合
(超爽中英!) 2024公认最好的【Agent智能体】系列教程!带你从0到1构建自己的智能体!
[论文速览]Self-supervised Video Transformer[2112.01514]
像套模版般简单!深度学习研1小白水一篇论文的全流程!
[论文速览]Bootstrapping Language-Image Pre-training...[2201.12086]
太厉害了!终于有人能把OpenCV图像处理+YOLO目标检测讲的这么通俗易懂了!无偿分享学不会你来找我!_计算机视觉/深度学习/OpenCV/YOLO
标星48.3K,包含380多个深度学习想项目代码,直接抄就完事了!
[论文速览]NeRF-RL: Reinforcement Learning with Neural Radiance Fields[2206.01634]
[论文速览]Align before Fuse / ALBEF: ...[2107.07651]
[论文速览]RetNet: A Successor to Transformer for Large Language Models[2307.08621]
[论文简析]Deconstructing Denoising Diffusion Models for SSL[2401.14404]
[论文速览]GENIMA: Generative Image as Action Models[2407.07875]
[论文速览]Visual Prompt Tuning / VPT[2203.12119]
[论文速览]Token Turing Machines[2211.09119]
[论文速览]ViperGPT: Visual Inference via Python Execution for Reasoning[2303.08128]
[论文速览]Diffusion Policy: Visuomotor Policy Learning via Action Diff.[2303.04137]
[论文速览]DDPG&TD3[1509.02971][1802.09477]
[论文简析]GroupViT: Semantic Segmentation Emerges from Text Supervision[2202.11094]
[论文速览]Flamingo: a Visual Language Model for Few-Shot Learning[2204.14198]