V
主页
京东 11.11 红包
[论文速览]VLMs are Zero-Shot Reward Models for RL[2310.12921]
发布人
论文题目:Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning 论文地址:https://arxiv.org/abs/2310.12921 项目地址:https://sites.google.com/view/vlm-rm * 本视频旨在传递一篇论文的存在推荐感兴趣的您阅读,并不是详细介绍,受up能力限制经常出现中英混杂,散装英语等现象,请见谅。如论文报道出了偏差,欢迎各位怒斥。 ** 新论文推荐,过往论文查找,欢迎编辑这个文档: https://docs.qq.com/sheet/DSUdOTG9xWUdydVB6 *** Slides每1-2月会上传到置顶动态地址
打开封面
下载高清视频
观看高清视频
视频下载器
[论文速览]NeRF-RL: Reinforcement Learning with Neural Radiance Fields[2206.01634]
[论文速览]GENIMA: Generative Image as Action Models[2407.07875]
[论文简析]Does SSL Really Improve RL from Pixels?[2206.05266]
[好文分享] Janus: Decoupling Visual Encoding for Unified Multimodal Und and Gen
[论文速览]Denoising Diffusion Implicit Models / DDIM[2010.02502]
[论文速览]Open-vocabulary Object Segmentation with Diffusion Models[2301.05221]
[论文速览]Ferret-v2: An Improved...for Referring and Grounding with LLMs[2404.07973]
[论文速览]Structured Denoising Diffusion Models in Discrete State-Spaces[2107.03006]
[论文简析]VoxPoser: Composable 3D Value Maps for Robotic...[2307.05973]
[论文速览]OpenVLA: An Open-Source Vision-Language-Action Model[2406.09246]
[论文速览]BLIP-2 ...with Frozen Image Encoders and Large Language Models[2301.12597]
[论文简析]Location-Aware Self-Supervised Transformers for Semantic Seg.[2212.02400]
[论文速览]Bootstrapping Language-Image Pre-training...[2201.12086]
[论文速览]Mask-based Latent Reconstruction for Reinforcement Learning[2201.12096]
[论文简析]C-Learning: Learning to .. via Recursive Classification[2011.08909]
[论文简析]SimSiam: Exploring Simple Siamese Representation Learning[2011.10566]
[论文速览]Synthetic Experience Replay[2303.06614]
[论文简析]RL-CycleGAN: Reinforcement Learning Aware Simulation-To-Real[2006.09001]
[论文速览]Object Discovery and Representation Networks[2203.08777]
[论文简析]The Dormant Neuron Phenomenon in Deep Reinforcement Learning[2302.12902]
[论文速览]LaFTer: Label-Free Tuning of Zero-shot Classifier...[2305.18287]
[论文简析]Reinforcement Learning with Augmented Data: RAD[2004.14990]
[论文速览]Autoregressive Image Generation using Residual Quantization[2203.01941]
[论文速览]Ferret: Refer and Ground Anything Anywhere at Any Granularity[2310.07704]
[论文速览]VRL3: A Data-Driven Framework for Visual Deep Reinforcement L.[2202.10324]
[论文简析]PolyFormer: Referring Image Seg. as Sequential Polygon Gen [2302.07387]
[论文速览]RetNet: A Successor to Transformer for Large Language Models[2307.08621]
[论文简析]MViT: Multiscale Vision Transformers[2104.11227]
【论文代码复现122】基于强化学习的路径规划问题||强化学习和群智能优化算法有什么区别
[论文简析]Rethinking Pre-training and Self-training[2006.06882]
[论文简析]FlowNet3D: Learning Scene Flow in 3D Point Clouds[1806.01411]
[论文简析]Large Language Models as General Pattern Machines[2307.04721]
[论文速览]A Simple LLM Framework for Long-Range Video Question-Answering[2312.17235]
[论文速览]Flamingo: a Visual Language Model for Few-Shot Learning[2204.14198]
[论文速览]Rethinking the Truly Unsupervised Image-to-Image Translation[2006.06500]
[论文速览]Multi-Object ... with Iterative Variational Inference[1903.00450]
[论文简析]Vision Transformers Need Registers[2309.16588]
[论文简析]MobileNet V2: Inverted Residuals and Linear Bottlenecks[1801.04381]
[论文速览]Learning to Learn with Generative Models of NN Checkpoints[2209.12892]
[论文简析]Directional SSL for Heavy Image Augmentations[2110.13555]