V
主页
京东 11.11 红包
[论文速览]A Simple LLM Framework for Long-Range Video Question-Answering[2312.17235]
发布人
论文题目:A Simple LLM Framework for Long-Range Video Question-Answering / LLoVi 论文地址:http://arxiv.org/abs/2312.17235 代码:https://github.com/CeeZh/LLoVi Lilian's blog: LLM Powered Autonomous Agents https://lilianweng.github.io/posts/2023-06-23-agent/ What's this? https://github.com/L1Xu4n/Awesome-ChatGPT-prompts-ZH_CN * 本视频旨在传递一篇论文的存在推荐感兴趣的您阅读,并不是详细介绍,受up能力限制经常出现中英混杂,散装英语等现象,请见谅。如论文报道出了偏差,欢迎各位怒斥。 ** 新论文推荐,过往论文查找,欢迎编辑这个文档: https://docs.qq.com/sheet/DSUdOTG9xWUdydVB6 *** Slides每1-2月会上传到置顶动态地址
打开封面
下载高清视频
观看高清视频
视频下载器
神经网络杀疯了,登上nature:35年首次被证明具有泛化能力,能像人类一样思考!人工智能/机器学习/神经网络/深度学习/计算机视觉
[论文速览]LLaVA: Visual Instruction Tuning[2304.08485]
[论文速览]Denoising Diffusion Probabilistic Models / DDPM[2006.11239]
70年老算法Dijkstra被证明普遍最优 斩获顶会FOCS 2024最佳论文
[论文速览]LongLoRA: Efficient Fine-tuning of Long-Context LLMs[2309.12307]
研一,研二必看!花12800买来的【MATLAB论文复现】教程,手把手教你MATLAB论文如何从代码到公式完整复现!-人工智能/MATLAB教程/深度学习
[论文简析]VAE: Auto-encoding Variational Bayes[1312.6114]
[论文速览]OpenVLA: An Open-Source Vision-Language-Action Model[2406.09246]
[论文速览]Ferret-v2: An Improved...for Referring and Grounding with LLMs[2404.07973]
小波变换+注意力机制,数据处理领域的“王炸”组合,创新性拉满!
[论文夕拾]Diffusion Models for Robotics
[论文速览]BLIP-2 ...with Frozen Image Encoders and Large Language Models[2301.12597]
[论文速览]Flamingo: a Visual Language Model for Few-Shot Learning[2204.14198]
深度学习环境配置一套搞定:anaconda+pytorch+pycharm+cuda全详解,带你从0配置环境到跑通代码!
[论文速览]Deformable Convolutional Networks; DCN[1703.06211]
[论文简析]SimCLR: A simple framework for contrastive learning[2002.05709]
[论文简析]DINO Emerging Properties in SelfSupervised Vision Transformers[2104.14294]
[论文速览]MixUp: Beyond Empirical Risk Minimization[1710.09412]
[论文速览]NeRF-RL: Reinforcement Learning with Neural Radiance Fields[2206.01634]
[论文速览]Finite Scalar Quantization: VQ-VAE Made Simple[2309.15505]
[论文速览]Diffusion Policy: Visuomotor Policy Learning via Action Diff.[2303.04137]
[论文速览]Align before Fuse / ALBEF: ...[2107.07651]
[论文速览]Rethinking the Truly Unsupervised Image-to-Image Translation[2006.06500]
[论文速览]Scalable Video Object Segmentation with Simplified Framework[2308.09903]
[论文简析]NeRF: Representing Scenes as Neural Radiance Fields...[2003.08934]
[论文速览]CRG: Improving Grounding in VLM w/o training[2403.02325]
[论文简析]DETR: End-to-End Object Detection with Transfromers[2005.12872]
[论文速览]VLMs are Zero-Shot Reward Models for RL[2310.12921]
当师妹说怀了你的孩子时……
[论文速览]Synthetic Experience Replay[2303.06614]
[论文简析]Point Transformer V2[2210.05666]
[论文速览]Masked-attention Mask Tr. for Universal Image Segmentation[2112.01527]
关于我只用两个月就结合AI发了SCI1区论文——经验分享和本人案例
[论文简析]Toolformer: Language Models Can Teach Themselves to Use Tools[2302.04761]
[论文速览]Aggregating Nested Transformers[2105.12723]
[论文速览]Theia: Distilling Diverse Vision Foundation Models for Robot..[2407.20179]
研究生话题:入门机器学习深度学习需要多久时间?这次终于能搞明白了!
[论文简析]DeiT: Data-efficient Image Transformers[2012.12877]
[论文简析]DAT: Vision Transformer with Deformable Attention[2201.00520]
[论文速览]LaFTer: Label-Free Tuning of Zero-shot Classifier...[2305.18287]