V
主页
[论文速览]Autoregressive Image Generation using Residual Quantization[2203.01941]
发布人
论文题目:Autoregressive Image Generation using Residual Quantization / RQ-VAE / RQ-Transformer 论文地址:http://arxiv.org/abs/2203.01941 VQ-GAN: BV1bb4y1i7j6 所以为啥没有代码呢… * 本视频旨在传递一篇论文的存在推荐感兴趣的您阅读,并不是详细介绍,受up能力限制经常出现中英混杂,散装英语等现象,请见谅。如论文报道出了偏差,欢迎各位怒斥。 ** 新论文推荐,过往论文查找,欢迎编辑这个文档: https://docs.qq.com/sheet/DSUdOTG9xWUdydVB6 *** Slides每1-2月会上传到置顶动态地址
打开封面
下载高清视频
观看高清视频
视频下载器
[论文速览]LLaVA: Visual Instruction Tuning[2304.08485]
[论文速览]Taming Transformers for High-Resolution Image Synthesis[2012.09841]
[论文速览]Denoising Diffusion Probabilistic Models / DDPM[2006.11239]
[论文夕拾]Diffusion Models for Robotics
[论文简析]VQ-VAE:Neural discrete representation learning[1711.00937]
[论文速览]Theia: Distilling Diverse Vision Foundation Models for Robot..[2407.20179]
[论文速览]iFormer: Inception Transformer[2205.12956]
[论文简析]PolyFormer: Referring Image Seg. as Sequential Polygon Gen [2302.07387]
神经网络必看!如何从零入门CNN、RNN、GAN、GNN、DQN、Transformer、LSTM等!清华大佬一天就教会了我如何入门神经网络算法,绝对通俗易懂
力作!切入点太好啦,何恺明谢赛宁解剖扩散模型,新作刚刚出!!
[论文简析]Deep Unsupervised Learning using Nonequilibrium Thermodynamics[1503.03585]
[论文速览]Bootstrapping Language-Image Pre-training...[2201.12086]
[论文简析]DeiT: Data-efficient Image Transformers[2012.12877]
[论文简析]Regularized Vector Quantization for Tokenized Image Synthesis[2303.06424]
关于地三鲜和番茄鸡蛋盖饭的权威性比较的研究——第二弹(完整PPT)
[论文简析]MaskGIT: Masked Generative Image Transformer[2202.04200]
[论文速览]DDPG&TD3[1509.02971][1802.09477]
[论文速览]GENIMA: Generative Image as Action Models[2407.07875]
[论文简析]Swin Transformer: Hierarchical ViT using Shifted Windows[2103.14030]
[论文速览]OpenVLA: An Open-Source Vision-Language-Action Model[2406.09246]
[论文速览]Token Turing Machines[2211.09119]
[论文简析]MixMAE & Mixed AE[2205.13137][2303.17152]
[论文速览]EViT: Expediting Vision Transformers via Token Reorganizations[2202.07800]
[论文简析]NeRF in the Wild: NeRF for Unconstrained Photo Collections[2008.02268]
[论文速览]Synthetic Experience Replay[2303.06614]
数学视角下的Transformer!MIT数学系的Philippe Rigollett主讲,UCL助理教授Haitham精讲论文,自注意力机制、深度神经网络
[论文速览]Ferret-v2: An Improved...for Referring and Grounding with LLMs[2404.07973]
[论文速览]RegMixup: Mixup as a Regularizer Can Surprisingly Improve...[2206.14502]
[论文速览]Visual Prompt Tuning / VPT[2203.12119]
[论文简析]Point Transformer[2012.09164]
[论文速览]CRG: Improving Grounding in VLM w/o training[2403.02325]
[论文简析]MViT: Multiscale Vision Transformers[2104.11227]
[论文速览]NeRF-RL: Reinforcement Learning with Neural Radiance Fields[2206.01634]
[论文简析]CLIP Dense Inference Yields Open-Vocab ... For-Free[2309.14289]
[论文速览]LaFTer: Label-Free Tuning of Zero-shot Classifier...[2305.18287]
[论文简析]NeRF: Representing Scenes as Neural Radiance Fields...[2003.08934]
[论文速览]Align before Fuse / ALBEF: ...[2107.07651]
[论文简析]NAT: Neighborhood Attention Transformer[2204.07143]
[论文速览]Ferret: Refer and Ground Anything Anywhere at Any Granularity[2310.07704]
[论文简析]Towards Better Understanding of Self-Supervised Representation[2203.01881]