V
主页
[论文速览]Autoregressive Image Generation using Residual Quantization[2203.01941]
发布人
论文题目:Autoregressive Image Generation using Residual Quantization / RQ-VAE / RQ-Transformer 论文地址:http://arxiv.org/abs/2203.01941 VQ-GAN: BV1bb4y1i7j6 所以为啥没有代码呢… * 本视频旨在传递一篇论文的存在推荐感兴趣的您阅读,并不是详细介绍,受up能力限制经常出现中英混杂,散装英语等现象,请见谅。如论文报道出了偏差,欢迎各位怒斥。 ** 新论文推荐,过往论文查找,欢迎编辑这个文档: https://docs.qq.com/sheet/DSUdOTG9xWUdydVB6 *** Slides每1-2月会上传到置顶动态地址
打开封面
下载高清视频
观看高清视频
视频下载器
[论文速览]Taming Transformers for High-Resolution Image Synthesis[2012.09841]
[论文简析]Regularized Vector Quantization for Tokenized Image Synthesis[2303.06424]
[论文速览]EViT: Expediting Vision Transformers via Token Reorganizations[2202.07800]
[论文速览]Masked-attention Mask Tr. for Universal Image Segmentation[2112.01527]
[论文速览]Bootstrapping Language-Image Pre-training...[2201.12086]
[论文简析]PolyFormer: Referring Image Seg. as Sequential Polygon Gen [2302.07387]
[论文简析]Contrastive Learning for Unpaired Image-to-Image Translation[2007.15651]
[论文简析]Visual Autoregressive Modeling: ...via Next-Scale Prediction[2404.02905]
[论文简析]NeRF in the Wild: NeRF for Unconstrained Photo Collections[2008.02268]
[论文速览]Deformable Convolutional Networks; DCN[1703.06211]
[论文简析]Deep Unsupervised Learning using Nonequilibrium Thermodynamics[1503.03585]
为什么神经网络可以学习任何东西?计算机博士精讲CNN_RNN_GAN_GNN_Transformer,全程大白话绝对通俗易懂!
[论文速览]BLIP-2 ...with Frozen Image Encoders and Large Language Models[2301.12597]
[论文速览]Diffusion Policy: Visuomotor Policy Learning via Action Diff.[2303.04137]
【李宏毅】2024秋《生成式人工智能导论》教程!LLM大模型入门到进阶,一套全解决!2024生成式人工智慧-附带课件代码
[论文速览]Rethinking the Truly Unsupervised Image-to-Image Translation[2006.06500]
[论文速览]iFormer: Inception Transformer[2205.12956]
[论文简析]NeRF: Representing Scenes as Neural Radiance Fields...[2003.08934]
[论文速览]Generative Modeling by Estimating Gradients of the Data Dist[1907.05600]
[论文速览]iBOT: Image BERT Pre-Training with Online Tokenizer[2111.07832]
[论文速览]Socratic Models: Zero-Shot Multimodal Reasoning[2204.00598]
深度学习BIBM2024 | 医学图像分割 | SMAFormer
[论文简析]Towards Better Understanding of Self-Supervised Representation[2203.01881]
这也太全了!CNN卷积+RNN循环+GAN+自编码器+LSTM+Transformer+GNN+CapsuleNet等八大深度学习神经网络算法教程一口气学完!
吹爆!这绝对是南京大学最出名的LLAMA3教程了没有之一,llama3原理代码精讲与微调量化部署实战,通俗易懂太适合小白了!人工智能|机器学习|深度学习
[论文速览]Personalizing Text2Img Generation using Textual Inversion[2208.01618]
[论文速览]SODA: Bottleneck Diffusion Models for Representation Learning[2311.17901]
[论文速览]Flamingo: a Visual Language Model for Few-Shot Learning[2204.14198]
[论文速览]Denoising Diffusion Implicit Models / DDIM[2010.02502]
[论文简析]DAT: Vision Transformer with Deformable Attention[2201.00520]
[论文速览]DDPG&TD3[1509.02971][1802.09477]
[论文简析]MixMAE & Mixed AE[2205.13137][2303.17152]
[论文简析]BiFormer: Vision Transformer with Bi-Level Routing Attention[2303.08810]
[论文简析]Location-Aware Self-Supervised Transformers for Semantic Seg.[2212.02400]
[论文速览]CRG: Improving Grounding in VLM w/o training[2403.02325]
[论文简析]Transf. Meta-learners for Implicit Neural Representations[2208.02801]
[论文速览]Ferret: Refer and Ground Anything Anywhere at Any Granularity[2310.07704]
[论文简析]DeiT: Data-efficient Image Transformers[2012.12877]
[论文速览]Aggregating Nested Transformers[2105.12723]
[论文简析]A Generalist Agent / Gato[2205.06175]