V
主页
[论文简析]Visual Autoregressive Modeling: ...via Next-Scale Prediction[2404.02905]
发布人
论文题目:Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction 论文地址:https://arxiv.org/abs/2404.02905 代码:https://github.com/FoundationVision/VAR VQ-VAE:BV1bb4y1i7j6 VQGAN:BV1ym4y1d7iP MaskGIT:BV1qS4y1r7p1 RQ-Transformer:BV1gY411E7ge * 视频受up能力限制经常出现中英混杂,散装英语等现象,请见谅。如论文理解报道出了偏差,欢迎各位怒斥。 ** 新论文推荐,过往论文查找,欢迎编辑这个文档: https://docs.qq.com/sheet/DSUdOTG9xWUdydVB6 *** Slides每1-2月会上传到置顶动态地址
打开封面
下载高清视频
观看高清视频
视频下载器
[论文速览]LLaVA: Visual Instruction Tuning[2304.08485]
[论文速览]Ferret-v2: An Improved...for Referring and Grounding with LLMs[2404.07973]
[论文简析]SimSiam: Exploring Simple Siamese Representation Learning[2011.10566]
[论文简析]Regularized Vector Quantization for Tokenized Image Synthesis[2303.06424]
[论文速览]Theia: Distilling Diverse Vision Foundation Models for Robot..[2407.20179]
[论文速览]Denoising Diffusion Probabilistic Models / DDPM[2006.11239]
斯坦福李飞飞终于把【计算机视觉】给讲明白了!全程16集,草履虫都能听明白!深度学习cs231n公开课【中英字幕】
强推!草履虫都能一口气学完CNN、RNN、GAN、GNN、DQN、Transformer、LSTM、DBN等八大深度学习神经网络算法!真的比刷剧还爽!
[论文简析]DAT: Vision Transformer with Deformable Attention[2201.00520]
[论文简析]Deconstructing Denoising Diffusion Models for SSL[2401.14404]
超全超简单!一口气刷完CNN、RNN、GAN、GNN、DQN、Transformer、LSTM、DBN等八大深度学习神经网络算法!真的比刷剧还爽!
[论文简析]MViT: Multiscale Vision Transformers[2104.11227]
[论文简析]DeiT: Data-efficient Image Transformers[2012.12877]
[论文简析]NeRF in the Wild: NeRF for Unconstrained Photo Collections[2008.02268]
吹爆!用Transformer结合目标检测做跨领域研究!真的超级容易出论文!(SCI丨论文写作丨科研丨研究生丨创新点丨idea)
问卷数据实操!问卷星下载数据如何处理才能导入spss分析? #问卷 #问卷调查 #spss #实证分析 #论文写作
[论文简析]NeRF: Representing Scenes as Neural Radiance Fields...[2003.08934]
B站强推!2024公认最通俗易懂的时间序列预测教程,从入门到精通!草履虫都能听懂!(LSTM/Informer/ARIMA/PandasTransformer)
[论文速览]DDPG&TD3[1509.02971][1802.09477]
图像分割、目标检测、特征提取、边缘检测、图像滤波、人脸识别...终于有人把OpenCV那些必备的知识点讲透彻了!从入门到图像处理实战!
[论文简析]SAC: Soft Actor-Critic Part 2[1812.05905]
[论文简析]DINO Emerging Properties in SelfSupervised Vision Transformers[2104.14294]
[论文简析]Large Language Models as General Pattern Machines[2307.04721]
[论文简析]Transformers are Sample Efficient World Models[2209.00588]
[论文简析]SlowFast Networks for Video Recognition[1812.03982]
[论文简析]Representation Learning with Contrastive Predictive Coding[1807.03748]
[论文简析]Deep Unsupervised Learning using Nonequilibrium Thermodynamics[1503.03585]
【LLM前沿】6小时精讲四大多模态大模型CLIP BLIP VIT MLLM及对话机器人办公助手!绝对的通俗易懂的大模型应用教程!
吹爆!不愧是中科院大佬,7天就把Transformer、RNN、BERT和迁移学习讲透了!整整100集付费,全程干货讲解,这还学不会up直接退出IT圈!
[论文简析]BiFormer: Vision Transformer with Bi-Level Routing Attention[2303.08810]
[论文速览]RetNet: A Successor to Transformer for Large Language Models[2307.08621]
西交|深度学习研讨班-4|从Attention到Transformer再到Mamba
[论文简析]DETR: End-to-End Object Detection with Transfromers[2005.12872]
[论文速览]Generative Modeling by Estimating Gradients of the Data Dist[1907.05600]
B站最全的【Transformer教程】中科院58集付费课程,最适合新手入门Transformer模型实战系列,绝对通俗易懂,允许白嫖!
[论文速览]GENIMA: Generative Image as Action Models[2407.07875]
[论文简析]How Do Vision Transformers Work?[2202.06709]
[论文简析]Toolformer: Language Models Can Teach Themselves to Use Tools[2302.04761]
强推!从入门到精通CNN、RNN、GAN、GNN、DQN、Transformer、LSTM等八大深度学习神经网络一口气全部学完!比刷剧还爽!
[论文简析]Unsupervised Image-to-Image Translation Networks[1703.00848]