V
主页
京东 11.11 红包
[论文简析]Swin Transformer: Hierarchical ViT using Shifted Windows[2103.14030]
发布人
论文题目:Swin Transformer: Hierarchical Vision Transformer using Shifted Windows 论文地址:http://arxiv.org/abs/2103.14030 论文代码:https://github.com/microsoft/Swin-Transformer * 本视频旨在隔离期间维持up思维清晰能说人话,受能力限制经常出现中英混杂,散装英语等现象,请见谅。涉及论文理解报道出了偏差,欢迎各位怒斥。
打开封面
下载高清视频
观看高清视频
视频下载器
[论文简析]GroupViT: Semantic Segmentation Emerges from Text Supervision[2202.11094]
[论文简析]Location-Aware Self-Supervised Transformers for Semantic Seg.[2212.02400]
[论文简析]MViT: Multiscale Vision Transformers[2104.11227]
[论文简析]Tokens-to-Token ViT: Training ViT from Scratch on ImageNet[2101.11986]
Ultralytics 代码框架解读 - 参数相关(上)| 默认参数有哪些?放在哪里 ?
[论文简析]Per-Pixel Classification is Not All You Need for Semantic Seg[2107.06278]
[论文简析]Crossway Diffusion: Improving Diffusion-based ... via SSL[2307.01849]
Ultralytics 环境配置 + Featurize 蒜粒方块(GPU租赁平台)|【粉丝专属福利】注册即送1小时免费试用券 + 云盘免费配额100GB
[论文简析]SPR: Data-Efficient RL with Self-Predictive Representations[2007.05929]
[论文简析]EfficientNet V1/V2[1905.11946/2104.00298]
【CVPR 2024】全新轻量级CNN架构,RepViT模型解读!
[论文简析]Unsupervised Image-to-Image Translation Networks[1703.00848]
[论文简析]Finding an Unsupervised Image Segmenter in .. Generative Model[2105.08127]
[论文简析]Transformers are Sample Efficient World Models[2209.00588]
[论文简析]Regularized Vector Quantization for Tokenized Image Synthesis[2303.06424]
[论文简析]SAC: Soft Actor-Critic Part 2[1812.05905]
[论文简析]End-to-End Video-Language Transformers..Masked Visual-token..[2111.12681]
[论文简析]Learning Invariant Representations for RL without Reconstructi[2006.10742]
[论文简析]TransRank: SS Video...Ranking-based Transformation Recognition[2205.02028]
[论文简析]VATT: Video-Audio-Text Transformer[2104.11178]
[论文速览]Denoising Diffusion Implicit Models / DDIM[2010.02502]
搞定一切图像分割问题!Mask2former论文解读+源码复现:语义分割、实例分割、全景分割一套搞定!(深度学习/计算机视觉)
[论文速览]Scalable Video Object Segmentation with Simplified Framework[2308.09903]
太强了!【GNN+Transformer】2024年最容易研究论文方向的内容!论文精讲+代码复现!小白都能轻松看懂!建议收藏!(图神经网络、机器学习、AI)
[论文简析]MixMAE & Mixed AE[2205.13137][2303.17152]
[论文速览]Efficient Visual Pretraining with Contrastive Detection[2103.10957]
[论文简析]Big Self-Supervised Models are Strong Semi-Supervised Learners[2006.10029]
[论文简析]NeRV: Neural Representations for Videos[2110.13903]
[论文速览]Ferret-v2: An Improved...for Referring and Grounding with LLMs[2404.07973]
[论文简析]Exploring Self-attention for Image Recognition[2004.13621]
[论文速览]Personalizing Text2Img Generation using Textual Inversion[2208.01618]
[论文简析]How Do Vision Transformers Work?[2202.06709]
[论文简析]C-Learning: Learning to .. via Recursive Classification[2011.08909]
[论文简析]MnasNet: Platform-Aware Neural Architecture Search for Mobile[1807.11626]
[论文速览]Generative Modeling by Estimating Gradients of the Data Dist[1907.05600]
[论文速览]Deformable Convolutional Networks; DCN[1703.06211]
[论文简析]Deep Unsupervised Learning using Nonequilibrium Thermodynamics[1503.03585]
[论文简析]TAN: Temporal Alignment Networks for Long-term Video[2204.02968]
2024终于有人把Transformer架构应用及其算法讲清楚了!迪哥精讲BERT、Swin、DETR、VIT四大核心模型,原理讲解+论文解读+代码复现!
[论文简析]PPG: Phasic Policy Gradient[2009.04416]