V
主页
[Short Review] Deduplicating Training Data Makes Language Models Better
发布人
打开封面
下载高清视频
观看高清视频
视频下载器
[Long Review] Deduplicating Training Data Makes Language Models Better
[Short Review]Conformer Convolution-augmented Transformer for Speech Recognition
十分钟看懂谷歌W2v-BERT: Combining Contrastive Learning and Masked Language Modeling
十分钟看懂脸书虎爪绝户手 - 虎BERT - HuBERT: Self-Supervised Speech Representation Learning
语音文本技术论文阅读 Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recogni
[Long Review] Transfer Learning from Speaker Verification to Multispeaker TTS
语音NLP论文阅读 Token-level Sequence Labeling for SLU using Compositional E2E Models
[Long Review] Xception: Deep Learning with Depthwise Separable Convolution
CV论文阅读OpenAI CLIP(2/3):Learning Transferable Visual Models From Natural Language
[Long Review] Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using
[Short Review] Cascaded Diffusion Models for High Fidelity Image Generation
[Short Review] Xception: Deep Learning with Depthwise Separable Convolution
[Short Review] Towards Zero-Label Language Learning
语音文本技术论文阅读 RNN-T: Sequence Transduction with Recurrent Neural Networks
[Long Review] Axial Attention in Multidimensional Transformers
语音文本技术论文阅读 Scaling Laws for Neural Language Models
语音文本技术论文阅读 SNRi Target Training for Joint Speech Enhancement and Recognition
[Short Review] Axial Attention in Multidimensional Transformers
语音文本技术论文阅读 Improving Speech Recognition Accuracy of Local POI Using Geographical
[Long Review] Cascaded Diffusion Models for High Fidelity Image Generation
语音文本技术论文阅读 RefineGAN - Universally Generating Waveform Better than Ground ...
十分钟看懂微软大力金刚掌WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack
[Long Review] Fully Sharded Data Parallel: faster AI training with fewer GPUs
详解OpenAI GPT-3: Language Models are Few-Shot Learners(2/3)
十分钟看懂谷歌易筋经BERT
十分钟看懂谷歌铁布衫BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised ...
语音文本技术论文阅读 XLS-R: Self-supervised Cross-lingual Speech Representation Learning a
十分钟看懂谷歌金钟罩Transformer以及语音的LAS模型
三分钟搞定微软零样本语音合成VALL-E
[Olewave's Long Review] Efficient Training of Neural Transducer for Speech Recog
福奇博士小声嘟囔议员蠢货,结果忘记关麦克风 -- analysis from a research perspective
详解OpenAI GPT-3: Language Models are Few-Shot Learners(1/3)
[Long Review] CLAS: Deep context: end-to-end contextual speech recognition
语音文本技术论文阅读 Joint Unsupervised and Supervised Training for Multilingual ASR
[Long Review] Towards Zero-Label Language Learning
三分钟搞定ChatGPT
[Long Review] Conformer: Convolution-augmented Transformer for Speech Recogniti
语音文本技术论文阅读 OpenAI最新的Whisper ASR也会像GPT-3一样火起来吗?
[Long Review] GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
详解微软零样本语音合成VALL-E