Review of End-to-End Speech Translation in 2023 (Recent Advances in Direct Speech-to-text Translation) - Code World

Review of End-to-End Speech Translation in 2023 (Recent Advances in Direct Speech-to-text Translation)

Enterprise 2023-12-17 10:05:13 views: null

NoSuchKey

Guess you like

Origin blog.csdn.net/m0_56942491/article/details/134035089

Review of End-to-End Speech Translation in 2023 (Recent Advances in Direct Speech-to-text Translation)

MuAViC Paper Research: A Multilingual Audiovisual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

WeChat AI speech-to-text synchronous translation function (WeChat face-to-face translation applet)

uni-app speech-to-text function demo (simultaneous translation of the mini program is available out of the box)

ACL 2023 | Unifying Speech Translation and Machine Translation via Speech Discrete Representations

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentatio

【S2ST】UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units

Paper Translation - Speech Synthesis: Tacotron

A review of end-to-end streaming speech recognition research—speech recognition (paper reading)

How to complete the online English translation? English translation of the speech easy way

[Translation] Chinese translation of Steve Jobs' speech at Stanford University

Paper Translation - Speech Synthesis: Tacotron 2

Baidu audio and video processing speech translation

Python Speech-to-Text

Paper translation: 2023_THLNet: two-stage heterogeneous lightweight network for monaural speech enhancement...

(20) AI simultaneous interpretation, AI speech recognition, AI text translation, AI real-time translation, AI text-to-speech, AI voiceprint recognition, AI male and female voice recognition

How to translate speech into text? Voice translation of text Zheliang Zhao seal the deal

Paper Translation - Speech Synthesis: Char2Wav

Call Youdao API to realize speech translation (Chinese to English)

Python Intelligent Speech Recognition Language Translation Platform｜Project Backend Construction

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis translation (excluding experimental part)

A Review of Speaker Diarization- Recent Advances with Deep Learning

translation

Translation

Graphic online translation - text translation

Google tacotron end-to-end text-to-speech synthesis model practice

Google tacotron end-to-end text-to-speech synthesis model practice

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short

Unity+C#, iFlytek text-to-speech, speech-to-text

Pictures of speech-to-text how to operate?

Recommended

Ranking

Han Han autumn iron second job

CentOS7.4 install Apache service

Cty's Linux study notes (2)

Performance testing tool - installation and use of wrk

Cattle-off practice match 60E

Balanced Trees: Why Redis Internal Implementations Use Jump Tables

Programmer is the best product manager

Micro letter about the problems encountered in applet Summary (continually updated)

Type ‘java.awt.List‘ does not have type parameters

How to break out of the for loop gracefully

Daily

More

2025-04-26(0)

2025-04-25(0)

2025-04-24(0)

2025-04-23(0)

2025-04-22(0)

2025-04-21(0)

2025-04-20(0)

2025-04-19(0)

2025-04-18(0)

2025-04-17(0)