-
Notifications
You must be signed in to change notification settings - Fork 48
Issues
is:issue state:open
is:issue state:open
Issue creation is restricted in this repository
Search results
[Auto-suggested] ReNikud: Audio-Supervised Hebrew Grapheme-to-Phoneme Conversion
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#92 In BinWang28/audio-ai-hub;[Auto-suggested] Low-Burden Data Augmentation for Dysarthric ASR via Zero-Shot Voice Cloning
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#91 In BinWang28/audio-ai-hub;[Auto-suggested] Transcript-Free Flow-Matching Text-to-Speech via Speech Feature Conditioning
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#90 In BinWang28/audio-ai-hub;[Auto-suggested] Exploring Pre-training Benefits on Phoneme Addition through Fine-tuning in Speech Synthesis
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#89 In BinWang28/audio-ai-hub;[Auto-suggested] Systematic Study of Dysarthric Speech Recognition: Spectral Features and Acoustic Models
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#88 In BinWang28/audio-ai-hub;[Auto-suggested] Improving End-to-End Speech Recognition for Dysarthric Speech through In-Domain Data Augmentation
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#87 In BinWang28/audio-ai-hub;[Auto-suggested] Investigating Human-Model Discrepancies in Speech Quality Assessment via Acoustic and Prosodic Perturbations
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#86 In BinWang28/audio-ai-hub;[Auto-suggested] PASQA: Pitch-Accent-Focused Speech Quality Assessment Model Trained on Synthetic Speech with Accent Errors
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#85 In BinWang28/audio-ai-hub;[Auto-suggested] BayLing-Duplex: Native Full-Duplex Speech Dialogue with a Single Autoregressive LLM
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#84 In BinWang28/audio-ai-hub;[Auto-suggested] Mask, Sample, Revise: A Revisable CTMC Inference Stack for Guided Discrete Flow Matching Text-to-Speech
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#83 In BinWang28/audio-ai-hub;[Auto-suggested] FoleyGenEx: Unified Video-to-Audio Generation with Multi-Modal Control, Temporal Alignment, and Semantic Precision
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#82 In BinWang28/audio-ai-hub;[Auto-suggested] Spatio-Temporal Audio Language Modeling for Dynamic Sound Sources
auto-suggestedOpened by scripts/arxiv_watcher.py — needs maintainer triage.Opened by scripts/arxiv_watcher.py — needs maintainer triage.new-entrySuggests adding a new item.Suggests adding a new item.Status: Open.#81 In BinWang28/audio-ai-hub;