Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition Paper • 2510.08047 • Published Oct 9, 2025 • 7
Awesome papers from 臺大李宏毅 (Hung-yi Lee) Collection Recent papers authored by Hung-yi Lee. Sorted by ID • 8 items • Updated Oct 24, 2025 • 17
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models Paper • 2510.16917 • Published Oct 19, 2025 • 19
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models Paper • 2510.16917 • Published Oct 19, 2025 • 19 • 2
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations Paper • 2510.16893 • Published Oct 19, 2025 • 17
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations Paper • 2510.16893 • Published Oct 19, 2025 • 17 • 2
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models Paper • 2510.16917 • Published Oct 19, 2025 • 19
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations Paper • 2510.16893 • Published Oct 19, 2025 • 17
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models Paper • 2510.06917 • Published Oct 8, 2025 • 34
Game-Time: Evaluating Temporal Dynamics in Spoken Language Models Paper • 2509.26388 • Published Sep 30, 2025 • 26
TAU: A Benchmark for Cultural Sound Understanding Beyond Semantics Paper • 2509.26329 • Published Sep 30, 2025 • 2
AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models Paper • 2506.05140 • Published Jun 5, 2025
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Paper • 2507.02768 • Published Jul 3, 2025 • 18
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Paper • 2507.02768 • Published Jul 3, 2025 • 18
Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models Paper • 2505.17496 • Published May 23, 2025 • 2
STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models Paper • 2507.15375 • Published Jul 21, 2025 • 30
Mitigating Object Hallucinations via Sentence-Level Early Intervention Paper • 2507.12455 • Published Jul 16, 2025 • 8
Einstein Fields: A Neural Perspective To Computational General Relativity Paper • 2507.11589 • Published Jul 15, 2025 • 9
Evaluations of Large Audio-Language Models (LALMs) Collection This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17, 2025 • 4
Evaluations of Large Audio-Language Models (LALMs) Collection This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17, 2025 • 4