4 52 2

Chih-Kai Yang

zenyn

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition

upvoted a collection 2 months ago

Awesome papers from 臺大李宏毅 (Hung-yi Lee)

upvoted a paper 2 months ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

View all activity

Organizations

upvoted a paper 2 months ago

Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition

Paper • 2510.08047 • Published Oct 9, 2025 • 7

upvoted a collection 2 months ago

Awesome papers from 臺大李宏毅 (Hung-yi Lee)

Collection

Recent papers authored by Hung-yi Lee. Sorted by ID • 8 items • Updated Oct 24, 2025 • 17

upvoted a paper 2 months ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published Oct 19, 2025 • 19

commented a paper 2 months ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published Oct 19, 2025 • 19 •

upvoted a paper 2 months ago

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

Paper • 2510.16893 • Published Oct 19, 2025 • 17

commented a paper 2 months ago

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

Paper • 2510.16893 • Published Oct 19, 2025 • 17 •

authored 2 papers 2 months ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published Oct 19, 2025 • 19

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

Paper • 2510.16893 • Published Oct 19, 2025 • 17

upvoted 3 papers 3 months ago

authored a paper 4 months ago

AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models

Paper • 2506.05140 • Published Jun 5, 2025

upvoted a paper 5 months ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published Jul 3, 2025 • 18

authored 2 papers 5 months ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published Jul 3, 2025 • 18

Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models

Paper • 2505.17496 • Published May 23, 2025 • 2

upvoted 3 papers 6 months ago

STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models

Paper • 2507.15375 • Published Jul 21, 2025 • 30

Mitigating Object Hallucinations via Sentence-Level Early Intervention

Paper • 2507.12455 • Published Jul 16, 2025 • 8

Einstein Fields: A Neural Perspective To Computational General Relativity

Paper • 2507.11589 • Published Jul 15, 2025 • 9

upvoted a collection 6 months ago

Evaluations of Large Audio-Language Models (LALMs)

Collection

This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17, 2025 • 4

updated a collection 6 months ago

Evaluations of Large Audio-Language Models (LALMs)

Collection

This collection contains papers for various LALM evaluation frameworks. • 45 items • Updated Jul 17, 2025 • 4

Chih-Kai Yang

AI & ML interests

Recent Activity

Organizations

zenyn's activity