I have several years of research experience in speech signal processing, focusing on spatial hearing and speech enhancement in challenging environments. At JAIST, I worked on monaural 3D sound localization using HRTF features under Prof. Masashi Unoki. For my Ph.D. at Nagoya University with Prof. Tomoki Toda, my main topic was directional target speaker extraction (TSE) in noisy and underdetermined conditions, resulting in publications such as TASLP. My future goal is to extend statistical signal processing (such as independent/low-rank and spatial covariance modeling) by coupling it with DNN priors and latest LLM-based context, aiming for identifiable, sample-efficient, and real-time/low-latency streaming speech enhancement.
Research Areas: Spatial audio, Speech signal processing, Speech enhancement/separation, Target speaker extraction, Deep learning
2026.4- | Tokyo, Japan
Specially Appointed Researcher: Research on speech enhancement-related topic
2025.5-2026.4 | Shanghai, China
Research Engineer: Research on robust multi-task speech interaction system in challenge environments; Research on speech llm and device agent
2022.3-2022.4 | Tokyo, Japan
Winter internship: Research on robust speech separation
2021.8-2021.10 | Kyoto, Japan
Summer internship: Research on robust speech recognition
2021.4-2025.3 | Nagoya, Japan
Doctor's degree: Computer Science, focus on target speaker extraction in challenge environments
Toda Laboratory of speech
2018.10-2021.3 | Ishikawa, Japan
Master's degree: Computer Science, focus on HRTF-based DOA estimation and spatial hearing
Akagi & Unoki Laboratory of speech
2016.9-2018.8 | Beijing, China
Master's course: Fluid Mechanics (Dropout due to lack of interest)
2012.9-2016.6 | Hangzhou, China
BS degree: Measurement and Control Technology and Instruments