Satellite Events

The Organising Committee of Interspeech 2024 is proud to announce the following satellite events which all have received ISCA approval.

Some Satellite Events offer a co-registration option and are listed on the Interspeech 2024 registration form. Their fees are listed in the grid below followed by Satellite Events where delegates can register directly with the organisers.

Satellite Events that will be available for booking through the Interspeech 2024 registration form:

Description

1. 2024 CHiME Workshop on Speech Processing in Everyday Environments

Date: September 6, 2024

Time: 8:00 – 17:00

Description: This one-day workshop, a satellite event of Interspeech 2024, will bring together researchers from the fields of speech enhancement, speech and speaker recognition, computational hearing, and machine learning to discuss the robustness of speech processing in everyday environments, i.e., real-world conditions with acoustic clutter, where the number and nature of the sound sources is unknown and changing over time.

Web page: https://www.chimechallenge.org/

 

 

 

Non-member registration: € 70
Non-member student registration: € 50

 

 

 

Fee ISCA Member: € 60
Fee ISCA Student: € 40

2. Synthetic Data’s Transformative Role in Foundational Speech Models

Date: August 31, 2024

Time: 8:30 – 18:00

Description:

The field of speech technology has undergone a revolution in recent years, driven by the rise of foundational models underpinning automatic speech recognition (ASR) and text-to-speech synthesis (TTS). However, a key challenge persists: the reliance on large quantities of real human speech data, raising privacy concerns and regulatory hurdles. Synthetic data offers a groundbreaking alternative, empowering researchers to develop speech models that are ethical, inclusive, and adaptable to diverse scenarios. While the use of synthetic data  has been studied extensively, its role in foundational models has not been explored. This workshop will bring together research under the following themes specifically targeted to this era of foundational models that have been pre-trained on every available and usable data source. The workshop will be organized around the key themes of generative modeling, privacy, bridging data gap for low resource languages, domain robustness and privacy. We invite submissions on novel techniques for generating realistic and diverse speech and text datasets, multimodal modality matching to make effective use of synthetic data, methods for fine-tuning/adapting  foundational models with synthetic data to improve performance, comparative studies evaluating the effectiveness of synthetic data versus real data in training models for speech and language processing and applications of synthetic data for language resource development in low-resource settings (domain specific challenges).

Contact: syndata4genai@gmail.com

Web page: https://syndata4genai.org/#details

 

 

 

 

Regular registration: € 50

3. Young Female Researchers in Speech Workshop (YFRSW)

Date: August 31, 2024

Time: 9:00 – 17:00

Location: Kos Island, Greece

Description: YFRSW is a workshop for female* Bachelor’s and Master’s students currently working in speech science and technology. The workshop aims to promote interest in research in our field among women* who have not yet committed to pursuing a PhD in speech science or technology, but who have already gained research experience at their universities through individual or group projects.
*The workshop is open for marginalized genders, including women, as well as non-binary and gender non-conforming people who are comfortable in a space that is centered on women’s experiences in the speech science and technology community. We aim to offer an inclusive and accessible program. If you are unsure if this workshop is for you, please don’t hesitate to reach out to us!

Contact: Iona Gessinger, Leda Sari, and Georgia Maniati (youngfemaleresearchersinspeech@gmail.com)

Web page: https://sites.google.com/view/yfrsw-2024

 

 

 

 

Registration for this event is free of charge

4. 3rd COG-MHEAR Audio-Visual Speech Enhancement Challenge (AVSEC-3)

Date: September 1, 2024

Time: Half-Day Afternoon Workshop 

Description: The Audio-Visual Speech Enhancement Challenge (AVSEC) brings together researchers from interdisciplinary fields working on audio-visual speech technologies and hearing research. Building upon the two previous successful editions of the challenge, we expect AVSEC-3 to continue advancing audio-visual speech technologies.
The workshop encompasses a set of oral and poster presentations where participants can actively discuss their ongoing research and present their systems. Invited keynotes will provide a space to reflect on the scope and limitations of current speech and hearing technologies. Furthermore, results of the third edition of the challenge will be announced during the workshop.

Web page: https://challenge.cogmhear.org/#/

 

 

 

Non-member registration: € 45
Non-member student registration: € 30

 

 

 

Fee ISCA Member: € 40
Fee ISCA Student: € 25

5. 4th SPSC Symposium

Date: September 6, 2024

Time: 9:00 – 17:00

Description: The fourth edition of the Symposium on Security & Privacy in Speech Communication (SPSC) focuses on speech and voice through which we express ourselves. As speech communication can be used to command virtual assistants to transport emotion or to identify oneself, the symposium encourages participants to give answers on how we can strengthen security and privacy for speech representation types in user-centric human/machine interaction. The VoicePrivacy initiative is spearheading the effort to develop privacy preservation solutions for speech technology. It aims to consolidate the newly formed community to develop the task and metrics and to benchmark progress in anonymization solutions using common datasets, protocols, and metrics. This edition focuses on preserving the emotional state, which is the key paralinguistic attribute in many real-world applications of voice anonymization. All the participants are encouraged to submit to the SPSC Symposium papers related to their challenge entry, as well as other scientific papers related to speaker anonymization and voice privacy. For the general SPSC Symposium, we welcome contributions to related topics, as well as progress reports, project dissemination, or theoretical discussions and “work in progress”. In addition, guests from academia, industry and public institutions as well as interested students are welcome to attend the workshop without having to make their own contribution.

Web page for SPSC Symposium: http://www.spsc2024.mobileds.de
Web page for VoicePrivacy Challenge: https://www.voiceprivacychallenge.org

Please see the SPSC Symposium website for more details regarding the satellite workshop event and submission deadlines, and the VoicePrivacy Challenge website for more details about the challenge, system development, and evaluation plan.

 

 

 

Non-member registration: € 77 
Non-member student registration: € 46

 

 

 

Fee ISCA Member: € 67
Fee ISCA Student: € 36 

6. Vocal Interactivity in-and-between Humans, Animals and Robots, VIHAR 2024

Date: September 6, 2024

Time: 9:00 – 12:30 or 09:00 – 18:00

Description: Taking place in Kos, Greece on 6 September 2024, VIHAR-2024 aims to bring together researchers studying vocalization and speech-based interaction in-and-between humans, animals and robots from a variety of different fields. VIHAR-2024 will provide an opportunity to share and discuss theoretical insights, best practices, tools and methodologies, and to identify common principles underpinning vocal behavior in a multi-disciplinary environment.

Web page: https://vihar-2024.vihar.org/
Email address for contact: vihar2024@earthspecies.org

 

 

 

Regular registration: € 20

7. 7th International Workshop on Speech, Music and Mind 2024 (SMM24)

(Detecting and Influencing Mental States with Audio)

Date: September 9, 2024

Time: TBC, Virtual

Description: After six successful editions of SMM (SMM18 to SMM23) we are pleased to announce the 7th Edition of the International Workshop on Speech, Music and Mind 2024 (SMM24): Detecting and Influencing Mental States with Audio.

This one-day, satellite workshop of Interspeech 2024 will bring together, from both academia and industry, communities interested in understanding the interplay between the mind and sound/music/speech. The focus of this workshop is to further the state of the art in detecting and influencing mental states with audio. Applications to diverse contexts, with emphasis on multi-modal approaches with diverse applications across culture, languages and music genres, are of special interest.

The workshop will be held as a virtual event on (Monday) 09th September 2024, a few days after Interspeech 2024 commences. Details of the virtual event platform and registration (free, but necessary) will be announced a fortnight before the workshop date. We invite you to submit relevant contributions to the workshop before the deadlines.

Important Dates
• July 12, 2024 (Friday): Submission Deadline
• August 02, 2024: Acceptance Notification
• August 30, 2024: Camera Ready Submission
• September 05, 2024: Registration Deadline
• September 09, 2024 (Monday): Workshop Date

Contact Us
To learn more, don’t hesitate to get in touch
• Meghna Pandharipande (meghna.pandharipande@tcs.com)
• Vinoo Alluri (vinoo.alluri@iiit.ac.in)
• Subhrojyoti Chaudhuri (subhrojyoti.c@tcs.com)

Web page: https://smm24.iiit.ac.in/index.html

8. ASVspoof workshop

Date: August 31, 2024

Time: Half-day workshop

Description:

The automatic speaker verification spoofing and countermeasures (ASVspoof) challenge series is a community-led initiative that aims to promote the consideration of spoofing and speech deepfakes in addition to the development of countermeasures. ASVspoof 5 is the fifth edition in a series of previously biennial, competitive challenges. While previous challenge editions have incorporated distinct logical access (LA), physical access
(PA), and speech deepfake (DF) scenario, ASVspoof 5 has evolved to encompass: (i) speech deepfake detection, and (ii) spoofing-robust automatic speaker verification. With the emergence of advanced techniques like zero-shot and few-shot speech synthesis and voice conversion, the need for robust detection methods has never been greater. The ASVspoof Workshop at Interspeech 2024 will provide an excellent platform for technical discussions, challenge analysis, and engage with experts and peers to discuss the latest advancements and challenges in the field.

Web page: https://www.asvspoof.org/

Email address for contact: info@asvspoof.org

Regular registration: € 80 
Student registration: € 65

* More details will follow