Home
About

About the Conference Welcome from the Chair Conference Committees Area Chairs Organizers ISCA
Calls

Papers Surveys Satellite Workshops Tutorials Show & Tell Special Sessions & Challenges Areas & Topics Important Dates
Authors

Author Resources Submission Policy ISCA Ethics Paper Submission Presentation Guidelines
Program

Program at a Glance Technical Program Presentation Videos Presentation Guidelines Keynotes Satellite Workshops Tutorials Special Sessions & Challenges Show & Tell
Student Information

Student Events Travel Grants
Venue & Travel

Conference Venue & Accommodations Transportations Visa About Shanghai
Registration

Registration Overview & Fees ISCA Membership ISCA Code of Conduct Online Registration
Sponsorships & Exhibition

Sponsors Virtual Booth Satellite Events Acknowledgement
Contact

Contact Us

Program

Program at a Glance

Technical Program

Presentation Videos

Presentation Guidelines

Satellite Workshops

Special Sessions & Challenges

The Zero Resource Speech Challenge 2020

Position: Home > Program > Technical Program > Thursday 21:45-22:45(GMT+8), October 29 > The Zero Resource Speech Challenge 2020 >

Thu-3-7-11 Perceptimatic: A human speech perception benchmark for unsupervised subword modelling

Juliette Millet(LLF, Université de Paris and CoML team, LSCP, ENS Paris) and Ewan Dunbar(Université Paris Diderot)

Abstract: In this paper, we present a dataset and methods to compare speech processing models and humans on a phone discrimination task. We provide Perceptimatic, an open dataset which consists of French and English speech stimuli, as well as the results of 91 English- and 93 French-speaking listeners. The stimuli test a wide range of French and English contrasts, and are extracted directly from corpora of natural running read speech, used for the 2017 Zero Resource Speech Challenge. We provide a method to compare humans' perceptual space with models' representational space and we apply it to models previously submitted to the challenge and some reference systems. We show that the topline used for the challenge, an HMM GMM phone recognition system, while well discriminating phones, is not producing a representational space closed to humans' perception space.

Paper

prev No More

next Thu-3-7-2 Vector-quantized neural networks for acoustic unit discovery in the ZeroSpeech 2020 challenge

About

About the Conference

Welcome from the Chair

Conference Committees

Calls

Satellite Workshops

Special Sessions & Challenges

Important Dates

Program

Program at a Glance

Technical Program

Presentation Videos

Presentation Guidelines

Satellite Workshops

Special Sessions & Challenges

Student Information

Venue & Travel

Conference Venue & Accommodations

Transportations

Sponsorships & Exhibition

Satellite Events

Acknowledgement