Today, AI and deep learning is a major research area. Microsoft, Google, Facebook, and Amazon are a few names that are focusing on building deep learning systems. AI assistants including Alexa, Siri, Cortana, and Google Home have already become a household name today. In the past couple of years, Amazon has sold more than 11 million Amazon Echo devices and the numbers are growing.
To build and test AI-assistant systems, apps, and devices, developers must have a good source of voice sample databases and companies like Google, Apple, Amazon, and Microsoft have the advantage of having a large collection of recorded voice samples.
Google Brain Team announced via a blog post that it has open-sourced its command dataset for anyone to download it. From the blog post:
“The TensorFlow and AIY teams have created the Speech Commands Dataset, and used it to add training* and inference sample code to TensorFlow. The dataset has 65,000 one-second long utterances of 30 short words, by thousands of different people, contributed by members of the public through the AIY website. It’s released under a Creative Commons BY 4.0 license, and will continue to grow in future releases as more contributions are received. The dataset is designed to let you build basic but useful voice interfaces for applications, with common words like “Yes”, “No”, digits, and directions included. The infrastructure we used to create the data has been open sourced too, and we hope to see it used by the wider community to create their own versions, especially to cover underserved languages and applications.”
You may download the dataset sample here:
Download: Speech Commands Dataset download (1.4GB)