Audio input and voice recognition on ESP8266 via Google

Audio input and voice recognition on ESP8266 via Google #53521

By Oleg Gerasimov - Sun Aug 21, 2016 5:02 pm

User mini profile
View full profile

Oleg Gerasimov

Posts: 7
Joined: Thu Aug 18, 2016 11:01 am

Status: Off-line

- Sun Aug 21, 2016 5:02 pm #53521 Hi!

At the end of 2014, then esp8266 has been just arrived, i decided to make universal IoT device with speech recognition, speaker.

Unfortunately the esp8266 hardware is not friendly for microphone connection. I've tried to use internal ADC, but no way. There are option to use internal I2S, but it is multiplexed with UART, and there are no working code example till now is available.

The next step - use external MCU with good sigma-delta ADC. I've tried use MSP430 for audio capture, and streaming samples to ESP8266 via SPI. In this config i've recorded some audio in first time. But MSP430 is too slow, and i've faced to serious performance problems with SPI protocol. Also quality of sound was poor. And if WiFi transmit occurs, then voice is hided by high amplitude noise.

Finally, i managed to use STM32F105 and PDM microphone for audio capture, and then stream audio via spi to ESP8266. The schematics https://github.com/wiieva/schematics

This setup give good sound quality, and pretty stable voice recognition by Google.

Here is sample video

STM32 code are do all hard work. It's captures PDM signal, filter it to aquire PCM, and then encode it to SPEEX format, which is suitable for Google voice recognition. I also tried RAW WAV audio, it works to, but it's less stable, due to requirement bigger buffer sizes and sensible to network delays.

The ESP8266 sources:
Arduino sketch: https://github.com/wiieva/examples/blob ... ro.cpp#L77
SPI protocol implementation: https://github.com/wiieva/wiieva-varian ... Wiring.cpp

Here is STM32 sources
Audio capture and encoding: https://github.com/wiieva/stm32aio/blob ... audio_in.c
SPI protocol implementation: https://github.com/wiieva/stm32aio/blob ... o_server.c

Here is schematics:
https://github.com/wiieva/schematics

PS, If you are interested in UI: uGFX library is used: http://www.ugfx.io . It's really amazing too. As you can see it's 100% compatible with esp8266 and Arduino environment.

ESP8266 Community Forum

Audio input and voice recognition on ESP8266 via Google

Audio input and voice recognition on ESP8266 via Google #53521

THESE FORUMS ARE CLOSED

Need to improve battery life of my ESP8266 Temp Hum sensor.

Sonoff Basic R2 unresponsive after upgrade

Best 3g module for IoT

NodeMCU ESP 8266 - 340G Driver issue

Lolin D1 Mini V4 problems with analog input reading

AP: Limited or no Connectivity.

Communication ESP 8266

ESP8266 web view

Firware ESP 8266

Speed up connection to WiFi after power on reset

NodeMCU: Failed uploading: uploading error: exit status 2

How to Set an ESP8266 NodeMCU Access Point for a Web Server

How to Post on Twitter using an ESP8266

Build a Water Level Control System Using ESP8266 NodeMCU

ESP32 [LOLIN WEMOS D1 32 Weak WiFi

Running CPP code using ESP8266_RTOS_SDK

Is my Wemos D1 Mini busted? Why is it doing this?

NODEMCU (ESP-12E) SOFT AP FAILURE?

WEMOS D1 Mini / esptool / Failed to connect - Timed out

Follow on Twitter @ESP8266COM